Header logo is


2015


no image
Fabrication and X-ray testing of true kinoform lenses with high efficiencies

Keskinbora, K., Sanli, U., Grévent, C., Schütz, G.

{Proceedings of SPIE}, 9592, SPIE, Bellingham, Washington, 2015 (article)

mms

DOI [BibTex]

2015


DOI [BibTex]


no image
Signaling equilibria in sensorimotor interactions

Leibfried, F, Grau-Moya, J, Braun, DA

Cognition, 141, pages: 73-86, August 2015 (article)

Abstract
Although complex forms of communication like human language are often assumed to have evolved out of more simple forms of sensorimotor signaling, less attention has been devoted to investigate the latter. Here, we study communicative sensorimotor behavior of humans in a two-person joint motor task where each player controls one dimension of a planar motion. We designed this joint task as a game where one player (the sender) possesses private information about a hidden target the other player (the receiver) wants to know about, and where the sender's actions are costly signals that influence the receiver's control strategy. We developed a game-theoretic model within the framework of signaling games to investigate whether subjects' behavior could be adequately described by the corresponding equilibrium solutions. The model predicts both separating and pooling equilibria, in which signaling does and does not occur respectively. We observed both kinds of equilibria in subjects and found that, in line with model predictions, the propensity of signaling decreased with increasing signaling costs and decreasing uncertainty on the part of the receiver. Our study demonstrates that signaling games, which have previously been applied to economic decision-making and animal communication, provide a framework for human signaling behavior arising during sensorimotor interactions in continuous and dynamic environments.

ei

DOI [BibTex]

DOI [BibTex]


no image
Focused ion beam micromachining enables novel optics for X-ray microscopy

Keskinbora, K., Sanli, U., Grévent, C., Hirscher, M., Schütz, G.

{Microscopy and Microanalysis}, 21(Suppl 3):1983-1984, Springer-Verlag New York, New York, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Selectable nanopattern arrays for nanolithographic imprint and etch-mask applications

Jeong, H., Mark, A. G., Lee, T., Son, K., Chen, W., Alarcón-Correa, M., Kim, I., Schütz, G., Fischer, P.

{Advanced Science}, 2(2), Wiley-VCH, Weinheim, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Grain boundaries as a source of ferromagnetism and increased solubility of Ni in nanograined ZnO

Straumal, B. B., Mazilkin, A. A., Protasova, S. G., Stakhanova, S. V., Straumal, P. B., Bulatov, M. F., Schütz, G., Tietze, T., Goering, E., Baretzky, B.

{Reviews on Advanced Materials Science}, 41, pages: 61-71, 2015 (article)

mms

[BibTex]

[BibTex]


no image
Gyrational modes of benzenelike magnetic vortex molecules

Adolff, C. F., Hänze, M., Pues, M., Weigand, M., Meier, G.

{Physical Review B}, 92(2), American Physical Society, Woodbury, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
"Job-Sharing" storage of hydrogen in Ru/Li2O nanocomposites

Fu, L., Tang, K., Oh, H., Kandavel, M., Bräuniger, T., Vinod Chandran, C., Menzel, A., Hirscher, M., Samuelis, D., Maier, J.

{Nano Letters}, 15(6):4170-4175, American Chemical Society, Washington, DC, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Overview of the multilayer-Fresnel zone plate and the kinoform lens development at MPI for Intelligent Systems

Sanli, U., Keskinbora, K., Grévent, C., Schütz, G.

{Proceedings of SPIE}, 9510, SPIE, Bellingham, Washington, 2015 (article)

mms

DOI [BibTex]


no image
Transition matrix elements for electron-phonon scattering: Phenomenological theory and ab initio electron theory

Illg, C., Haag, M., Müller, B. Y., Czycholl, G., Fähnle, M.

{Physical Review B}, 92(19), American Physical Society, Woodbury, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Phase evolution in single-crystalline LiFePO4 followed by in situ scanning X-ray microscopy of a micrometre-sized battery

Ohmer, N., Fenk, B., Samuelis, D., Chen, C., Maier, J., Weigand, M., Goering, E., Schütz, G.

{Nature Communications}, 6, Nature Publishing Group, London, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Nitrogen-rich covalent triazine frameworks as high-performance platforms for selective carbon capture and storage

Hug, S., Stegbauer, L., Oh, H., Hirscher, M., Lotsch, B. V.

{Chemistry of Materials}, 27(23):8001-8010, American Chemical Society, Washington, D.C., 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
From Humans to Robots and Back: Role of Arm Movement in Medio-lateral Balance Control

Huber, M, Chiovetto, E, Schaal, S., Giese, M., Sternad, D

In Annual Meeting of Neural Control of Movement, Charleston, NC, 2015 (inproceedings)

am

[BibTex]

[BibTex]


no image
Novel plasticity rule can explain the development of sensorimotor intelligence

Der, R., Martius, G.

Proceedings of the National Academy of Sciences, 112(45):E6224-E6232, 2015 (article)

Abstract
Grounding autonomous behavior in the nervous system is a fundamental challenge for neuroscience. In particular, self-organized behavioral development provides more questions than answers. Are there special functional units for curiosity, motivation, and creativity? This paper argues that these features can be grounded in synaptic plasticity itself, without requiring any higher-level constructs. We propose differential extrinsic plasticity (DEP) as a new synaptic rule for self-learning systems and apply it to a number of complex robotic systems as a test case. Without specifying any purpose or goal, seemingly purposeful and adaptive rhythmic behavior is developed, displaying a certain level of sensorimotor intelligence. These surprising results require no system-specific modifications of the DEP rule. They rather arise from the underlying mechanism of spontaneous symmetry breaking, which is due to the tight brain body environment coupling. The new synaptic rule is biologically plausible and would be an interesting target for neurobiological investigation. We also argue that this neuronal mechanism may have been a catalyst in natural evolution.

al

link (url) DOI Project Page [BibTex]

link (url) DOI Project Page [BibTex]


no image
Multilayer Fresnel zone plates for X-ray microscopy

Sanli, U. T., Keskinbora, K., Grévent, C., Szeghalmi, A., Knez, M., Schütz, G.

{Microscopy and Microanalysis}, 21(Suppl 3):1987-1988, Springer-Verlag New York, New York, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Interfacial dominated ferromagnetism in nanograined ZnO: a \muSR and DFT study

Tietze, T., Audehm, P., Chen, Y., Schütz, G., Straumal, B. B., Protasova, S. G., Mazilkin, A. A., Straumal, P. B., Prokscha, T., Luetkens, H., Salman, Z., Suter, A., Baretzky, B., Fink, K., Wenzel, W., Danilov, D., Goering, E.

{Scientific Reports}, 5, pages: 8871-8876, Nature Publishing Group, London, UK, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Preparation of a ferromagnetic barrier in YBa2Cu3O7-delta thinner than the coherence length

Soltan, S., Albrecht, J., Goering, E., Schütz, G., Mustafa, L., Keimer, B., Habermeier, H.

{Journal of Applied Physics}, 118(22), AIP Publishing, New York, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Microanalytical methods for in-situ high-resolution analysis of rock varnish at the micrometer to nanometer scale

Macholdt, D. S., Jochum, K. P., Pöhlker, C., Stoll, B., Weis, U., Weber, B., Müller, M., Kapl, M., Buhre, S., Kilcoyne, A. L. D., Weigand, M., Scholz, D., Al-Amri, A. M., Andreae, M. O.

{Chemical Geology}, 411, pages: 57-68, Elsevier, Amsterdam, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Chemical composition, microstructure, and hygroscopic properties of aerosol particles at the Zotino Tall Tower Observatory (ZOTTO), Siberia, during a summer campaign

Mikhailov, E. F., Mironov, G. N., Pöhlker, C., Chi, X., Krüger, M., Shiraiwa, M., Förster, J., Pöschl, U., Vlasenko, S. S., Ryshkevich, T. I., Weigand, M., Kilcoyne, A. L. D., Andreae, M.

{Atmospheric Chemistry and Physics}, 15(15):8847-8869, European Geosciences Union, Katlenburg-Lindau, Germany, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Orbital reflectometry of PrNiO3/PrAlO3 superlattices

Wu, M., Benckiser, E., Audehm, P., Goering, E., Wochner, P., Christiani, G., Logvenov, G., Habermeier, H., Keimer, B.

{Physical Review B}, 91(19), American Physical Society, Woodbury, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Dynamic domain wall chirality rectification by rotating magnetic fields

Bisig, A., Mawass, M., Stärk, M., Moutafis, C., Rhensius, J., Heidler, J., Gliga, S., Weigand, M., Tyliszczak, T., Van Waeyenberge, B., Stoll, H., Schütz, G., Kläui, M.

{Applied Physics Letters}, 106(12), American Institute of Physics, Melville, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Ultrafast demagnetization after laser pulse irradiation in Ni: Ab-initio electron-phonon scattering and phase space calculations

Illg, C., Haag, M., Fähnle, M.

In Ultrafast Magnetism I. Proceedings of the International Conference UMC 2013, 159, pages: 131-133, Springer Proceedings in Physics, Springer, Strasbourg, 2015 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
Imaging spin dynamics on the nanoscale using X-ray microscopy

Stoll, H., Noske, M., Weigand, M., Richter, K., Krüger, B., Reeve, R. M., Hänze, M., Adolff, C. F., Stein, F., Meier, G., Kläui, M., Schütz, G.

{Frontiers in Physics}, 3, Frontiers Media, Lausanne, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Trajectory generation for multi-contact momentum control

Herzog, A., Rotella, N., Schaal, S., Righetti, L.

In 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids), pages: 874-880, IEEE, Seoul, South Korea, 2015 (inproceedings)

Abstract
Simplified models of the dynamics such as the linear inverted pendulum model (LIPM) have proven to perform well for biped walking on flat ground. However, for more complex tasks the assumptions of these models can become limiting. For example, the LIPM does not allow for the control of contact forces independently, is limited to co-planar contacts and assumes that the angular momentum is zero. In this paper, we propose to use the full momentum equations of a humanoid robot in a trajectory optimization framework to plan its center of mass, linear and angular momentum trajectories. The model also allows for planning desired contact forces for each end-effector in arbitrary contact locations. We extend our previous results on linear quadratic regulator (LQR) design for momentum control by computing the (linearized) optimal momentum feedback law in a receding horizon fashion. The resulting desired momentum and the associated feedback law are then used in a hierarchical whole body control approach. Simulation experiments show that the approach is computationally fast and is able to generate plans for locomotion on complex terrains while demonstrating good tracking performance for the full humanoid control.

am mg

link (url) DOI [BibTex]

link (url) DOI [BibTex]


no image
Structure Learning in Bayesian Sensorimotor Integration

Genewein, T, Hez, E, Razzaghpanah, Z, Braun, DA

PLoS Computational Biology, 11(8):1-27, August 2015 (article)

Abstract
Previous studies have shown that sensorimotor processing can often be described by Bayesian learning, in particular the integration of prior and feedback information depending on its degree of reliability. Here we test the hypothesis that the integration process itself can be tuned to the statistical structure of the environment. We exposed human participants to a reaching task in a three-dimensional virtual reality environment where we could displace the visual feedback of their hand position in a two dimensional plane. When introducing statistical structure between the two dimensions of the displacement, we found that over the course of several days participants adapted their feedback integration process in order to exploit this structure for performance improvement. In control experiments we found that this adaptation process critically depended on performance feedback and could not be induced by verbal instructions. Our results suggest that structural learning is an important meta-learning component of Bayesian sensorimotor integration.

ei

DOI [BibTex]

DOI [BibTex]


no image
Unique high-temperature performance of highly consensed MnBi permanent magnets

Chen, Y., Gregori, G., Leineweber, A., Qu, F., Chen, C., Tietze, T., Kronmüller, H., Schütz, G., Goering, E.

{Scripta Materialia}, 107, pages: 131-135, Pergamon, Tarrytown, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Quantifying Emergent Behavior of Autonomous Robots

Martius, G., Olbrich, E.

Entropy, 17(10):7266, 2015 (article)

al

link (url) DOI [BibTex]

link (url) DOI [BibTex]


no image
Electrical determination of vortex state in submicron magnetic elements

Gangwar, A., Bauer, H. G., Chauleau, J., Noske, M., Weigand, M., Stoll, H., Schütz, G., Back, C. H.

{Physical Review B}, 91(9), American Physical Society, Woodbury, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Mechanisms for the symmetric and antisymmetric switching of a magnetic vortex core: Differences and common aspects

Noske, M., Stoll, H., Fähnle, M., Hertel, R., Schütz, G.

{Physical Review B}, 91(1), American Physical Society, Woodbury, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Automotive domain wall propagation in ferromagnetic rings

Richter, K., Mawass, M., Krone, A., Krüger, B., Weigand, M., Schütz, G., Stoll, H., Kläui, M.

In IEEE International Magnetics Conference (INTERMAG 2015), IEEE, Beijing, China, 2015 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
High resolution, high efficiency mulitlayer Fresnel zone plates for soft and hard X-rays

Sanli, U., Keskinbora, K., Gregorczyk, K., Leister, J., Teeny, N., Grévent, C., Knez, M., Schütz, G.

{Proceedings of SPIE}, 9592, SPIE, Bellingham, Washington, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Macroscopic drift current in the inverse Faraday effect

Hertel, R., Fähnle, M.

{Physical Review B}, 91(2), American Physical Society, Woodbury, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Single-step 3D nanofabrication of kinoform optics via gray-scale focused ion beam lithography for efficient X-ray focusing

Keskinbora, K., Grévent, C., Hirscher, M., Weigand, M., Schütz, G.

{Advanced Optical Materials}, 3, pages: 792-800, WILEY-VCH Verlag GmbH Co. KGaA, Weinheim, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Band structure engineering of two-dimensional magnonic vortex crystals

Behncke, C., Hänze, M., Adolff, C. F., Weigand, M., Meier, G.

{Physical Review B}, 91(22), American Physical Society, Woodbury, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Towards denoising XMCD movies of fast magnetization dynamics using extended Kalman filter

Kopp, M., Harmeling, S., Schütz, G., Schölkopf, B., Fähnle, M.

{Ultramicroscopy}, 148, pages: 115-122, North-Holland, Amsterdam, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Humanoid Momentum Estimation Using Sensed Contact Wrenches

Rotella, N., Herzog, A., Schaal, S., Righetti, L.

In 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids), pages: 556-563, IEEE, Seoul, South Korea, 2015 (inproceedings)

Abstract
This work presents approaches for the estimation of quantities important for the control of the momentum of a humanoid robot. In contrast to previous approaches which use simplified models such as the Linear Inverted Pendulum Model, we present estimators based on the momentum dynamics of the robot. By using this simple yet dynamically-consistent model, we avoid the issues of using simplified models for estimation. We develop an estimator for the center of mass and full momentum which can be reformulated to estimate center of mass offsets as well as external wrenches applied to the robot. The observability of these estimators is investigated and their performance is evaluated in comparison to previous approaches.

am mg

link (url) DOI [BibTex]

link (url) DOI [BibTex]


no image
A Reward-Maximizing Spiking Neuron as a Bounded Rational Decision Maker

Leibfried, F, Braun, DA

Neural Computation, 27(8):1686-1720, July 2015 (article)

Abstract
Rate distortion theory describes how to communicate relevant information most efficiently over a channel with limited capacity. One of the many applications of rate distortion theory is bounded rational decision making, where decision makers are modeled as information channels that transform sensory input into motor output under the constraint that their channel capacity is limited. Such a bounded rational decision maker can be thought to optimize an objective function that trades off the decision maker's utility or cumulative reward against the information processing cost measured by the mutual information between sensory input and motor output. In this study, we interpret a spiking neuron as a bounded rational decision maker that aims to maximize its expected reward under the computational constraint that the mutual information between the neuron's input and output is upper bounded. This abstract computational constraint translates into a penalization of the deviation between the neuron's instantaneous and average firing behavior. We derive a synaptic weight update rule for such a rate distortion optimizing neuron and show in simulations that the neuron efficiently extracts reward-relevant information from the input by trading off its synaptic strengths against the collected reward.

ei

DOI [BibTex]

DOI [BibTex]


no image
Magnetic moments induce strong phonon renormalization in FeSi

Krannich, S., Sidis, Y., Lamago, D., Heid, R., Mignot, J., von Löhneysen, H., Ivanov, A., Steffens, P., Keller, T., Wang, L., Goering, E., Weber, F.

{Nature Communications}, 6, Nature Publishing Group, London, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
What is epistemic value in free energy models of learning and acting? A bounded rationality perspective

Ortega, PA, Braun, DA

Cognitive Neuroscience, 6(4):215-216, December 2015 (article)

Abstract
Free energy models of learning and acting do not only care about utility or extrinsic value, but also about intrinsic value, that is, the information value stemming from probability distributions that represent beliefs or strategies. While these intrinsic values can be interpreted as epistemic values or exploration bonuses under certain conditions, the framework of bounded rationality offers a complementary interpretation in terms of information-processing costs that we discuss here.

ei

DOI [BibTex]

DOI [BibTex]


no image
Perpendicular magnetisation from in-plane fields in nano-scaled antidot lattices

Gräfe, J., Haering, F., Tietze, T., Audehm, P., Weigand, M., Wiedwald, U., Ziemann, P., Gawronski, P., Schütz, G., Goering, E. J.

{Nanotechnology}, 26(22), IOP Pub., Bristol, UK, 2015 (article)

mms

DOI Project Page [BibTex]

DOI Project Page [BibTex]


no image
Theory of ultrafast demagnetization after femtosecond laser pulses

Fähnle, M., Illg, C., Haag, M., Teeny, N.

{Acta Physica Polonica A}, 127(2):170-175, Państwowe Wydawnictwo Naukowe, Warszawa, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Non-linear radial spinwave modes in thin magnetic disks

Helsen, M., Gangwar, Ajay, De Clercq, J., Vansteenkiste, A., Weigand, M., Back, C. H., Van Waeyenberge, B.

{Applied Physics Letters}, 106(3), American Institute of Physics, Melville, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Hydrogen isotope separation in metal-organic frameworks: Kinetic or chemical affinity quantum-sieving?

Savchenko, I., Mavrandonakis, A., Heine, T., Oh, H., Teufel, J., Hirscher, M.

{Microporous and Mesoporous Materials}, 216, pages: 133-137, Elsevier, Amsterdam, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
High-resolution dichroic imaging of magnetic flux distributions in superconductors with scanning x-ray microscopy

Ruoß, S., Stahl, C., Weigand, M., Schütz, G., Albrecht, J.

{Applied Physics Letters}, 106, American Institute of Physics, Melville, NY, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
The third dimension: Vortex core reversal by interaction with \textquotesingleflexure modes’

Noske, M., Stoll, H., Fähnle, M., Weigand, M., Dieterle, G., Förster, J., Gangwar, A., Slavin, A., Back, C. H., Schütz, G.

In IEEE International Magnetics Conference (INTERMAG 2015), IEEE, Beijing, China, 2015 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
Preparation and characterisation of epitaxial Pt/Cu/FeMn/Co thin films on (100)-oriented MgO single crystals

Schmidt, M., Gräfe, J., Audehm, P., Phillipp, F., Schütz, G., Goering, E.

{Physica Status Solidi A}, 212(10):2114-2123, Wiley-VCH, Weinheim, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Probing the magnetic moments of [MnIII6CrIII]3+ single-molecule magnets - A cross comparison of XMCD and spin-resolved electron spectroscopy

Helmstedt, A., Dohmeier, N., Müller, N., Gryzia, A., Brechling, A., Heinzmann, U., Hoeke, V., Krickemeyer, E., Glaser, T., Leicht, P., Fonin, M., Tietze, T., Joly, L., Kuepper, K.

{Journal of Electron Spectroscopy and Related Phenomena}, 198, pages: 12-19, Elsevier B.V., Amsterdam, 2015 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Skyrmions at room temperature in magnetic multilayers

Moreau-Luchaire, C., Reyren, N., Moutafis, C., Sampaio, J., Van Horne, N., Vaz, C. A., Warnicke, P., Garcia, K., Weigand, M., Bouzehouane, K., Deranlot, C., George, J., Raabe, J., Cros, V., Fert, A.

In IEEE International Magnetics Conference (INTERMAG 2015), IEEE, Beijing, China, 2015 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]

1997


no image
Locally weighted learning

Atkeson, C. G., Moore, A. W., Schaal, S.

Artificial Intelligence Review, 11(1-5):11-73, 1997, clmc (article)

Abstract
This paper surveys locally weighted learning, a form of lazy learning and memory-based learning, and focuses on locally weighted linear regression. The survey discusses distance functions, smoothing parameters, weighting functions, local model structures, regularization of the estimates and bias, assessing predictions, handling noisy data and outliers, improving the quality of predictions by tuning fit parameters, interference between old and new data, implementing locally weighted learning efficiently, and applications of locally weighted learning. A companion paper surveys how locally weighted learning can be used in robot learning and control. Keywords: locally weighted regression, LOESS, LWR, lazy learning, memory-based learning, least commitment learning, distance functions, smoothing parameters, weighting functions, global tuning, local tuning, interference.

am

link (url) [BibTex]

1997


link (url) [BibTex]


no image
Locally weighted learning for control

Atkeson, C. G., Moore, A. W., Schaal, S.

Artificial Intelligence Review, 11(1-5):75-113, 1997, clmc (article)

Abstract
Lazy learning methods provide useful representations and training algorithms for learning about complex phenomena during autonomous adaptive control of complex systems. This paper surveys ways in which locally weighted learning, a type of lazy learning, has been applied by us to control tasks. We explain various forms that control tasks can take, and how this affects the choice of learning paradigm. The discussion section explores the interesting impact that explicitly remembering all previous experiences has on the problem of learning to control. Keywords: locally weighted regression, LOESS, LWR, lazy learning, memory-based learning, least commitment learning, forward models, inverse models, linear quadratic regulation (LQR), shifting setpoint algorithm, dynamic programming.

am

link (url) [BibTex]

link (url) [BibTex]


no image
Learning from demonstration

Schaal, S.

In Advances in Neural Information Processing Systems 9, pages: 1040-1046, (Editors: Mozer, M. C.;Jordan, M.;Petsche, T.), MIT Press, Cambridge, MA, 1997, clmc (inproceedings)

Abstract
By now it is widely accepted that learning a task from scratch, i.e., without any prior knowledge, is a daunting undertaking. Humans, however, rarely attempt to learn from scratch. They extract initial biases as well as strategies how to approach a learning problem from instructions and/or demonstrations of other humans. For learning control, this paper investigates how learning from demonstration can be applied in the context of reinforcement learning. We consider priming the Q-function, the value function, the policy, and the model of the task dynamics as possible areas where demonstrations can speed up learning. In general nonlinear learning problems, only model-based reinforcement learning shows significant speed-up after a demonstration, while in the special case of linear quadratic regulator (LQR) problems, all methods profit from the demonstration. In an implementation of pole balancing on a complex anthropomorphic robot arm, we demonstrate that, when facing the complexities of real signal processing, model-based reinforcement learning offers the most robustness for LQR problems. Using the suggested methods, the robot learns pole balancing in just a single trial after a 30 second long demonstration of the human instructor. 

am

link (url) [BibTex]

link (url) [BibTex]