Header logo is


2017


On the Design of {LQR} Kernels for Efficient Controller Learning
On the Design of LQR Kernels for Efficient Controller Learning

Marco, A., Hennig, P., Schaal, S., Trimpe, S.

Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), pages: 5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (conference)

Abstract
Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.

am ics pn

arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]

2017


arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]


Optimizing Long-term Predictions for Model-based Policy Search
Optimizing Long-term Predictions for Model-based Policy Search

Doerr, A., Daniel, C., Nguyen-Tuong, D., Marco, A., Schaal, S., Toussaint, M., Trimpe, S.

Proceedings of 1st Annual Conference on Robot Learning (CoRL), 78, pages: 227-238, (Editors: Sergey Levine and Vincent Vanhoucke and Ken Goldberg), 1st Annual Conference on Robot Learning, November 2017 (conference)

Abstract
We propose a novel long-term optimization criterion to improve the robustness of model-based reinforcement learning in real-world scenarios. Learning a dynamics model to derive a solution promises much greater data-efficiency and reusability compared to model-free alternatives. In practice, however, modelbased RL suffers from various imperfections such as noisy input and output data, delays and unmeasured (latent) states. To achieve higher resilience against such effects, we propose to optimize a generative long-term prediction model directly with respect to the likelihood of observed trajectories as opposed to the common approach of optimizing a dynamics model for one-step-ahead predictions. We evaluate the proposed method on several artificial and real-world benchmark problems and compare it to PILCO, a model-based RL framework, in experiments on a manipulation robot. The results show that the proposed method is competitive compared to state-of-the-art model learning methods. In contrast to these more involved models, our model can directly be employed for policy search and outperforms a baseline method in the robot experiment.

am ics

PDF Project Page [BibTex]

PDF Project Page [BibTex]


Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers
Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers

Doerr, A., Nguyen-Tuong, D., Marco, A., Schaal, S., Trimpe, S.

In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pages: 5295-5301, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (inproceedings)

am ics

PDF arXiv DOI Project Page [BibTex]

PDF arXiv DOI Project Page [BibTex]


Virtual vs. {R}eal: Trading Off Simulations and Physical Experiments in Reinforcement Learning with {B}ayesian Optimization
Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S.

In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pages: 1557-1563, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (inproceedings)

am ics pn

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]

2011


no image
An Experimental Demonstration of a Distributed and Event-based State Estimation Algorithm

(Best Interactive Paper Award (top out of 450))

Trimpe, S., D’Andrea, R.

In Proceedings of the 18th IFAC World Congress, 2011 (inproceedings)

am ics

PDF DOI [BibTex]

2011


PDF DOI [BibTex]


no image
Reduced Communication State Estimation for Control of an Unstable Networked Control System

Trimpe, S., D’Andrea, R.

In Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011 (inproceedings)

am ics

PDF Supplementary material DOI [BibTex]

PDF Supplementary material DOI [BibTex]


no image
Amorphous grain boundary layers in the ferromagnetic nanograined ZnO films

Straumal, B. B., Mazilkin, A. A., Protasova, S. G., Myatiev, A. A., Straumal, P. B., Goering, E., Baretzky, B.

In 520, pages: 1192-1194, Hersonissos, Greece, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
Inversed solid-phase grain boundary wetting in the Al-Zn system

Protasova, S. G., Kogtenkova, O. A., Straumal, B. B., Zieba, P., Baretzky, B.

In 46, pages: 4349-4353, Mie, Japan, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
First measurement of the heat effect of the grain boundary wetting phase transition

Straumal, B. B., Kogtenkova, O. A., Protasova, S. G., Zieba, P., Czeppe, T., Baretzky, B., Valiev, R. Z.

In 46, pages: 4243, Mie, Japan, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
Transmission electron microscopy investigation of boundaries between amorphous "grains" in Ni50Nb20Y30 alloy

Mazilkin, A. A., Abrosimova, G. E., Protasova, S. G., Straumal, B. B., Schütz, G., Dobatkin, S. V., Bakai, A. S.

In 46, pages: 4336-4342, Mie, Japan, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]

2009


no image
A Limiting Property of the Matrix Exponential with Application to Multi-loop Control

Trimpe, S., D’Andrea, R.

In Proceedings of the Joint 48th IEEE Conference on Decision (CDC) and Control and 28th Chinese Control Conference, 2009 (inproceedings)

am ics

PDF DOI [BibTex]

2009


PDF DOI [BibTex]

2007


no image
Less Conservative Polytopic LPV Models for Charge Control by Combining Parameter Set Mapping and Set Intersection

Kwiatkowski, A., Trimpe, S., Werner, H.

In Proceedings of the 46th IEEE Conference on Decision and Control, 2007 (inproceedings)

am ics

DOI [BibTex]

2007


DOI [BibTex]

2002


no image
Pressure Isotherms of Hydrogen Adsorption in Carbon Nanostructures

Chen, X., Dettlaff-Weglikowska, U., Haluska, M., Hulman, M., Roth, S., Hirscher, M., Becher, M.

In Making Functional Materials with Nanotubes, pages: Z9.11.1-Z9.11.6, Materials Research Society Symposium Proceedings, MRS, Boston [Mass.], 2002 (inproceedings)

mms

[BibTex]

2002


[BibTex]


no image
Hydrogen Storage in Carbon SWNTs: Atomic or Molecular?

Haluska, M., Hirscher, M., Becher, M., Dettlaff-Weglikowska, U., Chen, X., Roth, S.

In Structural and Electronic Properties of Molecular Nanostructures, pages: 601-605, AIP Conference Proceedings, AIP, Kirchberg, Tirol [Austria], 2002 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Hydrogen Storage in Nanostructured Carbon Materials at Room Temperature

Chen, X., Dettlaff-Weglikowska, U., Haluska, M., Hirscher, M., Becher, M., Roth, S.

In Structural and Electronic Properties of Molecular Nanostructures, pages: 597-600, AIP Conference Proceedings, AIP, Kirchberg, Tirol [Austria], 2002 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Micromagnetism and the microstructure of the cell walls in Sm2Co17 based permanent magnets

Goll, D., Hadjipanayis, G. C., Kronmüller, H.

In Proceedings of the 17th International Workshop on Rare-Earth Magnets and their Applications, pages: 696-703, Rinton Press, Newark, Delaware, USA, 2002 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Ab-initio study of the influence of epitaxial strain on magnetoelastic properties

Komelj, M., Fähnle, M.

In Atomistic Aspects of Epitaxial Growth, pages: 439-447, NATO Science series: Series 2, Mathematics, Physics, and Chemistry, Kluwer Academic Publishers, Dassia, Corfu [Greece], 2002 (inproceedings)

mms

[BibTex]

[BibTex]

2000


no image
High-performance nanocrystalline PrFeB-based bonded permanent magnets

Goll, D., Kleinschroth, I., Kronmüller, H.

In Proceedings of the 16th International Workshop on Rare-Earth Magnets and Their Applications, pages: 641-650, Japan Institute of Metals, 2000 (inproceedings)

mms

[BibTex]

2000


[BibTex]


no image
Experimental and theoretical study of the Verwey transition in magnetite

Brabers, V. A. M., Brabers, J. H. V. J., Walz, F., Kronmüller, H.

In Proceedings 8th International Conference on Ferrites, pages: 123-125, Japan Society of Powder and Powder Metallurgy, 2000 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Evolution of microstructure and microchemistry in the high-temperature Sm(Co, Fe, Cu, Zr)z magnets

Zhang, Y. W., Hadjipanayis, G. C., Goll, D., Kronmüller, H., Chen, C., Nelson, C., Krishnan, K.

In Proceedings of the 16th International Workshop on Rare-Earth Magnets and Their Applications, pages: 169-178, Sendai, Japan, 2000 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Fundamental investigations and industrial applications of magnetostriction

Hirscher, M., Fischer, S. F., Reininger, T.

In Modern Trends in Magnetostriction Study and Application. Proceedings of the NATO Advanced Study Institute on Modern Trends in Magnetostriction, 5, pages: 307-329, NATO Science Series: II: Mathematics, Physics and Chemistry, Kluwer Academic Publishers, Kyiv, Ukraine, 2000 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Micromagnetic and microstructural analysis of the temperature dependence of the coercive field of Sm2(Co, Cu, Fe, Zr)17 permanent magnets

Goll, D., Sigle, W., Hadjipanayis, G. C., Kronmüller, H.

In Proceedings of the 16th International Workshop on Rare-Earth Magnets and Their Applications, pages: 61-70, Kaneko, H.; Homma, M.; Okada, M., 2000 (inproceedings)

mms

[BibTex]

[BibTex]