Header logo is


2017


Thumb xl fig toyex lqr1kernel 1
On the Design of LQR Kernels for Efficient Controller Learning

Marco, A., Hennig, P., Schaal, S., Trimpe, S.

Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), pages: 5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (conference)

Abstract
Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.

am ics pn

arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]

2017


arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]


Thumb xl teaser
Optimizing Long-term Predictions for Model-based Policy Search

Doerr, A., Daniel, C., Nguyen-Tuong, D., Marco, A., Schaal, S., Toussaint, M., Trimpe, S.

Proceedings of 1st Annual Conference on Robot Learning (CoRL), 78, pages: 227-238, (Editors: Sergey Levine and Vincent Vanhoucke and Ken Goldberg), 1st Annual Conference on Robot Learning, November 2017 (conference)

Abstract
We propose a novel long-term optimization criterion to improve the robustness of model-based reinforcement learning in real-world scenarios. Learning a dynamics model to derive a solution promises much greater data-efficiency and reusability compared to model-free alternatives. In practice, however, modelbased RL suffers from various imperfections such as noisy input and output data, delays and unmeasured (latent) states. To achieve higher resilience against such effects, we propose to optimize a generative long-term prediction model directly with respect to the likelihood of observed trajectories as opposed to the common approach of optimizing a dynamics model for one-step-ahead predictions. We evaluate the proposed method on several artificial and real-world benchmark problems and compare it to PILCO, a model-based RL framework, in experiments on a manipulation robot. The results show that the proposed method is competitive compared to state-of-the-art model learning methods. In contrast to these more involved models, our model can directly be employed for policy search and outperforms a baseline method in the robot experiment.

am ics

PDF Project Page [BibTex]

PDF Project Page [BibTex]


Thumb xl apollo system2 croped
Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers

Doerr, A., Nguyen-Tuong, D., Marco, A., Schaal, S., Trimpe, S.

In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pages: 5295-5301, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (inproceedings)

am ics

PDF arXiv DOI Project Page [BibTex]

PDF arXiv DOI Project Page [BibTex]


Thumb xl this one
Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S.

In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pages: 1557-1563, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (inproceedings)

am ics pn

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]

2006


no image
Ab-initio calculations: I. Basic principles of the density functional electron theory and combination with phenomenological theories

Fähnle, M.

In Structural defects in ordered alloys and intermetallics. Characterization and modelling, pages: IX-1-IX-10, COST and CNRS, Bonascre [Ariege, France], 2006 (inproceedings)

mms

[BibTex]

2006


[BibTex]


no image
Hard magnetic FePt thin films and nanostructures in L1(0) phases

Goll, D., Breitling, A., Goo, N. H., Sigle, W., Hirscher, M., Schütz, G.

In 13, pages: 97-101, Beijing, PR China, 2006 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Ab-initio calculations: II. Application to atomic defects, phase diagrams, dislocations

Fähnle, M.

In Structural defects in ordered alloys and intermetallics. Characterization and modelling, pages: XIV-1-XIV-11, COST and CNRS, Bonascre [Ariege, France], 2006 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Residual stress analysis in reed pipe brass tongues of historic organs

Manescu, A., Giuliani, A., Fiori, F., Baretzky, B.

In Residual Stresses VII. 7th Europen Conference on Residual Stresses (ECRS7), pages: 969-974, Trans Tech, Berlin [Germany], 2006 (inproceedings)

mms

[BibTex]

[BibTex]


no image
High-pressure influence on the kinetics of grain boundary segregation in the Cu-Bi system

Chang, L.-S., Straumal, B., Rabkin, E., Lojkowski, W., Gust, W.

In 258-260, pages: 390-396, Aveiro (Portugal), 2006 (inproceedings)

mms

[BibTex]

[BibTex]

2005


no image
Magnetization reversal behavior of nanogranular CoCrPt alloy thin films studied with magnetic transmission X-ray microscopy

Fischer, P., Im, M., Eimüller, T., Schütz, G., Shin, S.

In 286, pages: 311-314, Boulder, CO, USA, 2005 (inproceedings)

mms

[BibTex]

2005


[BibTex]


no image
Defects distribution of Pr2Fe14B hard magnetic magnet from amorphous to nanostructures characterized by positron annihilation spectroscopy

Wu, Y. C., Sprengel, W., Reimann, K., Reichle, K. J., Goll, D., Würschum, R., Schaefer, H. E.

In PRICM 5. Proceedings of the Fifth Pacific RIM International Conference on Advanced Materials and Processing, 475-479, pages: 2123-2126, Materials Science Forum, Trans Tech, Beijing, China, 2005 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Implementing sub-ns time resolution into magnetic X-ray microscopies

Puzic, A., Stoll, H., Fischer, P., Van Waeyenberge, B., Raabe, J., Denbeaux, G., Haug, T., Weiss, D., Schütz, G.

In T115, pages: 1029-1031, Malmö/Lund, Sweden, 2005 (inproceedings)

mms

[BibTex]

[BibTex]