Header logo is


2017


Thumb xl fig toyex lqr1kernel 1
On the Design of LQR Kernels for Efficient Controller Learning

Marco, A., Hennig, P., Schaal, S., Trimpe, S.

Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), pages: 5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (conference)

Abstract
Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.

am ics pn

arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]

2017


arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]


no image
Optimal gamification can help people procrastinate less

Lieder, F., Griffiths, T. L.

Annual Meeting of the Society for Judgment and Decision Making, Annual Meeting of the Society for Judgment and Decision Making, November 2017 (conference)

re

Project Page [BibTex]

Project Page [BibTex]


Thumb xl teaser
Coupling Adaptive Batch Sizes with Learning Rates

Balles, L., Romero, J., Hennig, P.

In Proceedings Conference on Uncertainty in Artificial Intelligence (UAI) 2017, pages: 410-419, (Editors: Gal Elidan and Kristian Kersting), Association for Uncertainty in Artificial Intelligence (AUAI), Conference on Uncertainty in Artificial Intelligence (UAI), August 2017 (inproceedings)

Abstract
Mini-batch stochastic gradient descent and variants thereof have become standard for large-scale empirical risk minimization like the training of neural networks. These methods are usually used with a constant batch size chosen by simple empirical inspection. The batch size significantly influences the behavior of the stochastic optimization algorithm, though, since it determines the variance of the gradient estimates. This variance also changes over the optimization process; when using a constant batch size, stability and convergence is thus often enforced by means of a (manually tuned) decreasing learning rate schedule. We propose a practical method for dynamic batch size adaptation. It estimates the variance of the stochastic gradients and adapts the batch size to decrease the variance proportionally to the value of the objective function, removing the need for the aforementioned learning rate decrease. In contrast to recent related work, our algorithm couples the batch size to the learning rate, directly reflecting the known relationship between the two. On three image classification benchmarks, our batch size adaptation yields faster optimization convergence, while simultaneously simplifying learning rate tuning. A TensorFlow implementation is available.

ps pn

Code link (url) Project Page [BibTex]

Code link (url) Project Page [BibTex]


no image
Dynamic Time-of-Flight

Schober, M., Adam, A., Yair, O., Mazor, S., Nowozin, S.

Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, pages: 170-179, IEEE, Piscataway, NJ, USA, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017 (conference)

ei pn

DOI [BibTex]

DOI [BibTex]


Thumb xl this one
Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S.

In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pages: 1557-1563, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (inproceedings)

am ics pn

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]


Thumb xl screen shot 2017 07 20 at 12.31.00 pm
Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets

Klein, A., Falkner, S., Bartels, S., Hennig, P., Hutter, F.

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS 2017), 54, pages: 528-536, Proceedings of Machine Learning Research, (Editors: Sign, Aarti and Zhu, Jerry), PMLR, April 2017 (conference)

pn

pdf link (url) Project Page [BibTex]

pdf link (url) Project Page [BibTex]


no image
A reward shaping method for promoting metacognitive learning

Lieder, F., Krueger, P. M., Callaway, F., Griffiths, T. L.

In Proceedings of the Third Multidisciplinary Conference on Reinforcement Learning and Decision-Making, 2017 (inproceedings)

re

Project Page [BibTex]

Project Page [BibTex]


no image
The moderating role of arousal on the seductive detail effect

Schneider, S., Wirzberger, M., Augustin, Y., Rey, G. D.

In Abstracts of the 59th Conference of Experimental Psychologists (TeaP), pages: 96, Papst Science Publishers, Lengerich, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
Influences of cognitive load on learning performance, speech and physiological parameters in a dual-task setting

Wirzberger, M., Herms, R., Esmaeili Bijarsari, S., Rey, G. D., Eibl, M.

In Abstracts of the 20th Conference of the European Society for Cognitive Psychology, pages: 161, Potsdam, Germany, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
Time – Space – Content? Interrupting features of hyperlinks in multimedia learning

Wirzberger, M., Schneider, S., Dlouhy, S., Rey, G. D.

In Abstracts of the 59th Conference of Experimental Psychologists (TeaP), pages: 97, Pabst Science Publishers, Lengerich, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
Computer Science meets Cognition: Möglichkeiten und Herausforderungen interdisziplinärer Kognitionsforschung [Computer science meets cognition: Chances and challenges in interdisciplinary research on cognition]

Wirzberger, M., Truschzinski, M., Schmidt, R., Barlag, M.

In INFORMATIK 2017, Lecture Notes in Informatics (LNI), pages: 2273-2277, Gesellschaft für Informatik, Bonn, 2017 (inproceedings)

re

DOI [BibTex]

DOI [BibTex]


no image
When does bounded-optimal metareasoning favor few cognitive systems?

Milli, S., Lieder, F., Griffiths, T. L.

In AAAI Conference on Artificial Intelligence, 31, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
The Structure of Goal Systems Predicts Human Performance

Bourgin, D., Lieder, F., Reichman, D., Talmon, N., Griffiths, T.

In Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
Learning to (mis) allocate control: maltransfer can lead to self-control failure

Bustamante, L., Lieder, F., Musslick, S., Shenhav, A., Cohen, J.

In The 3rd Multidisciplinary Conference on Reinforcement Learning and Decision Making. Ann Arbor, Michigan, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
Inspecting cognitive load factors in digital learning settings with ACT-R

Wirzberger, M.

In Dagstuhl 2017. Proceedings of the 11th Joint Workshop of the German Research Training Groups in Computer Science, pages: 62, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
Lernförderliche Gestaltung computerbasierter Instruktionen zur Roboterkonstruktion [Enhancing design of computer-based instructions in a robot construction task]

Esmaeili Bijarsari, S., Wirzberger, M., Rey, G. D.

In INFORMATIK 2017, Lecture Notes in Informatics (LNI), pages: 2279-2286, Gesellschaft für Informatik, Bonn, 2017 (inproceedings)

re

DOI [BibTex]

DOI [BibTex]


no image
An automatic method for discovering rational heuristics for risky choice

Lieder, F., Krueger, P. M., Griffiths, T. L.

In Proceedings of the 39th Annual Meeting of the Cognitive Science Society. Austin, TX: Cognitive Science Society, 2017 (inproceedings)

re

Project Page [BibTex]

Project Page [BibTex]


no image
Mouselab-MDP: A new paradigm for tracing how people plan

Callaway, F., Lieder, F., Krueger, P. M., Griffiths, T. L.

In The 3rd multidisciplinary conference on reinforcement learning and decision making, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
A dynamic process model for predicting workload in an air traffic controller task

Truschzinski, M., Wirzberger, M.

In Proceedings of the 39th Annual Meeting of the Cognitive Science Society, pages: 1224-1229, Cognitive Science Society, Austin, TX, 2017 (inproceedings)

re

link (url) [BibTex]

link (url) [BibTex]


no image
Auswirkung systeminduzierter Delays auf die menschliche Gedächtnisleistung in einem virtuellen agentenbasierten Trainingssetting [Influence of system-induced delays on human memory performance in a virtual agent-based training scenario]

Wirzberger, M., Schmidt, R., Rey, G. D., Hardt, W.

In INFORMATIK 2017, Lecture Notes in Informatics (LNI), pages: 2287-2294, Gesellschaft für Informatik, Bonn, 2017 (inproceedings)

re

DOI [BibTex]

DOI [BibTex]


no image
Enhancing metacognitive reinforcement learning using reward structures and feedback

Krueger, P. M., Lieder, F., Griffiths, T. L.

In Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017 (inproceedings)

re

Project Page Project Page [BibTex]

Project Page Project Page [BibTex]


no image
Helping people choose subgoals with sparse pseudo rewards

Callaway, F., Lieder, F., Griffiths, T. L.

In Proceedings of the Third Multidisciplinary Conference on Reinforcement Learning and Decision Making, 2017 (inproceedings)

re

[BibTex]

[BibTex]


no image
Modeling cognitive load effects in an interrupted learning task: An ACT-R approach

Wirzberger, M., Rey, G. D., Krems, J.

In Proceedings of the 39th Annual Meeting of the Cognitive Science Society, pages: 3540-3545, Cognitive Science Society, Austin, TX, 2017 (inproceedings)

re

link (url) [BibTex]

link (url) [BibTex]