Publications

 
Thalmeier D., Gómez V., Kappen H.J.
Action selection in growing state spaces: control of network structure growth.
Journal of Physics A, vol. 50, pp. 1-21, 2017

Ruiz H.C., Kappen H.J.
Particle smoothing for hidden diffusion processes: adaptive path integral smoother.
IEEE Transactions on Signal Processing, vol. 65, pp. 3391-3203, 2017

file type image Bierkens J., Chernyak V.Y., Chertkov M., Kappen H.J. Journal of Statistical Mechanics: Theory and Experiment, vol. 013206, 2016

Satoh Satoshi, Kappen H.J.
An iterative method for nonlinear stochastic optimal control based on path integrals.
IEEE transactions on Automatic Control, vol. 62, pp. 262-276., 2016

file type image McGuire, K.N., Croon G. de, Remes B., De Wachter C., Tuyls K., Kappen H.J. IEEE International Conference on Robotics and Automation, 2016

file type image Kappen H.J., Ruiz H.C. Journal of Statistical Physics, vol. 162, pp. 1244-1266, 2016

file type image Gómez V., Thijssen S.A., Symington A., Hailes Stephen, Kappen H.J. Proceedings ICAPS, vol. 26, 2016

Thalmeier D., Uhlmann M., Memmesheimer R., Kappen H.J. Plos Computational Biology, vol. 12, no. 6, pp. 29-58, 2016

Thalmeier D., Gómez V., Kappen H.J.
Optimal control of network structure growth.
NIPS workshop on Advances in Approximate Bayesian Inference, 2016

Ruiz Euler, Kappen H.J.
Smoothing estimates of diffusion processes.
NIPS workshop on Advances in Approximate Bayesian Inference, 2016

file type image Thijssen S.A., Kappen H.J. Physical Review E, vol. 91, no. 032104, pp. 1-6, 2015

Gómez V., Thijssen S.A., Symington A., Hailes Stephen, Kappen H.J.
Real-time stochastic optimal control for multi-agent quadrotor swarm.
RSS Workshop R4Sim2015 Rome, 2015

Kappen H.J., Ruiz H.C., Christian H.
Adaptive importance sampling for control and inference.
Journal of Statistical Physics, 2015

Thalmeier D., Uhlmann M., Memmesheimer R., Kappen H.J.
Learning universal computations with spikes.
UNKNOWN, 2015

file type image Chernyak V., Chertkov, M, Bierkens J., Kappen H.J. Journal of Physics A: Mathematical and Theoretical as a Fast Track Communication, vol. 47, no. 2, pp. 022001, 2014

file type image Bierkens J., Ran A. Linear Algebra and its Applications, vol. 457, pp. 191-208, 2014

file type image Gómez V., Kappen H.J., Peters J., Neumann G. LNAI conference proceedings, pp. 1-16, 2014

file type image Bierkens J., Gaans, van O. Journal Differential Equations, vol. 257, no. 7, pp. 2418-2429, 2014

file type image Bierkens J., Kappen H.J. Systems and Control Letters, pp. 1-16, 2014

Matsubara T., Gómez V., Kappen H.J.
Latent kullback leibler control for continuous-state systems using probabilistic graphical models.
Proceedings UAI, vol. 30 th, pp. 1-10, 2014

Gheshlaghi Azar M., Munos R., Ghavamzadaeh M., Kappen H.J.
Speedy q-learning: a computationally efficient reinforcement learning algorithm with a near optimal rate of convergence.
Journal of Machine Learning Research, 2013

file type image Gheshlaghi Azar M., Munos R., Kappen H.J. Machine Learning Journal, vol. 91, no. 3, pp. 325-349, 2013


file type image Kappen H.J., Gómez V. Technical Report, pp. invited paper, 2013

Kappen H.J.
Comment: causal entropic forces.
Technical Report, pp. http://arxiv.org/abs/1312.4185, 2013

Thijssen S.A., Kappen H.J.
Stochastic path integral control.
International Journal of Control, 2013

file type image Kappen H.J., Gómez V., Opper M. Machine Learning, vol. 87, no. 2, pp. 159-182, 2012

file type image Gómez V., Kappen H.J., Litvak N., Kaltenbrunner A. World Wide Web, vol. 16, no. 5-6, pp. 645-675, 2012

Llera A., Gómez V., Kappen H.J.
Adaptive classification on brain computer interfaces using reinforcement signals.
Neural Computation, vol. 24, no. 11, pp. 2900-2923, 2012

file type image Gheshlaghi Azar M., Munos R., Kappen H.J. Proceedings of the International Conference on Machine Learning Learning, vol. 29 th, pp. 1-11, 2012

file type image Gheshlaghi Azar M., Gómez V., Kappen H.J. Journal of Machine Learning Research, no. 13, pp. 3207-3245, 2012

Gheshlaghi Azar M.
On the theory of reinforcement learning methods, convergence analysis and sample complexity..
PhD Thesis, pp. 1-143, 2012

file type image Gómez V., Chertkov, M, Backhaus S., Kappen H.J. SmartGrid Comm 2012, Symposium ion Architectures and Models for the SmartGrid, pp. invited paper, 2012

Tramper J.J., Broek J.L. van den, Kappen H.J., Gielen C.C.A.M. Plos One, vol. 7, no. 3, pp. e33724, 2012

file type image Llera A., Gerven M van, Gómez V., Jensen O., Kappen H.J. Neural Networks, vol. 24, pp. 1120-1127, 2011

file type image Kappen H.J. Inference and Learning in Dynamic Models, pp. 363-387, 2011

Tramper J.J., Broek J.L. van den, Wiegerinck W.A.J.J., Kappen H.J., Gielen C.C.A.M.
Stochastic optimal control predicts human motor behavior in time-constrained sensorimotor tasks.
Biological Cybernetics, pp. xx, 2011

file type image Gheshlaghi Azar M., Munos R., Ghavamzadaeh M., Kappen H.J. NIPS 2011, Advances in Neural Information Processing Systems 24, vol. 25, pp. 2411--2419, 2011

file type image Bierkens J., Kappen H.J. NIPS 2011, 4th International Workshop on Optimization for Machine Learning, vol. 25, pp. 1-6, 2011

Broek J.L. van den, Wiegerinck W.A.J.J., Kappen H.J.
Stochastic optimal control of state constrained systems.
International Journal of Control, vol. 84, no. 3, pp. 597-615, 2011

file type image Gheshlaghi Azar M., Gómez V., Kappen H.J. JMLR: Workshop and Conference Proceedings: AISTATS 2011, vol. 15, pp. 119-127, 2011

file type image Mensink T., Verbeek J., Kappen H.J. ECAI, pp. 1-6, 2010

file type image Kappen H.J., Tonk S. UNKNOWN, no. TR1001, pp. 1-5, 2010

file type image Broek J.L. van den, Wiegerinck W.A.J.J., Kappen H.J. UAI, vol. 26, pp. 1-8, 2010

Gheshlaghi Azar M., Kappen H.J.
Dynamic policy programming with kl-divergence minimization.
NIPS Workshop on Probabilistic Approaches for Stochastic Optimal Control and Robotics, 2009

file type image Broek J.L. van den, Wiegerinck W.A.J.J., Kappen H.J. Journal of Artificial Intelligence Research, vol. 32, pp. 95-122, 2008

file type image Broek J.L. van den, Wiegerinck W.A.J.J., Kappen H.J. Adaptive Agents and Multi-Agent Systems III. Adaptation and Multi-Agent Learning, vol. 4865, pp. 15-26, 2008

file type image Broek J.L. van den, Wiegerinck W.A.J.J., Kappen H.J. Alamas 07, Maastricht 2-3 April., pp. 9-20, 2008

file type image Wiegerinck W.A.J.J., Broek J.L. van den, Kappen H.J. AAMAS'07, vol. website, pp. 1-8, 2007

file type image Kappen H.J. In 9th Granada seminar on Computational Physics: Computational and Mathematical Modeling of Cooperative Behavior in Neural Systems., pp. 149-181, 2007

file type image Wiegerinck W.A.J.J., Broek J.L. van den, Kappen H.J. UAI, vol. 22 th, pp. 528-535, 2006

file type image Kappen H.J. Physical Review Letters, vol. 95, pp. 200201, 2005

file type image Kappen H.J. Journal of Statistical Mechanics: Theory and Experiment, pp. P11011, 2005

file type image Glasius R., Komoda A., Gielen C.C.A.M. Neural Networks, vol. 8 (1), pp. 125-133, 1995