C.T. Ponnambalam, F.A. Oliehoek, M.T.J. Spaan (2021), Abstraction-Guided Policy Recovery from Expert Demonstrations, In 31th International Conference on Automated Planning and Scheduling p.560-568, American Association for Artificial Intelligence (AAAI).

T. D. Simão, Nils Jansen, M.T.J. Spaan (2021), AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training, In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems p.1226-1235, International Foundation for Autonomous Agents and Multiagent Systems.

Frits de Nijs, Erwin Walraven, Mathijs M. de Weerdt, Matthijs T.J. Spaan (2021), Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms, In Journal of Artificial Intelligence Research Volume 70 p.955-1001.

Jordi Smit, Canmanie Ponnambalam, Matthijs T.J. Spaan, Frans A. Oliehoek (2021), PEBL: Pessimistic Ensembles for Offline Deep Reinforcement Learning, In Robust and Reliable Autonomy in the Wild Workshop at the 30th International Joint Conference of Artificial Intelligence.

Nils H. van der Blij, Pavel Purgat, Thiago B. Soeiro, Laura M. Ramirez Elizondo, Matthijs T.J. Spaan, Pavol Bauer (2021), Protection Framework for Low Voltage DC Grids, In Proceedings - 2021 IEEE 19th International Power Electronics and Motion Control Conference, PEMC 2021 p.331-337, IEEE .

Steven Carr, Nils Jansen, Suda Bharadwaj, M.T.J. Spaan, Ufuk Topcu (2021), Safe Policies for Factored Partially Observable Stochastic Games, Dylan A. Shell, Marc Toussaint, M. Ani Hsieh (Eds.), In Robotics: Science and System XVII.

Q. Yang, T. D. Simão, S.H. Tindemans, M.T.J. Spaan (2021), WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning, In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI-21) p.10639-10646.

Jennifer Renoux, Tiago S. Veiga, Pedro U. Lima, Matthijs T.J. Spaan (2020), A Unified Decision-Theoretic Model for Information Gathering and Communication Planning, In 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) p.67-74, IEEE .

C.T. Ponnambalam, F.A. Oliehoek, M.T.J. Spaan (2020), Abstraction-Guided Policy Recovery from Expert Demonstrations.

Joris Scharpff, Daan Schraven, Leentje Volker, Matthijs T.J. Spaan, Mathijs M. de Weerdt (2020), Can multiple contractors self-regulate their joint service delivery?: A serious gaming experiment on road maintenance planning, In Construction Management and Economics Volume 39 (2021) p.99–116.