Miguel Suau, Jinke He, Matthijs T.J. Spaan, Frans A. Oliehoek (2022), Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators, In International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2022 p.1735-1737, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

Johan Los, Frederik Schulte, Matthijs T.J. Spaan, Rudy R. Negenborn (2022), Strategic Bidding in Decentralized Collaborative Vehicle Routing, Michael Freitag, Aseem Kinra, Herbert Kotzab, Nicole Megow (Eds.), In Dynamics in Logistics p.261-274, Springer.

Q. Yang, T. D. Simão, Nils Jansen, Simon H. Tindemans, M.T.J. Spaan (2022), Training and Transferring Safe Policies in Reinforcement Learning, Hayes Cruz , Santos da Silva (Eds.), In Proceedings of the Adaptive and Learning Agents Workshop.

C.T. Ponnambalam, F.A. Oliehoek, M.T.J. Spaan (2021), Abstraction-Guided Policy Recovery from Expert Demonstrations, In 31th International Conference on Automated Planning and Scheduling p.560-568, American Association for Artificial Intelligence (AAAI).

T. D. Simão, Nils Jansen, M.T.J. Spaan (2021), AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training, In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems p.1226-1235, International Foundation for Autonomous Agents and Multiagent Systems.

Frits de Nijs, Erwin Walraven, Mathijs M. de Weerdt, Matthijs T.J. Spaan (2021), Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms, In Journal of Artificial Intelligence Research Volume 70 p.955-1001.

Jordi Smit, Canmanie Ponnambalam, Matthijs T.J. Spaan, Frans A. Oliehoek (2021), PEBL: Pessimistic Ensembles for Offline Deep Reinforcement Learning, In Robust and Reliable Autonomy in the Wild Workshop at the 30th International Joint Conference of Artificial Intelligence.

Nils H. van der Blij, Pavel Purgat, Thiago B. Soeiro, Laura M. Ramirez Elizondo, Matthijs T.J. Spaan, Pavol Bauer (2021), Protection Framework for Low Voltage DC Grids, In Proceedings - 2021 IEEE 19th International Power Electronics and Motion Control Conference, PEMC 2021 p.331-337, IEEE .

Steven Carr, Nils Jansen, Suda Bharadwaj, M.T.J. Spaan, Ufuk Topcu (2021), Safe Policies for Factored Partially Observable Stochastic Games, Dylan A. Shell, Marc Toussaint, M. Ani Hsieh (Eds.), In Robotics: Science and System XVII.

Q. Yang, T. D. Simão, S.H. Tindemans, M.T.J. Spaan (2021), WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning, In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI-21) p.10639-10646.