Johan Los, Frederik Schulte, Margaretha Gansterer, Richard F. Hartl, Matthijs T.J. Spaan, Rudy R. Negenborn (2022), Large-scale collaborative vehicle routing, In Annals of Operations Research.

Katia Sycara, Vasant Honavar, M.T.J. Spaan (Eds.) (2022), Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, Association for the Advancement of Artificial Intelligence (AAAI).

Q. Yang, T. D. Simão, Simon H. Tindemans, M.T.J. Spaan (2022), Refined Risk Management in Safe Reinforcement Learning with a Distributional Safety Critic, David Bossens, Stephen Giguere, Roderick Bloem, Bettina Koenighofer (Eds.), In Safe RL Workshop at IJCAI 2022.

Qisong Yang, Thiago D Simão, Simon H. Tindemans, Matthijs T.J. Spaan (2022), Safety-constrained reinforcement learning with a distributional safety critic, In Machine Learning Volume 112 p.859-887.

Miguel Suau, Jinke He, Matthijs T.J. Spaan, Frans A. Oliehoek (2022), Speeding up Deep Reinforcement Learning through Influence-Augmented Local Simulators, In International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2022 p.1735-1737, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

Johan Los, Frederik Schulte, Matthijs T.J. Spaan, Rudy R. Negenborn (2022), Strategic Bidding in Decentralized Collaborative Vehicle Routing, Michael Freitag, Aseem Kinra, Herbert Kotzab, Nicole Megow (Eds.), In Dynamics in Logistics p.261-274, Springer.

Q. Yang, T. D. Simão, Nils Jansen, Simon H. Tindemans, M.T.J. Spaan (2022), Training and Transferring Safe Policies in Reinforcement Learning, Hayes Cruz , Santos da Silva (Eds.), In Proceedings of the Adaptive and Learning Agents Workshop.

C.T. Ponnambalam, F.A. Oliehoek, M.T.J. Spaan (2021), Abstraction-Guided Policy Recovery from Expert Demonstrations, In 31th International Conference on Automated Planning and Scheduling p.560-568, American Association for Artificial Intelligence (AAAI).

T. D. Simão, Nils Jansen, M.T.J. Spaan (2021), AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training, In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems p.1226-1235, International Foundation for Autonomous Agents and Multiagent Systems.

Frits de Nijs, Erwin Walraven, Mathijs M. de Weerdt, Matthijs T.J. Spaan (2021), Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms, In Journal of Artificial Intelligence Research Volume 70 p.955-1001.