T. D. Simão, Nils Jansen, M.T.J. Spaan (2021), AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training, In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems p.1226-1235, International Foundation for Autonomous Agents and Multiagent Systems.
Greg Neustroev, Mathijs de Weerdt (2020), Generalized Optimistic Q-Learning with Provable Efficiency, Bo An, Amal El Fallah Seghrouchni, Gita Sukthankar (Eds.), In Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 p.913-921, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).
Greg Neustroev, Canmanie Ponnambalam, Mathijs de Weerdt, Matthijs Spaan (2020), Interval Q-Learning: Balancing Deep and Wide Exploration, In Adaptive and Learning Agents Workshop.
Thiago D. Simão, Romain Laroche, Rémi Tachet des Combes (2020), Safe Policy Improvement with an Estimated Baseline Policy, In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems p.1269–1277.
Thiago D. Simão, Matthijs T.J. Spaan (2019), Safe Policy Improvement with Baseline Bootstrapping in Factored Environments, In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 p.4967-4974, American Association for Artificial Intelligence (AAAI).
Thiago D. Simão (2019), Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments, Sarit Kraus (Eds.), In Proceedings of the 28th International Joint Conference on Artificial Intelligence, IJCAI 2019 p.6460-6461, International Joint Conferences on Artifical Intelligence (IJCAI).
Thiago D. Simão, Matthijs T.J. Spaan (2019), Structure Learning for Safe Policy Improvement, S. Kraus (Eds.), In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence p.3453-3459, International Joint Conferences on Artifical Intelligence (IJCAI).