T. D. Simão (2023), Safe Online and Offline Reinforcement Learning, PhD Thesis Delft University of Technology.

Alberto Castellini, Federico Bianchi, Edoardo Zorzi, Thiago D. Simão, Alessandro Farinelli, Matthijs T.J. Spaan (2023), Scalable Safe Policy Improvement via Monte Carlo Tree Search, In Proceedings of Machine Learning Research p.3732-3756.

Eghonghon Aye Eigbe, Bart De Schutter, Mitra Nasri, Neil Yorke-Smith (2023), Sequence- and time-dependent maintenance scheduling in twice re-entrant flow shops, In IEEE Access Volume 11 p.103461-103475.

Alexander Chebykin, Arkadiy Dushatskiy, Tanja Alderliesten, Peter Bosman (2023), Shrink-Perturb Improves Architecture Mixing During Population Based Training for Neural Architecture Search, Kobi Gal, Kobi Gal, Ann Nowe, Grzegorz J. Nalepa, Roy Fairstein, Roxana Radulescu (Eds.), In ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings p.381-388, IOS Press.

Matthias Horn, Emir Demirovic, Neil Yorke-Smith (2023), Solving the Multi-Choice Two Dimensional Shelf Strip Packing Problem with Time Windows, In Proceedings International Conference on Automated Planning and Scheduling, ICAPS p.491-499.

Yueqi Hou, Xiaolong Liang, Maolong Lv, Qisong Yang, Yang Li (2023), Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making, In Engineering Applications of Artificial Intelligence Volume 125.

Arthur Guijt, Dirk Thierens, Tanja Alderliesten, Peter A.N. Bosman (2023), The impact of asynchrony on parallel model-based eas, In GECCO 2023 - Proceedings of the 2023 Genetic and Evolutionary Computation Conference p.910-918, Association for Computing Machinery (ACM).

Yun Li, Neil Yorke-Smith, Tamas Keviczky (2023), Unlocking Energy Flexibility From Thermal Inertia of Buildings: A Robust Optimization Approach, In Proceedings of the 62nd IEEE Conference on Decision and Control (CDC 2023) p.2555-2562, IEEE.

G. Veviurko, J.W. Böhmer, M.M. de Weerdt (2023), You Shall not Pass: the Zero-Gradient Problem in Predict and Optimize for Convex Optimization.

Junhan Wen, Thomas Abeel, Mathijs de Weerdt (2023), “How sweet are your strawberries?”: Predicting sugariness using non-destructive and affordable hardware, In Frontiers in Plant Science Volume 14.