G. Neustroev, Sytze P.E. Andringa, Remco A. Verzijlbergh, Mathijs M. de Weerdt (2022), Deep Reinforcement Learning for Active Wake Control, In International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2022 p.944-953, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

G. Neustroev (2022), Generalized Models of Sequential Decision-Making under Uncertainty, PhD Thesis Delft University of Technology.

Greg Neustroev, Mathijs de Weerdt (2020), Generalized Optimistic Q-Learning with Provable Efficiency, Bo An, Amal El Fallah Seghrouchni, Gita Sukthankar (Eds.), In Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 p.913-921, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

Greg Neustroev, Canmanie Ponnambalam, Mathijs de Weerdt, Matthijs Spaan (2020), Interval Q-Learning: Balancing Deep and Wide Exploration, In Adaptive and Learning Agents Workshop.

Greg Neustroev, Mathijs de Weerdt, Remco Verzijlbergh (2019), Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards, J. Benton, Nir Lipovetzky, Eva Onaindia, David E. Smith, Siddharth Srivastava (Eds.), In Proceedings of the 29th International Conference on Automated Planning and Scheduling, ICAPS 2019 Volume 29 p.292-300, Association for the Advancement of Artificial Intelligence (AAAI).