Doctoral Dissertations Advised by Michael Littman

A list of my graduated Ph.D. students.

Jun Ki Lee (Brown, 2022)	Explaining Reinforcement Learning Agents by Policy Comparison	Research professor at the AI Institute of Seoul National University.
Carl Trimbach (Brown, 2021)	Methods for Teaching Hard Learning Problems	Applied Machine Learning Scientist at Grainger. Was software Engineer at American Water.
Sam Saarinen (Brown, 2021)	Query Strategies for Directed Graphical Models and their Application to Adaptive Testing	Founder of Edapt Technologies.
Lucas Lehnert (Brown, 2021)	Encoding Reusable Knowledge in State Representations	Assistant Professor at the University of Saskatchewan. Was postdoctoral researcher at the Mila-Quebec AI Institute.
Kavosh Asadi (Brown, 2020)	Smoothness in Reinforcement Learning with Large State and Action Spaces	Researcher at Amazon.
Guan ("Royal") Wang (Brown, 2020)	Interactive Reinforcement Learning from Human Language and Evaluative Feedback Through Task Decomposition	Co-Founder of Learnable.ai.
David Abel (Brown, 2020)	A Theory of Abstraction in Reinforcement Learning	Researcher at Google Deep Mind.
Stephen Brawner (Brown, 2018)	Algorithms for the Personalization of AI for Robots and the Smart Home	Independent consultant. Was employee at Heliotrope Energy.
Vukosi Marivate (Rutgers, 2014)	Improved Empirical Methods in Reinforcement-Learning Evaluation	Chair of Data Science, University of Pretoria (South Africa). Was employee at Council for Scientific and Industrial Research (CSIR)
Monica Babeș-Vroman (Rutgers, 2014)	Maximum Likelihood Inverse Reinforcement Learning	Visiting professor at The Master's University.
Sergiu Goschin (Rutgers, 2014, co-advised with Haym Hirsh)	Stochastic Dilemmas: Foundations and Applications	Engineer at Google NYC.
Ari Weinstein (Rutgers, 2013)	Local Planning for Continuous Markov Decision Processes	Researcher at Google Deep Mind. Was postdoc at Princeton University.
Michael Wunder (Rutgers, 2013, co-advised with Matthew Stone)	Transferable Strategic Meta-Reasoning Models	Engineer at Google NYC. Was engineer at Care.com. Was engineer at Consumr.
John Asmuth (Rutgers, 2013)	Model-based Bayesian Reinforcement Learning with Generalized Priors	Vice president at Two Sigma. Was engineer at Google NYC.
Ali Nouri (Rutgers, 2010)	Efficient Model-based Exploration in Continuous State-space Environments	Engineer at Google LAX. Was Solution Architect at Bank of America.
Carlos Diuk Wasser (Rutgers, 2010)	An Object-oriented Representation for Efficient Reinforcement Learning	Data Science team member at Facebook. Was postdoc at Princeton University.
Tom Walsh (Rutgers, 2010)	Efficient Learning of Relational Models for Sequential Decision Making	Research scientist at Sony AI. Was engineer at Kronos. Was postdoctoral researcher with the Laboratory for Information and Decision Systems (LIDS) at MIT. Was Research assistant with the Center for Educational Testing and Evaluation (CETE) at the University of Kansas. Was researcher in Accenture. Was postdoc at the University of Arizona.
Lihong Li (Rutgers, 2009)	A Unifying Framework for Computational Reinforcement Learning Theory	Researcher at Amazon. Was researcher at Google Cloud in Kirkland. Was researcher at Microsoft Research. Was researcher at Yahoo! Research.
Bethany Edmunds (nee Leffler) (Rutgers, 2008)	Perception-Based Generalization in Model-Based Reinforcement Learning	Professor at Northeastern Vancouver Campus. Was instructor at British Columbia Institute of Technology and technical director of Leffler Software Services.
Brian Russell (Rutgers, 2008, co-advised with Wade Trappe)	Learning-based Route Management in Wireless Ad Hoc Networks	On sabbatical. Was lecturer at Rutgers University. Was staff member at AT&T Labs.
Alexander L. Strehl (Rutgers, 2007)	Probably Approximately Correct (PAC) Exploration in Reinforcement Learning	Was researcher at Facebook.
Fancong Zeng (Rutgers, 2007)	Just-in-time and Just-in-place Deadlock Resolution	Software Engineer at Microsoft. Was Engineer at Amazon. Was engineer at Citigroup. Was engineer at Microsoft.
Stephen Michael Majercik [NASA Fellow] (Duke, 2000)	Planning Under Uncertainty via Stochastic Satisfiability	Emeritus Professor at Bowdoin. Was associate Professor of Computer Science at Bowdoin.
Fan Jiang (Duke, 2000)	Matrix Computations for Query Expansion in Information Retrieval	Developer at Virtu Financial. Was developer at Getco LLC. Was employee at Blue Capital Group.