Jun Ki Lee
(Brown, 2022) |
Explaining Reinforcement Learning
Agents by Policy Comparison |
Research professor at the AI Institute of Seoul National University. |
Carl Trimbach
(Brown, 2021) |
Methods for Teaching Hard Learning
Problems |
Applied Machine Learning Scientist at Grainger. Was software Engineer at American Water. |
Sam Saarinen
(Brown, 2021) |
Query Strategies for Directed
Graphical Models and their Application to Adaptive
Testing |
Founder of Edapt Technologies. |
Lucas Lehnert
(Brown, 2021) |
Encoding Reusable Knowledge in
State Representations |
Assistant Professor at the University of Saskatchewan. Was postdoctoral researcher at the Mila-Quebec AI Institute. |
Kavosh Asadi
(Brown, 2020) |
Smoothness in Reinforcement
Learning with Large State and Action Spaces |
Researcher at Amazon. |
Guan ("Royal") Wang
(Brown, 2020) |
Interactive Reinforcement
Learning from Human Language and Evaluative Feedback Through
Task Decomposition |
Co-Founder of Learnable.ai. |
David Abel
(Brown, 2020) |
A Theory of Abstraction in
Reinforcement Learning |
Researcher at Google Deep Mind. |
Stephen Brawner
(Brown, 2018) |
Algorithms for the Personalization
of AI for Robots and the Smart Home |
Independent consultant. Was employee at Heliotrope Energy. |
Vukosi Marivate
(Rutgers, 2014) |
Improved Empirical Methods in
Reinforcement-Learning Evaluation |
Chair of Data Science, University of Pretoria
(South Africa). Was employee at Council for Scientific and Industrial Research (CSIR) |
Monica Babeș-Vroman
(Rutgers, 2014) |
Maximum Likelihood Inverse Reinforcement Learning |
Visiting professor at The Master's University. |
Sergiu
Goschin (Rutgers, 2014, co-advised
with Haym Hirsh) |
Stochastic Dilemmas:
Foundations and Applications |
Engineer at Google NYC. |
Ari Weinstein (Rutgers, 2013) |
Local Planning for
Continuous Markov Decision Processes |
Researcher at Google Deep Mind. Was postdoc at Princeton University. |
Michael
Wunder (Rutgers, 2013, co-advised
with Matthew Stone) |
Transferable Strategic
Meta-Reasoning Models |
Engineer at Google NYC. Was engineer at Care.com. Was
engineer at Consumr. |
John
Asmuth (Rutgers, 2013) |
Model-based Bayesian
Reinforcement Learning with Generalized Priors |
Vice president at Two Sigma. Was engineer at Google NYC. |
Ali
Nouri (Rutgers, 2010) |
Efficient Model-based
Exploration in Continuous State-space Environments |
Engineer at Google LAX. Was Solution Architect at Bank of America. |
Carlos
Diuk Wasser (Rutgers, 2010) |
An Object-oriented
Representation for Efficient Reinforcement Learning |
Data Science team member at Facebook. Was postdoc at Princeton University. |
Tom
Walsh (Rutgers, 2010) |
Efficient Learning of Relational
Models for Sequential Decision Making |
Research scientist at Sony AI. Was engineer at
Kronos. Was postdoctoral researcher with the Laboratory for
Information and Decision Systems (LIDS) at MIT. Was
Research assistant with the Center for Educational
Testing and Evaluation (CETE) at the University of
Kansas. Was researcher in Accenture. Was postdoc at the University of Arizona.
|
Lihong
Li (Rutgers, 2009) |
A Unifying Framework for Computational Reinforcement
Learning Theory |
Researcher at Amazon. Was researcher at Google Cloud in Kirkland. Was researcher at Microsoft Research. Was researcher at Yahoo! Research. |
Bethany
Edmunds (nee Leffler) (Rutgers, 2008) |
Perception-Based
Generalization in Model-Based Reinforcement Learning |
Professor at Northeastern Vancouver Campus. Was instructor at British Columbia
Institute of Technology and technical director of
Leffler Software Services. |
Brian Russell
(Rutgers, 2008, co-advised with Wade Trappe) |
Learning-based Route Management in
Wireless Ad Hoc Networks |
On sabbatical. Was lecturer at Rutgers
University. Was staff member at AT&T Labs. |
Alexander
L. Strehl (Rutgers, 2007) |
Probably Approximately Correct
(PAC) Exploration in Reinforcement Learning |
Was researcher at Facebook. |
Fancong Zeng
(Rutgers, 2007) |
Just-in-time and Just-in-place Deadlock Resolution |
Software Engineer at Microsoft. Was Engineer at Amazon.
Was engineer at Citigroup. Was engineer at Microsoft. |
Stephen Michael Majercik [NASA Fellow] (Duke,
2000) |
Planning Under Uncertainty via Stochastic
Satisfiability |
Emeritus Professor at Bowdoin. Was associate Professor of Computer Science at
Bowdoin. |
Fan Jiang (Duke, 2000) |
Matrix Computations for Query
Expansion in Information Retrieval |
Developer at Virtu Financial. Was developer at Getco LLC. Was employee at Blue Capital Group. |