Home
Home
Publications
Michael L Littman
Latest
Deep Radial-basis Value Functions for Continuous Control
Lipschitz Lifelong Reinforcement Learning
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning
DeepMellow: Removing the Need for a Target Network in Deep Q-Learning
State Abstraction as Compression in Apprenticeship Learning
Equivalence between wasserstein and value-aware model-based reinforcement learning
Lipschitz Continuity for Model-based Reinforcement Learning
Mean Actor-Critic
An Alternative Softmax Operator for Reinforcement Learning
Cite
×