Reinforcement learning: A survey LP Kaelbling, ML Littman, AW Moore Journal of artificial intelligence research 4, 237-285, 1996 | 12286 | 1996 |
Planning and acting in partially observable stochastic domains LP Kaelbling, ML Littman, AR Cassandra Artificial intelligence 101 (1-2), 99-134, 1998 | 5943 | 1998 |
Markov games as a framework for multi-agent reinforcement learning ML Littman Machine learning proceedings 1994, 157-163, 1994 | 4190 | 1994 |
Measuring praise and criticism: Inference of semantic orientation from association PD Turney, ML Littman acm Transactions on Information Systems (tois) 21 (4), 315-346, 2003 | 2427 | 2003 |
Activity recognition from accelerometer data N Ravi, N Dandekar, P Mysore, ML Littman Aaai 5 (2005), 1541-1546, 2005 | 2330 | 2005 |
Packet routing in dynamically changing networks: A reinforcement learning approach J Boyan, M Littman Advances in neural information processing systems 6, 1993 | 1227 | 1993 |
Learning policies for partially observable environments: Scaling up ML Littman, AR Cassandra, LP Kaelbling Machine Learning Proceedings 1995, 362-370, 1995 | 1047 | 1995 |
Convergence results for single-step on-policy reinforcement-learning algorithms S Singh, T Jaakkola, ML Littman, C Szepesvári Machine learning 38, 287-308, 2000 | 1044 | 2000 |
Acting optimally in partially observable stochastic domains AR Cassandra, LP Kaelbling, ML Littman Aaai 94, 1023-1028, 1994 | 1036 | 1994 |
Friend-or-foe Q-learning in general-sum games ML Littman ICML 1 (2001), 322-328, 2001 | 941 | 2001 |
Graphical models for game theory M Kearns, ML Littman, S Singh arXiv preprint arXiv:1301.2281, 2013 | 824 | 2013 |
On the complexity of solving Markov decision problems ML Littman, TL Dean, LP Kaelbling arXiv preprint arXiv:1302.4971, 2013 | 757 | 2013 |
Interactions between learning and evolution D Ackley, M Littman Artificial life II 10, 487-509, 1991 | 744 | 1991 |
Predictive representations of state M Littman, RS Sutton Advances in neural information processing systems 14, 2001 | 735 | 2001 |
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes AR Cassandra, ML Littman, NL Zhang arXiv preprint arXiv:1302.1525, 2013 | 701 | 2013 |
Computerized cross-language document retrieval using latent semantic indexing TK Landauer, ML Littman US Patent 5,301,109, 1994 | 666 | 1994 |
An analysis of model-based interval estimation for Markov decision processes AL Strehl, ML Littman Journal of Computer and System Sciences 74 (8), 1309-1331, 2008 | 658 | 2008 |
PAC model-free reinforcement learning AL Strehl, L Li, E Wiewiora, J Langford, ML Littman Proceedings of the 23rd international conference on Machine learning, 881-888, 2006 | 654 | 2006 |
Towards a unified theory of state abstraction for MDPs. L Li, TJ Walsh, ML Littman AI&M 1 (2), 3, 2006 | 641 | 2006 |
Algorithms for sequential decision-making ML Littman Brown University, 1996 | 608 | 1996 |