User profiles for Michael Littman
Michael L. LittmanProfessor of Computer Science, Brown University Verified email at cs.brown.edu Cited by 27125 |
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org
Abstract This paper surveys the field of reinforcement learning from a computer-science
perspective. It is written to be accessible to researchers familiar with machine learning. Both
the historical basis of the field and a broad selection of current work are summarized. ...
perspective. It is written to be accessible to researchers familiar with machine learning. Both
the historical basis of the field and a broad selection of current work are summarized. ...
Cited by 5396 Related articles All 118 versions Cite SaveSaving...Error saving. Try again? More Fewer
[HTML][HTML] Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra - Artificial intelligence, 1998 - Elsevier
In this paper, we bring techniques from operations research to bear on the problem of
choosing optimal actions in partially observable stochastic domains. We begin by
introducing the theory of Markov decision processes (mdps) and partially observable ...
choosing optimal actions in partially observable stochastic domains. We begin by
introducing the theory of Markov decision processes (mdps) and partially observable ...
Cited by 2689 Related articles All 68 versions Cite SaveSaving...Error saving. Try again? More EBSCOhost Full Text Fewer
Markov games as a framework for multi-agent reinforcement learning
ML Littman - … of the eleventh international conference on machine …, 1994 - books.google.com
Abstract In the Markov decision process (MDP) formalization of reinforcement learning, a
single adaptive agent interacts with an environment defined by a probabilistic transition
function. In this solipsistic view, secondary agents can only be part of the environment and ...
single adaptive agent interacts with an environment defined by a probabilistic transition
function. In this solipsistic view, secondary agents can only be part of the environment and ...
Cited by 1378 Related articles All 35 versions Cite SaveSaving...Error saving. Try again? More Fewer
Measuring praise and criticism: Inference of semantic orientation from association
PD Turney, ML Littman - ACM Transactions on Information Systems ( …, 2003 - dl.acm.org
Abstract The evaluative character of a word is called its semantic orientation. Positive
semantic orientation indicates praise (eg," honest"," intrepid") and negative semantic
orientation indicates criticism (eg," disturbing"," superfluous"). Semantic orientation varies ...
semantic orientation indicates praise (eg," honest"," intrepid") and negative semantic
orientation indicates criticism (eg," disturbing"," superfluous"). Semantic orientation varies ...
Cited by 1400 Related articles All 33 versions Cite SaveSaving...Error saving. Try again? More EBSCOhost Full Text Fewer
[PDF][PDF] Activity recognition from accelerometer data
N Ravi, N Dandekar, P Mysore, ML Littman - AAAI, 2005 - aaai.org
Abstract Activity recognition fits within the bigger framework of context awareness. In this
paper, we report on our efforts to recognize user activity from accelerometer data. Activity
recognition is formulated as a classification problem. Performance of base-level classifiers ...
paper, we report on our efforts to recognize user activity from accelerometer data. Activity
recognition is formulated as a classification problem. Performance of base-level classifiers ...
Cited by 1074 Related articles All 14 versions Cite SaveSaving...Error saving. Try again? More View as HTML Fewer
Spectrally narrow pulsed dye laser without beam expander
MG Littman, HJ Metcalf - Applied optics, 1978 - osapublishing.org
We have developed a simplified version of the side-pumped pulsed dye laser which has a
spectral halfwidth of 1.25 GHz and a peak power of 10 kW at 600 nm. The basic laser
consists of only four components (output mirror, dye cell, diffraction grating, and tuning ...
spectral halfwidth of 1.25 GHz and a peak power of 10 kW at 600 nm. The basic laser
consists of only four components (output mirror, dye cell, diffraction grating, and tuning ...
Cited by 796 Related articles All 12 versions Cite SaveSaving...Error saving. Try again? More EBSCOhost Full Text Fewer
[PDF][PDF] Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman - AAAI, 1994 - cs.brown.edu
Abstract In this paper, we describe the partially observable Markov decision process
(pomdp) approach to nding optimal or near-optimal control strategies for partially observable
stochastic environments, given a complete model of the environment. The pomdp ...
(pomdp) approach to nding optimal or near-optimal control strategies for partially observable
stochastic environments, given a complete model of the environment. The pomdp ...
Cited by 671 Related articles All 27 versions Cite SaveSaving...Error saving. Try again? More View as HTML Fewer
Stark structure of the Rydberg states of alkali-metal atoms
ML Zimmerman, MG Littman, MM Kash, D Kleppner - Physical Review A, 1979 - APS
Abstract The authors describe practical methods for calculating the Stark structure of
Rydberg states of the alkali metals based on diagnolization of the energy matrix. A survey of
Stark structures is presented for all of the alkali metals in the vicinity of n= 15. Topics ...
Rydberg states of the alkali metals based on diagnolization of the energy matrix. A survey of
Stark structures is presented for all of the alkali metals in the vicinity of n= 15. Topics ...
Cited by 661 Related articles All 2 versions Cite SaveSaving...Error saving. Try again? More EBSCOhost Full Text Fewer
[PDF][PDF] Packet routing in dynamically changing networks: A reinforcement learning approach
JA Boyan, ML Littman - Advances in neural information …, 1994 - www-2.cs.cmu.edu
Abstract This paper describes the Q-routing algorithm for packet routing, in which a
reinforcement learning module is embedded into each node of a switching network. Only
local communication is used by each node to keep accurate statistics on which routing ...
reinforcement learning module is embedded into each node of a switching network. Only
local communication is used by each node to keep accurate statistics on which routing ...
Cited by 598 Related articles All 25 versions Cite SaveSaving...Error saving. Try again? More View as HTML Fewer
[PDF][PDF] Interactions between learning and evolution
D Ackley, M Littman - Artificial life II, 1991 - hawaii.edu
A program of research into weakly supervised learning algorithms led us to ask if learning
could occur given only natural selection as feedback. We developed an algorithm that
combined evolution and learning, and tested it in an artificial environment populated with ...
could occur given only natural selection as feedback. We developed an algorithm that
combined evolution and learning, and tested it in an artificial environment populated with ...