新規登録 | ログイン | FAQ      [?] 

タグ: reinforcementlearning [62 articles]

Recent papers classified by the tag reinforcementlearning.
  • Technical Note: Q-Learning
    Machine Learning, Vol. V8, No. 3. (1 May 1992), pp. 279-292.
    by Christopher JCH Watkins, Peter Dayan
    posted to reinforcementlearning pad by mangesh on 2007-02-02 18:27:24 as read along with 1 person Bc91
  • Multi-agent reinforcement learning: a critical survey
    (2003)
  • Multiagent reinforcement learning: theoretical framework and an algorithm
    (1998), pp. 242-250.
    by Junling Hu, Michael P Wellman
    posted to reinforcementlearning nashequilibrium games by mangesh on 2007-02-02 15:54:10 as ***
  • Reinforcement Learning: A Survey
    Journal of Artificial Intelligence Research, Vol. 4 (1996), pp. 237-285.
    by Leslie P Kaelbling, Michael L Littman, Andrew P Moore
  • Midbrain dopamine neurons encode decisions for future action
    Nature Neuroscience, Vol. 9, No. 8. (23 July 2006), pp. 1057-1063.
    by Genela Morris, Alon Nevet, David Arkadir, Eilon Vaadia, Hagai Bergman
  • Model-based fMRI and its application to reward-learning and decision making.
    Ann N Y Acad Sci (7 April 2007)
    by John Philip P O'doherty, Alan N N Hampton, Hackjin Kim
  • A unified model for perceptual learning
    Trends in Cognitive Sciences, Vol. In Press, Corrected Proof
    by Aaron Seitz, Takeo Watanabe
  • Effects of Magnitude of Reward and Percentage of Reinforcement on a Lever Movement Response
    Child Development, Vol. 35, No. 1. (1964), pp. 281-285.
    by James L Bruning
    posted to reinforcementlearning by brian on 2005-02-08 00:55:06 as ** along with 1 person dep
  • Computational roles for dopamine in behavioural control
    Nature, Vol. 431, No. 7010. (14 October 2004), pp. 760-767.
    by Read P Montague, Steven E Hyman, Jonathan D Cohen
  • Representation of action-specific reward values in the striatum.
    Science, Vol. 310, No. 5752. (25 November 2005), pp. 1337-1340.
    by K Samejima, Y Ueda, K Doya, M Kimura
  • Nonreinforced Trial Procedure for Probability Learning
    by SH Revusky
    posted to probabilitymatching reinforcementlearning by brian on 2005-02-08 00:53:15 as **
  • Functional differences between macaque prefrontal cortex and caudate nucleus during eye movements with and without reward.
    Exp Brain Res (9 August 2006)
    by Shunsuke Kobayashi, Reiko Kawagoe, Yoriko Takikawa, Masashi Koizumi, Masamichi Sakagami, Okihide Hikosaka
  • Reinforcement learning and decision making in monkeys during a competitive game
    Cognitive Brain Research, Vol. 22, No. 1. (December 2004), pp. 45-58.
    by Daeyeol Lee, Michelle L Conroy, Benjamin P Mcgreevy, Dominic J Barraclough
  • Intracranial Reinforcement Compared with Sugar-Water Reinforcement
    by William E Gibson, Larry D Reid, Makoto Sakai, Paul B Porter
    posted to reinforcementlearning by brian on 2005-02-08 00:51:56 as **
  • Dopamine, learning and motivation
    Nat Rev Neurosci, Vol. 5, No. 6. (June 2004), pp. 483-494.
    by Roy A Wise
  • The misbehavior of value and the discipline of the will.
    Neural Netw, Vol. 19, No. 8. (October 2006), pp. 1153-1160.
    by P Dayan, Y Niv, B Seymour, ND Daw
  • Learning Behavior in an Experimental Matching Pennies Game
    Games and Economic Behavior, Vol. 7, No. 1. (July 1994), pp. 62-91.
    by Mookherjee Dilip, Sopher Barry
    posted to gametheory human logistic logit pastchoices recency reinforcementlearning by brian on 2005-10-17 03:52:50 as **
  • Learning and Decision Costs in Experimental Constant Sum Games,
    Games and Economic Behavior, Vol. 19, No. 1. (April 1997), pp. 97-132.
    by Dilip Mookherjee, Barry Sopher
    posted to gametheory human logistic logit pastchoices recency reinforcementlearning by brian on 2005-10-17 03:51:59 as **
  • Reward or reinforcement: what's the difference?
    Neurosci Biobehav Rev, Vol. 13, No. 2-3. (l 1989), pp. 181-186.
    by NM White
  • Comparison of reward modulation in the frontal eye field and caudate of the macaque.
    J Neurosci, Vol. 26, No. 25. (21 June 2006), pp. 6695-6703.
    by L Ding, O Hikosaka
  • How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues.
    J Neurosci, Vol. 19, No. 23. (1 December 1999), pp. 10502-10511.
    posted to actorcritic basalganglia dopamine model pptn reinforcementlearning td by brian on 2006-11-27 19:41:12 as **
  • Learning to Predict by the Methods of Temporal Differences
    Machine Learning, Vol. 3 (1988), pp. 9-44.
    by Richard S Sutton
  • Addiction as a computational process gone awry.
    Science, Vol. 306, No. 5703. (10 December 2004), pp. 1944-1947.
    by AD Redish
  • Hold Your Horses: Impulsivity, Deep Brain Stimulation, and Medication in Parkinsonism
    Science (25 October 2007), 1146157.
    by Michael J Frank, Johan Samanta, Ahmed A Moustafa, Scott J Sherman
  • Markov Games as a Framework for Multi-Agent Reinforcement Learning
    (1994), pp. 157-163.
    by Michael L Littman
  • Learning in spiking neural networks by reinforcement of stochastic synaptic transmission.
    Neuron, Vol. 40, No. 6. (18 December 2003), pp. 1063-1073.
    by HS Seung
  • Consumer brand choice: A random walk?
    Journal of Marketing Research, Vol. 12, No. 3. (1975), pp. 314-324.
    by Raymond J Lawrence
    posted to consumer economics randomwalk reinforcementlearning by brian on 2005-02-21 18:28:26 as **
  • Temporal difference models describe higher-order learning in humans.
    Nature, Vol. 429, No. 6992. (10 June 2004), pp. 664-667.
    posted to aversive fmri hmm markov pain reinforcementlearning rpe by brian on 2006-05-30 19:23:54 as **
  • By carrot or by stick: cognitive reinforcement learning in parkinsonism.
    Science, Vol. 306, No. 5703. (10 December 2004), pp. 1940-1943.
    by MJ Frank, LC Seeberger, RC O'reilly
  • A more biologically plausible learning rule for neural networks.
    Proc Natl Acad Sci U S A, Vol. 88, No. 10. (15 May 1991), pp. 4433-4437.
    by P Mazzoni, RA Andersen, MI Jordan
  • Extended habit training reduces dopamine mediation of appetitive response expression.
    J Neurosci, Vol. 25, No. 29. (20 July 2005), pp. 6729-6733.
    by WY Choi, PD Balsam, JC Horvitz
  • Haloperidol, dynamics of choice, and the parameters of the matching law.
    Behav Processes (3 March 2007)
    by Carlos F F Aparicio
  • A re-examination of probability matching and rational choice
    Journal of Behavioral Decision Making, Vol. 15, No. 3. (18 March 2002), pp. 233-250.
    by David R Shanks, Richard J Tunney, John D Mccarthy
  • Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning.
    J Neurophysiol, Vol. 95, No. 2. (February 2006), pp. 948-959.
    by M Haruno, M Kawato
  • Prediction error during retrospective revaluation of causal associations in humans: fMRI evidence in favor of an associative model of learning.
    Neuron, Vol. 44, No. 5. (2 December 2004), pp. 877-888.
    by PR Corlett, MR Aitken, A Dickinson, DR Shanks, GD Honey, RA Honey, TW Robbins, ET Bullmore, PC Fletcher
  • Generality, repetition, and the role of descriptive learning models
    Journal of Mathematical Psychology, Vol. 49, No. 5. (October 2005), pp. 357-371.
    by Ido Erev, Ernan Haruvy
  • Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners during Reward-Based Decision Making
    J. Neurosci., Vol. 27, No. 47. (21 November 2007), pp. 12860-12867.
    by Tom Schonberg, Nathaniel D Daw, Daphna Joel, John P O'Doherty
    posted to caudate fmri reinforcementlearning striatum by brian on 2008-01-24 15:59:04 as ** along with 1 person nelmor
  • Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens
    Nature Neuroscience, Vol. 10, No. 8. (01 July 2007), pp. 1020-1028.
    by Jeremy J Day, Mitchell F Roitman, Mark R Wightman, Regina M Carelli
    posted to dopamine rat reinforcementlearning td-model voltammetry by brian on 2007-10-25 23:09:54 as **
  • Immediate changes in anticipatory activity of caudate neurons associated with reversal of position-reward contingency.
    J Neurophysiol, Vol. 94, No. 3. (September 2005), pp. 1879-1887.
  • A contribution of cognitive decision models to clinical assessment: decomposing performance on the Bechara gambling task.
    Psychol Assess, Vol. 14, No. 3. (September 2002), pp. 253-262.
    by JR Busemeyer, JC Stout
  • PVLV: the primary value and learned value Pavlovian learning algorithm.
    Behav Neurosci, Vol. 121, No. 1. (February 2007), pp. 31-49.
    by RC O'Reilly, MJ Frank, TE Hazy, B Watz
  • Functional magnetic resonance imaging of reward prediction.
    Curr Opin Neurol, Vol. 18, No. 4. (August 2005), pp. 411-417.
    by B Knutson, JC Cooper
    posted to fmri human reinforcementlearning review reward by brian on 2005-10-05 17:44:17 as **
  • Distinguishing Whether Dopamine Regulates Liking, Wanting, and/or Learning About Rewards.
    Behav Neurosci, Vol. 119, No. 1. (February 2005), pp. 5-15.
  • A cellular mechanism of reward-related learning.
    Nature, Vol. 413, No. 6851. (6 September 2001), pp. 67-70.
    by JN Reynolds, BI Hyland, JR Wickens
  • Dopamine-dependent plasticity of corticostriatal synapses.
    Neural Netw, Vol. 15, No. 4-6. (l 2002), pp. 507-521.
    by JN Reynolds, JR Wickens
  • Kalman filter control embedded into the reinforcement learning framework.
    Neural Comput, Vol. 16, No. 3. (March 2004), pp. 491-499.
    by I Szita, A Lorincz
    posted to kalmanfilter reinforcementlearning by brian on 2005-02-13 05:38:55 as **
  • Opponent appetitive-aversive neural processes underlie predictive learning of pain relief
    Nature Neuroscience, Vol. 8, No. 9. (21 August 2005), pp. 1234-1240.
    by Ben Seymour, John P O'Doherty, Martin Koltzenburg, Katja Wiech, Richard Frackowiak, Karl Friston, Raymond Dolan
  • Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning
    Proceedings of the National Academy of Sciences, Vol. 104, No. 41. (9 October 2007), pp. 16311-16316.
    by Michael J Frank, Ahmed A Moustafa, Heather M Haughey, Tim Curran, Kent E Hutchison
  • Neural correlates of decision variables in parietal cortex.
    Nature, Vol. 400, No. 6741. (15 July 1999), pp. 233-238.
    by ML Platt, PW Glimcher
  • Single dose of a dopamine agonist impairs reinforcement learning in humans: Behavioral evidence from a laboratory-based measure of reward responsiveness.
    Psychopharmacology (Berl) (2 October 2007)
    by Diego A Pizzagalli, A E Evins, Erika C Schetter, Michael J Frank, Petra E Pajtas, Diane L Santesso, Melissa Culhane
  • 注: このページを引用する時は次のURLでどうぞ: http://www.citeulike.org/tag/reinforcementlearning

    Result page: 1 2 Next RIS BibTeX RSS
    CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.