Adaptive learning via selectionism and Bayesianism, Part I: connection between the two

Jun Zhang

doi:10.1016/j.neunet.2009.03.018

Adaptive learning via selectionism and Bayesianism, Part I: connection between the two

Neural Netw. 2009 Apr;22(3):220-8. doi: 10.1016/j.neunet.2009.03.018. Epub 2009 Apr 5.

Author

Jun Zhang¹

Affiliation

¹ Department of Psychology, University of Michigan, 530 Church Street, Ann Arbor 48109-1043, USA. junz@umich.edu

PMID: 19386469
DOI: 10.1016/j.neunet.2009.03.018

Abstract

According to the selection-by-consequence characterization of operant learning, individual animals/species increase or decrease their future probability of action choices based on the consequence (i.e., reward or punishment) of the currently selected action (the so-called "Law of Effect"). Under Bayesianism, on the other hand, evidence is evaluated based on likelihood functions so that action probability is modified from a priori to a posteriori according to the Bayes formula. Viewed as hypothesis testing, a selectionist framework attributes evidence exclusively to the selected, focal hypothesis, whereas a Bayesian framework distributes across all hypotheses the support from a piece of evidence. Here, an intimate connection between the two theoretical frameworks is revealed. Specifically, it is proven that when individuals modify their action choices based on the selectionist's Law of Effect, the learning population, on the ensemble level, evolves according to a Bayesian-like dynamics. The learning equation of the linear operator model [Bush, R. R., & Mosteller, F. (1955). Stochastic models for learning, New York: John Wiley and Sons], under ensemble averaging, yields the class of predictive reinforcement learning models (e.g., [Busemeyer, J. R., & Myung, I. J. (1992). An adaptive approach to human decision making: Learning theory, decision theory, and human performance. Journal of Experimental Psychology: General, 121, 177-194; Montague, P. R., Dayan, P., & Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. Journal of Neuroscience, 16, 1936-1947]).

MeSH terms

Adaptation, Psychological / physiology*
Algorithms
Animals
Artificial Intelligence
Bayes Theorem*
Brain / physiology
Choice Behavior
Computer Simulation
Conditioning, Operant / physiology*
Decision Making
Dopamine
Humans
Learning / physiology*
Likelihood Functions
Linear Models*
Models, Neurological
Models, Psychological
Neural Networks, Computer
Probability
Probability Learning*
Psychomotor Performance
Stochastic Processes

Substances

Dopamine