Adaptive learning via selectionism and Bayesianism, Part I: connection between the two

Neural Netw. 2009 Apr;22(3):220-8. doi: 10.1016/j.neunet.2009.03.018. Epub 2009 Apr 5.

Abstract

According to the selection-by-consequence characterization of operant learning, individual animals/species increase or decrease their future probability of action choices based on the consequence (i.e., reward or punishment) of the currently selected action (the so-called "Law of Effect"). Under Bayesianism, on the other hand, evidence is evaluated based on likelihood functions so that action probability is modified from a priori to a posteriori according to the Bayes formula. Viewed as hypothesis testing, a selectionist framework attributes evidence exclusively to the selected, focal hypothesis, whereas a Bayesian framework distributes across all hypotheses the support from a piece of evidence. Here, an intimate connection between the two theoretical frameworks is revealed. Specifically, it is proven that when individuals modify their action choices based on the selectionist's Law of Effect, the learning population, on the ensemble level, evolves according to a Bayesian-like dynamics. The learning equation of the linear operator model [Bush, R. R., & Mosteller, F. (1955). Stochastic models for learning, New York: John Wiley and Sons], under ensemble averaging, yields the class of predictive reinforcement learning models (e.g., [Busemeyer, J. R., & Myung, I. J. (1992). An adaptive approach to human decision making: Learning theory, decision theory, and human performance. Journal of Experimental Psychology: General, 121, 177-194; Montague, P. R., Dayan, P., & Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. Journal of Neuroscience, 16, 1936-1947]).

MeSH terms

  • Adaptation, Psychological / physiology*
  • Algorithms
  • Animals
  • Artificial Intelligence
  • Bayes Theorem*
  • Brain / physiology
  • Choice Behavior
  • Computer Simulation
  • Conditioning, Operant / physiology*
  • Decision Making
  • Dopamine
  • Humans
  • Learning / physiology*
  • Likelihood Functions
  • Linear Models*
  • Models, Neurological
  • Models, Psychological
  • Neural Networks, Computer
  • Probability
  • Probability Learning*
  • Psychomotor Performance
  • Stochastic Processes

Substances

  • Dopamine