Can the site-frequency spectrum distinguish exponential population growth from multiple-merger coalescents?

Genetics. 2015 Mar;199(3):841-56. doi: 10.1534/genetics.114.173807. Epub 2015 Jan 9.

Abstract

The ability of the site-frequency spectrum (SFS) to reflect the particularities of gene genealogies exhibiting multiple mergers of ancestral lines as opposed to those obtained in the presence of population growth is our focus. An excess of singletons is a well-known characteristic of both population growth and multiple mergers. Other aspects of the SFS, in particular, the weight of the right tail, are, however, affected in specific ways by the two model classes. Using an approximate likelihood method and minimum-distance statistics, our estimates of statistical power indicate that exponential and algebraic growth can indeed be distinguished from multiple-merger coalescents, even for moderate sample sizes, if the number of segregating sites is high enough. A normalized version of the SFS (nSFS) is also used as a summary statistic in an approximate Bayesian computation (ABC) approach. The results give further positive evidence as to the general eligibility of the SFS to distinguish between the different histories.

Keywords: approximate Bayesian computation; approximate maximum likelihood test; coalescent; multiple mergers; population growth; site-frequency spectrum.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Genetics, Population / methods*
  • Likelihood Functions
  • Models, Genetic*
  • Population Growth