Population genetics without intraspecific data

Jeffrey L Thorne; Sang Chul Choi; Jiaye Yu; Paul G Higgs; Hirohisa Kishino

doi:10.1093/molbev/msm085

Population genetics without intraspecific data

Mol Biol Evol. 2007 Aug;24(8):1667-77. doi: 10.1093/molbev/msm085. Epub 2007 Apr 29.

Authors

Jeffrey L Thorne¹, Sang Chul Choi, Jiaye Yu, Paul G Higgs, Hirohisa Kishino

Affiliation

¹ Wissenschaftskolleg zu Berlin, Institute for Advanced Study, Berlin, Germany. thorne@statgen.ncsu.edu

PMID: 17470435
DOI: 10.1093/molbev/msm085

Abstract

A central goal of computational biology is the prediction of phenotype from DNA and protein sequence data. Recent models of sequence change use in silico prediction systems to incorporate the effects of phenotype on evolutionary rates. These models have been designed for analyzing sequence data from different species and have been accompanied by statistical techniques for estimating model parameters when the incorporation of phenotype induces dependent change among sequence positions. A difficulty with these efforts to link phenotype and interspecific evolution is that evolution occurs within populations, and parameters of interspecific models should have population genetic interpretations. We show, with two examples, how population genetic interpretations can be assigned to evolutionary models. The first example considers the impact of RNA secondary structure on sequence change, and the second reflects the tendency for protein tertiary structure to influence nonsynonymous substitution rates. We argue that statistical fit to data should not be the sole criterion for assessing models of sequence change. A good interspecific model should also yield a clear and biologically plausible population genetic interpretation.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Animals
Annexin A5 / genetics*
Computational Biology
Computer Simulation
Evolution, Molecular
Genetic Variation
Genetics, Population*
Likelihood Functions
Mice
Models, Biological
Models, Genetic
Nucleic Acid Conformation
Phenotype
Phylogeny
RNA / chemistry
RNA / genetics*
Rats
Selection, Genetic

Substances

Annexin A5
RNA

Abstract

Publication types

MeSH terms

Substances

Grants and funding