• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of geneticsGeneticsCurrent IssueInformation for AuthorsEditorial BoardSubscribeSubmit a Manuscript
Genetics. Jul 2003; 164(3): 1229–1236.
PMCID: PMC1462615

Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites.


Maximum-likelihood methods based on models of codon substitution accounting for heterogeneous selective pressures across sites have proved to be powerful in detecting positive selection in protein-coding DNA sequences. Those methods are phylogeny based and do not account for the effects of recombination. When recombination occurs, such as in population data, no unique tree topology can describe the evolutionary history of the whole sequence. This violation of assumptions raises serious concerns about the likelihood method for detecting positive selection. Here we use computer simulation to evaluate the reliability of the likelihood-ratio test (LRT) for positive selection in the presence of recombination. We examine three tests based on different models of variable selective pressures among sites. Sequences are simulated using a coalescent model with recombination and analyzed using codon-based likelihood models ignoring recombination. We find that the LRT is robust to low levels of recombination (with fewer than three recombination events in the history of a sample of 10 sequences). However, at higher levels of recombination, the type I error rate can be as high as 90%, especially when the null model in the LRT is unrealistic, and the test often mistakes recombination as evidence for positive selection. The test that compares the more realistic models M7 (beta) against M8 (beta and omega) is more robust to recombination, where the null model M7 allows the positive selection pressure to vary between 0 and 1 (and so does not account for positive selection), and the alternative model M8 allows an additional discrete class with omega = d(N)/d(S) that could be estimated to be >1 (and thus accounts for positive selection). Identification of sites under positive selection by the empirical Bayes method appears to be less affected than the LRT by recombination.

Full Text

The Full Text of this article is available as a PDF (94K).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Hudson RR, Kaplan NL. Deleterious background selection with recombination. Genetics. 1995 Dec;141(4):1605–1617. [PMC free article] [PubMed]
  • Kuhner MK, Yamato J, Felsenstein J. Maximum likelihood estimation of recombination rates from population data. Genetics. 2000 Nov;156(3):1393–1401. [PMC free article] [PubMed]
  • Maynard Smith J, Smith NH. Detecting recombination from gene trees. Mol Biol Evol. 1998 May;15(5):590–599. [PubMed]
  • McVean GA. What do patterns of genetic variability reveal about mitochondrial recombination? Heredity (Edinb) 2001 Dec;87(Pt 6):613–620. [PubMed]
  • McVean Gil, Awadalla Philip, Fearnhead Paul. A coalescent-based method for detecting and estimating recombination from gene sequences. Genetics. 2002 Mar;160(3):1231–1241. [PMC free article] [PubMed]
  • Nielsen R. Estimation of population parameters and recombination rates from single nucleotide polymorphisms. Genetics. 2000 Feb;154(2):931–942. [PMC free article] [PubMed]
  • Nielsen R, Yang Z. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics. 1998 Mar;148(3):929–936. [PMC free article] [PubMed]
  • Peek AS, Souza V, Eguiarte LE, Gaut BS. The interaction of protein structure, selection, and recombination on the evolution of the type-1 fimbrial major subunit (fimA) from Escherichia coli. J Mol Evol. 2001 Feb;52(2):193–204. [PubMed]
  • Anisimova M, Bielawski JP, Yang Z. Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol Biol Evol. 2001 Aug;18(8):1585–1592. [PubMed]
  • Anisimova Maria, Bielawski Joseph P, Yang Ziheng. Accuracy and power of bayes prediction of amino acid sites under positive selection. Mol Biol Evol. 2002 Jun;19(6):950–958. [PubMed]
  • Przeworski M, Charlesworth B, Wall JD. Genealogies and weak purifying selection. Mol Biol Evol. 1999 Feb;16(2):246–252. [PubMed]
  • Schierup MH, Hein J. Consequences of recombination on traditional phylogenetic analysis. Genetics. 2000 Oct;156(2):879–891. [PMC free article] [PubMed]
  • Burke DS. Recombination in HIV: an important viral evolutionary strategy. Emerg Infect Dis. 1997 Jul-Sep;3(3):253–259. [PMC free article] [PubMed]
  • Schierup MH, Hein J. Recombination and the molecular clock. Mol Biol Evol. 2000 Oct;17(10):1578–1579. [PubMed]
  • Charlesworth B, Morgan MT, Charlesworth D. The effect of deleterious mutations on neutral molecular variation. Genetics. 1993 Aug;134(4):1289–1303. [PMC free article] [PubMed]
  • Slade PF. Simulation of selected genealogies. Theor Popul Biol. 2000 Feb;57(1):35–49. [PubMed]
  • Suzuki Y, Gojobori T. A method for detecting positive selection at single amino acid sites. Mol Biol Evol. 1999 Oct;16(10):1315–1328. [PubMed]
  • Swanson WJ, Yang Z, Wolfner MF, Aquadro CF. Positive Darwinian selection drives the evolution of several female reproductive proteins in mammals. Proc Natl Acad Sci U S A. 2001 Feb 27;98(5):2509–2514. [PMC free article] [PubMed]
  • Fares MA, Moya A, Escarmís C, Baranowski E, Domingo E, Barrio E. Evidence for positive selection in the capsid protein-coding region of the foot-and-mouth disease virus (FMDV) subjected to experimental passage regimens. Mol Biol Evol. 2001 Jan;18(1):10–21. [PubMed]
  • Fearnhead P, Donnelly P. Estimating recombination rates from population genetic data. Genetics. 2001 Nov;159(3):1299–1318. [PMC free article] [PubMed]
  • Wall JD. A comparison of estimators of the population recombination rate. Mol Biol Evol. 2000 Jan;17(1):156–163. [PubMed]
  • Fitch WM, Bush RM, Bender CA, Cox NJ. Long term trends in the evolution of H(3) HA1 human influenza type A. Proc Natl Acad Sci U S A. 1997 Jul 22;94(15):7712–7718. [PMC free article] [PubMed]
  • Williamson Scott, Orive Maria E. The genealogy of a sequence subject to purifying selection at multiple sites. Mol Biol Evol. 2002 Aug;19(8):1376–1384. [PubMed]
  • Worobey M. A novel approach to detecting and measuring recombination: new insights into evolution in viruses, bacteria, and mitochondria. Mol Biol Evol. 2001 Aug;18(8):1425–1434. [PubMed]
  • Wu JC, Chiang TY, Shiue WK, Wang SY, Sheen IJ, Huang YH, Syu WJ. Recombination of hepatitis D virus RNA sequences and its implications. Mol Biol Evol. 1999 Nov;16(11):1622–1632. [PubMed]
  • Yamaguchi-Kabata Y, Gojobori T. Reevaluation of amino acid variability of the human immunodeficiency virus type 1 gp120 envelope glycoprotein and prediction of new discontinuous epitopes. J Virol. 2000 May;74(9):4335–4350. [PMC free article] [PubMed]
  • Griffiths RC, Marjoram P. Ancestral inference from samples of DNA sequences with recombination. J Comput Biol. 1996 Winter;3(4):479–502. [PubMed]
  • Yang Z. Maximum likelihood analysis of adaptive evolution in HIV-1 gp120 env gene. Pac Symp Biocomput. 2001:226–237. [PubMed]
  • Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends Ecol Evol. 2000 Dec 1;15(12):496–503. [PubMed]
  • Hey J, Wakeley J. A coalescent estimator of the population recombination rate. Genetics. 1997 Mar;145(3):833–846. [PMC free article] [PubMed]
  • Yang Ziheng, Swanson Willie J. Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes. Mol Biol Evol. 2002 Jan;19(1):49–57. [PubMed]
  • Holmes Edward C, Woelk Christopher H, Kassis Raid, Bourhy Hervé Genetic constraints and the adaptive evolution of rabies virus in nature. Virology. 2002 Jan 20;292(2):247–257. [PubMed]
  • Yang Z, Nielsen R, Goldman N, Pedersen AM. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics. 2000 May;155(1):431–449. [PMC free article] [PubMed]
  • Hudson RR. Properties of a neutral allele model with intragenic recombination. Theor Popul Biol. 1983 Apr;23(2):183–201. [PubMed]
  • Yang Z, Swanson WJ, Vacquier VD. Maximum-likelihood analysis of molecular adaptation in abalone sperm lysin reveals variable selective pressures among lineages and sites. Mol Biol Evol. 2000 Oct;17(10):1446–1455. [PubMed]
  • Hudson RR. Two-locus sampling distributions and their application. Genetics. 2001 Dec;159(4):1805–1817. [PMC free article] [PubMed]

Articles from Genetics are provided here courtesy of Genetics Society of America


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • MedGen
    Related information in MedGen
  • PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...