Logo of bmcgeneBioMed Centralsearchsubmit a manuscriptregisterthis articleBMC Genetics
BMC Genet. 2004; 5: 25.
Published online Aug 27, 2004. doi:  10.1186/1471-2156-5-25
PMCID: PMC517495

Molecular clock in neutral protein evolution

Abstract

Background

A frequent observation in molecular evolution is that amino-acid substitution rates show an index of dispersion (that is, ratio of variance to mean) substantially larger than one. This observation has been termed the overdispersed molecular clock. On the basis of in silico protein-evolution experiments, Bastolla and coworkers recently proposed an explanation for this observation: Proteins drift in neutral space, and can temporarily get trapped in regions of substantially reduced neutrality. In these regions, substitution rates are suppressed, which results in an overall substitution process that is not Poissonian. However, the simulation method of Bastolla et al. is representative only for cases in which the product of mutation rate μ and population size Ne is small. How the substitution process behaves when μNe is large is not known.

Results

Here, I study the behavior of the molecular clock in in silico protein evolution as a function of mutation rate and population size. I find that the index of dispersion decays with increasing μNe, and approaches 1 for large μNe . This observation can be explained with the selective pressure for mutational robustness, which is effective when μNe is large. This pressure keeps the population out of low-neutrality traps, and thus steadies the ticking of the molecular clock.

Conclusions

The molecular clock in neutral protein evolution can fall into two distinct regimes, a strongly overdispersed one for small μNe, and a mostly Poissonian one for large μNe. The former is relevant for the majority of organisms in the plant and animal kingdom, and the latter may be relevant for RNA viruses.

Background

Kimura has argued that the majority of nucleotide substitutions that accumulate in genes over time are selectively neutral, and go to fixation purely by chance [1]. One major prediction of Kimura's neutral theory is that the substitution process should be a Poisson process, with the mean number of substitutions per unit time equal to the variance. In contrast to this theory, empirical studies often find the variance to be significantly larger than the mean [2-8]. This observation has been termed the "overdispersed molecular clock". (For an excellent review of both empirical evidence and mathematical theories, see Ref. [9].) It is possible to reconcile Kimura's theory with the overdispersed molecular clock via Takahata's fluctuating neutral space model [10-12]. If the neutral mutation rate fluctuates slowly, then the substitution process ceases to be Poissonian, and becomes indeed overdispersed. However, the problem with the fluctuating neutral space model is that it does not offer any argument for why the neutral mutation rate should fluctuate, and thus ultimately fails to explain the observed substitution patterns.

An explanation for fluctuations in neutral mutation rate was recently proposed by Bastolla et al. [13-16]. Different proteins with identical structure naturally vary in their neutrality, that is, in the fraction of single-point mutants that are viable. Therefore, as a gene slowly drifts through sequence space, the neutral mutation rate will fluctuate in correlation to the changing neutrality, and this fluctuation alone could be sufficient to explain the overdispersed molecular clock. Bastolla et al. studied the substitution process in a variety of models of neutral protein evolution in silico, and found significant overdispersion in all cases they considered.

However, the simulations that Bastolla et al. carried out were limited to cases in which the product of mutation rate μ and population size Ne is small (because Bastolla et al. used only a single sequence as the representative of the whole population, an approach that is justified for μNe [less, similar] 1). Since population size and mutation rate can be substantial in some species (most notably in RNA viruses), it is justified to ask how general this result is for arbitrary values of μNe. It is known that large populations and populations evolving under high mutation pressure experience a strong selective pressure to avoid regions of low neutrality, an effect that has been termed "evolution of mutational robustness" [17-20]. In equilibrium, such populations settle in areas of sequence space that have above-average neutrality. As a result, regions of low neutrality are not represented in the population, and the distribution of neutralities in the population is much narrower than the total distribution of neutralities in sequence space. Therefore, we should expect that the neutral mutation rate does not fluctuate strongly under these conditions, and that the molecular clock will not be significantly overdispersed.

For the present paper, I have studied the behavior of the substitution process under neutral protein evolution as a function of mutation rate μ and population size Ne. I have found that the accumulation of non-synonymous mutations is substantially overdispersed for small μNe, in agreement with the results of Bastolla et al., but approaches a Poisson process when μNe [dbl greater-than sign] 1. The accumulation of synonymous substitutions is always Poissonian, regardless of the value of μNe.

Results

I carried out simulations with DNA sequences of length L = 75. I determined the fitness of a DNA sequence by translating it into the corresponding amino-acid sequence, and determining its native fold within the framework of a lattice-protein model. (A sequence would have fitness 1 if it folded into a pre-determined target structure, and fitness 0 otherwise.) I used a simple model of maximally compact proteins on a 5 × 5 lattice. This protein-folding model is much simpler than the ones used by Bastolla et al. [13,14], but has been shown to produce realistic distributions of folding free energies and neutralities [21-23]. The advantage of the simpler model is that entire populations of evolving sequences can be simulated, instead of just individual sequences.

First, I have found that my model produces overdispersion (that is, an index of dispersion R substantially above 1) for non-synonymous substitutions, but not for synonymous substitutions. The finding for synonymous mutations is not surprising, because changes in the protein's neutrality do not affect the probability with which a synonymous mutation is neutral (which is always one). Neutral evolution could produce overdispersion in the synonymous substitutions only if the number of synonymous sites in the sequence were undergoing significant fluctuations. While these fluctuations do occur, they are apparently not large enough to affect the index of dispersion.

Second, I have found that for non-synonymous substitutions, R decays quickly with increasing population size Ne at fixed μ (Fig. (Fig.1).1). Since one reason for a decaying index of dispersion could be a reduced number of accumulated mutations, I have studied how the mean number of accumulated mutations behaves as a function of population size. Instead of staying constant or decreasing, the mean increases with increasing Ne, while the variance decreases (Fig. (Fig.2).2). This result shows that the reduction in R is not caused by a mere reduction in the accumulated mutations, and that the substitution process does indeed shift from overdispersed to Poissonian as the population size increases.

Figure 1
Index of dispersion as a function of population size Ne for synonymous and non-synonymous substitutions (τ = 1000, μ = 0.075).
Figure 2
Mean, An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i1.gif and variance, An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i2.gif of lineage-adjusted number of non-synonymous substitutions as a function of population size Ne (τ = 1000, μ = 0.075). Quantities were calculated from all 500 replicates at each population size.

For non-synonymous substitutions, R decays with Ne because of evolution of mutational robustness. However, mutational robustness is caused by large μNe, rather than large Ne alone, and the parameter region in which mutational robustness becomes relevant is μNe [greater, similar] 10 [17]. Therefore, it is more instructive to plot R as a function of μNe. The only problem with a naive plot of that sort is that R increases as a function of μτ, where τ is the length of the time window during which mutations accumulate [9]. Thus, in Fig. Fig.3,3, I show R for constant μτ as a function of μNe. Note that in this figure, instead of the sequence-wide mutation rate μ, I use the non-synonymous mutation rate μn = 0.76 μ, which is corrected for the fact that only approximately 76% of mutations hit non-synonymous sites. (76% is the expected fraction of non-synonymous sites in a random DNA sequence.) Figure Figure33 shows that the transition from an overdispersed to a Poissonian substitution process occurs for μNe between approximately 10 and 100, in agreement with Ref. [17], and that the transition region seems to be largely independent of the value of μτ.

Figure 3
Index of dispersion for non-synonymous mutations as a function of the product of non-synonymous mutation rate μn (= 0.76 μ) and population size Ne.

Figure Figure33 also shows that R increases with μτ. This dependency becomes clearer in Fig. Fig.4,4, where I display R as a function of μτ for fixed μNe. The figure shows substantial increase in R with increasing μτ for small to moderate μNe. However, even for μNe well above 50, there is still a slight increase in R with μτ. Therefore, my results do not settle the question of whether the substitution process becomes truly Poissonian for sufficiently large μNe, or whether it just approaches a Poisson process but always remains slightly overdispersed. To settle this question, one would have to carry out simulations with much larger τ and Ne. Unfortunately, the protein folding model I use is still too computationally intensive to permit such simulations with current computational resources.

Figure 4
Index of dispersion for non-synonymous mutations as a function of the product of non-synonymous mutation rate μn (= 0.76 μ) and divergence time τ.

Discussion

My results show that the size of the product μNe has a substantial effect on the index of dispersion under neutral evolution. The substitution process is strongly overdispersed for small μNe, but approaches a Poisson process as μNe grows large. Therefore, the next question is which of the two regimes has more biological relevance. As discussed by Cutler [9], the biggest problem in explaining the overdispersed molecular clock is not to come up with mechanisms that produce overdispersion, but to find a general mechanism that does not depend on special conditions or finely-tuned parameters.

To assess the likelihood that fluctuations in neutrality contribute to the overdispersed molecular clock, we have to know the mutation rate and population size for the species of interest. It is notoriously difficult to obtain accurate data for these parameters, and only a few species have been studied in depth. One of the best data sets available is probably the one for Drosophila. Keightley and Eyre-Walker estimated the per-nucleotide substitution rate in Drosophila to be u = 2.2 × 10-9 [24]. If we assume that the average gene in Drosophila is 1770 bp long [24], and that 76% of the nucleotides are non-synonymous (this number stems from averaging the number of non-synonymous sites over all codons with equal weight), then the average number of non-synonymous sites per gene is 1345 bp. Thus, the average rate of non-synonymous mutations per gene is μn = 3.0 × 10-6. With an effective population size of approximately 3 × 105 [25], we get a product of population size and per-gene-non-synonymous mutation rate of approximately 1. Since selection for mutational robustness starts to take effect when this product is substantially larger than 1, Drosophila lies well within the parameter region in which we expect overdispersion to be caused by neutral evolution. For other higher organisms, in particular mammals, which tend to have comparatively small population sizes, we can expect that the product μnNe falls into the same parameter region. On the other hand, for microorganisms, which can have very large population sizes, mutational robustness may play a role in their evolution. In particular, RNA viruses have genomic mutation rates on the order of one [26,27] and their genomes consist typically of only a handful of genes. Because RNA viruses undergo severe bottlenecks on a regular basis, their effective population size Ne is much smaller than the number of virus particles in infected individuals (which can exceed 1012), and is more closely related to the number of infected individuals. For HIV-1, Ne has been estimated to be approximately 102 for subtype A, and 105 for subtype B [28].

The preceeding paragraph shows that neutral evolution of proteins is probably one source of overdispersed non-synonymous substitutions in Drosophila and other organisms. However, overdispersion has been observed in synonymous substitutions as well. For example, Zeng et al. [29] found an index of dispersion R significantly above one for synonymous, but not for non-synonymous substitutions in Drosophila. For mammals, some studies found R significantly above one for both synonymous and non-synonymous substitutions [8], while others found only the non-synonymous substitution process to be overdispersed [30]. Therefore, it is likely that other processes than neutral protein evolution also contribute to overdispersion. Such processes can be selection for optimal codon usage in the case of synonymous mutations, and positive selection on the amino acid level in the case of non-synonymous mutations.

I have demonstrated that large μNe results in a substitution process with little overdispersion. However, I have not yet given an explanation for how overdispersion is reduced in populations with large μNe. There are two elements: First, selection for mutational robustness reduces the fraction of sequences with low neutrality, and increases the fraction of sequences with high neutrality, thus making the population more homogeneous and reducing the overall range of neutralities [17-20]. Second, a sequence with low neutrality will experience a real selective disadvantage in comparison to a sequence with high neutrality for large μNe, and will therefore have a reduced probability to end up on the line of descent. While this selective disadvantage is often small, it can nevertheless determine the evolutionary fate of a sequence in a large population. The larger the population, the more sensitive it becomes to small fitness differences, so that in a very large population a sequence with only a moderate reduction in neutrality will have a small probability to end up on the line of descent. (The fact that the mean substitution rate increases with the population size, as seen in Fig. Fig.2,2, is also consistent with this reasoning. The larger the population size, the more high-neutrality sequences end up on the line of descent, which is reflected in the increase in the mean substitution rate.)

Throughout this paper, I have considered only neutral or lethal mutations. It is a reasonable question to ask if and how deleterious mutations would change my results. The answer is that they probably have only a minor impact, and the less so the larger Ne, unless they are very slightly deleterious. In order to affect the molecular clock, the deleterious mutations must end up on the line of descent, that is, they must go to fixation. The probability of fixation pfix of deleterious mutants drops exponentially with the population size, pfix = [1 - exp(2s)]/ [1 - exp(2sNe)], where s is the selective disadvantage of the deleterious mutation [31]. Therefore, for reasonable population sizes, only very slightly deleterious mutations can go to fixation and thus affect the molecular clock. This reasoning is independent of the size of μNe, as long as Ne is large in comparison to s.

Conclusions

The present study supports the following conclusions:

• Neutral drift of proteins can lead to an overdispersed substitution process for non-synonymous mutations, but not for synonymous mutations.

• The amount of overdispersion in the non-synonymous substitution process depends strongly on the product of mutation rate and population size. As this product increases, the substitution process becomes more and more Poissonian. The transition region starts at μNe ≈ 10, and extends to values well above 100.

• It is not clear whether there are any species that have a sufficiently large population size and mutation rate to prevent overdispersion through neutral drift. In Drosophila, the product of mutation rate and population size is close to one, which is well below the parameter region in which the substitution process turns Poissonian.

Methods

Lattice protein model

I implemented a version of the 5 × 5 lattice protein model put forward by Goldstein and coworkers [21-23,32]. In this model, proteins are sequences of n = 25 residues that fold into a maximally compact structure on a two-dimensional grid of 5 × 5 lattice points. There are 1081 distinct possible conformations in this model, and the partition function can be evaluated exactly by summing over the contact energies of all distinct conformations.

The contact energy of a conformation i is

An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i3.gif

where An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i4.gif is the contact energy between amino acids An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i5.gif at location j and An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i6.gif at location k in the sequence, and An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i7.gif is 1 if the two amino acids are in contact in conformation k, and 0 otherwise. The partition function is

An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i8.gif

where the sum runs over all 1081 conformations. A sequence folds into conformation f if the contact energy for that conformation is lower than the contact energies of all other formations, Ef <Ei for all i ≠ f, and if the free energy of folding, which is defined as

An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i9.gif

is smaller than some cutoff ΔGcut. Throughout this study, I used kT = 0.6 and ΔGcut = 0. The contact energies An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i4.gif where taken from Table VI in Ref. [33].

Sequence evolution

I simulated the evolution of populations of DNA sequences in discrete, non-overlapping generations. Population size is denoted by Ne. The fitness of a sequence was 1 if the DNA sequence translated into a peptide sequence that could fold into a chosen target structure, and had a free energy of folding smaller than Gcut. Otherwise, the fitness of the sequence was 0. All sequences had length L = 75. In each successive generation, sequences with fitness 1 were randomly chosen to reproduce, until the new generation had Ne members. At reproduction, the sequences were mutated, with an average of μ base pair substitutions per sequence. I let each population evolve for several thousand generations, and kept track of the full genealogic information of all sequences in the population. In order to measure the molecular clock of fixed mutations only, I studied the pattern of base substitutions in a window of τ generations along the line of descent backwards in time, starting from the most recent common ancestor of the final population.

I varied the parameters Ne (10, 33, 100, 330, 1000, 3300), μ (0.0075, 0.075, 0.75), and τ (500, 1000). For each set of parameters, I carried out 500 replicates (each with a different, randomly chosen target structure), to obtain a distribution for the number of synonymous and non-synonymous substitutions Sd and Nd. Since there was some variation in the number of synonymous and non-synonymous sites across different target structures (on the order of approximately ± 5% variation from the mean), I then applied a correction factor to Sd and Nd to bring them into comparable units: I calculated the corrected number of synonymous substitutions An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i10.gif as An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i11.gif Here, S is the mean number of synonymous sites for the given replicate, and (S) is the average of S over all 500 replicates. Likewise, I calculated An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i12.gif (Indices of dispersion calculated without this correction factor are slightly larger than the ones reported here, because the variation in S and N creates additional variance in Sd and Nd). Similar correction factors have been used in sequence analysis [7], and are generally referred to as lineage adjustments. They control for differences among lineages that are primarily related to the expected number of substitutions in a lineage, and thus should not enter the index of dispersion.

To obtain an estimate for mean and standard error of the index of dispersion, I subdivided the 500 results into 10 blocks of 50 each, and calculated mean and variance of the number of substitutions for each block. The ratio of variance to mean for a given set of substitutions (synonymous or non-synonymous) in a block is the index of dispersion for this data set. I then calculated mean and standard error for the index of dispersion from the individual results of the 10 blocks.

The total CPU time needed to carry out all simulations was several months on a small cluster of Pentium II 500 MHz machines.

Calculation of synonymous and non-synonymous substitutions and sites

I calculated the number of synonymous and non-synonymous sites S and N and the number of synonymous and non-synonymous substitutions Sd and Nd according to the method proposed by Nei and Gojobori [34]. In short, under this method the number of synonymous sites si of a codon i is the fraction of possible substitutions to that codon that leave the residue unchanged. The number of non-synonymous sites ni for the same codon is ni = 3 - si. For the complete sequence, S and N are calculated as An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i13.gif and An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i14.gif where i runs over all codons in the sequence. The number of synonymous or non-synonymous substitutions sd,i or nd,i between two codons is the average number of such substitutions, where the average is taken over all paths that lead from one codon to the other. The total number of synonymous or non-synonymous substitutions between two sequences is the sum over all individual constributions, An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i15.gif and An external file that holds a picture, illustration, etc.
Object name is 1471-2156-5-25-i16.gif (again, i runs over all codons in the sequence).

To calculate the number of synonymous or non-synonymous substitutions along the line of descent, I simply summed up all synonymous or non-synonymous substitutions that occurred from generation to generation. Because the full evolutionary history was known, a correction for multiple mutations such as the Jukes-Cantor correction [35] was not necessary. I also averaged the number of synonymous and non-synonymous sites over all sequences along the line of descent, to get the mean number of synonymous and non-synonymous sites for the given evolutionary trajectory.

Authors' contributions

COW carried out all aspects of this study.

Acknowledgements

This work was supported in part by the NSF under Contract No. DEB-9981397. I thank Ugo Bastolla for helpful comments on an earlier version of this paper.

References

  • Kimura M. The Neutral Theory of Molecular Evolution. Cambridge: Cambridge University Press; 1983.
  • Ohta T, Kimura M. On the constancy of the evolutionary rate of cistrons. J Mol Evol. 1971;1:18–25. [PubMed]
  • Langley CH, Fitch WM. An estimation of the constancy of the rate of molecular evolution. J Mol Evol. 1974;3:161–177. [PubMed]
  • Gillespie JH. The molecular clock may be an episodic clock. Proc Natl Acad Sci USA. 1984;81:8009–8013. [PMC free article] [PubMed]
  • Gillespie JH. Natural selection and the molecular clock. Mol Biol Evol. 1986;3:138–155. [PubMed]
  • Gillespie JH. Variability of evolutionary rates of DNA. Genetics. 1986;113:1077–1091. [PMC free article] [PubMed]
  • Gillespie JH. Lineage effects and the index of dispersion of molecular evolution. Mol Biol Evol. 1989;6:636–647. [PubMed]
  • Ohta T. Synonymous and non-synonymous substitutions in mammalian genes and the nearly neutral theory. J Mol Evol. 1995;40:56–63. [PubMed]
  • Cutler DJ. Understanding the overdispersed molecular clock. Genetics. 2000;154:1403–1417. [PMC free article] [PubMed]
  • Takahata N. On the overdispersed molecular clock. Genetics. 1987;116:169–179. [PMC free article] [PubMed]
  • Takahata N. Statistical models of the overdispersed molecular clock. Theor Popul Biol. 1991;39:329–344. [PubMed]
  • Cutler DJ. The index of dispersion of molecular evolution: slow fluctuations. Theor Popul Biol. 2000;57:177–186. doi: 10.1006/tpbi.1999.1445. [PubMed] [Cross Ref]
  • Bastolla U, Roman HE, Vendruscolo M. Neutral evolution of model proteins: Diffusion in sequence space and overdispersion. J Theor Biol. 1999;200:49–64. doi: 10.1006/jtbi.1999.0975. [PubMed] [Cross Ref]
  • Bastolla U, Porto M, Roman HE, Vendruscolo M. Lack of self-averaging in neutral evolution of proteins. Phys Rev Lett. 2002;89:20801. doi: 10.1103/PhysRevLett.89.208101. [PubMed] [Cross Ref]
  • Bastolla U, Porto M, Roman HE, Vendruscolo M. Connectivity of neutral networks, overdispersion, and structural conservation in protein evolution. J Mol Evol. 2003;56:243–254. doi: 10.1007/s00239-002-2350-0. [PubMed] [Cross Ref]
  • Bastolla U, Porto M, Roman HE, Vendruscolo M. Statistical properties of neutral evolution. J Mol Evol. 2003;57:S103–S119. doi: 10.1007/s00239-003-0013-4. [PubMed] [Cross Ref]
  • van Nimwegen E, Crutchfield JP, Huynen M. Neutral evolution of mutational robustness. Proc Natl Acad Sci USA. 1999;96:9716–9720. doi: 10.1073/pnas.96.17.9716. [PMC free article] [PubMed] [Cross Ref]
  • Bornberg-Bauer E, Chan HS. Modeling evolutionary landscapes: Mutational stability, topology, and superfunnels in sequence space. Proc Natl Acad Sci USA. 1999;96:10689–10694. doi: 10.1073/pnas.96.19.10689. [PMC free article] [PubMed] [Cross Ref]
  • Wilke CO. Adaptive evolution on neutral networks. Bull Math Biol. 2001;63:715–730. doi: 10.1006/bulm.2001.0244. [PubMed] [Cross Ref]
  • Wilke CO, Adami C. Evolution of mutational robustness. Mutat Res. 2003;522:3–11. doi: 10.1016/S0027-5107(02)00307-X. [PubMed] [Cross Ref]
  • Taverna DM, Goldstein RA. The distribution of structures in evolving protein populations. Biopolymers. 2000;53:1–8. doi: 10.1002/(SICI)1097-0282(200001)53:1<1::AID-BIP1>3.3.CO;2-O. [PubMed] [Cross Ref]
  • Taverna DM, Goldstein RA. Why are proteins so robust to site mutations? J Mol Biol. 2002;315:479–484. doi: 10.1006/jmbi.2001.5226. [PubMed] [Cross Ref]
  • Taverna DM, Goldstein RA. Why are proteins marginally stable? Proteins. 2002;46:105–109. doi: 10.1002/prot.10016. [PubMed] [Cross Ref]
  • Keightley PD, Eyre-Walker A. Deleterious mutations and the evolution of sex. Science. 2000;290:331–333. doi: 10.1126/science.290.5490.331. [PubMed] [Cross Ref]
  • Li YJ, Satta Y, Takahata N. Paleo-demography of the Drosophila melanogaster subgroup: application of the maximum likelihood method. Genes Genet Syst. 1999;74:117–127. doi: 10.1266/ggs.74.117. [PubMed] [Cross Ref]
  • Drake JW. Rates of spontaneous mutation among RNA viruses. Proc Natl Acad Sci USA. 1993;90:4171–4175. [PMC free article] [PubMed]
  • Drake JW, Holland JJ. Mutation rates among RNA viruses. Proc Natl Acad Sci USA. 1999;96:13910–13913. doi: 10.1073/pnas.96.24.13910. [PMC free article] [PubMed] [Cross Ref]
  • Grasslya NC, Harveya PH, Holmes EC. Population dynamics of HIV-1 inferred from gene sequences. Genetics. 1999;151:427–438. [PMC free article] [PubMed]
  • Zeng LW, Comeron JM, Chen B, Kreitman M. The molecular clock revisited: the rate of synonymous vs. replacement change in Drosophila. Genetica. 1998;102/103:369–382. doi: 10.1023/A:1017035109224. [PubMed] [Cross Ref]
  • Smith NG, Eyre-Walker A. Partitioning the variation in mammalian substitution rates. Mol Biol Evol. 2003;20:10–17. [PubMed]
  • Kimura M. On the probability of fixation of mutant genes in a population. Genetics. 1962;47:713–719. [PMC free article] [PubMed]
  • Buchler NEG, Goldstein RA. Effect of alphabet size and foldability requirements on protein structure designability. Proteins. 1999;34:113–124. doi: 10.1002/(SICI)1097-0134(19990101)34:1<113::AID-PROT9>3.3.CO;2-A. [PubMed] [Cross Ref]
  • Miyazawa S, Jernigan RL. Estimation of effective inter-residue contact energies from protein crystal structures: quasichemical approximation. Macromolecules. 1985;18:534–552.
  • Nei M, Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986;3:418–426. [PubMed]
  • Jukes TH, Cantor CR. Evolution of protein molecules. In: Munro HN, editor. In Mammalian protein metabolism III. New York: Academic Press; 1969. pp. 21–132.

Articles from BMC Genetics are provided here courtesy of BioMed Central
PubReader format: click here to try

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...