Format

Send to

Choose Destination
BMC Genomics. 2006 Feb 16;7:28.

Highly expressed proteins have an increased frequency of alanine in the second amino acid position.

Author information

1
Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu, Riia 23, Tartu 51010, Estonia. age.tats@ut.ee

Abstract

BACKGROUND:

Although the sequence requirements for translation initiation regions have been frequently analysed, usually the highly expressed genes are not treated as a separate dataset.

RESULTS:

To investigate this, we analysed the mRNA regions downstream of initiation codons in nine bacteria, three archaea and three unicellular eukaryotes, comparing the dataset of highly expressed genes to the dataset of all genes. In addition to the detailed analysis of the nucleotide and codon frequencies we compared the N-termini of highly expressed proteins to the N-termini of all proteins coded in the genome.

CONCLUSION:

The most conserved pattern was observed at the amino acid level: strong alanine over-representation was observed at the second amino acid position of highly expressed proteins. This pattern is well conserved in all three domains of life.

PMID:
16483368
PMCID:
PMC1397820
DOI:
10.1186/1471-2164-7-28
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center