Send to

Choose Destination
Plant J. 1997 May;11(5):1127-40.

Identification of members of gene families in Arabidopsis thaliana by contig construction from partial cDNA sequences: 106 genes encoding 50 cytoplasmic ribosomal proteins.

Author information

Laboratoire de Physiologie et Biologie Moléculaires Végétales, UMR5545 du CNRS, Université de Perpignan, France.


Partial cDNA sequencing to obtain expressed sequence tags (ESTs) has led to the identification of tags to about 8,000 of the estimated 20,000 genes on Arabidopsis thaliana. This figure represents four to five times the number of complete coding sequences from this organism available in international databases. In contrast to mammals, many proteins are encoded by multigene families in A. thaliana. Using ribosomal protein gene families as an example, it is possible to construct relatively long sequences from overlapping ESTs which are of sufficiently high quality to be able to unambiguously identify tags to individual members of multigene families, even when the sequences are highly conserved. A total of 106 genes encoding 50 different cytoplasmic ribosomal protein types have been identified, most proteins being encoded by at least two and up to four genes. Coding sequences of members of individual gene families are almost always very highly conserved and derived amino acid sequences are almost, if not completely, identical in the vast majority of cases. Sequence divergence is observed in untranslated regions which allows the definition of gene-specific probes. The method can be used to construct high-quality tags to any protein.

[Indexed for MEDLINE]
Free full text

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center