Format

Send to

Choose Destination
Mol Genet Genomics. 2009 Jun;281(6):579-90. doi: 10.1007/s00438-009-0433-y. Epub 2009 Feb 26.

Patterns of tandem repetition in plant whole genome assemblies.

Author information

1
Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA. rnavajas@ugr.es

Abstract

Tandem repeats often confound large genome assemblies. A survey of tandemly arrayed repetitive sequences was carried out in whole genome sequences of the green alga Chlamydomonas reinhardtii, the moss Physcomitrella patens, the monocots rice and sorghum, and the dicots Arabidopsis thaliana, poplar, grapevine, and papaya, in order to test how these assemblies deal with this fraction of DNA. Our results suggest that plant genome assemblies preferentially include tandem repeats composed of shorter monomeric units (especially dinucleotide and 9-30-bp repeats), while higher repetitive units pose more difficulties to assemble. Nevertheless, notwithstanding that currently available sequencing technologies struggle with higher arrays of repeated DNA, major well-known repetitive elements including centromeric and telomeric repeats as well as high copy-number genes, were found to be reasonably well represented. A database including all tandem repeat sequences characterized here was created to benefit future comparative genomic analyses.

PMID:
19242726
DOI:
10.1007/s00438-009-0433-y
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Springer
Loading ...
Support Center