Your browser version may not work well with NCBI's Web applications. More information here...
1: Genome Biol. 2006;7(1):R7. Epub 2006 Jan 31.Click here to read Click here to read Links

Identifying repeat domains in large genomes.

Bioinformatics Program, University of California, San Diego, CA 92093-0419, USA. dzhi@ucsd.edu

We present a graph-based method for the analysis of repeat families in a repeat library. We build a repeat domain graph that decomposes a repeat library into repeat domains, short subsequences shared by multiple repeat families, and reveals the mosaic structure of repeat families. Our method recovers documented mosaic repeat structures and suggests additional putative ones. Our method is useful for elucidating the evolutionary history of repeats and annotating de novo generated repeat libraries.

PMID: 16507140 [PubMed - indexed for MEDLINE]

PMCID: PMC1431705