Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Bioinformatics. 2009 May 15;25(10):1329-30. doi: 10.1093/bioinformatics/btp084. Epub 2009 Apr 5.

TEclass--a tool for automated classification of unknown eukaryotic transposable elements.

Author information

  • 1Katholieke Universiteit Leuven, Department of Biology, Laboratory of Aquatic Ecology and Evolutionary Biology, Ch. Deberiotstraat 32, 3000 Leuven, Belgium. abrusan@uni-muenster.de

Abstract

MOTIVATION:

The large number of sequenced genomes required the development of software that reconstructs the consensus sequences of transposons and other repetitive elements. However, the available tools usually focus on the accurate identification of raw repeats and provide no information about the taxonomic position of the reconstructed consensi. TEclass is a tool to classify unknown transposable elements into their four main functional categories, which reflect their mode of transposition: DNA transposons, long terminal repeats (LTRs), long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs). TEclass uses machine learning support vector machine (SVM) for classification based on oligomer frequencies. It achieves 90-97% accuracy in the classification of novel DNA and LTR repeats, and 75% for LINEs and SINEs.

AVAILABILITY:

http://www.compgen.uni-muenster.de/teclass, stand alone program upon request.

PMID:
19349283
[PubMed - indexed for MEDLINE]
Free full text
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire
    Loading ...
    Write to the Help Desk