Format

Send to

Choose Destination
Mob DNA. 2019 Dec 29;10:52. doi: 10.1186/s13100-019-0192-1. eCollection 2019.

Tools and best practices for retrotransposon analysis using high-throughput sequencing data.

Author information

1
1Institut Curie, PSL Research University, 75005 Paris, France.
2
2INSERM U900, 75005 Paris, France.
3
3MINES ParisTech, PSL Research University, 75005 Paris, France.
4
4INSERM U934, CNRS UMR 3215, 75005 Paris, France.

Abstract

Background:

Sequencing technologies give access to a precise picture of the molecular mechanisms acting upon genome regulation. One of the biggest technical challenges with sequencing data is to map millions of reads to a reference genome. This problem is exacerbated when dealing with repetitive sequences such as transposable elements that occupy half of the mammalian genome mass. Sequenced reads coming from these regions introduce ambiguities in the mapping step. Therefore, applying dedicated parameters and algorithms has to be taken into consideration when transposable elements regulation is investigated with sequencing datasets.

Results:

Here, we used simulated reads on the mouse and human genomes to define the best parameters for aligning transposable element-derived reads on a reference genome. The efficiency of the most commonly used aligners was compared and we further evaluated how transposable element representation should be estimated using available methods. The mappability of the different transposon families in the mouse and the human genomes was calculated giving an overview into their evolution.

Conclusions:

Based on simulated data, we provided recommendations on the alignment and the quantification steps to be performed when transposon expression or regulation is studied, and identified the limits in detecting specific young transposon families of the mouse and human genomes. These principles may help the community to adopt standard procedures and raise awareness of the difficulties encountered in the study of transposable elements.

KEYWORDS:

Data analysis; High-throughput sequencing; Mapping; Quantification; Retrotransposon

Conflict of interest statement

Competing interestsThe authors declare that they have no competing interests.

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center