Format

Send to

Choose Destination
See comment in PubMed Commons below
Database (Oxford). 2014 Mar 18;2014:bau020. doi: 10.1093/database/bau020. Print 2014.

Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes.

Author information

1
Centre for Diabetes Research, The Western Australian Institute for Medical Research, Western Australia, Australia, Centre of Medical Research, University of Western Australia, Perth, Western Australia, Australia and Southern Cross Plant Science, Southern Cross University, Lismore, New South Wales, Australia.

Abstract

The laboratory mouse is the most widely used mammalian model organism in biomedical research, so a thorough annotation of functional variation in the mouse genome would be of significant value. In this study, we compared sequence variation in a comprehensive list of functional elements (e.g. promoters, enhancers and CTCF binding sites) across 17 inbred mouse strains. Sequences were derived for ∼300,000 functional elements experimentally identified by the mouse ENCODE project as regulating gene expression in 19 different tissue sources. We aligned sequences for each predicted cis-regulatory element to genomes of 17 mouse strains. This yielded a database comprising ∼5 million aligned sequences, allowing interrogation of sequence variation of functional elements for each of the 19 tissues/cell types in commonly used mouse strains. We also developed an online tool to visualize the genome around each predicted cis-regulatory element in each tissue context and which allows efficient comparison of variation between any two sets of strains. This will be particularly useful in the context of the Collaborative Cross (CC), which was conceived as a powerful new systems genetics resource to accelerate gene discovery. Comprising a large number of inbred strains derived from eight genetically diverse founders, the CC offers rapid mapping and identification of genes that mediate complex traits. We show that, among the 17 sequenced strains, the set of CC founder strains captures the most variability in the ENCODE elements, further emphasizing the value of this resource. Database URL: www.sysgen.org/ecco.

PMID:
24647628
PMCID:
PMC3958616
DOI:
10.1093/database/bau020
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Silverchair Information Systems Icon for PubMed Central
    Loading ...
    Support Center