Format

Send to

Choose Destination
Gigascience. 2019 Jun 1;8(6). pii: giz073. doi: 10.1093/gigascience/giz073.

A large interactive visual database of copy number variants discovered in taurine cattle.

Author information

1
Department of Agricultural, Food and Nutritional Science (AFNS), University of Alberta, Edmonton, AB, Canada.
2
Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, Alberta, Canada.
3
Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada.
4
Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland.
5
Teagasc, Animal & Grassland Research and Innovation Centre, Moorepark, Fermoy, Ireland.

Abstract

BACKGROUND:

Copy number variants (CNVs) contribute to genetic diversity and phenotypic variation. We aimed to discover CNVs in taurine cattle using a large collection of whole-genome sequences and to provide an interactive database of the identified CNV regions (CNVRs) that includes visualizations of sequence read alignments, CNV boundaries, and genome annotations.

RESULTS:

CNVs were identified in each of 4 whole-genome sequencing datasets, which together represent >500 bulls from 17 breeds, using a popular multi-sample read-depth-based algorithm, cn.MOPS. Quality control and CNVR construction, performed dataset-wise to avoid batch effects, resulted in 26,223 CNVRs covering 107.75 unique Mb (4.05%) of the bovine genome. Hierarchical clustering of samples by CNVR genotypes indicated clear separation by breeds. An interactive HTML database was created that allows data filtering options, provides graphical and tabular data summaries including Hardy-Weinberg equilibrium tests on genotype proportions, and displays genes and quantitative trait loci at each CNVR. Notably, the database provides sequence read alignments at each CNVR genotype and the boundaries of constituent CNVs in individual samples. Besides numerous novel discoveries, we corroborated the genotypes reported for a CNVR at the KIT locus known to be associated with the piebald coat colour phenotype in Hereford and some Simmental cattle.

CONCLUSIONS:

We present a large comprehensive collection of taurine cattle CNVs in a novel interactive visual database that displays CNV boundaries, read depths, and genome features for individual CNVRs, thus providing users with a powerful means to explore and scrutinize CNVRs of interest more thoroughly.

KEYWORDS:

CNV; beef; cattle; dairy; database; sequence visualization; structural variants; whole-genome sequencing

PMID:
31241156
PMCID:
PMC6593363
DOI:
10.1093/gigascience/giz073
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center