Format

Send to

Choose Destination
Sci Data. 2017 Feb 14;4:170011. doi: 10.1038/sdata.2017.11.

11,670 whole-genome sequences representative of the Han Chinese population from the CONVERGE project.

Author information

1
Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK.
2
Wellcome Trust Sanger Institute, Wellcome Genome Campus, CB10 1SA Hinxton, Cambridge, UK.
3
European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, CB10 1SD Hinxton, Cambridge, UK.
4
Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, Virginia 23298, USA.
5
BGI-Shenzhen, Shenzhen, Guangdong 518083, China.
6
Department of Statistics, University of Oxford, Oxford OX1 3TG, UK.
7
UCL Genetics Institute, University College London, London WC1E 6BT, UK.
8
Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California, Los Angeles, Los Angeles, California 90095-1761, USA.

Abstract

The China, Oxford and Virginia Commonwealth University Experimental Research on Genetic Epidemiology (CONVERGE) project on Major Depressive Disorder (MDD) sequenced 11,670 female Han Chinese at low-coverage (1.7X), providing the first large-scale whole genome sequencing resource representative of the largest ethnic group in the world. Samples are collected from 58 hospitals from 23 provinces around China. We are able to call 22 million high quality single nucleotide polymorphisms (SNP) from the nuclear genome, representing the largest SNP call set from an East Asian population to date. We use these variants for imputation of genotypes across all samples, and this has allowed us to perform a successful genome wide association study (GWAS) on MDD. The utility of these data can be extended to studies of genetic ancestry in the Han Chinese and evolutionary genetics when integrated with data from other populations. Molecular phenotypes, such as copy number variations and structural variations can be detected, quantified and analysed in similar ways.

PMID:
28195579
PMCID:
PMC5308202
DOI:
10.1038/sdata.2017.11
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Nature Publishing Group Icon for PubMed Central
Loading ...
Support Center