Jump to: Authorized Access | Attribution | Authorized Requests

Study Description

THE ALZHEIMER'S DISEASE SEQUENCING PROJECT (ADSP) DATA ARE HOSTED AT NIAGADS DSS: Please go to NIAGADS DSS to apply for build 38 ADSP genetic and phenotypic data. See Background below for more details. For instructions on how to access the additional ADSP data that are shared through NIAGADS DSS, visit the Application Instructions page.

Background: Additional sequencing data are continuously generated through the ADSP. These data are mapped to the latest Genome Reference Consortium human genome build GRCh38 (hg38) and are being shared through the NIA Genetics of Alzheimer's Disease Data Storage Site (NIAGADS) Data Sharing Service (DSS). As of May 14, 2025 there are 58,507 whole genomes and 19,922 whole exomes from 66 cohorts available to the research community through NIAGADS DSS (ng00067). Released data types include basic phenotypes, raw individual level files (CRAMs/gVCFs/SV VCFs), and project level joint-genotype called VCFs (both raw and quality-controlled). Additionally, through efforts of the ADSP Phenotype Harmonization Consortium (ADSP-PHC), harmonized endophenotypic data from select cohorts are available. Please visit the ADSP website for more information.

Participant Data Availability by Cohort at NIAGADS DSS

SequencingHarmonized Phenotypes from ADSP-PHC
CohortWES R2*WGS R5*NeuropathologyVascular Risk FactorsCognitionFluid BiomarkerDiffusion Tensor Imaging (DTI)Fluid Attenuated Inversion Recovery (FLAIR)Magnetic Resonance Imaging (MRI) FreesurferMagnetic Resonance Imaging (MRI) MUSEPositron Emission Tomography (PET) AmyloidPositron Emission Tomography (PET) Tau
A4 Study (A4)03345003,345009491,38603,340338
Adult Changes in Thought (ACT)127527847501,3370000000
Alzheimer's Disease Neuroimaging Initiative (ADNI)0216802,1642,1661,2109871,5302,1092,1581,648808
Amish Protective Variant Study (AMISH_PV)011320000000000
Arizona APOE Cohort Study0880000000000
ASPrin in Reducing Events in the Elderly (ASPREE)030670000000000
Atherosclerosis Risk in Communities Study (ARIC)5700000000000
Brain Amyloid Cognitive Normal Elders Study (BACNE)01290000000000
Cache County Study on Memory in Aging (CCS)02070000000000
Cardiff EOAD03950000000000
Case Western Reserve University (CWRU) Autopsy Cohort01360000000000
Case Western Reserve University (CWRU) Rapid Decline Cohort02360000000000
Center for Cognitive Neuroscience and Aging (CNSA)02710000000000
Corticobasal Degeneration Study (CBD)34700000000000
Chicago Health and Aging Project (CHAP)23100000000000
Cardiovascular Health Study (CHS)76600000000000
Cuban American Alzheimer's Disease Initiative (CuAADI)01210000000000
Erasmus Rucphen Family (ERF)451220000000000
Estudio Familiar de la Influencia Genetica en Alzheimer (EFIGA)330530004,2964,5630000000
Framingham Heart Study (FHS)57700000000000
Genetic and Environmental Risk Factors for Alzheimer's Disease Among African Americans (GenerAAtions)45500000000000
Genetic Differences (GDF)20700000000000
Gwangju Alzheimer's and Related Dementia (GARD)019950000000000
Harmonized Diagnostic Assessment of Dementia for the Longitudinal Aging Study of India (LASI-DAD)026820000000000
Health and Aging Brain Study - Health Disparities (HABS-HD)0135901,319001,2111,3121,2761,293922629
Healthy Elderly Active Longevity (HEAL) Cohort "Wellderly"011470000000000
Heart SCORE (HS)01330000000000
Hillblom'Aging Network (HAN)0150000000000
Human Connectome Project (HCP)01890000000000
Indianapolis-Ibadan Dementia Project (IBADAN)09640000000000
Indianapolis African Americans (IndyAA)03950000000000
KnightADRC - Charles F. and Joanne Knight Alzheimer's Disease Research Center (KGAD)65018330084254315526740641300
Korean Brain Aging Study for the Early Diagnosis and Prediction of AD (KBASE)0603006030000000
LonGenity0270000000000
Longevity Genes Project (LGP)050000000000
Mayo Clinic (MAYO)3461690000000000
Memory and Aging Project (MAP)4343605005705860481198811900
Mexican Health and Aging Study (MHAS)026170000000000
Mexico-Southern California Autosomal Dominant Alzheimer's Disease Consortium0620000000000
Minority Aging Research Study (MARS)6209424401618161800
Mount Sinai Brain Bank (MSBB)03440000000000
Multi-Institutional Research in Alzheimer's Genetic Epidemiology (MIRAGE)55300000000000
National Cell Repository for Alzheimer's Disease (NCRAD)3605210000000000
National Institute on Aging Alzheimer's Disease Family Based Study (NIA AD-FBS)1394129239301,6510000000
NIA Alzheimer Disease Research Centers (ADRC/ADC)4591125744,99711,71511,9568362421,4951,4301,75438588
National Institute of Mental Health (NIMH)1700000000000
Northern Manhattan Study (NOMAS)08070000000000
Peru Alzheimer's Disease Initiative (PeADI)04930000000000
Progressive Supranuclear Palsy Study (PSP)54915840000000000
Progressive Supranuclear Palsy Study - UCLA (PSP-UCLA)04080000000000
Puerto Rican 10/66 Study (PR1066)015170000000000
Puerto Rican Alzheimer's Disease Initiative (PRADI)07380000000000
Religious Orders Study (ROS)372370536576581051341300
Research in African-American Alzheimer's Disease Initiative (REAAADI)07150000000000
Resource for Early-onset Alzheimer's Disease Research (READR)0390000000000
Rotterdam Study (RS)108700000000000
Stanford Extreme Phenotypes in AD (StEP-AD)01950000000000
Texas Alzheimer's Research and Care Consortium (TARCC)144101701,1611,1610000000
The Ginkgo Evaluation of Memory Study (GEMS)01610000000000
The Monongahela-Youghiogheny Healthy Aging Team study (MYHAT)0610000000000
The University of Alabama Birmingham Alzheimer's Disease Research Center (UAB_ADRC)0170000000000
University of Miami (MIAMI)123812460000000000
University of Miami Brain Bank (MBB)03120000000000
University of Pittsburgh ADRC- Kamboh05410000000000
University of Toronto (TOR)900000000000
University of Washington Families (RAS)83890000000000
Vanderbilt University (VAN)333750000000000
Washington Heights-Inwood Columbia Aging Project (WHICAP)399182103,1423,795001,30666776700
Wisconsin Registry for Alzheimer's Prevention (WRAP)0102009861,000271275267399499335390

*counts based off of samples, not participants

ADSP whole exome and whole genome sequence data that were shared through dbGaP were mapped to the GRCh37 (build 37). These data are from the Discovery Phase of the project (described below) and while archived at dbGaP, are no longer available for request. All new ADSP data access requests must go through NIAGADS DSS.

STUDY DESCRIPTION FOR HISTORICAL dbGaP BUILD 37 ADSP DATA: The overarching goals of the Alzheimer's Disease Sequencing Project (ADSP) are to: (1) identify new genomic variants contributing to increased risk of developing Alzheimer's Disease (AD), (2) identify new genomic variants contributing to protection against developing AD, and (3) provide insight as to why individuals with known risk factor variants escape from developing AD. These factors will be studied in multi-ethnic populations in order to identify new pathways for disease prevention. Such a study of human genomic variation and its relationship to health and disease requires examination of a large number of study participants and needs to capture information about common and rare variants (both single nucleotide and copy number) in well phenotyped individuals.

Using existing samples from NIH funded and other studies, three NHGRI funded Large Scale Sequencing and Analysis Centers (LSAC) - Broad, Baylor, and Washington University - produced the DNA sequence data. Variant call data are being made available to the scientific community through NIH-approved data repositories. Statistical analysis of the sequence data is anticipated to identify new genetic risk and protective factors. The ADSP will conduct and facilitate analysis of sequence data to extend previous discoveries that may ultimately result in new directions for AD therapeutics. Analysis of ADSP data will be done in two phases.

The Discovery Phase analysis (2014-2018) is funded under PAR-12-183. The entire Discovery dataset contains whole-genome sequencing data on 584 subjects from 113 families, and pedigree data for > 4000 subjects; whole exome sequencing data on 5096 cases 4965 controls; and whole exome sequence data on an additional 853 (682 Cases [510 Non-Hispanic, 172 Hispanic]), and 171 Hispanic Control subjects from families that are multiply affected with AD.

The Replication Phase (2016-2021) analysis will be funded under RFA-AG-16-001 and RFA-AG-16-002 and is expected to include a combination of genotyping and sequencing approaches on at least 30,000 subjects. Targeted sequencing will be done by the LSACs.

GRCh37 Data Releases

  • The first ADSP data release occurred on November 25, 2013. It included the whole-genome sequencing data in BAM file format on 410 individuals.
  • The second ADSP data release occurred on March 31, 2014, and included the whole-genome sequencing data in BAM file format for an additional 168 individuals.
  • The third ADSP data release occurred on November 03, 2014 and included whole-exome sequencing data in BAM file format for 10,939 individuals.
  • The fourth ADSP data release occurred on February 13, 2015 and included revised ethnic data for subjects with whole-exome sequencing data.
  • The fifth ADSP data release occurred on July 13, 2015 and included whole-genome genotypes and updated phenotypes as well as changes to pedigree structures and sample IDs.
  • The sixth ADSP data release occurred on December 8, 2015, and included whole-exome genotypes and updated phenotypes as well as changes to subject IDs.

This seventh ADSP data release on April 12, 2016 includes:

    (1) WES and WGS SNV VCF files

    (2) WES and WGS Indel PLINK files

ADSP Data Archived at dbGaP:

ADSP - Whole Genome Sequencing ADSP - Whole Exome Sequencing Comments
DNA-Seq (BAM) n=578 n=10913 Sequence data available (plus n=38 replications w/out genotype data)
Concordant SNV Genotypes (PLINK format) N/A n=10913 QC'ed genotypes that are concordant between the Atlas (Baylor's) and GATK (Broad's) calling pipelines (a subset of the consensus genotype set)
Consensus Genotypes (PLINK and VCF format) n=578 n=10913 QC'ed genotypes that are concordant between Atlas and GATK pipelines as well as those that that were called uniquely by Atlas or GATK
Concordant Indel Genotypes (PLINK format) n=578 n=10913 QC'ed genotypes that are concordant between the Atlas and GATK calling pipelines
Phenotype Data n=4735 n=10913 Data of n=53 phenotype variables available (plus administrative data), including APOE genotype. WGS phenotypes include data of connecting family members.

Please use the release notes provided by dbGaP to obtain detailed information about study release updates.

Authorized Access
Publicly Available Data
  Link to other NCBI resources related to this study
Study Inclusion/Exclusion Criteria

The samples for the ADSP have been selected from well-characterized cohorts of individuals characterized for AD diagnosis as well as having known AD genetic risk factors. Investigators in the ADSP will obtain from the NIH approved data repositories: (1) quality control checked and 'cleaned' sequence data. 'Quality control checked and cleaned' means a set of routine checks have been performed for sample information, phenotype, and GWAS data to ensure the sequence data are of high quality and are ready for downstream genetic analysis and that likely sources of false-positives have been ruled out, and that samples that are outliers which may skew project-level analyses have been identified; (2) information on the composition of the study cohorts (e.g. case-control, family based, and epidemiology cohorts); (3) descriptions of the study cohorts included in the study; and (4) accompanying phenotypic information such as age at disease onset, self-reported race/ethnicity, gender, diagnostic status, and cognitive measures. The ADSP will determine what additional information, if any, is needed by its members to facilitate the project.

Study History

On February 7, 2012, a new Presidential Initiative was announced to fight Alzheimer's Disease (AD). As part of this effort, the National Human Genome Research Institute (NHGRI) was asked by the Director of the National Institutes of Health (NIH) to use $25M already committed to its Large-Scale Sequencing Centers (LSSC) for genomic studies in AD. The NIH director asked the National Institute on Aging (NIA) and the NHGRI to work together to develop and execute a large scale sequencing project to analyze the genomes of a large number of well characterized individuals in order to identify a broad range of AD risk and protective gene variants, with the ultimate goal of facilitating the identification of new pathways for therapeutic approaches and prevention. The analysis will also provide insight as to why individuals with known risk factor genes escape from developing AD. The project, developed jointly by NIA and NHGRI, is called the Alzheimer's Disease Sequencing Project (ADSP).

Selected Publications
Diseases/Traits Related to Study (MeSH terms)
Authorized Data Access Requests
Study Attribution
  • Principal Investigators
    • Richard A. Gibbs, PhD. Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
    • Eric S. Lander, PhD. Broad Institute, Boston, MA, USA.
    • Richard K. Wilson, PhD. The Genome Institute, Washington University, St. Louis, MO, USA.
    • Gerard D. Schellenberg, PhD. Alzheimer's Disease Genetics Consortium (ADGC), University of Pennsylvania, Philadelphia, PA, USA.
    • Sudha Seshadri, MD. Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium (CHARGE), Boston University, Boston, MA, USA.
  • Funding Sources
    • U24 AG041689. The National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
    • U01 AG032984. Alzheimer's Disease Genetics Consortium, University of Pennsylvania, Philadelphia, PA, USA.
    • R01 AG033193. Cohorts for Heart and Aging Research in Genomic Epidemiology, Boston University, Boston, MA, USA.
    • U24 AG021886. National Cell Repository for Alzheimer's Disease, Indiana University, Bloomington, IN, USA.
  • Funding Sources for Sequencing
    • U54HG003079. National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
    • U54HG003273. National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
    • U54HG003076. National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.