NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Chang SM, Matchar DB, Smetana GW, et al., editors. Methods Guide for Medical Test Reviews [Internet]. Rockville (MD): Agency for Healthcare Research and Quality (US); 2012 Jun.

Cover of Methods Guide for Medical Test Reviews

Methods Guide for Medical Test Reviews [Internet].

Show details

Chapter 11Challenges in and Principles for Conducting Systematic Reviews of Genetic Tests Used as Predictive Indicators

, MD, MPH; , MD, MPH; , PhD, MPH; , MS; , MD.

Author Information

In this chapter we discuss common challenges in and principles for conducting systematic reviews of genetic tests. The types of genetic tests discussed are those used to (1) determine risk or susceptibility in asymptomatic individuals; (2) reveal prognostic information to guide clinical management in those with a condition; or (3) predict response to treatments or environmental factors. This chapter is not intended to provide comprehensive guidance on evaluating all genetic tests. Rather, it focuses on issues that have been of particular concern to analysts and stakeholders and on areas that are of particular relevance for the evaluation of studies of genetic tests.


With recent advances in genotyping, it is expected that whole genome sequencing will soon be available for less than $1,000. Consequently, the number of studies of genetic tests will likely increase substantially, as will the need to evaluate studies of genetic tests. The general principles for evaluating genetic tests are similar to those for interpreting other prognostic or predictive tests, but there are differences in how the principles need to be applied and the degree to which certain issues are relevant, particularly when considering genetic test results that provide predictive rather than diagnostic information.

This chapter focuses on issues of particular concern to analysts and stakeholders and areas of particular relevance for the evaluation of studies of genetic tests. It is not intended to provide comprehensive guidance on evaluating all genetic tests. We reflect on genetic tests used to (1) determine risk or susceptibility in asymptomatic individuals (e.g., to identify individuals at risk for future health conditions, such as BRCA1 and BRCA2 for breast and ovarian cancer); (2) reveal prognostic information to guide clinical management and treatment in those with a condition (e.g., Oncotype Dx® for breast cancer recurrence, a test to evaluate the tumor genome of surgically excised tumors from patients with breast cancer); or (3) predict response to treatments or environmental factors including diet (nutrigenomics), drugs (pharmacogenomics, such as CYP2C9 and VKORC1 tests to inform warfarin dosing), infectious agents, chemicals, physical agents, and behavioral factors. We do not address genetic tests used for diagnostic purposes. We address issues related to both heritable mutations and somatic mutations (e.g., genetic tests for tumors).

Clinicians, geneticists, analysts, policymakers, and other stakeholders may have varying definitions of what is considered a “genetic test.” We have chosen to use a broad definition in agreement with that of the Centers for Disease Control and Prevention (CDC)–sponsored Evaluation of Genomic Applications in Practice and Prevention (EGAPP) and the Secretary’s Advisory Committee on Genetics, Health, and Society,1 namely: “A genetic test involves the analysis of chromosomes, deoxyribonucleic acid (DNA), ribonucleic acid (RNA), genes, or gene products (e.g., enzymes and other proteins) to detect heritable or somatic variations related to disease or health. Whether a laboratory method is considered a genetic test also depends on the intended use, claim, or purpose of a test.”1 The same technologies are used for diagnostic and predictive genetic tests; it is the intended use of the test result that determines whether it is a diagnostic or predictive test.

In this chapter we discuss principles for addressing challenges related to developing the topic and structuring a genetic test review (context and scoping), as well as performing the review. This chapter is meant to complement the Methods Guide for Comparative Effectiveness Reviews.2 We do not attempt to reiterate the challenges and principles described in earlier sections of this Medical Test Methods Guide, but focus instead on issues of particular relevance for evaluating studies of genetic tests. Although we have written this chapter to serve as guidance for the Agency for Healthcare Research and Quality (AHRQ) Evidence-based Practice Centers (EPCs), we also intend it as a useful resource for other investigators interested in conducting systematic reviews on genetic tests.

Common Challenges

Genetic tests are different from other medical tests in their relationship to the outcomes measured. Reviewers need to take into account the penetrance of the disease, time lag to outcomes, variable expressivity, and pleiotropy (as defined below). These particular aspects of genetic tests result in specific actions at various stages of planning and performing the review. Both single-gene and polygenic disorders are known. Single gene disorders are the result of a single mutated gene and may be passed on to subsequent generations in various well described ways (e.g., autosomal dominant, autosomal recessive, X-linked). Polygenic disorders are the result of the combined action of more than one gene and are not inherited according to simple Mendelian patterns. Some examples include heart disease and diabetes. Some of the terms described below (penetrance, variable expressivity, and pleiotropy) are generally used to describe single-gene disorders.


Evaluations of predictive genetic tests should always consider penetrance, defined as “the proportion of people with a particular genetic change who exhibit signs and symptoms of a disorder.”3 Penetrance is a key factor in determining the future risk of developing disease and assessing the overall clinical utility of predictive genetic tests. Sufficient data to determine precise estimates of penetrance are sometimes lacking.45 This can be due to a lack of reliable prevalence data or of long-term outcomes data. In such cases, determining the overall clinical utility of a genetic test is difficult. In some cases, modeling with sensitivity analyses can help develop estimates.4

Time Lag

The time lag between genetic testing and clinically important events should be assessed in critical appraisal of studies of such tests. Whether the duration of studies is sufficient to characterize the relationship between positive tests and clinical outcomes is an important consideration. In addition, it should be determined whether or not subjects have reached the age beyond which clinical expression is likely.

Variable Expressivity

Variable expressivity refers to the range of severity of the signs and symptoms that can occur in different people with the same condition.3 For example, the features of hemochromatosis vary widely. Some individuals have mild symptoms, while others experience life-threatening complications such as liver failure. The degree of expressivity should be considered in the evaluation of genetic tests.


Pleiotropy occurs when a single gene influences multiple phenotypic traits. For example, the genetic mutation causing Marfan syndrome results in cardiovascular, skeletal, and ophthalmologic abnormalities. Similarly, BRCA mutations can increase the risk of a number of cancers, including breast, ovarian, prostate, and melanoma.

Other Common Challenges

Another common challenge for the evaluation of predictive genetic tests is that direct evidence is often lacking on the impact of the test results on health outcomes. The evidence base is often too limited in scope to evaluate the clinical utility of the test. In addition, it is often difficult to find published information on various aspects of genetic tests, especially data related to analytic validity. For example, laboratory-developed tests (LDT) are regulated by the Centers for Medicare & Medicaid Services (CMS) under Clinical Laboratory Improvement Act (CLIA) regulations for clinical laboratories. CLIA does not require clinical validation and many LDTs have had no clinical validation or clinical utility studies.

Genetic tests also raise a number of technical issues particularly relevant to the assessment of their analytic validity. These technical issues may differ according to the type of genetic test and may influence the interpretation of a genetic test result. Technical issues may also differ depending on the specimen being tested. For example, there are different considerations when assessing tumor genomes as opposed to human genomes.

Several common challenges arise in using genetic tests to determine susceptibility or risk in asymptomatic individuals. The utility of such tests may depend on the ability of respondents, such as the patient or their relative, to report and identify certain clinical factors. For instance, if patients cannot accurately recall the family history of a heritable disease, it can be difficult to assess their risk of developing the disease.

Finally, statistical issues must be taken into account when evaluating studies of genetic tests. For example, genetic test results are often derived from analytically complex studies that have undergone a very large number of statistical tests, creating a high risk of Type I error (i.e., when a spurious association is deemed significant).

Principles for Addressing the Challenges

The eight principles described in this section can be used to address the challenges related to developing, structuring, and performing a genetic test review (Table 11-1).

Table 11-1. Principles for addressing common challenges when evaluating genetic tests used as predictive indicators.

Table 11-1

Principles for addressing common challenges when evaluating genetic tests used as predictive indicators.

Principle 1. Use an organizing framework appropriate for genetic tests

Organizing frameworks for evaluating genetic tests have been developed by the United States Preventive Services Task Force (USPSTF), the CDC, and EGAPP.1,67 The model endorsed by the EGAPP initiative1 was based on a previous report of the NIH Task Force on Genetic Testing8 and developed through a CDC-sponsored project, which piloted an evidence evaluation framework that applied the following three criteria: (1) analytic validity (technical accuracy and reliability); (2) clinical validity (ability to detect or predict an outcome, disorder, or phenotype); and (3) clinical utility (whether use of the test to direct clinical management improves patient outcomes). A fourth criterion was added: (4) ethical, legal, and social implications.6 The ACCE model (Analytic validity, Clinical validity, Clinical utility, and Ethical, legal and social implications) includes a series of 44 questions that are useful for analysts in defining the scope of a review, as well as for critically appraising studies of genetic tests (Table 11-2). The initial seven questions help to guide an understanding of the disorder, the setting, and the type of testing. A detailed description of the methods of the EGAPP Working Group is published elsewhere.1

Table 11-2. ACCE model questions for reviews of genetic tests.

Table 11-2

ACCE model questions for reviews of genetic tests.

Principle 2. Develop analytic frameworks that reflect the predictive nature of genetic tests and incorporate appropriate outcomes

It is important to have a clear definition of the clinical scenario and analytic framework when evaluating any test, including a predictive genetic test. Prior to performing a review, analysts should develop clearly defined key questions and understand the needs of decisionmakers and the context in which the tests are used. They should consider whether this is a test used for determining future risk of disease in asymptomatic individuals, establishing prognostic information that will influence treatment decisions, or predicting response to treatments (either effectiveness or harms)—or whether it is used for some other purpose. They should clarify the type of specimens used for the genetic test under evaluation (i.e., patient genome or tumor genome). The PICOTS typology (Patient population, Intervention, Comparator, Outcomes, Timing, Setting) should be clearly described, as it will inform the development of the analytic framework and vice versa.

In constructing an analytic framework, it may be useful for analysts to consider preanalytic, analytic, and postanalytic factors particularly applicable to genetic tests (described later in this chapter), as well as the key outcomes of interest. Analytic frameworks should incorporate the factors and outcomes of greatest interest to decision makers. Figure 11-1 illustrates a generic analytic framework for evaluating predictive genetic tests that can be modified as necessary for various situations.

This flowchart illustrates a generic analytic framework for evaluating predictive genetic tests that can be modified as necessary for various situations. An asymptomatic person or a person with a diagnosis/disease is the first element; next is the genetic test; and from there treatment decisions with attendant benefits and harms (intermediate or process outcomes), and finally benefits and harms (health outcomes). Pre-analytic, analytic, and post-analytic factors can be inserted into the analysis at appropriate points in the study. Pre-analytic factors focus on the person tested, analytic factors on the genetic test itself, and post-analytic factors focus on the predicted outcomes. A timeline is depicted across the top of the chart.

Figure 11-1

Generic analytic framework for evaluating predictive genetic tests.

In addition to effects on family members, psychological distress and possible stigmatization or discrimination are potential harms that may result from predictive genetic tests, particularly if the test results predict probability of disease occurring with a high likelihood, especially if no proven preventive or ameliorative measures are available. For these potential harms, analysts should take into account whether the testing is for inherited or acquired genetic mutations, since these factors influence the potential for harms. In addition, whether the condition related to the test is multifactorial or follows classic Mendelian inheritance will affect the potential for these harms.

Other important outcomes to consider when evaluating genetic tests include, but are not limited to, cost, quality of life, long-term morbidity, and indirect impact. Genetic tests may have an impact that is difficult to measure, such as impact on important decisions regarding pregnancy.

Depending on the context, the impact of genetic testing on family members may be important, particularly in cases that involve testing for heritable conditions. One approach to including family members in the analytic framework is illustrated in Figure 11-2.

This figure repeats the analysis in Figure 11-1, adding an element of testing of family members. The analysis takes into account potential treatment of family members and resultant benefits and harms to them.

Figure 11-2

Generic analytic framework for evaluating predictive genetic tests when the impact on family members is important.

Principle 3. Search databases appropriate for genetic tests

The Human Genome Epidemiology Network (HuGE Net) Web site can provide a helpful supplement to searches, as it includes many meta-analyses of genetic association studies as well as a source called the HuGE Navigator that can identify all types of available studies related to a genetic test.9

The U.S. Food and Drug Administration (FDA)–approved test package inserts for genetic tests contain summaries of the analytic validity data. These summaries can be retrieved through searches of the gray literature. Package inserts are available on the FDA and manufacturer Web sites. Laboratory-developed tests do not require FDA clearance, and there is no requirement for publicly available data on analytic validity. When there are no published data on analytic validity of a genetic test, the external proficiency testing program carried out jointly by the American College of Medical Genetics (ACMG) and the College of American Pathologists (CAP) can be useful in establishing the degree of laboratory-to-laboratory variability, as well as some sense of reproducibility.1012 Other potentially useful sources of unpublished data include conference publications from professional societies (e.g., the College of American Pathologists), the GeneTests Web site (, the Association for Molecular Pathology Web site (, CDC programs (e.g., the Genetic Testing Reference Materials Coordination Program and the Newborn Screening Quality Assurance Program), and international proficiency testing programs.13

An AHRQ “horizon scan” found two databases—LexisNexis® ( and Cambridge Healthtech Institute (CHI) (—that had high utility in identifying genetic tests in development for clinical cancer care. A number of others had low-to-moderate utility, and some were not useful.14

Principle 4. Consult with experts to determine which technical issues are important to address in assessing genetic tests

There are a number of technical issues related to analytic validity that can influence the interpretation of a genetic test result, including preanalytic, analytic, and postanalytic factors.1516 In general, preanalytic steps are those involved in obtaining, fixing or preserving, and storing samples prior to staining and analysis. Important analytic variables include the type of assay chosen and its reliability, types of samples, the specific analyte investigated, specific genotyping methods, timing of sample analysis, and complexity of performing the assay. Postanalytic variables relate to the complexity of interpreting the test result, variability from laboratory to laboratory, and quality control.1516 To determine which of these technical issues are pertinent for a given review, comparative effectiveness review teams should include or consult with molecular pathologists, geneticists, or others familiar with the issues related to the process of performing and reporting genetic tests. Table 11-3 summarizes some of the preanalytic, analytic, and postanalytic questions that should be addressed.

Table 11-3. Questions for assessing preanalytic, analytic, and postanalytic factors for evaluating predictive genetic tests.

Table 11-3

Questions for assessing preanalytic, analytic, and postanalytic factors for evaluating predictive genetic tests.

For genetic testing of tumor specimens, it is important to understand that the tumor genome may be in a dynamic state, with mutations emerging over time (e.g., due to drug exposure or disruption of cellular repair). Tumor specimens will often contain normal cells from the patient as well as tumor cells. To accurately assess for somatic mutations using tumor specimens, particular strategies may be needed, such as enriching samples for tumor cells (e.g., by microscopic evaluation and dissection of the cells).

Principle 5. Distinguish between functional assays and DNA-based assays to determine important technical issues

Some studies may utilize DNA-based assays, whereas others may utilize functional assays with different sensitivities and specificities. Functional assays, in which a substrate or product of a metabolic process affected by a particular genetic polymorphism is measured, may have the advantage of showing potentially more important information than the presence of the genetic polymorphism itself. However, they may be affected by a number of factors and do not necessarily reflect the polymorphism alone. Unmeasured environmental factors, other genetic polymorphisms, and various disease states may influence the results of functional assays. In addition, functional assays that measure enzyme activity are taken at a single point in time. Depending on the enzyme and polymorphism being evaluated, the variation in enzyme activity over time should be considered in critical appraisal. Inconsistent results have been reported between studies using DNA-based molecular methods and those using phenotypic assays.1618

For DNA-based tests, a variety of sample sources are available (e.g., blood, cheek swab, hair) that should hypothetically result in identical genotype results.16,1923 However, DNA may be more difficult to obtain and purify from some tissues than from blood, particularly if the tissues have been fixed in paraffin versus fresh samples. (DNA extraction from formalin-fixed tissue is difficult, but sometimes possible).16 Some studies utilize different sources of DNA for cases and controls, introducing potential measurement bias from differences in ease of technique and test accuracy. Extraction of DNA from tumors in oncology studies may raise additional issues that influence analytic validity, including the quantity of tissue, admixture of normal and cancerous tissue, amount of necrosis, timing of collection, and storage technique (e.g., fresh, frozen, paraffin, formalin).16

When evaluating DNA-based molecular tests, complexity of the test method, laboratory-to-laboratory variability, and quality control should be assessed. A number of methods are available for genotyping single nucleotide polymorphisms that vary in complexity and potential for polymorphism misclassification.16,2426 Considering laboratory reporting of internal controls and repetitive experiments can be useful in assessment of overall analytic validity. The method of interpreting test results may influence complexity as well. For example, some tests require visual inspection of electrophoresis gels. Inter-observer variability should be considered for such tests.16,27

Principle 6. Evaluate case-control studies carefully for potential selection bias

In critical appraisal of any case-control study, it is important to determine whether cases and controls were selected from the same source population. In the case of genetic studies, the geographic location of the population does not suffice. Rather, having cases and controls matched for ethnicity/race or ancestry (i.e., population stratification) is important, since the frequencies of DNA polymorphisms vary from population to population. It has been noted that many case-control studies of gene-disease associations have selected controls from a population that does not represent the population from which the cases arose.1617,2830 In general, only nested case-control studies could have low enough potential for selection bias to provide reliable information.

Principle 7. Determine the added value of the genetic test over existing risk assessment approaches

For some scenarios, a number of clinical factors associated with risk assessment or susceptibility may already be well characterized. In such cases, comparative effectiveness reviews should determine the added value of using genetic testing along with known factors, compared with using the known factors alone. For example, age, sex, smoking, hypertension, diabetes, and cholesterol are all well established risk factors for cardiovascular disease. Risk stratification of individuals to determine cholesterol-lowering targets is based on these factors.31 Assessment of newly identified polymorphisms—such as those described on chromosome 9p2132—that may confer increased risk of cardiovascular disease and have potential implications for medical interventions should be evaluated in the context of these known risk factors. In this scenario, investigators should determine the added value of testing for polymorphisms of chromosome 9p21 in addition to known clinical risk factors.

Multiple polymorphisms may be associated with risk of disease, prognosis, or prediction of drug response. In such cases, the effect of multiple polymorphisms can be explored using a multiple regression model. Once this is done, prospective studies would usually be needed to determine whether the model, including the genetic tests, has clinical utility. For example, VKORC1 and CYP2C9 genotypes have been associated with warfarin dose requirements in multiple regression models. In order to determine whether tests for VKORC1 and CYP2C9 have clinical utility, studies would need to compare the use of a prediction model that contains the genetic tests in combination with known clinical factors that affect warfarin dose (e.g., age, BMI) with the use of clinical factors alone.3335

Principle 8. Understand statistical issues of particular relevance to genetic tests

Hardy-Weinberg Equilibrium

In population genetics, most allele distributions follow a usual distribution, known as the Hardy-Weinberg equilibrium (HWE). Genetic association studies should generally report whether the frequencies of the alleles being evaluated follow HWE. There are a number of reasons that distributions may deviate from HWE, including new mutations, selection, migration, genetic drift, and inbreeding.36 In addition, when numerous polymorphisms are tested for associations with diseases or outcomes, as in many genome-wide association studies, many of them (5 percent) will deviate from HWE based on chance alone (related to multiple testing).37 Deviation from HWE may be a clue to bias and genotyping error, but it is not specific and possibly not sensitive.37 Analysts should consider whether studies have tested for and reported HWE. A more detailed discussion of this topic as it relates to genetic association studies has been published elsewhere.3637

Sample Size Calculations

When assessing internal validity of studies, it is important to assess whether sample size calculations appropriately accounted for the number of variant alleles and the prevalence of variants in the population of interest. This is particularly relevant for pharmacogenomic studies evaluating the functional relevance of genetic polymorphisms.38 Such studies often enroll an insufficient number of subjects to account for the number of variant alleles and the prevalence of variants in the population.38

Genetic Association Studies and Multiple Comparisons

Genetic test results are sometimes derived from analytically complex studies that have undergone a very large number of statistical tests. These may be in the form of genome-wide association studies searching for associations between a huge number of genetic polymorphisms and health conditions. Such association studies may enhance understanding of the importance of genetics in relation to a variety of health conditions, but should generally be used to generate hypotheses rather than to test hypotheses or to confirm cause-effect relationships.16 Close scrutiny should be applied to ensure that the evidence for the association has been validated in multiple studies to minimize both potential confounding and potential publication bias issues. In addition, reviewers should note whether appropriate adjustments for multiple comparisons were used. Many investigators recommend using a P value of less than 5 × 10−8 for the threshold of significance in large genome-wide studies.37,3940 Other approaches include assessing the false positive report probability and controlling the false discovery rate.4143

When a genetic mutation associated with increased risk is present, evaluating potential causality can be difficult, as many factors other than the mutation may influence associations. These include environmental exposures, behaviors, and other genes. Many genetic variants identified that are thought to influence susceptibility to diseases are associated with low relative and absolute risk.16,44 Thus, exclusion of non-causal explanations for associations and consideration of potential confounders are central to critical appraisal of such associations. It may also be important to explore biologic plausibility (e.g., from in vitro studies) to help support or oppose theories of causation.16

Overlapping Data Sets

Be cautious of publications that report prevalence estimates for genetic variants that have actually arisen from overlapping data sets.16 For example, genome-wide association studies or other large collaborative efforts, such as the International Warfarin Pharmacogenomics Consortium, may pool samples of patients that were previously included in other published studies.3 To the degree possible, investigators should identify overlapping data sets and avoid double-counting. It may be useful to organize evidence tables by study time period and geographic area to identify potential overlapping data sets.16

Assessing Tumor Genetics

As mentioned under Principle 4, it is important to understand that a tumor genome may be in a dynamic state. In addition, tumor specimens will often contain normal cells from the patient. The characteristics of the specimen will influence the sensitivity and operating characteristics of the test. Tests with greater sensitivity may be required when specimens contain both normal cells and tumor cells.


Since the completion of the Human Genome Project, the Hap Map project, and related works, there have been a great number of publications describing the clinical validity of genetic test results (e.g., gene-disease associations), but far fewer studies of their clinical utility. A review of genetic testing for cytochrome P450 polymorphisms in adults with depression treated with selective serotonin reuptake inhibitors (SSRIs) developed an analytic framework and five corresponding key questions which, taken together, provide an example of a well defined predictive genetic test scenario that explores a potential chain of evidence relating to intermediate outcomes (Figure 11-3).45 The authors found no prospective studies with clinical outcomes that used genotyping to guide treatment. They constructed a chain of questions to assess whether sufficient indirect evidence could answer the overarching question by evaluating the links between genotype and metabolism of SSRIs (phenotype), metabolism and SSRI efficacy, and metabolism and adverse drug reactions to SSRIs.

This flowchart is an analytic framework for evidence gathering on CYP450 genotype testing for SSRI treatment of depression. The numbers in the flowchart correspond to the five key questions to be answered through the systematic review. These questions are listed in the legend below the chart. The chart begins with the patient population—adults with non-psychotic depression entering therapy with SSRI. “Incorrect genotype assignment” is a result of a negative answer to Question 2 regarding the validity of available CYP450 genotype tests. The next element is CYP450 genotype testing, addressed by question 3a (“How well do particular CYP450 genotypes predict metabolism of particular SSRIs?”) Two elements follow from the answer to the question about testing. Question 3b addresses the element “predicted drug efficacy.” Question 3c addresses the element “predicted risk for adverse drug reactions.” Questions 4a, 4b, and 4c address issues regarding treatment decisions. Hopefully, these decisions lead to positive outcomes such as improvements in depression status, quality of life, and other outcomes such as work or absenteeism. Question 5 addresses the harms of subsequent management options.

Figure 11-3

Analytic framework for evidence gathering on CYP450 genotype testing for SSRI treatment of depression. CYP450 = cytochrome p450; SSRI = selective serotonin reuptake inhibitor Numbers in this figure represent the research questions addressed in the systematic (more...)

An EPC report on HER2 testing to manage patients with breast cancer and other solid tumors provides a detailed assessment of challenges in conducting a definitive evaluation of preanalytic, analytic, and postanalytic factors when there is substantial heterogeneity or lack of available information related to the methods of testing.46 The authors noted that it had been only very recently that many aspects of HER2 assays were standardized, and that the effects of widely varying testing methods could not be isolated. Thus, they approached this challenge by providing a narrative review for their first key question (What is the evidence on concordance and discrepancy rates for methods [e.g., FISH, IHC, etc.] used to analyze HER2 status in breast tumor tissue?).

Additional considerations arise when evaluating genetic test results used to determine susceptibility or risk in asymptomatic individuals. The utility of such tests may depend on the ability of patients and providers to report and identify certain clinical factors. For example, a review of genetic risk assessment and BRCA mutation testing underscores the importance of accurately determining family history.4,47 The analytic framework begins by classifying asymptomatic women into high, moderate, or average risk categories. This is a good example of incorporating a key preanalytic factor (family history), that has an important influence on analytic validity. Tests for BRCA mutations may be used to predict the risk for breast and ovarian cancer in high-risk women (i.e., those with a family history suggesting increased risk). However, because we do not know all of the genes that contribute to hereditary breast and ovarian cancer and because analytic methods to detect mutations in the known genes are not perfect, population-based testing for hereditary susceptibility to breast and ovarian cancer is currently not an appropriate strategy. Rather, family history-based testing is the paradigm that is recommended to guide the use of BRCA testing.4,47

Thus, family history is a genetic/genomics tool that is used to (1) identify people with possible inherited disease susceptibilities, (2) guide genetic testing strategies, (3) help interpret genetic test results, and (4) assess disease risk. The ability of providers to accurately determine a family history that confers increased risk is a key prerequisite to the utility of BRCA mutation and other predictive genetic testing. It is sometimes difficult for people to accurately recall the presence of a condition in their relatives. Sensitivity and specificity of self-reported family history are important in determining overall usefulness of predictive genetic testing.4


Analysts should understand common challenges, and apply the principles for addressing those challenges, when conducting systematic reviews of genetic tests used as predictive indicators. Key points include:

  1. The general principles that apply in evaluating genetic tests are similar to those for other prognostic or predictive tests, but there are differences in how the principles need to be applied or the degree to which certain issues are relevant.
  2. A clear definition of the clinical scenario and an analytic framework is important when evaluating any test, including genetic tests.
  3. Organizing frameworks and analytic frameworks are useful constructs for approaching the evaluation of genetic tests.
  4. In constructing an analytic framework for evaluating a genetic test, analysts should consider preanalytic, analytic, and postanalytic factors; such factors are useful when assessing analytic validity.
  5. Predictive genetic tests are generally characterized by a delayed time between testing and clinically important events.
  6. Published information on the analytic validity of some genetic tests may be difficult to find. Web sites (FDA or diagnostic companies) and gray literature may be important sources.
  7. In situations where clinical factors associated with risk are well characterized, comparative effectiveness reviews should assess the added value of using genetic testing along with known factors, compared with using the known factors alone.
  8. For genome-wide association studies, reviewers should determine whether the association has been validated in multiple studies to minimize both potential confounding and publication bias. In addition, reviewers should note whether appropriate adjustments for multiple comparisons were used.


Teutsch SM, Bradley LA, Palomaki GE, et al. The Evaluation of Genomic Applications in Practice and Prevention (EGAPP) initiative: methods of the EGAPP Working Group. Genet Med. 2008 [PMC free article: PMC2743609] [PubMed: 18813139]
Methods Guide for Effectiveness and Comparative Effectiveness Reviews. Rockville, MD: Agency for Healthcare Research and Quality; Mar, 2011. [Accessed August 22, 2011]. AHRQ Publication No. 10(11)-EHC063-EF. Available at: www​
Lister Hill National Center for Biomedical Communications: Collections of the National Library of Medicine. What are reduced penetrance and variable expressivity? 2008. [Accessed August 22, 2011]. [electronic resource] Available at: http://ghr​​/handbook/inheritance​/penetranceexpressivity.
Nelson HD, Huffman LH, Fu R, Harris EL. Genetic risk assessment and BRCA mutation testing for breast and ovarian cancer susceptibility: systematic evidence review for the U.S. Preventive Services Task Force. Annals of internal medicine. 2005;143(5):362–79. [PubMed: 16144895]
Whitlock EP, Garlitz BA, Harris EL, Beil TL, Smith PR. Screening for hereditary hemochromatosis: a systematic review for the U.S. Preventive Services Task Force. Annals of internal medicine. 2006;145(3):209–23. [PubMed: 16880463]
National Office of Public Health Genomics C. ACCE Model Process for Evaluating Genetic Tests. 2007. [Accessed August 22, 2011]. Available at: http://www​​/gtesting/ACCE/index.htm.
Harris RP, Helfand M, Woolf SH, et al. Current methods of the US Preventive Services Task Force: a review of the process. American journal of preventive medicine. 2001;20(3 Suppl):21–35. [PubMed: 11306229]
Task Force on Genetic Testing (NIH). Promoting Safe and Effective Genetic Testing in the United States. Final Report of the Task Force on Genetic Testing. 1997. [Accessed August 22, 2011]. Available at: http://www​
Khoury MJ, Dorman JS. The Human Genome Epidemiology Network. American journal of epidemiology. 1998;148(1):1–3. [PubMed: 9663396]
Palomaki GE, Bradley LA, Richards CS, Haddow JE. Analytic validity of cystic fibrosis testing: a preliminary estimate. Genet Med. 2003;5(1):15–20. [PubMed: 12544471]
Palomaki GE, Haddow JE, Bradley LA, FitzSimmons SC. Updated assessment of cystic fibrosis mutation frequencies in non-Hispanic Caucasians. Genet Med. 2002;4(2):90–4. [PubMed: 11882786]
Palomaki GE, Haddow JE, Bradley LA, Richards CS, Stenzel TT, Grody WW. Estimated analytic validity of HFE C282Y mutation testing in population screening: the potential value of confirmatory testing. Genet Med. 2003;5(6):440–3. [PubMed: 14614395]
Sun F, Bruening W, Erinoff E, Schoelles KM. Methods Research Report. Rockville, MD: Agency for Healthcare Research and Quality; Jun, 2011. Addressing Challenges in Genetic Test Evaluation. Evaluation Frameworks and Assessment of Analytic Validity. (Prepared by the ECRI Institute Evidence-based Practice Center under Contract No. 290-2007-10063-I.) AHRQ Publication No. 11-EHC048-EF. Available at: www​.effectivehealthcare​ [PubMed: 21834175]
Agency for Healthcare Research and Quality. Genetic Tests for Cancer Technology Assessment. Rockville, MD: Agency for Healthcare Research and Quality; 2006. [Accessed August 22, 2011]. Available at: http://archive​​/clinic/ta/gentests/
Burke W, Atkins D, Gwinn M, et al. Genetic test evaluation: information needs of clinicians, policy makers, and the public. American journal of epidemiology. 2002;156(4):311–8. [PubMed: 12181100]
Little J, Bradley L, Bray MS, et al. Reporting, appraising, and integrating data on genotype prevalence and gene-disease associations. American journal of epidemiology. 2002;156(4):300–10. [PubMed: 12181099]
Brockton N, Little J, Sharp L, Cotton SC. N-acetyltransferase polymorphisms and colorectal cancer: a HuGE review. American journal of epidemiology. 2000;151(9):846–61. [PubMed: 10791558]
d’Errico A, Malats N, Vineis P, Boffetta P. IARC scientific publications; Review of studies of selected metabolic polymorphisms and cancer. 1999;(148):323–93. [PubMed: 10493265]
Yang M, Hendrie HC, Hall KS, Oluwole OS, Hodes ME, Sahota A. Improved procedure for eluting DNA from dried blood spots. Clinical chemistry. 1996;42(7):1115–6. [PubMed: 8674202]
Gale KB, Ford AM, Repp R, et al. Backtracking leukemia to birth: identification of clonotypic gene fusion sequences in neonatal blood spots. Proceedings of the National Academy of Sciences of the United States of America. 1997;94(25):13950–4. [PMC free article: PMC28413] [PubMed: 9391133]
Walker AH, Najarian D, White DL, Jaffe JF, Kanetsky PA, Rebbeck TR. Collection of genomic DNA by buccal swabs for polymerase chain reaction-based biomarker assays. Environmental health perspectives. 1999;107(7):517–20. [PMC free article: PMC1566681] [PubMed: 10378997]
Harty LC, Shields PG, Winn DM, Caporaso NE, Hayes RB. Self-collection of oral epithelial cell DNA under instruction from epidemiologic interviewers. American journal of epidemiology. 2000;151(2):199–205. [PubMed: 10645823]
Garcia-Closas M, Egan KM, Abruzzo J, et al. Collection of genomic DNA from adults in epidemiological studies by buccal cytobrush and mouthwash. Cancer Epidemiol Biomarkers Prev. 2001;10(6):687–96. [PubMed: 11401920]
Hixson JE, Vernier DT. Restriction isotyping of human apolipoprotein E by gene amplification and cleavage with HhaI. Journal of lipid research. 1990;31(3):545–8. [PubMed: 2341813]
Tobe VO, Taylor SL, Nickerson DA. Single-well genotyping of diallelic sequence variations by a two-color ELISA-based oligonucleotide ligation assay. Nucleic acids research. 1996;24(19):3728–32. [PMC free article: PMC146169] [PubMed: 8871551]
Lee LG, Connell CR, Bloch W. Allelic discrimination by nick-translation PCR with fluorogenic probes. Nucleic acids research. 1993;21(16):3761–6. [PMC free article: PMC309885] [PubMed: 8367293]
Bogardus ST Jr, Concato J, Feinstein AR. Clinical epidemiological quality in molecular genetic research: the need for methodological standards. JAMA. 1999;281(20):1919–26. [PubMed: 10349896]
Botto LD, Yang Q. 5,10-Methylenetetrahydrofolate reductase gene variants and congenital anomalies: a HuGE review. American journal of epidemiology. 2000;151(9):862–77. [PubMed: 10791559]
Dorman JS, Bunker CH. HLA-DQ locus of the human leukocyte antigen complex and type 1 diabetes mellitus: a HuGE review. Epidemiologic reviews. 2000;22(2):218–27. [PubMed: 11218373]
Cotton SC, Sharp L, Little J, Brockton N. Glutathione S-transferase polymorphisms and colorectal cancer: a HuGE review. American journal of epidemiology. 2000;151(1):7–32. [PubMed: 10625170]
National Cholesterol Education Program. Third Report of the National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults (Adult Treatment Panel III) final report. Dec 17, 2002. Report No.: 1524-4539 (Electronic) [PubMed: 12485966]
Schunkert H, Gotz A, Braund P, et al. Repeated replication and a prospective meta-analysis of the association between chromosome 9p21.3 and coronary artery disease. Circulation. 2008;117(13):1675–84. [PMC free article: PMC2689930] [PubMed: 18362232]
Gage BF, Eby C, Milligan PE, Banet GA, Duncan JR, McLeod HL. Use of pharmacogenetics and clinical factors to predict the maintenance dose of warfarin. Thromb Haemost. 2004;91(1):87–94. [PubMed: 14691573]
Gage BF, Lesko LJ. Pharmacogenetics of warfarin: regulatory, scientific, and clinical issues. J Thromb Thrombolysis. 2008;25(1):45–51. [PubMed: 17906972]
Jonas DE, McLeod HL. Genetic and clinical factors relating to warfarin dosing. Trends Pharmacol Sci. 2009;30(7):375–86. [PubMed: 19540002]
Attia J, Ioannidis JP, Thakkinstian A, et al. How to use an article about genetic association: A: Background concepts. JAMA. 2009;301(1):74–81. [PubMed: 19126812]
Attia J, Ioannidis JP, Thakkinstian A, et al. How to use an article about genetic association: B: Are the results of the study valid? JAMA. 2009;301(2):191–7. [PubMed: 19141767]
Williams JA, Johnson K, Paulauskis J, Cook J. So many studies, too few subjects: establishing functional relevance of genetic polymorphisms on pharmacokinetics. Journal of clinical pharmacology. 2006;46(3):258–64. [PubMed: 16490801]
Hoggart CJ, Clark TG, De Iorio M, Whittaker JC, Balding DJ. Genome-wide significance for dense SNP and resequencing data. Genetic epidemiology. 2008;32(2):179–85. [PubMed: 18200594]
McCarthy MI, Abecasis GR, Cardon LR, et al. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature reviews. 2008;9(5):356–69. [PubMed: 18398418]
Benjamini Y, Drai D, Elmer G, Kafkafi N, Golani I. Controlling the false discovery rate in behavior genetics research. Behav Brain Res. 2001;125(1–2):279–84. [PubMed: 11682119]
Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, Rothman N. Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. J Natl Cancer Inst. 2004;96(6):434–42. [PubMed: 15026468]
Ziegler A, Konig IR, Thompson JR. Biostatistical aspects of genome-wide association studies. Biom J. 2008;50(1):8–28. [PubMed: 18217698]
Caporaso N. Selection of candidate genes for population studies. In: Vineis P, Malats N, Lang M, et al., editors. Metabolic polymorphisms and susceptibility to cancer. Lyon, France: IARC Monogr Eval Carcinog Risks Hum; 1999. pp. 23–36.
Matchar DB, Thakur ME, Grossman I, et al. Testing for cytochrome P450 polymorphisms in adults with non-psychotic depression treated with selective serotonin reuptake inhibitors (SSRIs) Evid Rep Technol Assess (Full Rep) 2007;(146):1–77. [PMC free article: PMC4781099] [PubMed: 17764209]
Seidenfeld J, Samson DJ, Rothenberg BM, Bonnell CJ, Ziegler KM, Aronson N. HER2 Testing to Manage Patients With Breast Cancer or Other Solid Tumors/Technology Assessment No. 172. Rockville, MD: 2008. (Prepared by Blue Cross and Blue Shield Association Technology Evaluation Center Evidence-based Practice Center, under Contract No. 290-02-0026) [PMC free article: PMC4781031] [PubMed: 19408965]
Genetic risk assessment: recommendation statement. Genetic risk assessment and BRCA mutation testing for breast and ovarian cancer susceptibility: recommendation statement. Annals of internal medicine. 2005;143(5):355–61. [PubMed: 16144894]

Acknowledgements: We would like to thank Halle R. Amick (University of North Carolina, Cecil G. Sheps Center for Health Services Research) and Crystal M. Riley (Duke-NUS Graduate Medical School Singapore) for their assistance with preparation of this manuscript, insightful editing, and outstanding attention to detail. We deeply appreciate the considerable support, commitment, and contributions of Stephanie Chang, MD, MPH, the AHRQ Task Order Officer for this project and the Evidence-based Practice Center Director.

Funding: Funded by the Agency for Health Care Research and Quality (AHRQ) under the Effective Health Care Program.

Disclaimer: The findings and conclusions expressed here are those of the authors and do not necessarily represent the views of AHRQ. Therefore, no statement should be construed as an official position of AHRQ or of the U.S. Department of Health and Human Services.

Accessibility: Persons using assistive technology may not be able to fully access information in this report. For assistance contact vog.shh.qrha@eraChtlaeHevitceffE.

Conflict of interest: None of the authors has any affiliations or financial involvement that conflicts with the information presented in this chapter.

Jonas DE, Wilt TJ, Taylor BC, Wilkins TM, Matchar DB. Challenges in and principles for conducting systematic reviews of genetic tests used as predictive indicators. AHRQ Publication No. 12-EHC083-EF. Chapter 11 of Methods Guide for Medical Test Reviews (AHRQ Publication No. 12-EHC017). Rockville, MD: Agency for Healthcare Research and Quality; June 2012. www​.effectivehealthcare​ Also published in a special supplement to the Journal of General Internal Medicine, July 2012.


  • PubReader
  • Print View
  • Cite this Page
  • PDF version of this page (252K)
  • PDF version of this title (2.4M)

Related information

  • PMC
    PubMed Central citations
  • PubMed
    Links to PubMed

Similar articles in PubMed

See reviews...See all...

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...