NCBI » Bookshelf » Human Molecular Genetics 2 » Genetic testing in individuals and populations
 
hmg
Human Molecular Genetics 2
2nd
Tom Strachan1 and Andrew P Read2
1University of Newcastle, Newcastle-upon-Tyne, UK
2University of Manchester, Manchester, UK
BIOS Scientific Publishers Ltd1-85996-202-51999
genetics

 Chapter 17:  Genetic testing in individuals and populations

A2178

Geneticists have no monopoly on DNA-based diagnosis. For microbiologists and virologists, for example, PCR is a central tool for identifying pathogens. Hematologists, oncologists and other pathologists all use DNA testing as a basis for diagnosis. For reviews of a range of applications, see the book by Newton and Graham (Further reading). However, for the purposes of this chapter, we will define genetic testing as testing for mendelian factors. The factors indicate a person's risk of developing or transmitting a disease, or identify her, or indicate her relationship to somebody else.

Wherever possible, we shall use two of the most common mendelian diseases, cystic fibrosis and Duchenne muscular dystrophy, to illustrate the various testing methods. Between them these two diseases exemplify many of the situations that arise in genetic testing for mendelian disorders. As always in this book, we concentrate on the principles and not the practical details. The reader interested in specific procedures can find a series of ‘best practice’ guidelines for laboratory diagnosis of the commoner mendelian diseases at http//:www.cmgs.org. These have been drawn up by consensus workshops of the UK Clinical Molecular Genetics Society.

How much information a genetic test can give depends on the state of knowledge about the gene(s) involved, but in principle laboratory genetic diagnosis can be made in two essentially different ways.

  • Direct testing: a sample (DNA, RNA, protein, etc.) from a consultand is tested to see whether or not he has a certain genotype - typically, a pathogenic mutation in a certain gene. The test is of an individual, and gives information about that individual.

  • Gene tracking: linked markers are used in family studies to discover whether or not the consultand inherited the high-risk chromosome from a heterozygous parent. The test is of a family, and gives information about the segregation of a chromosomal segment in the family.

With each year that passes, the role of gene tracking shrinks and the applications of direct testing grow. However, direct testing is not always possible, and even when it is scientifically possible, it may not always be practical in the context of a routine diagnostic service.

17.1. Direct testing is like any other path lab investigation: a sample from the patient is tested to see if it is normal or abnormal

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f1.jpg.

Figure 17.1

.

   Genetic diagnosis, counseling and prediction

The flow diagram shows how a clinical diagnosis is made starting from the patient's phenotype and then, depending on the mode of inheritance and the state of genetic knowledge, various means are deployed for counseling and predictive tests.

The optimal, though not always practical, method of laboratory genetic diagnosis is to test a person's gene or gene product directly to see whether the sequence is normal or mutant. We must of course know which gene to examine and we must know the relevant ‘normal’ (wild-type) sequence (Figure 17.1).

17.1.1. Direct testing can use a variety of methods, almost all based on PCR, applied to a wide range of sample types

Direct testing is almost always done by PCR, applying the methods described in Section 6.2. The few applications of Southern blotting include testing for major gene rearrangements or disruptions and for Fragile X and myotonic dystrophy full mutations (Boxes 16.7 and 16.8). The sensitivity of PCR allows us to use a wide range of tissue samples. These can include the following:

  • Blood samples - the most widely used source of DNA from adults.

  • Mouthwashes or buccal scrapes - being noninvasive, they are especially favored for population screening programs. Mouthwashes yield sufficient DNA for a few dozen tests, and by using whole genome amplification (Section 6.2.4) more extensive testing of a single sample may be possible.

  • Chorionic villus biopsy samples - the best source of fetal DNA (better than amniocentesis specimens).

  • One or two cells removed from eight-cell stage embryos, for pre-implantation diagnosis after in vitro fertilization.

  • Hair, semen, etc. for criminal investigations.

  • Archived pathological specimens, for typing dead people when no DNA has been stored, or testing tumors for genetic changes. Only short sequences, 250 bp or less, can be reliably amplified from fixed tissue specimens.

  • Guthrie cards - these are the cards on which a spot of dried blood is sent to a laboratory for neonatal screening for phenylketonuria (PKU) in the UK and elsewhere; not all the blood spot is used for the screening test. They are a possible source of DNA from a dead child.

RNA has advantages over DNA, but is more difficult to obtain and handle

Table 17.5

Methods for scanning a gene for mutations
MethodAdvantagesDisadvantages
Southern blot, hybridize to cDNA probeOnly way to detect major deletions and rearrangementsLaborious, expensive
Needs several μg of DNA
SequencingDetects all changesExpensive; can be hard to interpret
Mutations fully characterized
Heteroduplex gel mobilityVery simple, cheapSequences <200 bp only
Limited sensitivity
Does not reveal position of change
Denaturing HPLCQuick, high throughput; quantitativeExpensive equipment
Does not reveal position of change
Single-strand conformation polymorphism (SSCP) analysisSimple, cheap equipmentSequences <200 bp only
Limited sensitivity
Does not reveal position of change
Denaturing gradient gel electrophoresis (DGGE)High sensitivityChoice of primers is critical
Expensive primers
Does not reveal position of change
Dideoxy fingerprintingHigh sensitivityComplicated to interpret
Mismatch cleavage
(i) chemicalHigh sensitivityToxic chemicals
Shows position of changeExperimentally difficult
(ii) enzymaticNo nasty reagentsPoor quality results
Protein truncation test (PTT)High sensitivity for chain terminating mutationsChain terminating mutations only
Shows position of changeExpensive
Experimentally difficult
Usually needs RNA
Oligonucleotide arrays (gene chips):Both types:Both types:
(i) hybridization arraysQuickNovel technology still under development
(ii) minisequencing arraysHigh throughputExpensive equipment
Might detect and define all changesLimited range of genes

The table summarizes the advantages and disadvantages of each method for use in a routine diagnostic service. Heteroduplex gel mobility and SSCP can be performed simultaneously on a single gel (see Figure 17.6).

If a gene has to be scanned for unknown mutations, testing by RT-PCR (see Figure 6.5) offers several advantages. DNA testing usually involves amplifying and testing each exon separately, and this can be a major chore in a gene with many exons. Most of the mutation-scanning methods (see Table 17.5) can scan fragments larger than the average sized exon, so that an RT-PCR product can be examined using a smaller number of reactions. Also, only RT-PCR can reliably detect aberrant splicing, which is sometimes hard to predict from a DNA sequence change, or may be caused by activation of a cryptic splice site deep within an intron. However, RNA is much less convenient to obtain and work with. Samples must be handled with extreme care and processed rapidly to avoid degrading mRNA, and the gene of interest may not be expressed in readily accessible tissues. In addition, many mutations result in unstable mRNA (Section 9.4.6), so that the RT-PCR product from a heterozygous person may show only the normal allele.

Functional assays of proteins have a role in genetic testing

A protein-based functional assay might classify the products of a highly heterogeneous allelic series into two simple groups, functional and nonfunctional - which is, after all, the essential question in most diagnoses. The problem with functional assays is that they are specific to a particular protein. DNA technology by contrast is generic. This has obvious advantages for the diagnostic lab, but in addition it encourages technical development, since any new technique can be applied to many problems.

17.1.2. Some diseases show limited allelic heterogeneity, and genetic testing involves testing for specific mutations

Table 17.1

Examples of diseases that show a limited range of mutations
DiseaseCauseComments
Huntington disease, myotonic dystrophyGain of function mutationUnstable expanded repeat (see Box 16.7)
Fragile XCommon molecular mechanism: expansion of an unstable repeatSee Boxes 16.7 and 16.8; other mutations occur, but are rare
AchondroplasiaOnly G380R produces this particular phenotype; very high mutation rateTwo distinct changes, both causing G380R in FGFR3 gene (Figure 16.8)
Sickle cell diseaseOnly this particular mutation produces the sickle-cell phenotypeE6V in HBB gene (see Figure 5.11)
α- and β-thalassemiaSelection for heterozygotes leads to different ancestral mutations being common in different populationsSee Figure 16.2 (α-thalassemia) and Table 17.2 (β-thalassemia)
Cystic fibrosisCommon ancestral mutations in northern European populations; ancient heterozygote advantageSee Table 17.3 and Section 3.3.2
Charcot-Marie-Tooth disease (HMSN1)Common molecular mechanism: recombination between misaligned repeatsDuplication of 1.5 Mb at 17p11.2 (Figure 16.7); point mutations also occur
21-Hydroxylase deficiencyCommon molecular mechanism: sequence exchange with adjacent closely related pseudogeneSee Figure 9.17
Tay-Sachs diseaseFounder effect in Ashkenazi Jews; ancient heterozygote advantageTwo common HEXA mutations in Ashkenazi: 4-bp insertion in exon11 (73%); exon 11 donor splice site G>C (15%)

See Section 16.3 for further discussion of the reasons why some diseases show a limited range of mutations, while others have extensive allelic heterogeneity.

For some diseases, most or all affected people have the same mutation, or one of a small number of different mutations. Table 17.1 lists some examples. We saw why this should happen in Section 16.3.3; in brief it is when:

  • the disease depends on a specific molecular mechanism;

  • the nature of the gene is such that one particular mutation occurs repeatedly;

  • affected people mostly carry the same ancestral disease mutation, or one of a limited number of ancestral mutations.

Testing for these diseases starts with checking for specific mutations, using one of the methods described in Section 17.1.3.

β-Thalassemia and cystic fibrosis illustrate different situations of limited mutational diversity

For both these conditions, a very large number of mutations in the relevant gene have been described, but in each case a handful of mutations account for the majority of cases in any particular population. With β-thalassemia different mutations are predominant in different populations (Table 17.2). DNA testing is not needed to diagnose carriers or affected people (orthodox hematology does this perfectly well) but it is the method of choice for prenatal diagnosis. Provided one has DNA samples from the parents and knows their ethnic origin, the parental mutations can often be found using only a small cocktail of direct tests, after which the fetus can be readily checked.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f14.jpg.

Figure 17.14

.

   Gene tracking for prenatal diagnosis of an autosomal recessive disease

Four families each have a child affected with a recessive disease. Direct mutation testing is not possible (either because the gene has not been cloned, or because the mutations could not be found). (A) No diagnosis is possible if there is no sample from the affected child. (B) If everybody has the same heterozygous genotype for the marker, the result is not clinically useful. (C) If the parents are homozygous for the marker, no prediction is possible with this marker. (D) Successful prediction. The error rates shown are the risk of predicting an unaffected pregnancy when the fetus is affected, or vice versa, if the marker used shows a recombination fraction θ with the disease locus. These examples emphasize the need for both an appropriate pedigree structure (DNA must be available from the affected child) and informative marker types.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f18.jpg.

Figure 17.18

.

   Flowchart for CF population screening

Results of screening 10 000 people, 1:23 of whom is a carrier. If a person tests positive, his/her spouse is then tested. Black figures show results using a test which detects 70% of CF mutations (i.e. testing for F508del only); blue figures show results for a test with 90% sensitivity. Blue boxes represent cases which would be seen as successes for the screening program (regardless of what action they then take), gray boxes represent failures. PND, prenatal diagnosis.

In cystic fibrosis, by contrast, the F508del mutation is the commonest in all European populations, and is believed to be of ancient origin. However, the proportion of all mutations that are F508del varies, being generally high in the north and west of Europe and lower in the south. Direct testing for cystic fibrosis mutations divides into two phases. First, a limited number of specified mutations, always including F508del, are sought using the methods described in Section 17.1.3. As Table 17.3 shows, there is no obvious natural cut-off in terms of diminishing returns on testing for specific mutations. If this phase fails to reveal the mutations, then if resources allow, a screen for unknown mutations may be instituted, using the methods described in Section 17.1.4, or alternatively gene tracking (see Figure 17.14) may be used. The impact of this diversity on proposals for population screening is discussed below (see Figure 17.18).

Some combination of founder effects and heterozygote advantage must be the cause of this relative mutational homogeneity in recessive disorders. However, surprisingly often, when a recessive disease is particularly common in a certain population, it turns out that more than one mutation is responsible. An example is Tay-Sachs disease among Ashkenazi Jews, where there are two common HEXA mutations (Table 17.1). It is difficult to explain this situation except by assuming there was substantial heterozygote advantage some time when the founding population was small.

17.1.3. Some genetic testing methods test for presence or absence of a specified DNA sequence change

Table 17.4

Methods of testing for a specified mutation
MethodComments
Restriction digestion of PCR-amplified DNA; check size of products on a gel (Figure 6.6)Only when the mutation creates or abolishes a natural restriction site, or one engineered by use of special PCR primers (Figure 17.2)
Hybridize PCR-amplified DNA to allele-specific oligonucleotides (ASO) on a dot-blot or gene chipGeneral method for specified point mutations; large arrays allow scanning for any mutation
PCR using allele-specific primers (ARMS test)General method for point mutations; primer design critical (Figure 17.3)
Can be adapted to chip technology. Can provide real-time quantitative readout, using TaqMan technology (Figure 6.10)
Oligonucleotide ligation assay (OLA)General method for specified point mutations (Figure 17.4)
PCR with primers located either side of a translocation breakpointSuccessful amplification shows presence of the suspected deletion or specified rearrangement
Check size of expanded repeatDynamic repeat diseases (Box 16.7) only; large expansions require Southern blots, smaller ones can be done by PCR

See Section 6.2.3 for description of the principles.

Testing for the presence or absence of a known sequence change is a different and much simpler problem than scanning a gene for the presence of any mutation. Some of the main methods were described in Section 6.2.3, and are summarized in Table 17.4. Many variants of these and other methods have been developed as kits by biotechnology companies. Testing for a known change is useful for:

  • diseases with limited allelic heterogeneity, such as those discussed above;

  • diagnosis within a family. Mutation scanning methods may be needed to define the family mutation but, once it is characterized, other family members normally need be tested only for that particular mutation;

  • in research, for testing control samples. A common problem in positional cloning is that a candidate gene has been identified, and a patient has a sequence change in this gene. The question then arises, is this change pathogenic (confirming that this gene is indeed the disease gene), or might it be a nonpathogenic polymorphism (see Section 16.3.4)? One common approach is to screen a panel of 100 or so normal control samples for the presence of the change (which does not, of course, solve the problem of rare neutral variants).

Testing for the presence or absence of a restriction site

When a base substitution mutation creates or abolishes the recognition site of a restriction enzyme, this allows a simple direct PCR test for the mutation (Figure 6.6). Although hundreds of restriction enzymes are known, they almost all recognize symmetrical palindromic sites, and many point mutations will not happen to affect such sequences. Also, sites for rare and obscure restriction enzymes are unsuitable for routine diagnostic use because the enzymes are expensive and often of poor quality. Sometimes, however, a diagnostic restriction site can be introduced by a form of PCR mutagenesis using carefully designed primers. Figure 17.2 shows an example.

Use of allele-specific oligonucleotide (ASO) hybridization

Under suitably stringent hybridization conditions, these short synthetic probes hybridize only to a perfectly matched sequence (see Figure 5.10). Figure 5.11 demonstrates the use of dot-blot hybridization with ASO probes to detect the single base substitution that causes sickle cell disease. For diagnostic purposes, a reverse dot-blot procedure has often been used. A screen for a series of defined cystic fibrosis mutations, for example, would use a series of ASOs specific for each mutant allele, spotted onto a single membrane which is then hybridized to labeled PCR-amplified test DNA. Recently reverse dot-blotting has developed from manually-spotted arrays of small numbers of ASOs to very large ASO arrays on ‘gene chips’ that can potentially detect all possible mutations in a gene (Section 17.1.4).

Allele-specific PCR amplification (ARMS test)

The principle of the ARMS (amplification refractory mutation system) method was shown in Figure 6.9. Paired PCR reactions are carried out. One primer (the common primer) is the same in both reactions, the other exists in two slightly different versions, one specific for the normal sequence and the other specific for the mutant sequence. Additional control primers are usually included, to amplify some unrelated sequence from every sample as a check that the PCR reaction has worked. The location of the common primer can be chosen to give different sized products for different mutations, so that the PCR products of multiplexed reactions form a ladder on a gel. With careful primer design, the mutation-specific primers can also be made to give distinguishable products. For example, they can be labeled with different fluorescent or other labels, or given 5′ extensions of different sizes. Multiplexed mutation-specific PCR is well suited to screening large numbers of samples for a given panel of mutations (Figure 17.3).

The ARMS principle can be adapted to allow the course of a quantitative PCR reaction to be followed in real time, using TaqMan or related fluorescer-quencher methods (Section 6.2.3).

Oligonucleotide ligation assay (OLA)

In the OLA test for base substitution mutations, two oligonucleotides are constructed that hybridize to adjacent sequences in the target, with the join sited at the position of the mutation. DNA ligase will covalently join the two oligonucleotides only if they are perfectly hybridized (Nickerson et al., 1990). Various formats for the test are possible, for example as an ELISA reaction or, as in Figure 17.4, for analysis on a fluorescence sequencer.

17.1.4. Mutation scanning methods are used to check whether a gene carries any mutation

For the great majority of diseases there is extensive allelic heterogeneity, and genetic testing requires a search for any mutation anywhere within or near the relevant gene. The biggest current problem in laboratory genetic diagnosis is the lack of any quick, cheap and reliable method for doing this. Table 17.5 lists a number of methods that can be used to seek mutations in a gene, and these are briefly described below; for laboratory details, see the book by Cotton et al.. (Further reading). The table also summarizes their advantages and disadvantages, from the point of view of a diagnostic laboratory.

The requirements for routine diagnostic and for research use are rather different. Researchers usually want to screen a small number of samples for mutations in several candidate genes as quickly as possible. They are not worried if the methods require several days of benchwork or high levels of skill, but they do not want to invest much effort in fine-tuning a method for a particular gene. Sequencing and heteroduplex/SSCP analysis have been the methods most frequently used. Diagnostic laboratories grew up in a research culture and inherited the same attitudes, but increasingly now they need methods for testing a large number of samples for mutations in just a few genes, with as near 100% sensitivity as possible. Time can be spent optimizing a test for a particular gene, provided the method once developed is quick and simple. Methods particularly suited to this approach include DGGE, two-dimensional gels, the protein truncation test, and many commercial kits.

For routine diagnostic use, all these methods suffer in varying degree from two limitations:

  • They are quite laborious and expensive for use in a diagnostic service that needs to produce answers quickly and within a modest budget.

  • They detect differences between the patient's sequence and the published ‘normal’ sequence - but they do not generally distinguish between pathogenic and chance nonpathogenic changes.

It seems likely that eventually oligonucleotide arrays of one sort or another (DNA chips) will replace most other methods for routine mutation scanning in the commoner diseases, and automated sequencing will be increasingly used for the rarer diseases, where the investment to produce custom chips is not warranted. Widespread use of these methods will highlight the problem of deciding whether a sequence change is pathogenic or not (Box 16.4). Hopefully, advancing knowledge of gene function and the development of disease-specific mutation databases will make this task easier. Nevertheless, at least for the next few years, testing for unknown mutations in a service laboratory remains a considerable problem.

Sequencing, particularly of RT-PCR product, is increasingly the method of choice for mutation scanning

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f5.jpg.

Figure 17.5

.

   CF mutations revealed by sequencing

(A) A base substitution in exon 3. The double peak (arrow) in the upper trace shows the sample contains the mutation 332C>T (amino acid substitution P67L) in heterozygous form. Careful quality control is needed to allow heterozygous base substitutions to be reliably distinguished from noise. (B) A single base deletion 3659delC in exon 19 (lower trace). Sequence downstream of the deletion is confused, reflecting overlapping sequence of the two alleles in this heterozygote. The change would be confirmed by sequencing the reverse strand. Courtesy of Dr Andrew Wallace, St Mary's Hospital, Manchester.

As automated fluorescence sequencers become standard items of laboratory equipment, sequencing becomes more and more attractive as a means of mutation scanning (Figure 17.5). Mutations picked up by other methods are often confirmed by sequencing, so it is tempting to do the sequencing straight away. Nevertheless, sequencing is expensive, especially for scanning many exons of a gene in a genomic DNA sample, and much time can be wasted investigating artefacts, especially if the sequencing template is not of the highest quality.

Several methods rely on formation of heteroduplexes

Many tests use the properties of heteroduplexes to detect differences between two sequences. Most mutations occur in heterozygous form, and heteroduplexes can be formed simply by heating the test PCR product to denature it, and then cooling slowly. For homozygous mutations, or X-linked mutations in males, it is necessary to add some reference wild-type DNA. Several properties of heteroduplexes can be exploited:

  • Heteroduplexes often have abnormal mobility on nondenaturing polyacrylamide gels (Figure 17.6, lower panel). Special gels (Hydrolink™, MDE™) are supposed to improve the resolution. This is a particularly simple method to use. If fragments no more than 200 bp long are tested, insertions, deletions and most but not all single-base substitutions are detectable (Keen et al., 1991).

  • An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f7.jpg.

    Figure 17.7

    .

       CF mutation detection by denaturing gradient gel electrophoresis

    Each exon of the CFTR gene (except exon 9, which in this protocol is sequenced directly) is PCR-amplified in one or more segments and run on 9% polyacrylamide gels containing a gradient of urea-formaldehyde denaturant. Bands contain exons as labeled. The band from any amplicon that contains a heterozygous variant splits into usually four sub-bands (arrows). Individual 1 (left lane in each panel) has variants in amplicons 6, 10 and 14; individual B (right lane) has variants in amplicons 3, 10, 13, 16, 17 and 24. Characterization of the variants showed that individual 1 was heterozygous for F508del (exon 10), and individual 2 was a compound heterozygote E60X (exon 3) / R1070Q (exon 17). Other variants were nonpathogenic. Courtesy of Dr Hans Scheffer, University of Groningen, Netherlands.

    Heteroduplexes have abnormal denaturing profiles. This is exploited in denaturing gradient gel electrophoresis (DGGE, Figure 17.7) and denaturing high performance liquid chromatography (dHPLC). In both cases the mobility of a fragment changes markedly when it denatures. These methods require tailoring to the particular DNA sequence under test, and so are best suited to routine analysis of a given fragment in many samples. DGGE requires special primers with a 5′ poly(G;C) extension (a GC clamp). Once optimized, these methods have very high sensitivity. Two-dimensional DGGE gels have near 100% sensitivity for mutation detection (Dhanda et al., 1998). As with any method that finds every sequence variant in a large gene, it is necessary to sort out pathogenic from nonpathogenic changes (Figure 17.7).

  • An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f8.jpg.

    Figure 17.8

    .

       CF mutation detection by chemical cleavage of mismatch

    The test sample, PCR-amplified exon 13 of the CFTR gene, was denatured and allowed to renature. Because a mutation is present in heterozygous form, heteroduplexes are formed, and these are cleaved by treatment with OsO4 (or KMnO4) or hydroxylamine. Running on a fluorescence sequencer shows the full-length fragment (906 bp) plus cleavage products of 357 and 549 bp. Sequencing revealed the mutation 2184delA. Courtesy of Dr Julie Wu, St Mary's Hospital, Manchester.

    Mismatched bases in heteroduplexes are sensitive to cleavage by chemicals or enzymes. The chemical cleavage of mismatch (CCM) method (Figure 17.8) is a sensitive method for mutation detection, with the advantages that quite large fragments (over 1 kb) can be analyzed, and the location of the mismatch is pinpointed by the size of the fragments generated. Its disadvantages are that it uses very toxic chemicals, particularly osmium tetroxide (though this can be substituted by potassium permanganate), and some practice is needed before it works well. An alternative is enzymic cleavage of mismatches, which uses enzymes such as T4 phage resolvase or endonuclease VII to achieve the same result without the toxic chemicals. Unfortunately in most people's hands, the quality of the gels produced leaves much to be desired.

Single-strand conformational polymorphism (SSCP) analysis is one of the most popular methods for mutation scanning

Single-stranded DNA has a tendency to fold up and form complex structures stablilized by weak intramolecular bonds, notably base-pairing hydrogen bonds. The electrophoretic mobilities of such structures on nondenaturing gels will depend not only on their chain lengths but also on their conformations, which are dictated by the DNA sequence. For SSCP (Figure 17.6), amplified DNA samples (which may be RT-PCR products) are denatured and loaded on a nondenaturing polyacrylamide gel. Primers can be radiolabeled, or unlabeled products can be detected by silver staining. Control samples must be run, so that differences from the wild-type pattern can be noticed. SSCP is simple and adequately sensitive for fragments up to 200 bp long, but it does not reveal the nature or position of any mutation detected (Sheffield et al., 1993). SSCP and heteroduplex analysis can be combined on a single gel, as in Figure 17.6. The precise pattern of bands seen is very dependent on details of the conditions. An elaboration of SSCP, dideoxy fingerprinting, analyzes each band in a sequencing ladder by SSCP, and is claimed to give 100% sensitivity (Sarkar et al., 1992).

The protein truncation test (PTT) provides an efficient means to check for mutations that produce premature termination codons

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f9.jpg.

Figure 17.9

.

   The protein truncation test (PTT)

Coding sequence without introns (cDNA or large exons in genomic DNA) is PCR amplified using a special forward primer that includes a T7 promoter, a eukaryotic translation initiator with an ATG start codon, and a gene-specific 3′ sequence designed so that the sequence amplified reads in-frame from the ATG. A coupled transcription-translation system is used to produce polypeptide from the PCR product, and the protein is checked for size by SDS-PAGE gel electrophoresis. A truncated polypeptide points to the presence of a premature stop codon. In this example, RT-PCR product from two males with DMD has been analyzed. Patient 1 exons 58–68 give normal (47 kDa) product only (lane 2) but exons 67–79 (lane 3) encode a 30 kDa truncated product (arrow) - the band is very faint in this example. In patient 2 exons 58–68 give a truncated product of 22 kDa (lane 5); exons 67–79 give normal (48 kDa) product only (lane 6). Sequencing revealed mutations 10431 + 1G>A and 9405C>A in the two patients. Photo courtesy of Drs Steve Abbs and Zandra Hatton, Medical and Molecular Genetics, Guy's Hospital, London.

The PTT (Figure 17.9) is a specific test for frameshifts, splice site or nonsense mutations that truncate a protein product (van der Luijt et al., 1994). Clearly, the strength and weakness of the PTT is that it detects only certain classes of mutation. It would not be useful for cystic fibrosis, where only a minority of mutations introduce premature termination codons. But in Duchenne muscular dystrophy, adenomatous polyposis coli or BRCA1-related breast cancer, missense mutations are infrequent, and any such change found may well be coincidental and nonpathogenic. For such diseases, the PTT has several advantages. It conveniently ignores silent or missense base substitutions, and (like mismatch cleavage methods, but unlike SSCP) it reveals the approximate location of any mutation. Large exons, such as exon 15 of the APC gene (6.5 kb), can be tested using genomic DNA rather than by RT-PCR. Several variants have been developed to give cleaner results, usually by incorporating an immunoprecipitation step.

High-density oligonucleotide (GeneChip®) arrays promise to make high-throughput mutation scanning feasible

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f10.jpg.

Figure 17.10

.

   Oligonucleotide arrays

(A) Principles of mutation detection by hybridization to an oligonucleotide array. Oligonucleotides are arrayed in sets of four, with each set corresponding to part of the wild-type sequence and to each of the three possible base substitutions at a central position in the oligo. The mismatches to the wild-type sequence are underlined. The mutant sequence has an A>C substitution at position 13. Nucleotides with mismatches to the mutant sequence are in lower case. The number of mismatches and strength of hybridization to the wild-type and mutant sequences are shown on the right. Oligos with one mismatch hybridize weakly; those with two mismatches do not hybridize. When hybridized to the normal DNA, the sequence can be read off as a series of strongly hybridizing cells. With the mutant DNA, the chip will show an area of diminished hybridization with one strongly hybridizing cell marking the substituted base (lower panel). (B) Principles of miniseqencing array. Each cell of the array contains an oligonucleotide anchored by its 5′end, corresponding to part of the target sequence. PCR-amplified target DNA is hybridized to the array. It acts as a template for extension of the primers. Each primer is extended by a single nucleotide, using color-labeled dideoxynucleotides. The result of using the A>C mutation as above is shown. The blue label added to primer 4 identifies the changed base. Primer 5 will fail to add any label because of the mismatched 3′ end.

Trials to date have involved custom-designed gene-specific arrays of 20–25-mer oligonucleotide probes, in either of two basic designs (Figure 17.10):

  • Hybridization chips contain oligonucleotides matching all wild-type and single nucleotide substitution sequences in a gene. Test DNA is PCR-amplified, fluorescently labeled and hybridized to the array, either alone or (preferably) in competition with a reference wild-type sequence, labeled with a different color. Trials of these designs have achieved over 90% mutation detection in blind trials of BRCA1 and ATM samples (Hacia, 1999). In general these designs detect homozygous base substitutions well, but miss a few heterozygous substitutions and have major problems with insertion mutations.

  • Minisequencing chips use arrayed oligonucleotide primers with free 3′-OH groups. Unlabeled PCR-amplified test DNA is hybridized to the array, and DNA polymerase plus four differently labeled dideoxynucleotides are added. The test DNA acts as template for addition of a single labeled dideoxynucleotide to each array primer. As in an ARMS reaction, addition will occur only if the 3′ end of the primer exactly matches the template. The array could be made with primers specific not only for the wild-type sequence but also for all possible mutations. This technology has been less extensively tested than hybridization array technology because the chemistry used to produce most large arrays to date anchors the 3′ end of the oligonucleotide to the support, leaving a free 5′ end.

The emerging technology of gene chips promises to revolutionize mutation scanning, as well as other branches of human molecular genetics, but the revolution has not yet happened. Mutation detection arrays face a more difficult problem than expression arrays - the anchored probes are shorter, so specific hybridization is more difficult to achieve, and the test DNA must be PCR-amplified, unlike expression arrays, which can use total cellular poly(A) RNA. The current state of the art has been well reviewed by Hacia (1999).

17.1.5. Diagnosis of Duchenne muscular dystrophy: approaches to detecting structural rearrangements or point mutations in a giant gene that has high rates of mutation and recombination

Table 17.6

The contrasting genetics of cystic fibrosis and Duchenne muscular dystrophy
Cystic fibrosisDuchenne muscular dystrophy
Autosomal recessiveX-linked recessive
Loss of function mutationsLoss of function mutations
Fairly large gene:Giant gene:
 250 kb genomic DNA 2400 kb genomic DNA
 27 exons 79 exons
 6.5 kb mRNA 14 kb mRNA
Almost all mutations are single nucleotide changes65% of mutations are deletions encompassing one or more complete exons
5% duplications
30% nonsense, splice site, etc. mutations
Missense mutations are very unusual
New mutations are extremely rareNew mutations are very frequent
Mosaicism is not a problemMosaicism is common
Little intragenic recombinationRecombination hotspot (12% between markers at either end of the gene)

Genetic testing in DMD and CF require different sets of approaches

As stated at the start of this chapter, we are using cystic fibrosis and Duchenne muscular dystrophy to illustrate many of the applications of genetic testing. Both involve large genes with extensive allelic heterogeneity, but beyond that, CF and DMD pose rather different sets of problems for DNA diagnosis (Table 17.6). Between them, they show many of the issues involved in testing for mendelian diseases.

A deletion screen reveals the majority of mutations in affected males

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f11.jpg.

Figure 17.11

.

   Multiplex deletion screen for dystrophin

(A) Products of multiplex PCR amplification of nine exons, using samples from 10 unrelated patients with Duchenne/Becker muscular dystrophy. PCR primers have been designed so that each exon, with some flanking intron sequence, gives a different sized PCR product. Courtesy of Dr R. Mountford, Liverpool Women's Hospital. (B) Interpretation: solid lines show exons definitely deleted, dotted lines show possible extent of deletion running into untested exons. No deletion is seen in samples 7 and 9 - these patients may have point mutations, or deletions of exons not examined in this test. Exon sizes and spacing are not to scale.

Although dystrophin mutations are extremely heterogeneous, 60–65% of all mutations are deletions of one or more exons (see Figure 16.3), and these preferentially affect certain exons. In affected males, two multiplex PCR reactions (that shown in Figure 17.11 and one testing exons in the 5′ part of the gene) will reveal 98% of all deletions. Most deletions remove more than one exon; deletions that appear to affect noncontiguous exons and deletions of just a single exon may need confirming (because of the risk of spurious results due to PCR failure).

Detecting deletions in females, for carrier testing, is much more difficult because the normal X chromosome masks any deletion present on the other X. Sequences deleted on one X will show a 50% reduction in dosage compared to nondeleted sequences, and this can be detected by quantitative PCR or quantitative Southern blotting. Very careful work is needed to get a result sufficiently unambiguous that a woman could prudently base reproductive decisions on it. A principal requirement for quantitative PCR is to restrict the number of cycles, so that the reaction does not reach a plateau; methods that use competitive reactions are also more likely to be reliably quantitative. Fluorescence sequencing machines, with their inherently quantitative mode of operation and their ability to measure small quantities of product, are probably the best tool for detecting carriers of deletions and duplications (Yau et al., 1996).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f12.jpg.

Figure 17.12

.

   A family with Duchenne muscular dystrophy. (A) pedigree; (B) results of typing with the intragenic marker STR45

The affected boy III-1 has a deletion which includes STR45 (lane 7 of the gel is blank). His mother II-2 and his aunt II-3 inherited no allele of STR45 from their mother I-2, showing that the deletion is being transmitted in the family. I-2 is apparently homozygous for this highly polymorphic marker (lane 2), but in fact is hemizygous. The other aunt II-4 and the sister III-2 are heterozygous for the marker, and therefore do not carry the deletion.

If a deletion is segregating in a family, typing females for microsatellites mapping within the deletion may reveal apparent nonmaternity, where a mother has transmitted no marker allele to her daughter because of the deletion (see Figure 17.12). In such families nonmaternity proves a woman is a carrier, while heterozygosity (in the daughter or sister of a carrier) proves a woman is not a carrier. Several markers suitable for this purpose have been identified in the introns at deletion hotspots. A final option for detecting deletion carriers is fluorescence in situ hybridization to metaphase chromosomes (Section 10.1.4), using a probe that does not hybridize to the deleted chromosome. Both the FISH and microsatellite methods work best in families where there is an affected male in whom the deletion can first be defined.

Mutation scanning in nondeletion families

If the multiplex screen fails to show a deletion, then defining the mutation is difficult. With 79 exons, screening the DNA exon by exon using SSCP or other standard methods is excessively laborious. Missense mutations very seldom cause DMD, so the best method is probably to use the protein truncation test, as illustrated in Figure 17.9. Because the dystrophin gene consists of so many scattered small exons (averaging only 180 bp), PTT testing requires RNA. Dystrophin is primarily expressed in muscle, but in skilled hands low-frequency ectopic (‘illegitimate’) transcripts can be amplified from lymphocytes.

Gene tracking in nondeletion families

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f13.jpg.

Figure 17.13

.

   Gene tracking in Duchenne muscular dystrophy using flanking markers

The family has been typed for two polymorphisms A and B that flank the dystrophin locus. III-2 can have inherited DMD only if she has one recombination between marker A and DMD and another between DMD and marker B. If the recombination fractions are θA and θB respectively, then the probability of a double recombinant is of the order θAθB, which typically will be well under 1%. III-1 has a recombination between marker locus A and DMD.

In the setting of a routine diagnostic service, if the multiplex deletion screen fails to show a deletion, or if there is no sample available from an affected male, the best answer may be gene tracking, as described below. DMD presents special problems for gene tracking because of the extremely high recombination frequency across the gene. Even intragenic markers show an average 5% recombination with the disease. Therefore it is prudent to use flanking markers for gene tracking, as in Figure 17.13.

New mutations and mosaicism in DMD

To complete the list of problems posed by DMD, there is a high frequency of new mutations. The mutation-selection equilibrium calculations in Box 3.5 show that for any lethal X-linked recessive condition (f = 0), 1/3 of cases are fresh mutations. Therefore the mother of an isolated DMD boy has only a 2/3 chance of being a carrier. This greatly complicates the risk calculations that are necessary for interpreting gene tracking results. The interested reader should consult the book by Bridge (Further reading) for example calculations. Moreover, as shown in Figure 3.7, the first mutation carrier in a DMD pedigree is very often a mosaic (male or female). This raises yet more problems, both for risk estimation and for interpretation of the results of direct testing.

17.2. Gene tracking

Gene tracking was historically the first type of DNA diagnostic method to be widely used. Most of the mendelian diseases that form the bread-and-butter work of diagnostic laboratories went through a phase of gene tracking, then moved on to direct tests once the genes were cloned. Huntington disease, cystic fibrosis and myotonic dystrophy are familiar examples. A similar progression is likely with any disease that is studied by the classic approach of linkage analysis followed by positional cloning. However, the progression is not inevitable. With some diseases, even though the gene has been cloned, mutations are hard to find. In some cases the known mutations are scattered widely over a large gene and, for others, mutation detection, for unknown reasons, has not so far been very successful. Thus gene tracking using linked markers still has its place in modern molecular diagnosis.

17.2.1. Gene tracking involves three logical steps

Box 17.1 illustrates the essential logic of gene tracking. This logic can be applied to diseases with any mode of inheritance. Always there is at least one parent who could have passed on the disease allele to the proband, and who may or may not have done so. The process always follows the three steps:

  1. distinguish the two chromosomes in the relevant parent(s) - i.e. find a closely linked marker for which they are heterozygous;

  2. determine phase - i.e. work out which chromosome carries the disease allele;

    work out which chromosome the consultand received.

The prerequisites for gene tracking are:

  1. the disease should be adequately mapped, so that markers can be used that are known to be tightly linked to the disease locus; and

    the pedigree structure and sample availability must allow determination of phase.

Nowadays, informativeness of the marker is not usually a big problem. With over 10 000 highly polymorphic microsatellites mapped across the human genome, it is almost always possible to find an informative marker that maps close to the disease locus.

Figure 17.14 shows gene tracking for an autosomal recessive disease. The pedigrees emphasize the need for both an appropriate pedigree structure (DNA must be available from the affected child) and informative marker types. Even if the affected child is dead, if the Guthrie card (Section 17.1.1) can be retrieved, sufficient DNA for PCR typing can usually be extracted from the dried blood spot.

17.2.2. Recombination sets a fundamental limit on the accuracy of gene tracking

Because the DNA marker used for gene tracking is not the sequence that causes the disease, there is always the possibility of making a wrong prediction if recombination separates the disease and the marker. The recombination fraction, and hence the error rate, can be estimated from family studies by standard linkage analysis (Chapter 11). With almost any disease there should be a good choice of markers showing less than 1% recombination with the disease locus. This follows from the observations that one nucleotide in 500 is polymorphic, and that loci 1 Mb apart show approximately 1% recombination (Section 11.1.4). Ideally one uses an intragenic marker, such as a microsatellite within an intron. The special problem of the DMD recombination hotspot has been mentioned above.

Recombination between marker and disease can never be ruled out, even for very tightly linked markers, but the error rate can be greatly reduced by using two marker loci, situated on opposite sides of the disease locus. With such flanking or bridging markers, a recombination between either marker and the disease will also produce a marker-marker recombinant, which can be detected (e.g. III-1, Figure 17.13). If a marker-marker recombinant is seen in the consultand, then no prediction can be made about inheritance of the disease, but at least a false prediction has been avoided. Provided no marker-marker recombinant is seen, the only residual risk is that of double recombinants. The true probability of a double recombinant is very low because of interference (Section 11.1.3). With a suitable choice of markers, the risk of an error due to unnoticed recombination is likely to be much smaller than the risk of a wrong prediction due to human error in obtaining and processing the DNA samples.

17.2.3. Calculating risks in gene tracking

Unlike direct testing, gene tracking always involves a calculation. Factors to be taken into account in assessing the final risk include:

  • the probability of disease-marker and marker-marker recombination;

  • uncertainty, due to imperfect pedigree structure or limited informativeness of the markers, about who transmitted what marker allele to whom (see Figure 11.4C for an example);

  • uncertainty as to whether somebody in the pedigree carries a newly mutant disease allele (see Figure 3.7 for an example of this problem in DMD).

Two alternative methods are available for performing the calculation.

Bayesian calculations

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f15.jpg.

Figure 17.15

.

   A Bayesian calculation of genetic risk

III-2 wishes to know her risk of being a carrier of DMD, which affected her brother III-1 and uncle II-1. Serum creatine kinase testing (an indicator of subclinical muscle damage common in DMD carriers ) gave carrier:noncarrier odds of 0.7:1. A DNA marker that shows on average 5% recombination with DMD gave the types shown. The risk calculation, following the guidelines in Box 17.2, gives her overall carrier risk as 3.6%.

Bayes' theorem provides a general method for combining probabilities into a final overall probability. The theory and procedure are shown in Box 17.2, and a sample calculation is set out in Figure 17.15. A very detailed set of calculations covering almost every conceivable situation in DNA diagnostics can be found in the book by Bridge (Further reading), which the interested reader should consult.

For simple pedigrees, Bayesian calculations give a quick answer, but for more complex pedigrees the calculations can get very elaborate. Few people feel fully confident of their ability to work through a complex pedigree correctly, although the attempt is a valuable mental exercise for teasing out the factors contributing to the final risk. An alternative is to use a linkage analysis program.

Using linkage programs for calculating genetic risks

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f16.jpg.

Figure 17.16

.

   Use of linkage analysis programs for calculating genetic risks

Given information on any two of these subjects, the program can calculate the third. For linkage analysis, the program is given (A) and (B), and calculates (C). For calculating genetic risks, the program is given (B) and (C), and calculates (A).

At first sight it may seem surprising that a program designed to calculate lod scores can also calculate genetic risks - but in fact the two are closely related (Figure 17.16). Linkage analysis programs are general-purpose engines for calculating the likelihood of a pedigree, given certain data and assumptions. For calculating the likelihood of linkage we calculate the ratio:

graphic element

For estimating the risk that a proband carries a disease gene, we calculate the ratio:

graphic element

As in Box 17.2, the vertical line | means ‘given’.

17.3. Population screening

Table 17.7

Requirements for a population screening program
RequirementExamples and comments
A positive result must lead to some useful action
  • Preventive treatment, e.g. special diet for PKU

  • Review and choice of reproductive options in CF carrier screening

The whole program must be socially and ethically acceptable
  • Subjects must opt in with informed consent

  • Screening without counseling is unacceptable

  • There must be no pressure to terminate affected pregnancies

  • Screening must not be seen as discriminatory

The test must have high sensitivity and specificity
  • Tests with many false negatives undermine confidence in the program

  • Tests with many false positives, even if these are subsequently filtered out by a definitive diagnostic test, can create unacceptably high levels of anxiety among normal people

The benefits of the program must outweigh its costs
  • It is unethical to use limited health care budgets in an inefficient way

Population screening follows naturally from the ability to test directly for the presence of a mutation. Traditionally a distinction is drawn between screening and diagnosis. A screening test defines a high-risk group, who are then given a definitive diagnostic test. DNA tests are rather different because there are no separate screening and diagnostic tests. However, proposals to introduce any population screening test still need to satisfy the same criteria (Table 17.7), regardless of the technology used.

17.3.1. Acceptable screening programs must fit certain criteria

What would screening achieve?

The most important single function of any screening program is to produce some useful outcome. It is quite unacceptable to tell people out of the blue that they are at risk of something unpleasant, unless the knowledge enables them to do something about the risk. Proposals to screen for genes conferring susceptibility to breast cancer or heart attacks must be assessed stringently against this criterion. Predictive testing for HD might appear to break this rule - but it is offered only to people who are at high risk of HD and who are suffering such agonies of uncertainty that they request a predictive test, and persist despite counseling in which all the disadvantages are pointed out.

Ideally the useful outcome is treatment, as in neonatal screening for phenylketonuria. Increased medical surveillance is a useful outcome only if it greatly improves the prognosis. A special case is screening for carrier status, where the outcome is the possibility of avoiding the birth of an affected child. People unwilling to accept prenatal diagnosis and termination of affected pregnancies would not see this as a useful outcome, and in general should not be screened.

An ethical framework for screening

Ethical issues in genetic population screening have been discussed by a committee of distinguished American geneticists, clinicians, lawyers and theologians, and the reader is referred to their report for a very detailed survey (Andrews et al., 1994). It is in the nature of ethical problems that they have no solutions, but certain principles emerge.

  • Any program must be voluntary, with subjects taking the positive decision to opt in.

  • Programs must respect the autonomy and privacy of the subject.

  • People who score positive on the test must not be pressured into any particular course of action. For example, in countries with insurance-based health care systems, it would be unacceptable for insurance companies to put pressure on carrier couples to accept prenatal diagnosis, or financial pressure or inducements to terminate affected pregnancies.

  • Information should be confidential. This may seem obvious, but it can be a difficult issue - we like to think that drivers of heavy trucks or jumbo jets have been tested for all possible risks. Societies with insurance-based health care systems have particular problems about the confidentiality of genetic data, since insurance companies will argue that they are penalizing low-risk people by not loading the premiums of high-risk people.

17.3.2. Specificity and sensitivity measure the technical performance of a screening test

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f17.jpg.

Figure 17.17

.

   Sensitivity and specificity of a screening test

Compared with the ethical problems, the technical questions in population screening are fairly simple. The performance of a test can be measured by its sensitivity and specificity (Figure 17.17).

Specificity of a test

Table 17.8

A test that performs well in the laboratory may be useless for population screening
Prevalence of conditionTrue positives in population screenedTrue positives detected by screeningTrue negatives in population screenedFalse positives detected by screeningRatio of true:total positives
1/10001000990999 00099900.09
1/10 00010099999 90099990.0098
1/100 0001010999 99010 0000.001

In a laboratory trial on a panel of 100 affected and 100 control people, this hypothetical test was 99% accurate: it gave a positive result for 99% of true positives, and a negative result for 99% of true negatives. The table shows results of screening 1 million people. The great majority of all people positive on the test are false positives. Such a test is unlikely to be acceptable socially or viable financially for any mendelian disease (these typically affect less than 1 person in 1000).

Unexpectedly, perhaps, false positive test results can pose a more serious problem than false negatives. If the specificity is low, then a positive test result does not mean much. Even if the false positives can be filtered out subsequently by a diagnostic test, many people will have been worried unnecessarily. Table 17.8 shows that if a test does have a significant false positive rate, then the specificity is hopelessly low except when testing for very common conditions. DNA tests are potentially valuable for population screening because, compared with biochemical tests, they should generate very few false positives. The most likely causes of false positives in DNA testing are laboratory or clerical errors.

Sensitivity of a test

A test must pick up a reasonable proportion of its intended target (i.e. the sensitivity must be high). While the specificity of DNA tests looks encouraging, the sensitivity usually depends on the degree of allelic heterogeneity. Unless a disease is unusually homogeneous (Table 17.1), it is not practicable to test for every conceivable mutation, especially in a large-throughput population screening program. Normally only a subset of mutations will be tested for. Figure 17.18 shows how the choice of mutations can affect the outcome of a CF carrier screening program.

It is clear that simply testing for the commonest mutation, F508del, would not produce an acceptable program. Almost as many affected children would be born to couples negative on the screening program as would be detected by the screening. Whether or not such a program were financially cost-effective, it would surely be socially unacceptable. What constitutes an acceptable program is harder to define. One suggestion focuses on ‘+/-’ couples (i.e. couples with one known carrier and the partner negative on all the tests). The partner still might be a carrier of a rare mutation. Professor LP Ten Kate suggested that an acceptable screening program is one in which the risk for such +/- couples is no higher than the general population risk before screening. That would require a sensitivity of about 95%.

17.3.3. Organization of a genetic screening program

Assuming the proposed program looks ethically acceptable and cost-effective, who should be screened? Three examples highlight some of the options.

Neonatal screening: screening for phenylketonuria

All babies in the UK are tested a few days after birth for PKU. A blood spot from a heel-prick is collected on a card (the Guthrie card) during a home visit and sent to a central laboratory. The phenylalanine level in the blood is measured by chromatography or a bacterial growth test. This is the screening test. Babies whose level is above a threshold are called in for a definitive diagnostic test. Only a small proportion eventually turn out to have PKU. The lack of informed consent on the part of the infant is justified by the benefit it receives from dietary treatment (Smith, 1993).

Prenatal screening: screening for β-thalassemia

Carriers of β-thalassemia can be detected by conventional hematological testing, either before marriage or in the antenatal clinic. Carrier-carrier couples can be offered prenatal diagnosis by DNA analysis. Pre-implantation diagnosis or fetal stem-cell transplants may become alternatives to termination of affected pregnancies. Two ethnic groups in the UK have a high incidence of β-thalassemia: Cypriots and Pakistanis. Screening was quickly accepted by Cypriots in the UK but uptake has been slower among Pakistanis. The comparison illustrates the complex social questions surrounding genetic screening and the relevance of cultural background (Gill and Modell, 1998). Importantly, long-term studies of the Cypriot community show how the success of screening can be measured, not by counts of affected fetuses aborted, but by counts of couples having normal families. Before screening was available, many carrier couples opted to have no children; now they are using screening and having normal families (Modell et al., 1984).

Population screening for carriers: proposals for cystic fibrosis

It is now technically feasible and financially worthwhile to screen northern European populations to detect CF carriers. Surveys in the UK suggest that most carrier-carrier couples would opt for prenatal diagnosis and would value the opportunity to ensure that they did not have affected children. This view might change if treatment becomes more effective, for example using gene therapy.

Table 17.9

Possible ways of organizing population screening for carriers of CF
Group testedAdvantagesDisadvantages
NeonatesEasily organizedNo consequences for 20 years
Many families would forget the result
Unethical to test children
School leaversEasily organizedDifficult to conduct ethically
Inform people before they start relationshipsRisk of stigmatization of carriers
Couples from physician's listsCouple is unit of riskDifficult to control quality of counseling
Stresses physician's role in preventitive medicine
Allows time for decisions
Women in antenatal clinicEasily organizedBombshell effect for carriers
Rapid resultsPartner may be unavailable
Time pressure on laboratory
Adult volunteers (‘drop-in CF center’)Few ethical problemsBad framework for counseling
No targeting to suitable users
May be inefficient use of resources
If a screening program is to be introduced, two sets of questions must be considered. How many mutations should the laboratory test for, and who should be offered the test? The problems raised by allelic heterogeneity have been discussed above (see Figure 17.18). On the question of who to screen, Table 17.9 shows some possibilities considered in the UK. Naturally the way health care delivery is organized in each country will determine the range of possibilities. Preliminary results from controlled pilot studies suggest that none of the methods has had the negative effects (increased anxiety) sometimes predicted.

17.4. DNA profiling can be used for identifying individuals and determining relationships

We use the term DNA profiling to refer to the general use of DNA tests to establish identity or relationships. DNA fingerprinting is reserved for the technique invented by Jeffreys et al. (1985) using multilocus probes. For more detail on this whole area, the reader should consult the book by Evett and Weir (Further reading).

17.4.1. A variety of different DNA polymorphisms have been used for profiling

DNA fingerprinting using minisatellite probes

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is permission.jpg.

Figure 17.19

.

   Legal and forensic use of DNA fingerprinting

(A) A paternity test. ‘Fingerprints’ are shown from the mother (M), child (C) and two possible fathers (F1, F2). The DNA fingerprint of F1 contains all the paternal bands found in the child, whereas that of F2 contains only one of the paternal bands. (B) A rape case. The ‘fingerprint’ of suspect 1 exactly matches that from the semen sample S on a vaginal swab from the victim. As a result of this evidence, Suspect 1 was charged with rape and found guilty. Photograph courtesy of Cellmark Diagnostics, Abingdon, Oxfordshire.

These probes contain the common core sequence of a hypervariable dispersed repetitive sequence GGGCAGGAXG, first discovered by Jeffreys et al. (1985) in the myoglobin gene (see Section 7.4.2). When hybridized to Southern blots they give an individual-specific fingerprint of bands (Figure 17.19). Their chief disadvantage is that it is not possible to tell which pairs of bands in a fingerprint represent alleles. Thus, when matching DNA fingerprints, one matches each band individually by position and intensity. Other hypervariable repeated sequences have been used in the same way, for example those detected by the synthetic oligonucleotide (CAC)5 (Krawczak and Schmidtke, 1998).

DNA profiling using single-locus minisatellite markers

Minisatellite probes recognize single-locus variable tandem repeats on Southern blots. Each probe should reveal two bands in any person's DNA, representing the two alleles. Profiling is based on four to ten different polymorphisms. These probes allow exact calculations of probabilities (of paternity, of the suspect not being the rapist, etc.), if the gene frequency of each allele in the population is known. For matching alleles between different gel tracks, the continuously variable distance along the gel has to be divided into a number of ‘bins’. Bands falling within the same bin are deemed to match. It is imperative that the criteria used for judging matches in each profiling test should be the same binning criteria that were used to calculate the population frequencies of each allele. The binning criteria can be arbitrary within certain limits, but they must be consistent. Minor variations within repeated units of some minisatellites potentially allow an almost infinite variety of alleles to be discriminated, so that the genotype at a single locus might suffice to identify an individual (Jeffreys et al., 1991).

DNA profiling using microsatellite markers

Microsatellite polymorphisms (Section 7.4.3) are based on short tandem repeats, usually di-, tri- or tetranucleotides. They have the advantages over minisatellites that they can be typed by PCR and that discrete alleles can be defined unambiguously by the precise repeat number. This avoids the binning problem and makes it easier to relate the results to population gene frequencies.

The use of Y-chromosome and mitochondrial polymorphisms

For tracing relationships to dead persons, Y-chromosome and mitochondrial DNA polymorphisms are especially useful because of their sex-specific pattern of transmission. An interesting example was the identification of the remains of the Russian Tsar and his family, killed by the Bolsheviks in 1917, by comparing DNA profiles of excavated remains with living distant relatives (Gill et al., 1994).

17.4.2. DNA profiling can be used to determine the zygosity of twins

In studying nonmendelian characters (Chapter 19), and sometimes in genetic counseling, it is important to know whether a pair of twins are monozygotic (MZ, identical) or dizygotic (DZ, fraternal). Traditional methods depended on an assessment of phenotypic resemblance or on the condition of the membranes at birth (twins contained within a single chorion are always MZ, though the converse is not true). Errors in zygosity determination systematically inflate heritability estimates for nonmendelian characters, because very similar DZ twins are wrongly counted as MZ, while very different MZ twins are wrongly scored as DZ.

Genetic markers provide a much more reliable test of zygosity. The extensive literature on using blood groups for this purpose is summarized by Race and Sanger (1975). DNA profiling is nowadays the method of choice. The Jeffreys fingerprinting probe allows a very simple test - samples from MZ twins look like the same sample loaded twice, and samples from DZ twins show some differences. An error rate could be calculated from empirical data on band sharing by unrelated people, using some defined binning strategy (see above).

When single-locus markers are used, if twins give the same types, then for each locus, the probability that DZ twins would type alike is calculated. If the parents have been typed, this follows from mendelian principles; otherwise the probability of DZ twins typing the same must be calculated for each possible parental mating and weighted by the probability of that mating calculated from population gene frequencies. The resultant probabilities for each (unlinked) locus are multiplied, to give an overall likelihood PI that DZ twins would give the same results with all the markers used. The probability that the twins are MZ is then:

graphic element

where m is the proportion of twins in the population who are MZ (about 0.4 for like-sex pairs). Sample calculations are given in Appendix 4 of Vogel and Motulsky (Further reading).

17.4.3. DNA profiling can be used to disprove or establish paternity

Excluding paternity is fairly simple - if the child has a marker allele not present in either the mother or alleged father then, barring new mutations, the alleged father is not the biological father. Proving paternity is, in principle, impossible - one can never prove that there is not another man in the world who could have given the child that particular set of marker alleles. All one can do is establish a probability of nonpaternity that is low enough to satisfy the courts and, if possible, the putative father.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch17f20.jpg.

Figure 17.20

.

   Using single-locus markers for a paternity test

The odds that the alleged father, rather than a random member of the population, is the true father are 1/2:q3, where q3 is the gene frequency of A3. A series of n unlinked markers would be used and, if paternity were not excluded, the odds would be (1/2)n : qA.qB.qC...qN.

DNA fingerprinting probes have been widely used for this purpose (Figure 17.19). Bands must be binned according to an arbitrary but consistent scheme, as explained above, to decide whether or not each nonmaternal band in the child fits a band in the alleged father. Then if, say, 10/10 bands fit, the odds that the suspect, rather than a random man from the population, is the father are 1:p10, where p is the chance that a random man from that population would have a band matching a given band in the child. Even for p = 0.2, p10 is only 10-7. Single-locus probes allow a more explicit calculation of the odds (Figure 17.20). A series of four to ten unlinked single-locus markers can give overwhelming odds favoring paternity if all the bands fit.

17.4.4. DNA profiling is a powerful tool for forensic investigations

DNA profiling for forensic purposes follows the same principles as paternity testing. Scene-of-crime material (bloodstains, hairs or a vaginal swab from a rape victim) are typed and matched to a DNA sample from the suspect. If the bands don't match, the suspect is excluded. One of the most powerful applications of DNA profiling is for preventing miscarriages of justice. If the bands do all match, the odds that the criminal is the suspect rather than a random member of the population can be calculated, based on the allele frequencies in the population. Of course, if the alternative were the suspect's brother, the odds would look very different. The fate of DNA evidence in courts provides a fascinating insight into the difference between scientific and legal cultures. There are at least three stumbling blocks for DNA data.

  • The jury may simply not believe, or perhaps choose to ignore, the DNA data, as evidently happened in the OJ Simpson trial. A fascinating account of the DNA evidence is given by Weir (1995).

  • The jury may be led into a false probability argument, the so-called Prosecutor's Fallacy. Suppose a suspect's DNA profile matches the scene-of-crime sample. The Prosecutor's Fallacy confuses two different probabilities:

    1. the probability the suspect is innocent, given the match;

    2. the probability of a match, given that the suspect is innocent.

    The jury should consider the first probability, not the second.

    Using Bayesian notation (Box 17.2), with M = match, G = suspect is guilty, I = suspect is innocent, we want to calculate PI | M, and not PM | I. If the suspect were guilty, the samples would necessarily match: PM | G = 1. Population genetic arguments might say there is a 1 in 106 chance that a randomly selected person would have the same profile as the crime sample: PM | I = 10-6. Suppose the guilty person could have been any one of 107 men in the local population. If there is no other evidence to implicate him, he is simply a random member of the population and the prior probability that he is guilty (before considering the DNA evidence) is PG = 10-7. The prior probability that he is innocent is PI = 1–10-7, ~ 1. Baye's theorem tells us that

    graphic element

    The prosecutor would no doubt be happy to see the jury use 106 instead of 0.9 for the probability that the suspect is innocent! Given the Bayesian argument, it is clear that a forensic test needs PM | I to be 10-10 or less if it is to be able to convict a suspect on DNA evidence alone.

  • Objections may be raised to some of the principles by which DNA-based probabilities are calculated.

    1. The multiplicative principle, that the overall probabilities can be obtained by multiplying the individual probability for each band or locus, depends on the assumption that bands are independent. If the population were actually stratified into reproductively isolated groups, each of whom tended to have a particular subset of bands or alleles, the calculation would be misleading. This is serious because it is the multiplicative principle that allows such exceedingly definite likelihoods to be given.

    2. For single-locus markers, the probability depends on the gene frequencies. DNA profiling laboratories maintain databases of gene frequencies - but were these determined in an appropriate ethnic group for the case being considered? Taken to extremes, this argument implies that the DNA evidence might identify the criminal as belonging to a particular ethnic group, but would not show which member of the group it was who committed the crime.

These issues have been debated at great length, especially in the American courts. Both objections are valid in principle, but the question is whether they make enough difference to matter. General opinion is that they do not. It would be ironic if courts, seeing opposing expert witnesses giving odds of correct identification differing a million-fold (105:1 versus 1011:1), were to decide that DNA evidence is hopelessly unreliable, and turn instead to eye-witness identification (odds of correct identification < 50:50).

Further reading
Bridge PJ (1994) The Calculation of Genetic Risks - Worked Examples in DNA Diagnostics. Johns Hopkins University Press, Baltimore, MD.
Clinical Molecular Genetics Society Best Practice Guidelines for Molecular Genetics Services. http://www.cmgs.org .
Cotton RGH, Edkins E, Forrest S (eds) (1998) Mutation Detection: a Practical Approach. IRL Press, Oxford.
Evett IW, Weir BS (1998) Interpreting DNA Evidence: Statistical Genetics for Forensic Scientists. Sinauer Associates, Inc., Sunderland, MA.
Newton CR, Graham A (1997) PCR, 2nd edn. BIOS Scientific Publishers, Oxford.
Vogel F, Motulsky AG (1996) Human Genetics, 3rd edn. Springer-Verlag, Berlin.
References
Andrews LB, Fullarton JE, Holtzman NA, Motulsky AG (1994) Assessing Genetic Risks - Implications for Health and Social Policy. National Academy Press, Washington, DC.
Dhanda R K, Smith R M, Scott C B, Eng C, Vijk J. A simple system for automated two-dimensional electrophoresis: applications to genetic testing. Genetic Testing. (1998); 2: 6770. [PubMed]
Gill P, Ivanov P L, Kimpton C. et al. Identification of the remains of the Romanov family by DNA analysis. Nature Genet. (1994); 6: 130135. [PubMed]
Gill P S, Modell B. Thalassaemia in Britain: a tale of two communities. Births are rising among British Asians but falling in Cypriots. Br. Med. J. (1998); 317: 761762. [PubMed]
Hacia J G. Resequencing and mutational analysis using oligonucleotide microarrays. Nat. Genet. (1999); 21 (1 Suppl.): 4247. [PubMed]
Jeffreys A J, Wilson V, Thein L S. Individual-specific fingerprints of human DNA. Nature. (1985); 314: 6773. [PubMed]
Jeffreys A J, MacLeod A, Tamaki K, Neil D L, Monckton D G. Minisatellite repeat coding as a digital approach to DNA typing. Nature. (1991); 354: 204209. [PubMed]
Keen J, Lester D, Inglehearn C, Curtis A, Bhattacharya S. Rapid detection of single base mismatches as heteroduplexes on Hydrolink gels. Trends Genet. (1991); 7: 5. [PubMed]
Krawczak M, Schmidtke J (1998) DNA Fingerprinting, 2nd edn. BIOS Scientific Publishers, Oxford.
Modell B, Petrou M, Ward R H. et al. Effect of fetal diagnostic testing on birth-rate of thalassaemia major in Britain. Lancet. (1984); ii: 13831386. [PubMed]
Nickerson D A, Kaiser R, Lappin S, Stewart J, Hood L. Automated DNA diagnostics using an ELISA-based oligonucleotide ligation assay. Proc. Natl Acad. Sci. USA. (1990); 87: 89238927. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Race RR, Sanger R (1975) Blood Groups in Man, 6th edn. Blackwell, Oxford.
Sarkar G, Yoon H S, Sommer S S. Dideoxy fingerprinting (ddF): a rapid and efficient screen for the presence of mutations. Genomics. (1992); 13: 441443. [PubMed]
Sheffield V C, Beck J S, Kwitek A E, Sandstrom D W, Stone E M. The sensitivity of single strand conformational polymorphism analysis for the detection of single base substitutions. Genomics. (1993); 16: 325332. [PubMed]
Smith I. MRC Working Party on Phenylketonuria. Phenylketonuria due to phenylalanine hydroxylase deficiency: an unfolding story. Br. Med. J. (1993); 306: 115119. [PubMed]
van der Luijt R, Khan P M, Vasen H, van Leeuwen C, Tops C, Roest P A. et al. Rapid detection of translation-terminating mutations at the adenomatous polyposis coli (APC) gene by direct protein truncation test. Genomics. (1994); 20: 14. [PubMed]
Weir B S. DNA statistics in the Simpson matter. Nature Genet. (1995); 11: 365368. [PubMed]
Yau S C, Bobrow M, Mathew C G, Abbs S. Accurate diagnosis of carriers of deletions and duplications in Duchenne/Becker muscular dystrophy by fluorescent dosage analysis. J. Med. Genet. (1996); 33: 550558. [PubMed]
Help ǀ Contact Bookshelf
Human Molecular Genetics 21999
(navigation arrows) Go to previous chapter Go to next chapter Go to top of this page Go to bottom of this page Go to Table of Contents