21.3.1. Gene targeting by homologous recombination in ES cells can be used to produce
mice with a mutation in a predetermined gene
Gene targeting involves engineering
a mutation in a preselected gene within an intact cell. It can
therefore be viewed as a form of artificial site-directed
in vivo
mutagenesis (as opposed to the various methods of site-directed
in vitro mutagenesis described in Section 6.4). The mutation may result in inactivation of
gene expression (a ‘knock-out’ mutation), or
altered gene expression, and so can be useful for studying gene function (see
below). In addition, the same method can be used to
‘correct’ a pathogenic mutation by restoring the normal
phenotype, and so has therapeutic potential (see Section 22.3).
Gene targeting typically involves introducing a mutation by homologous
recombination. A cloned gene (or gene segment) closely related in sequence to
endogenous target gene is transfected into the appropriate cells. In some of the
cells, homologous recombination occurs between the introduced gene and its
chromosomal homolog. Gene targeting by homologous recombination has been
achieved in some somatic mammalian cells, such as myoblasts. However, the most
important application involves mouse ES cells: once a mutation has been
engineered into a specific mouse gene within the ES cells, the modified ES cells
can then be injected into the blastocyst of a foster mother and eventually a
mouse can be produced with the mutation in the desired gene in all nucleated
cells (Capecchi, 1989; Melton, 1994).
Homologous recombination in mammalian cells is a very rare occurrence (unlike in
yeast cells, for example, where it occurs naturally at high frequencies,
enabling sophisticated genetic manipulation). The frequency of homologous
recombination is increased, however, when the degree of sequence homology
between the introduced DNA and the target gene is very high. As a result, the
introduced DNA clone is a mouse sequence which should
preferably be isogenic (derived from the same mouse strain as the strain of
mouse from which the ES cells were derived). Even then, the frequency of genuine
homologous recombination events is very low and may be difficult to identify
against a sizeable background of random integration events.
To assist identification of the desired homologous recombination events, the
targeting vector (often a plasmid vector) contains a marker gene, such as the
neo gene (see Box
10.1), which permits selection for cells that have taken up the
introduced DNA. PCR assays are used to screen for evidence of a homologous
recombination event (by using a marker-specific primer plus a primer derived
from a sequence present in the target gene but absent from the introduced
homologous gene segment). The targeting construct is transferred into cultured
mouse ES cells by electroporation, a method in which pulses of high voltage are
delivered to cells, causing temporary relaxation of the selective permeability
properties of the plasma membranes. Two basic approaches have been used:
-
Figure 21.4
.
Gene targeting by homologous recombination can inactivate a
predetermined chromosomal gene within an intact cell
(A) Insertion vector method. The introduced vector DNA
(blue) is cut at a unique site within a sequence which is identical or
closely related to part of a chromosomal gene (black). Homologous
recombination (X) can occur, leading to integration of the entire vector
sequence including the marker gene (M). Note that the
letters do not represent exons but are simply meant to indicate linear
order within the gene. (B) Replacement vector method. In
this case, the marker gene is contained within the sequence homologous
to the endogenous gene, and the vector is cut at a unique location
outside the homologous sequence. A double recombination or gene
conversion event (X X) can result in replacement of internal sequences
within the chromosomal gene by homologous sequences from the vector,
including the marker gene.
Insertion vectors target the locus of interest by a single
reciprocal recombination, causing insertion of the entire introduced DNA
including the vector sequence (). This is the most reliable way of causing a knock-out
mutation. -
Replacement vectors are designed to replace some of the
sequence in the chromosomal gene by a homologous sequence from the
introduced DNA ().
This can occur as a result of a double reciprocal recombination or by
gene conversion. The replacement method can inactivate a gene when the
introduced sequence contains one or more premature termination codons or
lacks critical coding sequences. It can also be used to correct a
pathogenic mutation.
Figure 21.5
.
Double replacement gene targeting can be used to introduce subtle
mutations
Both the methods in
result in introduction of a substantial amount of exogenous sequence
within the endogenous gene. To introduce a subtle mutation without
leaving residual exogenous sequence, a double replacement method with
positive and negative selection can be used (
Melton, 1994). Exons in the endogenous gene are
represented as numbered large boxes, and introns as long thin boxes. In
order to introduce a subtle mutation, such as a single nucleotide
substitution in
exon 8, a replacement knock-out
vector is used with a
marker gene (e.g. the
HPRT gene) flanked by homologous
sequences from introns 7 and 8, and a second
marker, such as the herpes
simplex thymidine kinase (
TK) gene outside the
homologous region.
Gene conversion, or double crossover within the
flanking
intron sequences, can lead to replacement of
exon 8 by the
HPRT gene, and can be selected for if a mutant
HPRT- ES cell is used. A
positive-negative selection system can be used: selection in the first
step is for
HPRT+
TK- cells. Cells containing random
vector
integrations will contain the
TK gene and can be killed
with the thymidine analog
gancyclovir (see
Figure 22.13). The second
replacement involves introducing an altered
exon 8 with a
point mutation
(*) to replace the
HPRT gene and can be screened by
identifying
HPRT- cells.
Note that mice engineered in this way cannot be
described as transgenic because of the lack of foreign sequences in the
germline.
The replacement
vector approach, as well as the insertion
vector method, often
leaves foreign sequences at the target
locus. In some cases, however, a more
subtle mutation is required. For example, it may be desirable to investigate the
effect of changing a single
codon. Various two-step recombination techniques can
be used to accomplish this method, and the resulting mouse, although genetically
modified lacks any foreign sequences and so can no longer be described as
transgenic (see and
Melton, 1994).
Gene targeting in mice is popularly used for producing artificial mouse models of
human disease (Section 21.4). In
addition, it provides a powerful general method of studying gene function. The
gene in question is selectively inactivated, producing a
‘knock-out’ mouse, and the effect of the
mutation on the development of the mouse is monitored carefully. Sometimes there
is little or no phenotypic consequence after inactivating a gene that would be
expected to be crucially important, such as some genes which encode a
transcription factor known to be expressed in early embryonic development. The
lack of a phenotype in such cases is often thought to be due to genetic redundancy (another gene is
able to carry out the function of the gene that has been knocked out). As a
result, in some cases double or even triple gene knock-outs have been carried
out to analyze gene function, as in the case of some of the Hox
genes (see, for example, Manley and Capecchi,
1997).
Figure 21.6
.
The knock-in method replaces the activity of one chromosomal gene by
that of an introduced gene
The En-1 gene shown at top has two exons and coding
sequences are shown by filled boxes. Its promoter (P) and
polyadenylation site (pA) are also shown. The gene targeting vector
(‘knock-in vector’) contains cloned
En-1 gene sequences comprising an upstream sequence
a which contains the En-1
promoter, and an internal segment b which spans the
3′ end of exon 1, the single intron and the 5′
coding sequence of exon 2. Separating these two sequences is the coding
sequence of the En-2 gene. Two marker cassettes include
a thymidine kinase (TK) gene and a neomycin-resistance gene (Neo) both
driven by a phosphoglycerate kinase promoter (chosen because
phosphoglycerate kinase is expressed in ES cells). The Neo gene is
flanked by loxP sequences. The targeting procedure results in
replacement of endogenous sequences a and
b but the 5′ coding sequence of the
En-1 gene is deleted. The knocked-in
En-2 gene comes under the control of the
En-1 promoter (Hanks et al., 1995). Note
that the term ‘knock-in’ has also been applied to
any procedure where an endogenous gene is inactivated by insertion of a
new gene which is then expressed, even if the latter is only meant to
serve as a reporter gene such as lacZ.
A useful example of investigating functional redundancy concerns studies of the
mouse
Engrailed genes,
En-1 and
En-2. Both of these genes are homeobox genes which had been
considered to play crucial roles in brain formation.
En-1
knock-outs have serious abnormalities but surprisingly
En-2
knock-outs have only minor problems. Expression of the
En-1
gene is switched on 8–10 hours before that of the
En-2 product, suggesting that perhaps the
En-1 product can compensate for the lack of
En-2 product in
En-2 knock-outs. To test
for the possibility of functional redundancy,
Hanks et al. 1995 used a variant of the knock-out
procedure known as the
‘knock-in’ technique. In
this case the transgene used to knock-out the target endogenous gene is itself
designed to be expressed under the control of the
cis-acting
elements of the knocked-out gene. A transgene containing an
En-2 coding sequence was used to knock-out the endogenous
En-1 gene. In so doing, the introduced
En-2 sequence came under control of the
En-1 regulatory sequences and was expressed before the
endogenous
En-2 gene was switched on (see ). The resulting
En-1
knock-out mouse had a normal
phenotype, demonstrating that the knocked-in
En-2 gene was functionally equivalent to
En-1 (
Hanks et
al., 1995).
21.3.2. Site-specific recombination systems, notably the Cre-loxP system, extend the
power of gene targeting
Figure 21.7
.
Structure of the loxP recognition sequence
Note that the central 8 bp sequence which is flanked by
the 13 bp inverted repeats is asymmetric and confers orientation.
Several site-specific recombination systems from bacteriophages and yeasts have
been characterized and are promising tools for
genome engineering (
Kilby et al., 1993).
Thus far, the Cre-
loxP recombination system from bacteriophage
P1 has been the most widely used. The natural function of the
Cre
(
causes
recombination)
recombinase is to mediate recombination between two
loxP
sequences that are in the same orientation (the
loxP sequence
consists of 34 bp and comprises two inverted 13 bp repeats separated by a
central asymmetric 8 bp spacer; ). As a result of recombination, the intervening sequence between
the two
loxP sites is excised (see
Figure 4.15). Using
gene targeting,
loxP
sequences can be stitched into a desired gene or chromosomal location, and the
subsequent provision of a gene encoding the Cre product can result in an
artificially generated site-directed recombination event (see
Chambers, 1994). As described below,
several applications can be envisaged (
Lobe
and Nagy, 1998).
Tissue- and cell type-specific knock-outs
Some genes are vital to early development and simple knock-out experiments
are generally not helpful because death ensues at the early embryonic stage.
To overcome this problem, methods have been developed to inactivate
expression of the target gene in only selected, predetermined cells of the
animal (conditional knock-out).
The animal can therefore survive and the effect of the knock-out can be
studied in a tissue or cell type of interest.
Figure 21.8
.
Gene targeting using the Cre-loxP recombination
system can be used to inactivate a gene in a desired cell
type
(A) Illustration of a standard homologous recombination
method using mouse ES cells, in which three loxP
sites are introduced along with a marker M at a target locus A
(typically a small gene or an internal exon which if deleted would
cause a frameshift mutation). Subsequent transfection of a
Cre recombinase gene and transient expression
of this gene results in recombination between the introduced
loxP sites to give different products. Type I
recombinants are used to generate mice in which the target locus is
flanked by loxP sites. Such mice can be mated with
previously constructed transgenic mice (B) which carry
an integrated construct consisting of the Cre
recombinase gene linked to a tissue-specific promoter. Offspring
which contain both the loxP-flanked target locus
plus the Cre gene will express the
Cre gene in the desired tissue type, and the
resulting recombination between the loxP sites in
these cells results in tissue-specific inactivation of the target
locus A.
An early example of a
conditional knock-out sought to study the role of DNA
polymerase β (an enzyme that is essential for embryonic
development) in T lymphocytes (
Gu
et al., 1994). The
gene targeting procedure
replaced part of the endogenous gene by an introduced homologous gene
segment flanked by
loxP sequences. Mice carrying this
targeted mutation were then mated with a strain of mice which carried a
Cre transgene gene under the control of a T
cell-specific
promoter. Offspring with the
loxP-flanked pol
β sequences plus the
Cre transgene were identified
and survived to adulthood. The Cre product was expressed only in T cells,
leading to inactivation of the target gene in these cells by excision of DNA
polymerase β gene segment between the two
loxP
sequences (see for the
general method).
Tissue- and cell type-specific gene activation
This approach is the opposite to that described above: it involves selective
activation of a gene in certain cells of the animal to
switch on a foreign gene only in predetermined cells of the animal (Barinaga, 1994).
Chromosome engineering
Figure 21.9
.
Chromosome engineering can be accomplished using
cre-loxP systems
(A) Use of targeted insertion of loxP
sites to facilitate a chromosomal translocation. See Smith et al.
1995 for a practical example. (B) Use of
targeted insertion of loxP sites to permit modeling
of a microdeletion by intrachromosomal recombination. See Ramirez-Solis et
al. 1995 for a practical example, and for
other examples of intrachromosomal recombinations.
Another important recent development is a strategy for chromosome engineering
in
ES cells which relies on sequential
gene targeting and
Cre-
loxP recombination.
Gene targeting is used to
integrate
loxP sites at the desired chromosomal locations
and, subsequently, transient expression of Cre recombinase is used to
mediate a selected chromosomal rearrangement (
Ramirez-Solis et al., 1995;
Smith et al., 1995;
). Chromosome
engineering strategies of this type offer the exciting possibility of
creating novel mouse lines with specific chromosomal abnormalities for
genetic studies.
The multiple targeting and selection steps in ES cells used in the above
chromosome engineering methods can be avoided using the novel approach of
Herault et al.
1998. This targeted meiotic recombination method takes advantage
of the homologous chromosome pairing that occurs naturally during meiosis at
the first cell division. A transgene is designed to express Cre recombinase
under the control of a Sycp1 promoter (the
Sycp1 gene encodes the SCYP1 protein which is part of
the synaptonemal complex that facilitates crossing over). As a result Cre
recombinase is produced in male spermatocytes during the zygotene to
pachytene stages when chromosome pairing occurs.
21.3.3. Gene trapping in mouse ES cells allows an efficient approach to functional
analysis
Large-scale approaches to investigating animal gene function have principally
relied on exposing animals to high doses of radiation, or to potent chemical
mutagens such as ethylnitrosurea (ENU) or
ethyl methylsulfonate (EMS). In mouse
mutagenesis programs, for example, males are typically exposed to high levels of
a suitable mutagen to induce a high frequency of mutation in sperm DNA. The
progeny of irradiated mice are then screened for obvious phenotypic
abnormalities. Such mutagenesis screens have been very useful in producing novel
mutants, but a major problem is that the mutations occur essentially at random.
Identification of the structural change(s) in the DNA of a single mutant animal
may therefore require a laborious positional cloning approach.
A mutation that is induced by inserting a known foreign DNA sequence (transgene)
into the mouse genome has a major advantage over one induced by chemical
mutagens or X rays: it leaves a sequence tag at the locus which is
mutated. As a result, rapid molecular characterization of the
mutated locus is possible. Mouse ES cells provide a way of introducing such
mutations into the germ line, and so random insertional mutagenesis in ES cells
using transgenes was considered a useful way of producing a large number of
mouse mutants which could quickly be characterized at the molecular level.
Unlike gene targeting, the transgene should not be related to endogenous
sequences, but should integrate essentially at random by nonhomologous recombination. However,
because large sections of the mouse genome are noncoding, many random insertion
events may not result in gene disruption.
In order to improve the efficiency of recovering mutations that are likely to
have a phenotypic effect (by altering a gene or its expression) the gene trap approach was devised (see
Evans et al.,
1997). The underlying principle is that the transgene which is inserted
into ES cells contains a defective reporter gene or marker gene which
lacks some component needed for gene expression. The reporter or
marker gene is designed to be expressed only after it inserts into a gene
(within an intron or exon) or at a promoter. When it integrates at such
positions it can acquire the expression element that it lacks.
Figure 21.10
.
Gene trapping uses an expression-defective transgene to select for
chromosomal integration events that occur within or close to a
gene
A specimen host cell gene with 4 exons (E1–E4) is shown at top
together with its promoter (P) and polyadenylation signal (pA). Two
possibilities for a gene trap vector are shown. Transgene 1 includes a
reporter gene which lacks a promoter. It has two exonic sequences
a and b, a polyadenylation signal
(pA) and an upstream sequence which contains a splice acceptor
sequence (SA). In this example, transgene 1
integrates into intron 1 of the host cell gene. During transcription
from the endogenous promoter the splice acceptor sequence will help the
transgene exons to be spliced to the sequence from the first exon of the
endogenous gene, producing a fusion transcript. Selection is based on
this transcript having a functional reporter activity (see Evans et al.,
1997). Transgene 2 has a marker gene (drug-resistance, e.g.
puromycin N-acetyl transferase) coupled to a promoter
which works in ES cells (usually a phosphoglycerate kinase promoter) and
a downstream splice donor sequence, but lacks a
polyadenylation signal. Again, integration is intended to permit
expression but this time from the transgene's promoter and the splice
donor sequence helps the RNA transcript to be spliced to downstream host
exons. The location of the integrated transgene can be identified by
RACE-PCR (see Figure 20.1). See
Zambrowicz et al.
1998 for a detailed example.
Different
gene trap strategies are possible. In some cases, the reporter lacks a
functional
promoter and so relies on chance integration next to an appropriate
cis-acting sequence element that can activate its
transcription. Other approaches have used a
marker gene coupled to a suitable
promoter but lacking a
downstream polyadenylation signal. Here, the
marker gene
is designed to be expressed after integrating into a host cell gene such that a
fusion RNA product is made that utilizes 3′ host sequences in order to
acquire a poly (A) tail (see ). Already, this type of approach has been applied on a large
scale, with a recent report describing disruption and sequence identification of
2000 genes in mouse embryonic stem cells (
Zambrowicz et al., 1998).