NCBI » Bookshelf » Molecular Cell Biology » Nucleic Acids, the Genetic Code, and the Synthesis of Macromolecules » 4.4 The Three Roles of RNA in Protein Synthesis
 
mcb
Molecular Cell Biology
4th
Harvey Lodish,1 Arnold Berk,2 Lawrence Zipursky,2 Paul Matsudaira,3 David Baltimore,4 and James Darnell5
1Whitehead Institute for Biomedical Research and Massachusetts Institute of Technology
2Molecular Biology Institute, University of California, Los Angeles
3Howard Hughes Medical Institute, School of Medicine, University of California, Los Angeles
4California Institute of Technology (Caltech)
5Rockefeller University, New York
W. H. Freeman0-7167-3136-32000
cell biologymolecular biology

 4:  4.4 The Three Roles of RNA in Protein Synthesis

Although DNA stores the information for protein synthesis and RNA carries out the instructions encoded in DNA, most biological activities are carried out by proteins. The accurate synthesis of proteins thus is critical to the proper functioning of cells and organisms. We saw in Chapter 3 that the linear order of amino acids in each protein determines its three-dimensional structure and activity. For this reason, assembly of amino acids in their correct order, as encoded in DNA, is the key to production of functional proteins.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f20.jpg.

Figure 4-20

.

   The three roles of RNA in protein synthesis

Messenger RNA (mRNA) is translated into protein by the joint action of transfer RNA (tRNA) and the ribosome, which is composed of numerous proteins and two major ribosomal RNA (rRNA) molecules. [Adapted from A. J. F. Griffiths et al., 1993, An Introduction to Genetics Analysis, 5th ed., W. H. Freeman.]

Three kinds of RNA molecules perform different but cooperative functions in protein synthesis (Figure 4-20):

Messenger RNA Carries Information from DNA in a Three-Letter Genetic Code

RNA contains ribonucleotides of adenine, cytidine, guanine, and uracil; DNA contains deoxyribonucleotides of adenine, cytidine, guanine, and thymine. Because 4 nucleotides, taken individually, could represent only 4 of the 20 possible amino acids in coding the linear arrangement in proteins, a group of nucleotides is required to represent each amino acid. The code employed must be capable of specifying at least 20 words (i.e., amino acids).

Table 4-2

The Genetic Code (RNA to Amino Acids)*
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4t2.jpg.
If two nucleotides were used to code for one amino acid, then only 16 (or 42) different code words could be formed, which would be an insufficient number. However, if a group of three nucleotides is used for each code word, then 64 (or 43) code words can be formed. Any code using groups of three or more nucleotides will have more than enough units to encode 20 amino acids. Many such coding systems are mathematically possible. However, the actual genetic code used by cells is a triplet code, with every three nucleotides being “read” from a specified starting point in the mRNA. Each triplet is called a codon. Of the 64 possible codons in the genetic code, 61 specify individual amino acids and three are stop codons. Table 4-2 shows that most amino acids are encoded by more than one codon. Only two — methionine and tryptophan — have a single codon; at the other extreme, leucine, serine, and arginine are each specified by six different codons. The different codons for a given amino acid are said to be synonymous. The code itself is termed degenerate, which means that it contains redundancies.

Synthesis of all protein chains in prokaryotic and eukaryotic cells begins with the amino acid methionine. In most mRNAs, the start (initiator) codon specifying this aminoterminal methionine is AUG. In a few bacterial mRNAs, GUG is used as the initiator codon, and CUG occasionally is used as an initiator codon for methionine in eukaryotes. The three codons UAA, UGA, and UAG do not specify amino acids but constitute stop (terminator) signals that mark the carboxyl terminus of protein chains in almost all cells. The sequence of codons that runs from a specific start site to a terminating codon is called a reading frame. This precise linear array of ribonucleotides in groups of three in mRNA specifies the precise linear sequence of amino acids in a protein and also signals where synthesis of the protein chain starts and stops.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f21.jpg.

Figure 4-21

.

   Example of how the genetic code — an overlapping, commaless triplet code — can be read in two different frames

If translation of the mRNA sequence shown begins at two different upstream start sites (not shown), then two overlapping reading frames are possible; in this case, the codons are shifted one base to the right in the lower frame. As a result, different amino acids are encoded by the same nucleotide sequence. Many instances of such overlaps have been discovered in viral and cellular genes of prokaryotes and eukaryotes. It is theoretically possible for the mRNA to have a third reading frame.

Because the genetic code is a commaless, overlapping triplet code, a particular mRNA theoretically could be translated in three different reading frames. Indeed some mRNAs have been shown to contain overlapping information that can be translated in different reading frames, yielding different polypeptides (Figure 4-21). The vast majority of mRNAs, however, can be read in only one frame because stop codons encountered in the other two possible reading frames terminate translation before a functional protein is produced. Another unusual coding arrangement occurs be- cause of frameshifting. In this case the protein-synthesizing machinery may read four nucleotides as one amino acid and then continue reading triplets, or it may back up one base and read all succeeding triplets in the new frame until termination of the chain occurs. These frameshifts are not common events, but a few dozen such instances are known.

Table 4-3

Unusual Codon Usage in Nuclear and Mitochondrial Genes
UniversalUnusual
CodonCodeCodeOccurrence*
UGAStopTrpMycoplasma, Spiroplasma, mitochondria of many species
CUGLeuThrMitochondria in yeasts
UAA, UAGStopGlnAcetabularia, Tetrahymena, Paramecium, etc.
UGAStopCysEuplotes
*

“Unusual code” is used in nuclear genes of the listed organisms and in mitochondrial genes as indicated.

SOURCE: S. Osawa et al., 1992, Microbiol. Rev. 56:229.

The meaning of each codon is the same in most known organisms — a strong argument that life on earth evolved only once. Recently the genetic code has been found to differ for a few codons in many mitochondria, in ciliated protozoans, and in Acetabularia, a single-celled plant. As shown in Table 4-3, most of these changes involve reading of normal stop codons as amino acids, not an exchange of one amino acid for another. It is now thought that these exceptions to the general code are later evolutionary developments; that is, at no single time was the code immutably fixed, although massive changes were not tolerated once a general code began to function early in evolution.

Experiments with Synthetic mRNAs and Trinucleotides Broke the Genetic Code

Having described the genetic code, we briefly recount how it was deciphered — one of the great triumphs of modern biochemistry. The underlying experimental work was carried out largely with cell-free bacterial extracts containing all the necessary components for protein synthesis except mRNA (i.e., tRNAs, ribosomes, amino acids, and the energy-rich nucleotides ATP and GTP).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f22.jpg.

Figure 4-22

.

   Assigning codons using synthetic mRNAs containing a single ribonucleotide

Addition of such a synthetic mRNA to a bacterial extract that contained all the components necessary for protein synthesis except mRNA resulted in synthesis of polypeptides composed of a single type of amino acid as indicated. [See M. W. Nirenberg and J. H. Matthei, 1961, Proc. Nat’l. Acad. Sci. USA 47:1588.]

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f23a.jpg.
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f23b.jpg.

Figure 4-23

.

   Assigning codons using mixed polynucleotides

(a) When a synthetic mRNA with alternating A and C residues was added to a protein-synthesizing bacterial extract, the resulting polypeptide contained alternating threonine and histidine residues. This finding is compatible with the two alternative codon assignments shown. (Note that alternating residues yield the same sequence of triplets regardless of which reading frame is chosen.) (b) To determine which codon assignment shown in (a) is correct, a second mRNA consisting of AAC repeats was tested. This mRNA, which can be read in three frames, yielded the three types of polypeptides shown. Since only the ACA codon was common to both experiments, it must encode threonine; thus CAC must encode histidine in (a). The assignments AAC = asparagine (Asn) and CAA = glutamine (Gln) were derived from additional experiments. [See H. G. Korana, 1968, reprinted in Nobel Lectures: Physiology or Medicine (19631970), Elsevier (1973), p. 341.]

Initially, researchers added synthetic mRNAs containing a single type of nucleotide to such extracts and then determined the amino acid incorporated into the polypeptide that was formed. In the first successful experiment, synthetic mRNA composed only of U residues [poly(U)] yielded polypeptides made up only of phenylalanine. Thus it was concluded that a codon for phenylalanine consisted entirely of U’s. Likewise, experiments with poly(C) and poly(A) showed that a codon for proline contained only C’s and a codon for lysine only A’s (Figure 4-22). [Poly(G) did not work in this type of experiment because it assumes an unusable stacked structure that is not translated well.] Next, synthetic mRNAs composed of alternating bases were used. The results of these experiments not only revealed more codons but also demonstrated that codons are three bases long. The example of this approach illustrated in Figure 4-23 led to identification of ACA as the codon for threonine and CAC for histidine. Similar experiments with many such mixed polynucleotides revealed a substantial part of the genetic code.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f24.jpg.

Figure 4-24

.

   Breaking the entire genetic code by use of chemically synthesized trinucleotides

Marshall Nirenberg and his collaborators prepared 20 ribosome-free bacterial extracts containing all possible aminoacyl-tRNAs (tRNAs with an amino acid attached). In each sample, a different amino acid was radioactively labeled (green); the other 19 amino acids were bound to tRNAs but were unlabeled. Aminoacyl-tRNAs and trinucleotides passed through a nitrocellulose filter (left), but ribosomes were retained by the filter (center) and would bind trinucleotides and their cognate tRNAs (right). Each possible trinucleotide was tested separately for its ability to attract a specific tRNA by adding it with ribosomes to samples from each of the 20 aminoacyl-tRNA mixtures. The sample was then filtered. If the added trinucleotide caused the radiolabeled aminoacyl-tRNA to bind to the ribosome, then radioactivity would be detected on the filter (a positive test); otherwise, the label would pass through the filter (a negative test). By synthesizing and testing all possible trinucleotides, the researchers were able to match all 20 amino acids with one or more codons (e.g., phenylalanine with UUU as shown here). [See M. W. Nirenberg and P. Leder, 1964, Science 145:1399.]

The entire genetic code was finally worked out by a second type of experiment conducted by Marshall Nirenberg and his collaborators. In this approach, all the possible trinucleotides were tested for their ability to attract tRNAs attached to the 20 different amino acids found in natural proteins (Figure 4-24). In all, 61 of the 64 possible trinucleotides were found to code for a specific amino acid; the trinucleotides UAA, UGA, and UAG did not encode amino acids.

Although synthetic mRNAs were useful in deciphering the genetic code, in vitro protein synthesis from these mRNAs is very inefficient and yields polypeptides of variable size. Successful in vitro synthesis of a naturally occurring protein was achieved first when mRNA from bacteriophage F2 (a virus) was added to bacterial extracts, leading to formation of the coat, or capsid, protein (the “packaging” protein that covers the virus particle). Studies with such natural mRNAs established that AUG encodes methionine at the start of almost all proteins and is required for efficient initiation of protein synthesis, while the three trinucleotides (UAA, UGA, and UAG) that do not encode any amino acid act as stop codons, necessary for precise termination of synthesis.

The Folded Structure of tRNA Promotes Its Decoding Functions

The next step in understanding the flow of genetic information from DNA to protein was to determine how the nucleotide sequence of mRNA is converted into the amino acid sequence of protein. This decoding process requires two types of adapter molecules: tRNAs and enzymes called aminoacyl-tRNA synthetases. First we describe the role of tRNAs in decoding mRNA codons, and then examine how synthetases recognize tRNAs.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f25.jpg.

Figure 4-25

.

   Translation of nucleic acid sequences in mRNA into amino acid sequences in proteins requires a two-step decoding process

First, an aminoacyl-tRNA synthetase couples a specific amino acid to its corresponding tRNA. Second,a three-base sequence in the tRNA (the anticodon) base-pairs with a codon in the mRNA specifying the attached amino acid. If an error occurs in either step, the wrong amino acid may be incorporated into a polypeptide chain.

All tRNAs have two functions: to be chemically linked to a particular amino acid and to base-pair with a codon in mRNA so that the amino acid can be added to a growing peptide chain. Each tRNA molecule is recognized by one and only one of the 20 aminoacyl-tRNA synthetases. Likewise, each of these enzymes links one and only one of the 20 amino acids to a particular tRNA, forming an aminoacyl-tRNA. Once its correct amino acid is attached, a tRNA then recognizes a codon in mRNA, thereby delivering its amino acid to the growing polypeptide (Figure 4-25).

As studies on tRNA proceeded, 30 – 40 different tRNAs were identified in bacterial cells and as many as 50 – 100 in animal and plant cells. Thus the number of tRNAs in most cells is more than the number of amino acids found in proteins (20) and also differs from the number of codons in the genetic code (61). Consequently, many amino acids have more than one tRNA to which they can attach (explaining how there can be more tRNAs than amino acids); in addition, many tRNAs can attach to more than one codon (explaining how there can be more codons than tRNAs). As noted previously, most amino acids are encoded by more than one codon, requiring some tRNAs to recognize more than one codon.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f26a.jpg.
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is permission.jpg.

Figure 4-26

.

   Structure of tRNAs

(a) The primary structure of yeast alanine tRNA (tRNAAla), the first such sequence determined. This molecule is synthesized from the nucleotides A, C, G, and U, but some of the nucleotides, shown in red, are modified after synthesis: D = dihydrouridine, I = inosine, T = thymine, Ψ = pseudouridine, and m = methyl group. Although the exact sequence varies among tRNAs, they all fold into four base-paired stems and three loops. The partially unfolded molecule is commonly depicted as a cloverleaf. Dihydrouridine is nearly always present in the D loop; likewise, thymidylate, pseudouridylate, cytidylate, and guanylate are almost always present in the TΨCG loop. The triplet at the tip of the anticodon loop base-pairs with the corresponding codon in mRNA. Attachment of an amino acid to the acceptor arm yields an aminoacyl-tRNA. (b) Computergenerated three-dimensional model of the generalized backbone of all tRNAs. Note the L shape of the molecule. [Part (a) see R. W. Holly et al., 1965, Science 147:1462; part (b) from J. G. Arnez and D. Moras, 1997, Trends Biochem. Sci. 22:211.]

The function of tRNA molecules, which are 70 – 80 nucleotides long, depends on their precise three-dimensional structures. In solution, all tRNA molecules fold into a similar stem-loop arrangement that resembles a cloverleaf when drawn in two dimensions (Figure 4-26a). The four stems are short double helices stabilized by Watson-Crick base pairing; three of the four stems have loops containing seven or eight bases at their ends, while the remaining, unlooped stem contains the free 3′ and 5′ ends of the chain. Three nucleotides termed the anticodon, located at the center of one loop, can form base pairs with the three complementary nucleotides forming a codon in mRNA. As discussed later, specific aminoacyl-tRNA synthetases recognize the surface structure of each tRNA for a specific amino acid and covalently attach the proper amino acid to the unlooped amino acid acceptor stem. The 3′ end of all tRNAs has the sequence CCA, which in most cases is added after synthesis and processing of the tRNA are complete. Viewed in three dimensions, the folded tRNA molecule has an L shape with the anticodon loop and acceptor stem forming the ends of the two arms (Figure 4-26b).

Besides addition of CCA at the 3′ terminus after a tRNA molecule is synthesized, several of its nucleic acid bases typically are modified. For example, most tRNAs are synthesized with a four-base sequence of UUCG near the middle of the molecule. The first uridylate is methylated to become a thymidylate; the second is rearranged into a pseudouridylate (abbreviated Ψ), in which the ribose is attached to carbon 5 instead of to nitrogen 1 of the uracil. These modifications produce a characteristic TΨCG loop in an unpaired region at approximately the same position in nearly all tRNAs (see Figure 4-26a).

Nonstandard Base Pairing Often Occurs between Codons and Anticodons

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f27.jpg.

Figure 4-27

.

   The first and second bases in an mRNA codon form Watson-Crick base pairs with the third and second bases, respectively, of a tRNA anticodon

However, the base in the third (or wobble) position of an mRNA codon often forms a nonstandard base pair with the base in the first (or wobble) position of a tRNA anticodon. Wobble pairing allows a tRNA to recognize more than one mRNA codon (top); conversely, it allows a codon to be recognized by more than one kind of tRNA (bottom), although each tRNA will bear the same amino acid. Note that a tRNA with I (inosine) in the wobble position can “read” (become paired with) three different codons (see Figure 4-28), and a tRNA with G or U in the wobble position can read two codons. Although A is theoretically possible in the wobble position of the anticodon, it is almost never found in nature.

If perfect Watson-Crick base pairing were demanded between codons and anticodons, cells would have to contain exactly 61 different tRNA species, one for each codon that specifies an amino acid. As noted above, however, many cells contain fewer than 61 tRNAs. The explanation for the smaller number lies in the capability of a single tRNA anticodon to recognize more than one, but not necessarily every, codon corresponding to a given amino acid. This broader recognition can occur because of nonstandard pairing between bases in the so-called “wobble” position: the third base in a mRNA codon and the corresponding first base in its tRNA anticodon. Although the first and second bases of a codon form standard Watson-Crick base pairs with the third and second bases of the corresponding anticodon, four nonstandard interactions can occur between bases in the wobble position. Particularly important is the G·U base pair, which structurally fits almost as well as the standard G·C pair. Thus, a given anticodon in tRNA with G in the first (wobble) position can base-pair with the two corresponding codons that have either pyrimidine (C or U) in the third position (Figure 4-27). For example, the phenylalanine codons UUU and UUC (5′ → 3′) are both recognized by the tRNA that has GAA (5′ → 3′) as the anticodon. In fact, any two codons of the type NNPyr (N = any base; Pyr = pyrimidine) encode a single amino acid and are decoded by a single tRNA with G in the first (wobble) position of the anticodon.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f28.jpg.

Figure 4-28

.

   The nonstandard, wobble base pairs U·G, C·I, A·I, and U·I

The heavy black lines indicate the bonds by which the nitrogenous bases attach to the 1′ carbon of ribose (C1).

Although adenine rarely is found in the anticodon wobble position, many tRNAs in plants and animals contain inosine (I), a deaminated product of adenine, at this position. Inosine can form nonstandard base pairs with A, C, and U (Figure 4-28). A tRNA with inosine in the wobble position thus can recognize the corresponding mRNA codons with A, C, or U in the third (wobble) position (see Figure 4-27). For this reason, inosine-containing tRNAs are heavily employed in translation of the synonymous codons that specify a single amino acid. For example, four of the six codons for leucine have a 3′ A, C, or U (see Table 4-2); these four codons are all recognized by the same tRNA (3′-GAI-5′), which has inosine in the wobble position of the anticodon (and thus recognizes CUA, CUC, and CUU), and uses a G·U pair in position 1 to recognize the UUA codon.

Aminoacyl-tRNA Synthetases Activate Amino Acids by Linking Them to tRNAs

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f29.jpg.

Figure 4-29

.

   Aminoacylation of tRNA. Amino acids are covalently linked to tRNAs by aminoacyl-tRNA synthetases

Each of these enzymes recognizes one kind of amino acid and all the cognate tRNAs that recognize codons for that amino acid. The two-step aminoacylation reaction requires energy from the hydrolysis of ATP. The equilibrium of the overall reaction favors the indicated products because the pyrophosphate (PPi) released in step 1 is converted to inorganic phosphate (Pi) by a pyrophosphatase. The 3′ end of all tRNAs, to which the amino acid attaches, has the sequence CCA. Class I synthetases (purple) attach the amino acid to the 2′ hydroxyl of the terminal adenylate in tRNA; class II synthetases (green) attach the amino acid to the 3′ hydroxyl. (Ad = adenine; Cyt = cytosine.)

Recognition of the codon or codons specifying a given amino acid by a particular tRNA is actually the second step in decoding the genetic message. The first step, attachment of the appropriate amino acid to a tRNA, is catalyzed by a specific aminoacyl-tRNA synthetase (see Figure 4-25). Each of the 20 different synthetases recognizes one amino acid and all its compatible, or cognate, tRNAs. These coupling enzymes link an amino acid to the free 2′ or 3′ hydroxyl of the adenosine at the 3′ terminus of tRNA molecules by a two-step ATP-requiring reaction (Figure 4-29). About half the aminoacyl-tRNA synthetases transfer the aminoacyl group to the 2′ hydroxyl of the terminal adenosine (class I), and about half to the 3′ hydroxyl (class II). In this reaction, the amino acid is linked to the tRNA by a high-energy bond and thus is said to be activated. The energy of this bond subsequently drives the formation of peptide bonds between adjacent amino acids in a growing polypeptide chain. The equilibrium of the aminoacylation reaction is driven further toward activation of the amino acid by hydrolysis of the high-energy phosphoanhydride bond in pyrophosphate. The overall reaction is
graphic element

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f30.jpg.

Figure 4-30

.

   Recognition of a tRNA by aminoacyl synthetases. Aspartyl-tRNA synthetase (AspRS) is a class II enzyme, and arginyl-tRNA synthetase (ArgRS) is a class I enzyme

Shown here are the outlines of the three-dimensional structures of the two synthetases. The tRNA shown between them as a ribbon diagram will bind to either and is a slightly modified version of tRNAAsp. It is used as an illustration of common surface interactions between tRNA and class I and II enzymes. Sites on the opposite sides of this modified tRNAAsp make contacts with the two enzymes: the blue balls show contacts with the class II AspRS; those that make contact with class I ArgRS are indicated by yellow balls. The synthetases are shown positioned away from the tRNA for clarity, but the fit of the surfaces at close range is obvious. The ability of ArgRS to interact with the noncognate tRNAAsp is lost when residue G37 in tRNAAsp is methylated, a normal modification that occurs in vivo. However, the shape and binding sites of this modified tRNA are characteristic of class I and class II interactions with tRNAs. This molecular graphic picture was produced using the DRAWNA program. [Adapted from M. Sissler et al., 1997, Nucl. Acids Res. 25:4899; courtesy of R. Giegé.]

The amino acid sequences of the aminoacyl-tRNA synthetases (ARSs) from many organisms are now known, and the three-dimensional structures of over a dozen enzymes of both classes have been solved. Each of these enzymes has a rather precise binding site for ATP (GTP is not admitted and CTP and UTP are too small) and binding pockets for its specific amino acid. Class I and class II enzymes bind to opposite faces of the incoming tRNAs. The binding surfaces of class I enzymes tend to be somewhat complementary to those of class II enzymes. These different binding surfaces and the consequent alignment of bound tRNAs probably account in part for the difference in the hydroxyl group to which the aminoacyl group is transferred (Figure 4-30). Because some amino acids are so similar structurally, aminoacyl-tRNA synthetases sometimes make mistakes. These are corrected, however, by the enzymes themselves, which check the fit in the binding pockets and facilitate deacylation of any misacylated tRNAs. This crucial function helps guarantee that a tRNA delivers the correct amino acid to the protein-synthesizing machinery.

Each tRNA Molecule Is Recognized by a Specific Aminoacyl-tRNA Synthetase

The ability of aminoacyl-tRNA synthetases to recognize their correct cognate tRNAs is just as important to the accurate translation of the genetic code as codon-anticodon pairing. Once a tRNA is loaded with an amino acid, codon-anticodon pairing directs the tRNA into the proper ribosome site; if the wrong amino acid is attached to the tRNA, an error in protein synthesis results.

As noted already, each aminoacyl-tRNA synthetase can aminoacylate all the different tRNAs whose anticodons correspond to the same amino acid. Therefore, all these cognate tRNAs must have a similar binding site, or “identity element,” that is recognized by the synthetase. One approach for studying the identity elements in tRNAs that are recognized by aminoacyl-tRNA synthetases is to produce synthetic genes that encode tRNAs with normal and various mutant sequences by techniques discussed in Chapter 7. The normal and mutant tRNAs produced from such synthetic genes then can be tested for their ability to bind purified synthetases.

Very probably no single structure or sequence completely determines a specific tRNA identity. However, some important structural features of several E. coli tRNAs that allow their cognate synthetases to recognize them are known. Perhaps the most logical identity element in a tRNA molecule is the anticodon itself. Experiments in which the anticodons of methionine tRNA (tRNAMet) and valine tRNA (tRNAVal) were interchanged showed that the anticodon is of major importance in determining the identity of these two tRNAs. In addition, x-ray crystallographic analysis of the complex between glutamine aminoacyl-tRNA synthetase (GlnRS) and glutamine tRNA (tRNAGln) showed that each of the anticodon bases neatly fits into a separate, specific “pocket” in the three-dimensional structure of GlnRS. Thus this synthetase specifically recognizes the correct anticodon.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is permission.jpg.

Figure 4-31

.

   Identity elements in tRNA involved in recognition by aminoacyl-tRNA synthetases, as demonstrated by both conservation and experimentation

The 67 known tRNA sequences in E. coli were compared by computer analysis. The conserved nucleotides in different tRNAs that recognize the same amino acid are shown as yellow circles in the left drawing, with the tRNA chain in blue. Increasing size indicates increasing conservation of a base at a given position. It is clear that nucleotides in the anticodon loop and in the acceptor stem are most often similar when a particular amino acid must be recognized. This appreciation is heightened by results shown in the right drawing. Here, nucleotides that have been experimentally demonstrated to have a role in identity (selection of an amino acid by an ARS-tRNA complex) are shown as yellow circles. In this case, the circle size indicates the relative frequency that a given position acts as an identity element. [From W. H. McClain, 1993, J. Mol. Biol. 234:257; also see L. D. H. Schulman and J. Abelson, 1988, Science 240:1590.]

However, the anticodon may not be the principal identity element in other tRNAs (see Figure 4-30). Figure 4-31 shows the extent of base sequence conservation in E. coli tRNAs that become linked to the same amino acid. Identity elements are found in several regions, particularly the end of the acceptor arm. A simple case is presented by tRNAAla: a single G·U base pair (G3·U70) in the acceptor stem is necessary and sufficient for recognition of this tRNA by its cognate aminoacyl-tRNA synthetase. Solution of the three-dimensional structure of additional complexes between aminoacyl-tRNA synthetases and their cognate tRNAs should provide a clear understanding of the rules governing the recognition of tRNAs by specific synthetases.

Ribosomes Are Protein-Synthesizing Machines

If the many components that participate in translating mRNA had to interact in free solution, the likelihood of simultaneous collisions occurring would be so low that the rate of amino acid polymerization would be very slow. The efficiency of translation is greatly increased by the binding of the mRNA and the individual aminoacyl-tRNAs to the most abundant RNA-protein complex in the cell — the ribosome. This two-part machine directs the elongation of a polypeptide at a rate of three to five amino acids added per second. Small proteins of 100 – 200 amino acids are therefore made in a minute or less. On the other hand, it takes 2 to 3 hours to make the largest known protein, titin, which is found in muscle and contains 30,000 amino acid residues. The machine that accomplishes this task must be precise and persistent.

With the aid of the electron microscope, ribosomes were first discovered as discrete, rounded structures prominent in animal tissues secreting large amounts of protein; initially, however, they were not known to play a role in protein synthesis. Once reasonably pure ribosome preparations were obtained, radiolabeling experiments showed that radioactive amino acids first were incorporated into growing polypeptide chains associated with ribosomes before appearing in finished chains.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f32.jpg.

Figure 4-32

.

   The general structure of ribosomes in prokaryotes and eukaryotes

In all cells, each ribosome consists of a large and a small subunit. The two subunits contain rRNAs of different lengths, as well as a different set of proteins. All ribosomes contain two major rRNA molecules (dark red) — 23S and 16S rRNA in bacteria, 28S and 18S rRNA in eukaryotes — and one or two small RNAs (light red). The proteins are named L1, L2, etc., and S1, S2, etc., depending on whether they are found in the large or the small subunit.

A ribosome is composed of several different ribosomal RNA (rRNA) molecules and more than 50 proteins, organized into a large subunit and a small subunit. The proteins in the two subunits differ, as do the molecules of rRNA. The small ribosomal subunit contains a single rRNA molecule, referred to as small rRNA; the large subunit contains a molecule of large rRNA and one molecule each of two much smaller rRNAs in eukaryotes (Figure 4-32). The ribosomal subunits and the rRNA molecules are commonly designated in svedbergs (S), a measure of the sedimentation rate of suspended particles centrifuged under standard conditions (Chapter 3). The lengths of the rRNA molecules, the quantity of proteins in each subunit, and consequently the sizes of the subunits differ in prokaryotic and eukaryotic cells. (The small and large rRNAs are about 1500 and 3000 nucleotides long in bacteria and about 1800 and 5000 nucleotides long in humans.) Perhaps of more interest than these differences are the great structural and functional similarities among ribosomes from all species. This consistency is another reflection of the common evolutionary origin of the most basic constituents of living cells.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is ch4f33.jpg.

Figure 4-33

.

   Two-dimensional map of the secondary structure of the small (16S) rRNA from bacteria, showing the location of base-paired stems and loops

In general, the length and position of the stem-loops are very similar in all species, although the exact sequence varies from species to species. The most highly conserved regions are represented as red lines, and the numbered stem-loops unique to prokaryotes are preceded by a P. Eukaryotic small (18S) rRNAs exhibit a generally similar pattern of stem-loops, although, as with prokaryotes, a few are unique. [Adapted from E. Huysmans and R. DeWachter, 1987, Nucl. Acids Res. 14:73].

The sequences of the small and large rRNAs from several thousand organisms are now known. Although the primary nucleotide sequences of these rRNAs vary considerably, the same parts of each type of rRNA theoretically can form base-paired stem-loops, generating a similar threedimensional structure for each rRNA in all organisms. Evidence that such stem-loops occur in rRNA was obtained by treating rRNA with chemical agents that cross-link paired bases; the samples then were digested with enzymes that destroy single-stranded rRNA, but not any cross-linked (base-paired) regions. Finally, the intact, cross-linked rRNA that remained was collected and sequenced, thus identifying the stem-loops in the original rRNA. Experiments of this type have located about 45 stem-loops at similar positions in small rRNAs from many different prokaryotes and eukaryotes (Figure 4-33). An even larger number of regularly positioned stem-loops have been demonstrated in large rRNAs. All the ribosomal proteins have been identified and their sequences determined, and many have been shown to bind specific regions of rRNA. It seems clear that the fundamental protein-synthesizing machinery in all present-day cells arose only once and has been modified about a common plan during evolution.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is permission.jpg.
An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is permission.jpg.

Figure 4-34

.

   Overall structure of the E. coli ribosome at 25-Å resolution inferred from cryoelectron microscopy and three-dimensional reconstruction based on the analysis of 4300 individual projections

(a) This model shows the shapes of the large (blue) and small (yellow) subunits of the ribosome with three aminoacyl-tRNAs (pink, green, yellow) superimposed at the A, P, and E sites. The roles of these tRNA-binding sites during protein synthesis are discussed later. Chemical cross-linking experiments have demonstrated that the mRNA (orange beads) passes close to the anticodon loops of the tRNAs and that the nascent polypeptide chain is buried in the tunnel in the large ribosomal subunit that begins within 10 – 15 Å of the 3′ aminoacylated end of the tRNAs. The tunnel termination site on the ribosome surface has also been accurately mapped. (b) Large panel shows a field of 70S ribosomes. Small panels (left) show cryoelectron microscopy images of a single 70S ribosome, small (30S) subunit, and large (50S) subunit. Small panels (right) show computer-derived averages of many dozens of images in the same orientation. Cryoelectron microscopy is carried out on unstained samples of ribosomes or subunits flash frozen as “vitreous ices” (without ice crystals) in a very thin layer of water (Chapter 5). Individual images are analyzed by computer projections. [See R. K. Agrawal et al., Cell, in press; J. Frank, 1995, Nature 356:441; J. Frank et al., 1995, Biochem. Cell Biol. 73:757. Courtesy of J. Frank.]

During protein synthesis, a ribosome moves along an mRNA chain, interacting with various protein factors and tRNA and very likely even undergoes shape changes. Despite the complexity of the ribosome, great progress has been made in determining both the overall structure of bacterial ribosomes and in identifying reactive sites that bind specific proteins, mRNA, and tRNA and that participate in important steps in protein synthesis. Quite detailed models of the large and small ribosomal subunits from E. coli have been constructed based on cryoelectron microscope and neutron-scattering studies (Figure 4-34). These studies not only have determined the dimensions and overall shape of the ribosomal subunits, but also have localized the positions of tRNAs bound to the ribosome during protein chain elongation. Powerful chemical experiments have also helped unravel the complex interactions between proteins and RNAs. In a technique called footprinting, for example, ribosomes are treated with chemical reagents that modify single-stranded RNA unprotected by binding either to protein or to another RNA. If the total sequence of the RNA is known, then the location of the modified nucleotides can be located within the molecule. (This technique, which is also useful for locating protein-binding sites in DNA, is described in Chapter 10.) Thus the overall structure and function of ribosomes during protein synthesis is finally, after 40 years, yielding to successful experiments. How these results aid in understanding the specific steps in protein synthesis is described in the next section.

SUMMARY

Help ǀ Contact Bookshelf