The genome is a store of biological information but on its own it is unable to release that information to the cell. Utilization of the biological information requires the coordinated activity of enzymes and other proteins, which participate in a complex series of biochemical reactions referred to as genome expression. The details of genome expression are described in Part 3. Before reaching this detailed discussion, an overview of the key events involved in genome expression will be valuable, in order to establish a foundation of knowledge onto which the more comprehensive understanding can subsequently be built. This chapter provides that overview.
The initial product of genome expression is the transcriptome, a collection of RNA molecules derived from those protein-coding genes whose biological information is required by the cell at a particular time (Figure 3.1
(A) shows the old-fashioned depiction of gene expression, summarized as ‘DNA makes RNA makes protein’, the two steps being called transcription and translation. (B) gives a more accurate outline of the events involved in genome expression, especially in higher organisms. Note that these schemes apply only to protein-coding genes. Those genes that give rise to non-coding RNAs are transcribed and processed as shown but the RNAs are not translated.
Accessing the genome. This involves various processes that influence chromatin structure and nucleosome positioning in the parts of the genome that contain active genes, ensuring that these genes are accessible and are not buried deep within highly packaged parts of the chromosomes.
Assembly of the transcription initiation complex, which comprises the set of proteins that work together to copy genes into RNA. Assembly of initiation complexes is a highly targeted process because these complexes must be constructed at precise positions in the genome, adjacent to active genes, and nowhere else.
Synthesis of RNA, during which the gene is transcribed into an RNA copy.
Processing of RNA involves a series of alterations that are made to the sequence of the RNA molecule and to its chemical structure, and which must occur before the RNA molecules can be translated into protein or, in the case of non-coding RNA, before they can carry out their other functions in the cell.
RNA degradation is the controlled turnover of RNA molecules. Degradation is not simply a means of getting rid of unwanted RNAs: it plays an active role in determining the make up of the transcriptome and hence is an integral step in genome expression.
Assembly of the translation initiation complex occurs near the 5′ termini of coding RNA molecules, and is a prerequisite for translation of these molecules.
Protein synthesis is the synthesis of a protein by translation of an RNA molecule.
Protein folding and protein processing may occur together. Folding results in the protein taking up its correct three-dimensional configuration. Processing involves modification of the protein by addition of chemical groups and, for some proteins, removal of one or more segments of the protein.
Protein degradation has an important influence on the composition of the proteome and, like RNA degradation, is an integral component of genome expression.
A typical bacterium contains 0.05–0.10 pg of RNA, making up about 6% of its total weight. A mammalian cell, being much larger, contains more RNA, 20–30 pg in all, but this represents only 1% of the cell as a whole (Alberts et al., 1994). It is important to appreciate that not all of this RNA constitutes the transcriptome. The latter is just the coding RNA - those molecules that have been transcribed from protein-coding genes and which are therefore capable of being translated into protein. Most of the cellular RNA does not fall into this category because it is non-coding. An understanding of the distinctive features of coding and non-coding RNA is therefore essential before we continue with our overview of genome expression.
This scheme shows the types of RNA present in all organisms (eukaryotes, bacteria and archaea) and those categories found only in eukaryotic or bacterial cells. The non-coding RNAs of archaea have not yet been fully characterized and it is not clear which types are present in addition to rRNA and tRNA. For abbreviations, see the text.
Messenger RNAs (mRNAs), which are transcripts of protein-coding genes and hence are translated into protein in the latter stages of genome expression.
Messenger RNAs rarely make up more than 4% of the total RNA and are short-lived, being degraded soon after synthesis. Bacterial mRNAs have half-lives of no more than a few minutes and in eukaryotes most mRNAs are degraded a few hours after synthesis. This rapid turnover means that the composition of the transcriptome is not fixed and can quickly be restructured by changing the rate of synthesis of individual mRNAs.
The second type of RNA is non-coding. This is more diverse than the coding RNA and comprises transcripts with a number of different functions, all of which are performed by the RNA molecules themselves. In both prokaryotes and eukaryotes the two main types of non-coding RNA are:
Ribosomal RNAs (rRNAs), which are the most abundant RNAs in the cell, making up over 80% of the total in actively dividing bacteria. These molecules are components of ribosomes, the structures on which protein synthesis takes place (Section 11.2).
Transfer RNAs (tRNAs) are small molecules that are also involved in protein synthesis, carrying amino acids to the ribosome and ensuring that these are linked together in the order specified by the nucleotide sequence of the mRNA that is being translated (Section 11.1).
Small nuclear RNA (snRNA; also called U-RNA because these molecules are rich in uridine nucleotides), which is involved in mRNA processing (Section 10.1.3);
Small nucleolar RNA (snoRNA), which plays a central role in the processing of rRNA molecules (Section 10.3.1);
Small cytoplasmic RNA (scRNA), a diverse group including molecules with a range of functions, some understood and others still mysterious.
Bacteria and archaea also contain non-coding RNAs other than rRNA and tRNA but these molecules do not make up a substantial fraction of the total RNA. In bacteria they include one interesting RNA type, apparently present in most if not all species, called transfer-messenger RNA (tmRNA), which looks like a tRNA attached to an mRNA, and which adds short peptide tags onto proteins that have been synthesized incorrectly, labeling them for immediate degradation (Muto et al., 1998).
The RNA transcript is synthesized in the 5′→3′ direction, reading the DNA in the 3′→5′ direction, with the sequence of the transcript determined by base-pairing to the DNA template.
Compare this reaction with polymerization of DNA, as illustrated in Figure 1.8.
(A) shows the direct recognition of the promoter by the RNA polymerase, as occurs in bacteria. (B) shows recognition of the promoter by a DNA-binding protein which forms a platform onto which the RNA polymerase binds. This indirect mechanism occurs with eukaryotic and archaeal RNA polymerases.
Not all events occur in all organisms - see the text for details.
End-modifications occur during the synthesis of eukaryotic and archaeal mRNAs, most of which have a single, unusual nucleotide called a cap attached at the 5′ end and a poly(A) tail attached to the 3′ end. The cap and poly(A) tails are involved in assembly of the translation initiation complex on these mRNAs (Section 10.2.2).
Splicing is the removal of introns from a precursor RNA. Many eukaryotic protein-coding genes contain introns and these are copied when the gene is transcribed. The introns are removed from the pre-mRNA by cutting and joining reactions. Unspliced pre-mRNA forms the nuclear RNA fraction called heterogenous nuclear RNA (hnRNA). Some eukaryotic pre-rRNAs and pre-tRNAs also contain introns, as do some archaeal transcripts, but they are extremely rare in bacteria.
Cutting events are particularly important in the processing of rRNA and tRNA, many of which are initially synthesized from transcription units that specify more than one molecule. The pre-rRNAs and pre-tRNAs must therefore be cut into pieces to produce the mature RNAs. This type of processing occurs in both prokaryotes and eukaryotes.
Chemical modifications are made to rRNAs, tRNAs and mRNAs. The rRNAs and tRNAs of all organisms are modified by addition of new chemical groups, these groups being added to specific nucleotides within each RNA. Chemical modification of mRNA, called RNA editing, is seen in a diverse group of eukaryotes.
(A) RNA editing can change the sequence of an mRNA, resulting in synthesis of a different protein. An example occurs with the human mRNA for apolipoprotein B, as shown in Figure 10.29. (B) Alternative splicing results in different combinations of exons becoming linked together, again resulting in different proteins being synthesized from the same pre-mRNA. Figure 10.20 shows how alternative splicing underlies sex determination in Drosophila.
Although the transcriptome makes up less than 4% of the total cell RNA, it is the most significant component because it contains the coding RNAs that specify the composition of the proteome and hence determine the biochemical capacity of the cell. One important point to note is that the transcriptome is never synthesized de novo. Every cell receives part of its parent's transcriptome when it is first brought into existence by cell division, and maintains a transcriptome throughout its lifetime. Even quiescent cells in bacterial spores or in the seeds of plants have a transcriptome, although expression of that transcriptome into protein may be completely switched off. Transcription does not therefore result in synthesis of the transcriptome but instead maintains the transcriptome by replacing mRNAs that have been degraded, and brings about changes to the composition of the transcriptome via the switching on and off of different sets of genes.
Even in the simplest organisms such as bacteria and yeast, many genes are active at any one time. Transcriptomes are therefore complex, containing copies of hundreds, if not thousands, of different mRNAs. Usually, each mRNA makes up only a small fraction of the whole, with the most common type rarely contributing more than 1% of the total mRNA. Exceptions are those cells that have highly specialized biochemistries, which are reflected by transcriptomes in which one or a few mRNAs predominate. Developing wheat seeds are an example: these synthesize large amounts of the gliadin proteins, which accumulate in the dormant grain and provide a source of amino acids for the germinating seedling. Within the developing seeds, the gliadin mRNAs can make up as much as 30% of the transcriptomes of certain cells.
Perhaps surprisingly, it is relatively easy to determine the composition of a transcriptome, and to make comparisons between different transcriptomes, using the microarray and DNA chip technologies described in Section 7.3.1. To illustrate the types of analysis that are possible we will examine some of the recent research on the yeast and human transcriptomes.
With less than 6000 genes, the yeast Saccharomyces cerevisiae is ideally suited for transcriptome studies, and many of the pioneering projects have been carried out with this organism. One of the first discoveries was that, although mRNAs are being degraded and re-synthesized all the time, the composition of the yeast transcriptome undergoes very little change if the biochemical features of the environment remain constant (DeRisi et al., 1997). When yeast is grown in a glucose-rich medium, which allows the cells to divide at their maximum rate, the transcriptome is almost completely stable, only 19 mRNAs displaying a greater than two-fold change in abundance over a period of 2 hours (Figure 3.9
The middle three drawings show the nuclear divisions that occur during sporulation. See Figure 5.15 for details of the events involved in meiosis I and meiosis II.
This work on yeast sporulation is important for two reasons. First, by describing the changes in genome expression that occur during sporulation, the transcriptome analyses open the way to studies of the interactions between the genome and the environmental signals that trigger sporulation. Studies of this type, in a relatively simple organism such as yeast, act as an important model for the more complex developmental processes that operate in higher eukaryotes, including humans. Secondly, several of the mRNAs whose levels change significantly during sporulation are transcripts of genes whose functions were previously unknown. Transcriptome studies therefore help to annotate a genome sequence, aiding identification of genes whose roles in the genome have not been determined by other methods. We will return to this issue in Chapter 7.
The diagram shows human chromosome 11 aligned vertically. The bar charts indicate the expression levels in different cell types of the genes on this chromosome. The lengths of the blue bars are proportional to the extent of gene expression, and the red bars indicate genes whose expression levels are higher than can be illustrated on this scale. The box highlights significant differences between the transcriptomes of normal and cancerous breast tissue cells. Reprinted with permission from Caron et al. Science, 291, 1289-1292. Copyright 2001 American Association for the Advancement of Science.
Transcriptome studies also have applications in cancer diagnosis. The initial breakthrough in this respect came in 1999 when it was shown that the transcriptome of acute lymphoblastic leukemia cells is different from that of acute myeloid leukemia cells (Golub et al., 1999). Twenty-seven lymphoblastic and eleven myeloid cancers were studied and, although all the transcriptomes were slightly different, the distinctions between the two types were sufficient for unambiguous identifications to be made. The significance of this work lies with the improved remission rates that are achievable if a cancer is identified accurately at an early stage, before clear morphological indicators are seen. This is not relevant with these two types of leukemia because these can be distinguished by non-genetic means, but it is important with other cancers such as non-Hodgkin lymphoma. The commonest version of this disease is called diffuse large B-cell lymphoma, and for many years it was thought that all tumors of this type were the same. Transcriptome studies changed this view and showed that B-cell lymphoma can be divided into two distinct subtypes (Alizadeh et al., 2000). The distinctions between the transcriptomes of the two subtypes enable each one to be related to a different class of B cells, stimulating and directing the search for specific treatments that are tailored for each lymphoma.
The proteome is the final product of genome expression and comprises all the proteins present in a cell at a particular time. A ‘typical’ mammalian cell, for example a liver hepatocyte, is thought to contain 10 000–20 000 different proteins, about 8 × 109 individual molecules in all, representing approximately 0.5 ng of protein or 18–20% of the total cell weight (Alberts et al., 1994; Lodish et al., 2000). The copy numbers of individual proteins vary enormously, from less than 20 000 molecules per cell for the rarest types to 100 million copies for the commonest ones. Any protein that is present at a copy number of greater than 50 000 per cell is considered to be relatively abundant, and in the average mammalian cell some 2000 proteins fall into this category. When the proteomes of different types of mammalian cell are examined, very few differences are seen among these abundant proteins, suggesting that most of them are housekeeping proteins which perform general biochemical activities that occur in all cells. The proteins that provide the cell with its specialized function are often quite rare, although there are exceptions such as the vast amounts of hemoglobin that are present only in red blood cells (Alberts et al., 1994).
The proteome can be looked upon as the central link between the genome and the cell: it is, on the one hand, the culmination of genome expression and, on the other hand, the starting point for the biochemical activities that constitute cellular life (Figure 3.12
Proteins are traditionally looked upon as having four distinct levels of structure. These levels are hierarchical, the protein being built up stage by stage, with each level of structure depending on the one below it:
The drawing shows the chemical reaction that results in two amino acids becoming linked together by a peptide bond. The reaction is called a condensation because it results in elimination of water.
The polypeptide chains are shown in outline with the positions of the α-carbons indicated by small dots. The R groups have been omitted for clarity. Each structure is stabilized by hydrogen bonds between the C=O and N-H groups of different peptide bonds. The β-sheet conformation that is shown is anti-parallel, the two chains running in opposite directions. Parallel β-sheets also occur.
This imaginary protein structure comprises three α-helices, shown as coils, and a four-stranded β-sheet, indicated by the arrows. Redrawn from Turner et al. (1997).
The quaternary structure involves the association of two or more polypeptides, each folded into its tertiary structure, into a multi-subunit protein. Not all proteins form quaternary structures, but it is a feature of many proteins with complex functions, including several involved in genome expression. Some quaternary structures are held together by disulfide bridges between different polypeptides, but many proteins comprise looser associations of subunits stabilized by hydrogen bonding and hydrophobic effects, and can revert to their component polypeptides, or change their subunit composition, according to the functional requirements.
Proteins are functionally diverse because the amino acids from which proteins are made are themselves chemically diverse. Different sequences of amino acids therefore result in different combinations of chemical reactivities, these combinations dictating not only the overall structure of the resulting protein but also the positioning on the surface of the structure of reactive groups that determine the chemical properties of the protein.
These 20 amino acids are the ones that are conventionally looked upon as being specified by the genetic code (Section 3.3.2). The classification into non-polar, polar etc. is as described in Lehninger (1970).
| Abbreviation | ||
|---|---|---|
| Amino acid | Three-letter | One-letter |
| Alanine | Ala | A |
| Arginine | Arg | R |
| Asparagine | Asn | N |
| Aspartic acid | Asp | D |
| Cysteine | Cys | C |
| Glutamic acid | Glu | E |
| Glutamine | Gln | Q |
| Glycine | Gly | G |
| Histidine | His | H |
| Isoleucine | Ile | I |
| Leucine | Leu | L |
| Lysine | Lys | K |
| Methionine | Met | M |
| Phenylalanine | Phe | F |
| Proline | Pro | P |
| Serine | Ser | S |
| Threonine | Thr | T |
| Tryptophan | Trp | W |
| Tyrosine | Tyr | Y |
| Valine | Val | V |
Selenocysteine is the same as cysteine but with the sulfur replaced with a selenium atom.
During protein processing, some amino acids are modified by the addition of new chemical groups, for example by acetylation or phosphorylation, or by attachment of large side chains made up of sugar units (Section 11.3.3).
Proteins therefore have an immense amount of chemical variability, some of this directly specified by the genome, the remainder arising by protein processing.
The flow of information from DNA to RNA by transcription does not provide any conceptual difficulty. DNA and RNA polynucleotides have very similar structures and we can easily understand how an RNA copy of a gene can be made by template-dependent synthesis using the base-pairing rules with which we are familiar. The second phase of genome expression, during which the mRNA molecules of the transcriptome direct synthesis of proteins, is less easy to understand simply by considering the structures of the molecules that are involved. In the early 1950s, shortly after the double helix structure of DNA had been discovered, several molecular biologists attempted to devise ways in which amino acids could attach to mRNAs in an ordered fashion, but in all of these schemes at least some of the bonds had to be shorter or longer than was possible according to the laws of physical chemistry, and each idea was quietly dropped. Eventually, in 1957, Francis Crick cut a way through the confusion by predicting the existence of an adaptor molecule (Crick, 1990) that would form a bridge between the mRNA and the polypeptide being synthesized. Soon afterwards it was realized that the non-coding tRNAs were these adaptor molecules, and, once this fact had been established, a detailed understanding of the mechanism by which proteins are synthesized was quickly built up. We will examine this process in Section 11.1.
The other aspect of protein synthesis that interested molecular biologists in the 1950s and 1960s was the informational problem. This refers to the second important component of the link between the transcriptome and proteome: the genetic code which specifies how the nucleotide sequence of an mRNA is translated into the amino acid sequence of a protein.
It was assumed that the genetic code is triplet, that codons do not overlap, and that there is a colinear relationship between the sequences of an mRNA and the polypeptide it encodes.
| Organism | Codon | Should code for | Actually codes for |
|---|---|---|---|
| Mitochondrial genomes | |||
| Mammals | UGA | Stop | Trp |
| AGA, AGG | Arg | Stop | |
| AUA | Ile | Met | |
| Drosophila | UGA | Stop | Trp |
| AGA | Arg | Ser | |
| AUA | Ile | Met | |
| Saccharomyces cerevisiae | UGA | Stop | Trp |
| CUN | Leu | Thr | |
| AUA | Ile | Met | |
| Fungi | UGA | Stop | Trp |
| Maize | CGG | Arg | Trp |
| Nuclear and prokaryotic genomes | |||
| Several protozoa | UAA, UAG | Stop | Gln |
| Candida cylindracea | CUG | Leu | Ser |
| Micrococcus sp. | AGA | Arg | Stop |
| AUA | Ile | Stop | |
| Euplotes sp. | UGA | Stop | Cys |
| Mycoplasma sp. | UGA | Stop | Trp |
| CGG | Arg | Stop | |
| Context-dependent codon reassignments | |||
| Various | UGA | Stop | Selenocysteine |
Abbreviation: N, any nucleotide.
| Protein | Organism |
|---|---|
| Prokaryotic enzymes | |
| Formate dehydrogenase | Clostridium thermoaceticum, Clostridium thermoautotrophicum, Enterobacter aerogenes, Escherichia coli, Methanococcus vaniellii |
| Glycine reductase | Clostridium purinolyticum, Clostridium sticklandii |
| NiFeSe hydrogenase | Desulfomicrobium baculatum, Methanococcus voltae |
| Eukaryotic enzymes | |
| Glutathione peroxidase | Human, cow, rat, mouse |
| Selenoprotein P | Human, cow, rat |
| Selenoprotein W | Rat |
| Type 1 deiodinase | Human, rat, mouse, dog |
| Type 2 deiodinase | Frog |
| Type 3 deiodinase | Human, rat, frog |
See Low and Berry (1996).
How does the proteome convert the biological information that it has received from the genome into the biochemical capabilities of the cell? Two fundamental aspects of protein chemistry enable this final step in genome expression to be achieved. The first of these is the hierarchical nature of the four levels of protein structure, which provides a direct link between the amino acid sequence of a protein and its chemical properties. The second aspect is the multiplicity of the chemical properties that can be displayed by different proteins, this variability enabling proteins to carry out a huge range of different biochemical activities.
An α-helix is shown nucleating at a position containing amino acids that favor helix formation, and extending in either direction until groups of amino acids that block helix formation are reached.
By identifying which amino acids are most frequently located in which secondary structures, and by studying the structures taken up by small polypeptides of known sequence, biochemists have been able to deduce rules for this level of protein folding, and, to a certain extent, can predict which secondary structures will be adopted by a polypeptide simply by examining its primary sequence (Barton, 1995). It is less easy to predict the outcomes of the next two stages of protein folding, which result in the secondary structural units becoming arranged into the tertiary structure, and tertiary units associating to form quaternary multi-subunit structures. The tertiary structures of most proteins are made up of two or more structural domains, possibly with little interaction between them; these domains are thought to fold independently of one another. Understanding how this occurs is complicated by the fact that many domains include secondary structural units from quite different regions of a polypeptide. But the difficulty in identifying rules for predicting tertiary and quaternary structures does not detract from the fact that these higher levels of structure are determined by the amino acid sequence of the polypeptide. This is illustrated by the ability of proteins that have been unfolded in the test tube, for example by treatment with urea, to refold spontaneously into their correct structures when the treatment is reversed (Section 11.3.1).
Spontaneous refolding does not occur with all proteins and is particularly difficult to achieve with larger ones, the problem appearing to be that the protein can adopt alternative partially folded structures at various stages of the folding process, only one of which leads to the correctly folded tertiary configuration. If the protein makes the wrong ‘choice’ it can end up at a dead end in which it is partially folded in an incorrect manner but from which it cannot escape. In cells, proteins called molecular chaperones aid the folding of other proteins, probably by reducing the likelihood that the protein being folded adopts the wrong intermediate structure (Section 11.3.1). The existence of alternative folding pathways, and of proteins that aid the folding of other proteins, complicates the premise that a protein's folded structure is dictated by its amino acid sequence, but the premise still holds. The role of molecular chaperones is not to impose a new structure on a protein, but merely to increase the efficiency of the protein's natural, sequence-directed, folding pathway.
The biological information encoded by the genome finds its final expression in a protein whose biological properties are determined by the spatial arrangement of chemical groups on its surface and within its folded structure. By specifying proteins of different types, the genome is able to construct and maintain a proteome whose overall biological properties form the underlying basis of life. The proteome can play this role because of the huge diversity of protein structures that can be formed, the diversity enabling proteins to carry out a variety of biological functions. These functions include the following:
Biochemical catalysis is the role of the special type of proteins called enzymes. The central metabolic pathways, which provide the cell with energy, are catalyzed by enzymes, as are the biosynthetic processes that result in construction of nucleic acids, proteins, carbohydrates and lipids. Biochemical catalysis also drives genome expression through the activities of enzymes such as RNA polymerase.
Structure, which at the cellular level is determined by the proteins that make up the cytoskeleton, is also the primary function of some extracellular proteins. An example is collagen, which is an important component of bones and tendons.
Movement is conferred by contractile proteins, of which actin and myosin in cytoskeletal fibers are the best known examples.
Transport of materials around the body is an important protein activity: for example, hemoglobin transports oxygen in the bloodstream, and serum albumin transports fatty acids.
Regulation of cellular processes is mediated by signaling proteins such as STATs (Section 12.1.2) and by proteins such as activators that bind to the genome and influence the expression levels of individual genes and groups of genes (Section 9.3.2). The activities of groups of cells are regulated and coordinated by extracellular hormones and cytokines, many of which are proteins (e.g. insulin, the hormone that controls blood sugar levels, and the interleukins, a group of cytokines that regulate cell division and differentiation).
Protection of the body and of individual cells is the function of a range of proteins, including the antibodies, and those proteins involved in the blood clotting response.
Storage functions are performed by proteins such as ferritin, which acts as an iron store in the liver, and the gliadins, which store amino acids in dormant wheat seeds.
This multiplicity of protein function provides the proteome with its ability to convert the blueprint contained in the genome into the essential features of the life process.
Give short definitions of the following terms:
α-helix
β-sheet
Heterogenous nuclear RNA
Hydrophobic effect
Messenger RNA
Poly(A) polymerase
Poly(A) tail
Ribosomal RNA
Small cytoplasmic RNA
Small nuclear RNA
Small nucleolar RNA
Transfer RNA
Transfer-messenger RNA
Draw a diagram illustrating the steps involved in genome expression. Indicate the points at which the flow of information from genome to proteome can be regulated.
Distinguish between ‘coding’ and ‘non-coding’ RNA. List the types of coding and non-coding RNA found in prokaryotic and eukaryotic cells and briefly describe the functions of each type.
Outline how RNA is synthesized in living cells.
Briefly describe the types of RNA processing event that are important in genome expression.
Describe, with examples, how transcriptomes are studied.
Describe the four levels of protein structure. Your answer should include an indication of the types of chemical interaction that are important at each structural level.
Explain how the diversity of amino acid structure is related to the functional diversity of proteins.
Outline the experiments that enabled the meaning of each codon in the genetic code to be elucidated.
List the key features of the genetic code. What variations of the standard genetic code are known?
Explain how protein folding enables the amino acid sequence of a protein to determine the function of that protein.
Give examples of the various roles that proteins play in living organisms.
The transcriptome and proteome are looked on as, respectively, an intermediate and the end-product of genome expression. Evaluate the strengths and limitations of these terms for our understanding of genome expression.
To what extent can transcriptome studies identify the functions of genes?
What experiments were carried out in order to establish that the genetic code is a triplet and non-overlapping code and that proteins are colinear with their genes?
‘The role of molecular chaperones is not to impose a new structure on a protein, but merely to increase the efficiency of the protein's natural, sequence-directed, folding pathway’ (page 89). Defend this statement.