This chapter provides historical background to the field of glycobiology and an overview
of this book. General terms found throughout the volume are also considered. The common
monosaccharide units of glycoconjugates are mentioned and a uniform symbol nomenclature
used for structural depictions throughout the book is presented. The major glycan
classes to be discussed in the book are described, and an overview of the general
pathways for their biosynthesis is provided. Topological issues relevant to biosynthesis
and functions of glycoconjugates are also briefly considered, and the growing role of
these molecules in medicine and biotechnology is briefly surveyed.
WHAT IS GLYCOBIOLOGY?
The central paradigm driving research in molecular biology has been that biological
information flows from DNA to RNA to protein. The power of this concept lies in its
template-based precision, the ability to manipulate one class of molecules based on
knowledge of another, and the patterns of sequence homology and relatedness that
predict function and reveal evolutionary relationships. A variety of additional
roles for RNA have also recently emerged. With the sequencing of human genomes and
many other commonly studied organisms, even more spectacular gains in understanding
the biology of nucleic acids and proteins are anticipated.
However, the tendency is to assume that a conventional molecular biology approach
encompassing just these molecules will explain the makeup of cells, tissues, organs,
physiological systems, and intact organisms. In fact, making a cell requires two
other major classes of molecules: lipids and carbohydrates. These molecules can
serve as intermediates in generating energy and as signaling effectors, recognition
markers, and structural components. Taken together with the fact that they encompass
some of the major posttranslational modifications of proteins themselves, lipids and
carbohydrates help to explain how the relatively small number of genes in the
typical genome can generate the enormous biological complexities inherent in the
development, growth, and functioning of intact organisms.
The biological roles of carbohydrates are particularly important in the assembly of
complex multicellular organs and organisms, which requires interactions between
cells and the surrounding matrix. All cells and numerous macromolecules in nature
carry an array of covalently attached sugars (monosaccharides) or sugar chains
(oligosaccharides), which are generically referred to as
“glycans” in this book. Sometimes, these glycans can also be
freestanding entities. Because many glycans are on the outer surface of cellular and
secreted macromolecules, they are in a position to modulate or mediate a wide
variety of events in cell–cell, cell–matrix, and
cell–molecule interactions critical to the development and function of a
complex multicellular organism. They can also act as mediators in the interactions
between different organisms (e.g., between host and a parasite or a symbiont). In
addition, simple, rapidly turning over, protein-bound glycans are abundant within
the nucleus and cytoplasm, where they can serve as regulatory switches. A more
complete paradigm of molecular biology must therefore include glycans, often in
covalent combination with other macromolecules, that is, glycoconjugates, such as
glycoproteins and glycolipids.
The chemistry and metabolism of carbohydrates were prominent matters of interest in
the first part of the 20th century. Although these topics engendered much scientific
attention, carbohydrates were primarily considered as a source of energy or as
structural materials and were believed to lack other biological activities.
Furthermore, during the initial phase of the molecular biology revolution of the
1960s and 1970s, studies of glycans lagged far behind those of other major classes
of molecules. This was in large part due to their inherent structural complexity,
the great difficulty in determining their sequences, and the fact that their
biosynthesis could not be directly predicted from a DNA template. The development of
many new technologies for exploring the structures and functions of glycans has
since opened a new frontier of molecular biology that has been called
“glycobiology”—a word first coined in the late
1980s to recognize the coming together of the traditional disciplines of
carbohydrate chemistry and biochemistry with a modern understanding of the cell and
molecular biology of glycans and, in particular, their conjugates with proteins and
lipids.
From a strictly technical point of view, glycobiology can almost be viewed as an
anomaly in the history of the biological sciences. If the development of
methodologies for glycan analysis had kept pace with those of other macromolecules
in the 1960s and 1970s, glycans would have been an integral part of the initial
phase of the molecular and cell biology revolution, and there might have been no
need to single them out later for study as a distinct discipline. Regardless, the
term glycobiology has gained wide acceptance, with a number of textbooks, a major
biomedical journal, a growing scientific society, and many research conferences now
bearing this name. Defined in the broadest sense, glycobiology is the study of the
structure, biosynthesis, biology, and evolution of saccharides (sugar chains or
glycans) that are widely distributed in nature, and the proteins that recognize
them. (The Oxford English Dictionary definition is “the
branch of science concerned with the role of sugars in biological
systems.”) Glycobiology is one of the more rapidly growing fields in the
natural sciences, with broad relevance to many areas of basic research, biomedicine,
and biotechnology. The field includes the chemistry of carbohydrates, the enzymology
of glycan formation and degradation, the recognition of glycans by specific proteins
(lectins and glycosaminoglycan-binding proteins), glycan roles in complex biological
systems, and their analysis or manipulation by a variety of techniques. Research in
glycobiology thus requires a foundation not only in the nomenclature, biosynthesis,
structure, chemical synthesis, and functions of glycans, but also in the general
disciplines of molecular genetics, protein chemistry, cell biology, developmental
biology, physiology, and medicine. This volume provides an overview of the field,
with some emphasis on the glycans of animal systems. It is assumed that the reader
has a basic background in advanced undergraduate-level chemistry, biochemistry, and
cell biology.
HISTORICAL ORIGINS OF GLYCOBIOLOGY
FIGURE 1.1
.
Nobel laureates in fields related to the early history of glycobiology.
Listed are the Laureates and their original Nobel citations: Hermann Emil
Fischer (Chemistry, 1902), “in recognition of the
extraordinary services he has rendered by his work on sugar and purine
syntheses”; Karl Landsteiner (Physiology or
Medicine, 1930), “for his discovery of human blood
groups”; Walter Norman Haworth (Chemistry, 1937),
“for his investigations on carbohydrates and vitamin
C”; Carl and Gerty Cori (Physiology or Medicine,
1947), “for their discovery of the course of the
catalytic conversion of glycogen”; Luis F. Leloir
(Chemistry, 1970), “for his discovery of sugar
nucleotides and their role in the biosynthesis of
carbohydrates”; and George E. Palade (Physiology or
Medicine, 1974) “for discoveries concerning the
structural and functional organization of the
cell.” (Reprinted, with permission, ©The
Nobel Foundation.)
TABLE 1.1
| 1876 | J.L.W. Thudichum | glycosphingolipids (cerebrosides),
sphingomyelin and sphingosine | 10 |
| 1888 | H. Stillmark | lectins as hemagglutinins | 26, 28 |
| 1891 | H.E. Fischer | stereoisomeric structure of glucose and
other monosaccharides | 2 |
| 1900 | K. Landsteiner | human ABO blood groups as transfusion
barriers | 5, 13 |
| 1909 | P.A. Levene | structure of ribose in RNA | 1 |
| 1916 | J. MacLean | isolation of heparin as an anticoagulant | 16 |
| 1925 | P.A. Levene | characterization of chondroitin sulfate
and “mucoitin sulfate” (later, hyaluronan) | 15, 16 |
| 1929 | P.A. Levene | structure of 2-deoxyribose in DNA | 1 |
| 1929 | W.N. Haworth | pyranose and furanose ring structures of
monosaccharides | 2 |
| 1934 | K. Meyer | hyaluronan and hyaluronidase | 15 |
| 1934–1938 | G. Blix, E. Klenk | sialic acids | 14 |
| 1936 | C.F. Cori, G.T. Cori | glucose-1-phosphate as an intermediate in
glycogen biosynthesis | 17 |
| 1942–1946 | G.K. Hirst, F.M. Burnet | hemagglutination of influenza virus and
“receptor-destroying enzyme” | 14 |
| 1942 | E. Klenk, G. Blix | gangliosides in brain | 10, 14 |
| 1946 | Z. Dische | colorimetric determination of
deoxypentoses and other carbohydrates | 2 |
| 1948–1950 | E. Jorpes, S. Gardell | occurrence of N-sulfates in heparin and
identification of heparan sulfate | 16 |
| 1949 | L.F. Leloir | nucleotide sugars and their role in the
biosynthesis of glycans | 4 |
| 1950 | Karl Schmid | isolation of α1-acid
glycoprotein (orosomucoid), a major serum glycoprotein | |
| 1952 | W.T. Morgan, W.M. Watkins | carbohydrate determinants of ABO blood
group types | 13 |
| 1952 | E.A. Kabat | relationship of ABO to Lewis blood groups
and secretor vs. nonsecretor status | 13 |
| 1952 | A. Gottschalk | sialic acid as the receptor for influenza
virus | 14 |
| 1952 | T. Yamakawa | globoside, the major glycosphingolipid of
the erythrocyte membrane | 10 |
| 1956–1963 | M.R.J. Salton, J.M. Ghuysen, R.W. Jeanloz,
N. Sharon, H.M. Flowers | bacterial peptidoglycan backbone
structure
major structural polysaccharides in nature (chitin,
cellulose, and peptidoglycan) are β1-4-linked throughout | 20 |
| 1957 | P.W. Robbins, F. Lipmann | biosynthesis and characterization of PAPS,
the donor for glycan sulfation | 4, 16 |
| 1957 | H. Faillard, E. Klenk | crystallization of
N-acetylneuraminic acid as product of influenza
virus receptor-destroying enzyme (RDE)
(“neuraminidase”) | 14 |
| 1957–1963 | J. Strominger, J.T. Park, H.R. Perkins,
H.J. Rogers | mechanism of peptidoglycan biosynthesis
and site of penicillin action | 20 |
| 1958 | H. Muir | “mucopolysaccharides” are covalently
attached to proteins via serine | 16 |
| 1960 | D.C. Comb, S. Roseman | structure and enzymatic synthesis of
CMP-N-acetylneuraminic acid | 4, 14 |
| 1960–1965 | O. Westphal, O. Lüderitz, H.
Nikaido, P.W. Robbins | structure of lipopolysaccharides and
endotoxin glycans | 20 |
| 1960–1970 | R. Jeanloz, K. Meyer, A. Dorfman | structural studies of glycosaminoglycans | 15, 16 |
| 1961 | S. Roseman, L. Warren | biosynthesis of sialic acid | 4, 14 |
| 1961–1965 | G.E. Palade | ER-Golgi pathway for glycoprotein
biosynthesis and secretion | 3 |
| 1962 | A. Neuberger, R. Marshall, I. Yamashina,
L.W. Cunningham | GlcNAc-Asn as the first defined
carbohydrate-peptide linkage | 8 |
| 1962 | W.M. Watkins, W.Z. Hassid | enzymatic synthesis of lactose from
UDP-galactose and glucose | 4 |
| 1962 | J.A. Cifonelli, J. Ludowieg, A. Dorfman | iduronic acid as a constituent of heparin | 16 |
| 1962–1966 | L. Roden, U. Lindahl | identification of tetrasaccharide linking
glycosaminoglycans to protein core of proteoglycans | 16 |
| 1962 | E.H. Eylar, R.W. Jeanloz | demonstration of the presence of
N-acetyllactosamine in α1-acid
glycoprotein | 13 |
| 1963 | L. Svennerholm | analysis and nomenclature of gangliosides | 10 |
| 1963 | D. Hamerman, J. Sandson | covalent cross-linkage between hyaluronan
and inter-α-trypsin inhibitor | 15 |
| 1963–1964 | B. Anderson, K. Meyer, V.P. Bhavanandan,
A. Gottschalk | β-elimination of
Ser/Thr-O-linked glycans | 9 |
| 1963–1965 | R. Kuhn, H. Wiegandt | structure of GM1 and other brain
gangliosides | 10 |
| 1963–1967 | B.L. Horecker, P.W. Robbins, H. Nakaido,
M.J. Osborn | lipid-linked intermediates in bacterial
lipopolysaccharide and peptidoglycan biosynthesis | 20 |
| 1964 | V. Ginsburg | GDP-fucose and its biosynthesis from
GDP-mannose | 4 |
| 1964 | B. Gesner, V. Ginsburg | glycans control the migration of
leukocytes to target organs | 26 |
| 1965 | L.W. Cunningham | microheterogeneity of glycoprotein glycans | 2, 8, 9 |
| 1965–1966 | R.O. Brady | glucocerebrosidase is the enzyme deficient
in Gaucher’s disease | 41 |
| 1965–1975 | J.E. Silbert, U. Lindahl | cell-free biosynthesis of heparin and
chondroitin sulfate | 16 |
| 1965–1975 | W. Pigman | tandem repeat amino acid sequences with
Ser or Thr as O-glycosylation sites in mucins | 9 |
| 1966 | M. Neutra, C. Leblond | role of Golgi apparatus in protein
glycosylation | 3 |
| 1966–1969 | B. Lindberg, S. Hakomori | refinement of methylation analysis for
determination of glycan linkages | 47 |
| 1966–1976 | R. Schauer | multiple modifications of sialic acids in
nature, their biosynthesis, and degradation | 14 |
| 1967 | L. Rodén, L.-Å.
Fransson | semonstration of a copolymeric structure
for dermatan sulfate | 16 |
| 1967 | R.D. Marshall | N-glycosylation occurs only at asparagine
residues in the sequence motif Asn-X-Ser/Thr | 8 |
| 1968 | J.A. Cifonelli | description of the domain structure of
heparan sulfate | 16 |
| 1968 | R.L. Hill, K. Brew | α-lactalbumin as a modifier of
galactosyltransferase specificity | 5 |
| 1969 | L. Warren, M.C. Glick, P.W. Robbins | increased size of N-glycans in malignantly
transformed cells | 8, 44 |
| 1969 | R.J. Winzler | structures of O-glycans from erythrocyte
membranes | 9 |
| 1969–1974 | V.C. Hascall, S.W. Sajdera, H. Muir, D.
Heinegård, T. Hardingham | hyaluronan-proteoglycan interactions in
cartilage | 16, 17 |
| 1969 | H. Tuppy, P. Meindl | synthesis of 2-deoxy-2,3-didehydro-Neu5Ac
as viral sialidase inhibitor | 14 |
| 1968–1970 | E. Neufeld | identification of lysosomal enzyme
deficiencies in the mucopolysaccharidoses | 41 |
| 1969 | G. Ashwell, A. Morell | glycans can control the lifetime of
glycoproteins in blood circulation | 26 |
| 1970 | K.O. Lloyd, J. Porath, I.J. Goldstein | use of lectins for affinity purification
of glycoproteins | 45 |
| 1971–1973 | L.F. Leloir | dolichylphosphosugars are intermediates in
protein N-glycosylation | 4, 8 |
| 1971–1975 | P. Kraemer, J.E. Silbert | heparan sulfate as a common constituent of
vertebrate cell surfaces | 16 |
| 1971–1980 | B. Toole | hyaluronan in differentiation,
morphogenesis, and development | 15 |
| 1972–1982 | S. Hakomori | lacto- and globo-series glycosphingolipids
as developmentally regulated and tumor-associated antigens | 10, 44 |
| 1972 | J.F.G. Vliegenthart | high-field proton NMR spectroscopy for
structural analysis of glycans | 2 |
| 1973 | W.E. van Heyningen | glycosphingolipids are receptors for
bacterial toxins | 39 |
| 1973 | J. Montreuil, R.G. Spiro, R. Kornfeld | a common pentasaccharide core structure of
all N-glycans | 8 |
| 1974 | C.E. Ballou | structure of yeast mannans and generation
of yeast mannan mutants | 8, 46 |
| 1975 | V.I. Teichberg | the first galectin | 33 |
| 1975 | V.T. Marchesi | primary structure of glycophorin, the
first known transmembrane glycoprotein | 3, 8, 9 |
| 1975–1980 | A. Kobata | N- and O-glycan structural elucidation
using multiple convergent techniques | 2, 8, 9 |
| 1975–1980 | P. Stanley, S. Kornfeld, R.C. Hughes | lectin-resistant cell lines with
glycosylation defects | 46 |
| 1977 | W.J. Lennarz | Asn-X-Ser/Thr necessary and sufficient for
lipid-mediated N-glycosylation | 8 |
| 1977 | I. Ofek, D. Mirelman, N. Sharon | cell-surface glycans as attachment sites
for infectious bacteria | 39 |
| 1977–1978 | S. Kornfeld, P.W. Robbins | biosynthesis and processing of
intermediates of N-glycans in protein glycosylation | 8 |
| 1977 | R.L. Hill, R. Barker | first purification of a
glycosyltransferase involved in protein glycosylation | 5, 8 |
| 1978 | C. Svanborg | glycosphingolipids as receptors for
bacterial adhesion | 10, 39 |
| 1979–1982 | E. Neufeld, S. Kornfeld, K. Von Figura, W.
Sly | the mannose-6-phosphate pathway for
lysosomal enzyme trafficking | 30 |
| 1980–1983 | F.A. Troy, J. Finne, S. Inoue, Y. Inoue | structure of polysialic acids in bacteria
and vertebrates | 14 |
| 1980 | H. Schachter | role of glycosyltransferases in N- and
O-glycan branching | 5, 8 |
| 1980–1982 | V.N. Reinhold, A. Dell, A.L. Burlingame | mass spectrometry for structural analysis
of glycans | 47, 48 |
| 1980–1985 | S. Hakomori, Y. Nagai | glycosphingolipids as modulators of
transmembrane signaling | 10 |
| 1981–1985 | M.J. Ferguson, I. Silman, M. Low | structural definition of
glycosylphosphatidylinositol (GPI) anchors | 11 |
| 1982 | U. Lindahl, R.D. Rosenberg | specific sulfated heparin pentasaccharide
sequence recognized by antithrombin | 16, 35 |
| 1982 | C. Hirschberg, R. Fleischer | transport of sugar nucleotides into the
Golgi apparatus | 3, 14 |
| 1984 | G. Hart | intracellular protein glycosylation by
O-GlcNAc | 18 |
| 1984 | J. Jaeken | description of
“carbohydrate-deficient glycoprotein
syndromes” | 42 |
| 1985 | M. Klagsbrun, D. Gospodarowicz | discovery of heparin–FGF
interactions | 35 |
| 1986 | W.J. Whelan | glycogen is a glycoprotein synthesized on
a glycogenin primer | 17 |
| 1986 | J.U. Baenziger | structures of sulfated N-glycans of
pituitary hormones | 13, 28 |
| 1986 | Y. Inoue, S. Inoue | discovery of 2-keto-3-deoxynononic acid
(Kdn) in rainbow trout eggs | 14 |
| 1986 | P.K. Qasba, J. Shaper, N. Shaper | cloning of first animal
glycosyltransferase | 5 |
| 1987 | Y-C. Lee | high-performance anion-exchange
chromatography of oligosaccharides with pulsed amperometric
detection (HPAEC-PAD) | 47 |
TABLE 1.2
| Occurrence |
| All cells in nature are covered with a
dense and complex array of sugar chains (glycans). |
| The cell walls of bacteria and archea are
composed of several classes of glycans and glycoconjugates. |
| Most secreted proteins of eukaryotes carry
large amounts of covalently attached glycans. |
| In eukaryotes, these cell-surface and
secreted glycans are mostly assembled via the ER-Golgi pathway. |
| The extracellular matrix of eukaryotes is
also rich in such secreted glycans. |
| Cytosolic and nuclear glycans are common
in eukaryotes. |
| For topological, evolutionary, and
biophysical reasons, there is little similarity between
cell-surface/secreted and nuclear/cytosolic glycans. |
| Chemistry and structure |
| Glycosidic linkages can be in
α- or β-linkage forms, which are
biologically recognized as completely distinct. |
| Glycan chains can be linear or
branched. |
| Glycans can be modified by a variety of
different substituents, such as acetylation and sulfation. |
| Complete sequencing of glycans is feasible
but usually requires combinatorial or iterative methods. |
| Modern methods allow in vitro
chemoenzymatic synthesis of both simple and complex glycans. |
| Biosynthesis |
| The final products of the genome are
posttranslationally modified proteins, with glycosylation being the
most common and versatile of these modifications. |
| The primary units of glycans
(monosaccharides) can be synthesized within a cell or salvaged from
the environment. |
| Monosaccharides are activated into
nucleotide sugars or lipid-linked sugars before they are used as
donors for glycan synthesis. |
| Whereas lipid-linked sugar donors can be
flipped across membranes, nucleotide sugars must be transported into
the lumen of the ER-Golgi pathway. |
| Each linkage unit of a glycan or
glycoconjugate is assembled by one or more unique
glycosyltransferases. |
| Many glycosyltransferases are members of
multigene families with related functions. |
| Most glycosyltransferases recognize only
the underlying glycan of their acceptor, but some are protein or
lipid specific. |
| Many biosynthetic enzymes
(glycosyltransferases, glycosidases, sulfotransferases, etc.) are
expressed in a tissue-specific, temporally regulated manner. |
| Diversity |
| Monosaccharides generate much greater
combinatorial diversity than nucleotides or amino acids. |
| Further diversity arises from covalent
modifications of glycans. |
| Glycosylation introduces a marked
diversity in proteins. |
| Only a limited subset of the potential
diversity is found in a given organism or cell type. |
| Intrinsic diversity (microheterogeniety)
of glycoprotein glycans within a cell type or even a single
glycosylation site. |
| The total expressed glycan repertoire
(glycome) of a given cell type or organism is thus much more complex
than the genome or proteome. |
| The glycome of a given cell type or
organism is also dynamic, changing in response to intrinsic and
extrinsic signals. |
| Glycome differences in cell type, space,
and time generate biological diversity and can help to explain why
only a limited number of genes are expressed from the typical
genome. |
| Recognition |
| Glycans are recognized by specific
glycan-binding proteins that are intrinsic to an organism. |
| Glycans are also recognized by many
extrinsic glycan-binding proteins of pathogens and symbionts. |
| Glycan-binding proteins fall in two
general categories: those that can usually be grouped by shared
evolutionary origins and/or similarity in structural folds (lectins)
and those that emerged by convergent evolution from different
ancestors (e.g., GAG-binding proteins). |
| Lectins often show a high degree of
specificity for binding to specific glycan structures, but they
typically have relatively low affinities for single-site
binding. |
| Thus, biologically relevant lectin
recognition often requires multivalency of both the glycan and
glycan-binding protein, to generate high avidity of binding. |
| Genetics |
| Naturally occurring genetic defects in
glycans seem to be relatively rare in intact organisms. However,
this apparent rarity may be due to a failure of detection, caused by
unpredictable or pleiotropic phenotypes. |
| Genetic defects in cell-surface/secreted
glycans are easily obtained in cultured cells but have somewhat
limited biological consequences. |
| The same mutations typically have major
phenotypic consequences in intact multicellular organisms. |
| Thus, many of the major roles of glycans
likely involve cell–cell or extracellular
interactions. |
| Nuclear/cytosolic glycans may have more
cell-intrinsic roles, e.g., in signaling. |
| Complete elimination of major glycan
classes generally causes early developmental lethality. |
| Organisms bearing tissue-specific
alteration of glycans often survive, but they exhibit both
cell-autonomous and distal biological effects. |
| Biological roles |
| Biological roles for glycans span the
spectrum from nonessential activities to those that are crucial for
the development, function, and survival of an organism. |
| Many theories regarding the biological
roles of glycans appear to be correct, but exceptions occur. |
| Glycans can have different roles in
different tissues or at different times in development. |
| Terminal sequences, unusual glycans, and
modifications are more likely to mediate specific biological
roles. |
| However, terminal sequences, unusual
glycans, or modifications may also reflect evolutionary interactions
with microorganisms and other noxious agents. |
| Thus, a priori prediction of the functions
of a specific glycan or its relative importance to the organism is
difficult. |
| Evolution |
| Relatively little is known about glycan
evolution. |
| Interspecies and intraspecies variations
in glycan structure are relatively common, suggesting rapid
evolution. |
| The dominant mechanism for such evolution
is likely the ongoing selection pressure by pathogens that recognize
glycans. |
| However, glycan evolution must also
preserve and/or elaborate critical intrinsic functions. |
| Interplay between pathogen selection
pressure and preservation of intrinsic roles could result in the
formation of “junk” glycans. |
| Such “junk”
glycans could be the substrate from which new intrinsic functions
arise during evolution. |
As mentioned above, glycobiology had its early origins in the fields of carbohydrate
chemistry and biochemistry, and has only recently emerged as a major aspect of
molecular and cellular biology and physiology. Some of the major investigators and
important discoveries that influenced the development of the field are presented in
and
Table 1.1. As with any such attempt, a comprehensive
list is impossible, but details regarding some of these discoveries can be found in
other chapters of this volume. A summary of the general principles gained from this
research is presented in
Table 1.2.
MONOSACCHARIDES ARE THE BASIC STRUCTURAL UNITS OF GLYCANS
FIGURE 1.2
.
Open-chain and ring forms of glucose. Changes in the orientation of hydroxyl
groups around specific carbon atoms generate new molecules that have a
distinct biology and biochemistry (e.g., galactose is the C-4 epimer of
glucose).
Carbohydrates are defined as polyhydroxyaldehydes, polyhydroxyketones and their
simple derivatives, or larger compounds that can be hydrolyzed into such units. A
monosaccharide is a carbohydrate that cannot be hydrolyzed into a simpler form. It
has a potential carbonyl group at the end of the carbon chain (an aldehyde group) or
at an inner carbon (a ketone group). These two types of monosaccharides are
therefore named aldoses and ketoses, respectively (for examples, see below and for
more details, see
Chapter 2). Free
monosaccharides can exist in open-chain or ring forms (). Ring forms of the monosaccharides are the
rule in oligosaccharides, which are linear or branched chains of monosaccharides
attached to one another via glycosidic linkages (the term
“polysaccharide” is typically reserved for large glycans
composed of repeating oligosaccharide motifs). The generic term
“glycan” is used throughout this book to refer to any form
of mono-, oligo-, or polysaccharide, either free or covalently attached to another
molecule.
The ring form of a monosaccharide generates a chiral anomeric center at C-1 for aldo
sugars or at C-2 for keto sugars (for details, see Chapter 2). A glycosidic linkage involves the attachment of a
monosaccharide to another residue, typically via the hydroxyl group of this anomeric
center, generating α linkages or β linkages that are defined
based on the relationship of the glycosidic oxygen to the anomeric carbon and ring
(see Chapter 2). It is important to
realize that these two linkage types confer very different structural properties and
biological functions upon sequences that are otherwise identical in composition, as
classically illustrated by the marked differences between starch and cellulose (both
homopolymers of glucose), the former largely α1-4 linked and the latter
β1-4 linked throughout. A glycoconjugate is a compound in which one or
more monosaccharide or oligosaccharide units (the glycone) are covalently linked to
a noncarbohydrate moiety (the aglycone). An oligosaccharide that is not attached to
an aglycone possesses the reducing power of the aldehyde or ketone in its terminal
monosaccharide component. This end of a sugar chain is therefore often called the
reducing terminus or reducing end, terms that tend to be used even when the sugar
chain is attached to an aglycone and has thus lost its reducing power.
Correspondingly, the outer end of the chain tends to be called the nonreducing end
(note the analogy to the 5′ and 3′ ends of nucleotide chains
or the amino and carboxyl termini of polypeptides).
GLYCANS CAN CONSTITUTE A MAJOR PORTION OF THE MASS OF A GLYCOCONJUGATE
FIGURE 1.3
.
Schematic representation of the Thy-1 glycoprotein including the three
N-glycans (blue) and a glycosylphosphatidylinositol
(GPI-glycan, green) lipid anchor whose acyl chains
(yellow) would normally be embedded in the membrane
bilayer. Note that the polypeptide (purple) represents only
a relatively small portion of the total mass of the protein. (Original art
courtesy of Mark Wormald and Raymond Dwek, Oxford Glycobiology
Institute.)
FIGURE 1.4
.
Historical electron micrograph of endothelial cells from a blood capillary in
the diaphragm muscle of a rat, showing the lumenal cell membrane of the
cells (facing the blood) decorated with particles of cationized ferritin
(arrowheads). These particles are binding to acidic
residues (sialic-acid-containing glycans and sulfated glycosaminoglycans)
contained in the cell-surface glycocalyx. Note that the particles are
several layers deep, indicating the remarkable thickness of this layer of
glycoconjugates. (Courtesy of George E. Palade.)
In naturally occurring glycoconjugates, the portion of the molecule comprising the
glycans can vary greatly in its contribution to the overall size, from being very
minor in amount to being the dominant component or even almost the exclusive one. In
many cases, the glycans comprise a substantial portion of the mass of
glycoconjugates (for a typical example, see ). For this reason, the surfaces of all types of cells in nature
(which are heavily decorated with different kinds of glycoconjugates) are
effectively covered with a dense array of sugars, giving rise to the so-called
glycocalyx. This cell-surface structure was first observed many years ago by
electron microscopists as an anionic layer external to the cell surface membrane,
which could be decorated with polycationic reagents such as cationized ferritin (for
a historical example, see ).
MONOSACCHARIDES CAN BE LINKED TOGETHER IN MANY MORE WAYS THAN AMINO ACIDS OR
NUCLEOTIDES
Nucleotides and proteins are linear polymers that can each contain only one basic
type of linkage between monomers. In contrast, each monosaccharide can theoretically
generate either an α or a β linkage to any one of several
positions on another monosaccharide in a chain or to another type of molecule. Thus,
it has been pointed out that although three different nucleotides or amino acids can
only generate six trimers, three different hexoses could produce (depending on which
of their forms are considered) anywhere from 1,056 to 27,648 unique trisaccharides.
This difference in complexity becomes even greater as the number of monosaccharide
units in the glycan increases. For example, a hexasaccharide with six different
hexoses could have more than 1 trillion possible combinations. Thus, an almost
unimaginable number of possible saccharide units could be theoretically present in
biological systems. Fortunately for the student of glycobiology, naturally occurring
biological macromolecules are so far known to contain relatively few of the possible
monosaccharide units, in a limited number of combinations. However, the great
majority of glycans in nature have yet to be discovered and structurally
defined.
COMMON MONOSACCHARIDE UNITS OF GLYCOCONJUGATES
Although several hundred distinct monosaccharides are known to occur in nature, only
a small number of these are commonly found in animal glycans. They are listed below,
along with their standard abbreviations (for details regarding their structures, see
Chapter 2).
-
Pentoses: Five-carbon neutral sugars, e.g., D-xylose
(Xyl)
-
Hexoses: Six-carbon neutral sugars, e.g., D-glucose (Glc),
D-galactose (Gal), and D-mannose (Man).
-
Hexosamines: Hexoses with an amino group at the 2-position,
which can be either free or, more commonly, N-acetylated,
e.g., N-acetyl-D-glucosamine (GlcNAc) and
N-acetyl-D-galactosamine (GalNAc).
-
Deoxyhexoses: Six-carbon neutral sugars without the
hydroxyl group at the 6-position (e.g., L-fucose
[Fuc]).
-
Uronic acids: Hexoses with a negatively charged carboxylate
at the 6-position, e.g., D-glucuronic acid (GlcA) and L-iduronic acid
(IdoA).
-
Sialic acids: Family of nine-carbon acidic sugars (generic
abbreviation is Sia), of which the most common is
N-acetylneuraminic acid (Neu5Ac, also sometimes called
NeuAc or historically, NANA) (for details, see Chapter 14).
For the sake of simplicity, the symbols D- and L- are omitted from the full names of
monosaccharides from here on, and only the symbol L- will be used when appropriate
(e.g., L-fucose or L-iduronic acid).
This limited set of monosaccharides dominates the glycobiology of more recently
evolved (so-called “higher”) animals, but several others
have been found in “lower” animals (e.g., tyvelose; see
Chapters 23 and 24), bacteria (e.g.,
keto-deoxyoctulosonic acid, rhamnose, L-arabinose, and muramic acid; see Chapter 20), and plants (e.g., arabinose,
apiose, and galacturonic acid; see Chapter
22). A variety of modifications of glycans enhance their diversity in nature
and often serve to mediate specific biological functions. Thus, the hydroxyl groups
of different monosaccharides can be subject to phosphorylation, sulfation,
methylation, O-acetylation, or fatty acylation. Although amino groups are commonly
N-acetylated, they can be N-sulfated or remain unsubstituted. Carboxyl groups are
occasionally subject to lactonization to nearby hydroxyl groups or even
lactamization to nearby amino groups.
FIGURE 1.5
.
Recommended symbols and conventions for drawing glycan structures.
(Top panel) The monosaccharide symbol set from the
first edition of Essentials of Glycobiology is modified to
avoid using the same shape or color, but with different orientation to
represent different sugars. Each monosaccharide class (e.g., hexose) now has
the same shape, and isomers are differentiated by
color/black/white/shading. The same shading/color is
used for different monosaccharides of the same stereochemical designation,
e.g., Gal, GalNAc, and GalA. To minimize variations, sialic acids and uronic
acids are in the same shape, and only the major uronic and sialic acid types
are represented. When the type of sialic acid is uncertain, the abbreviation
Sia can be used instead. Only common monosaccharides in vertebrate systems
are assigned specific symbols. All other monosaccharides are represented by
an open hexagon or defined in the figure legend. If there is more than one
type of undesignated monosaccharide in a figure, a letter designation can be
included to differentiate between them. Unless otherwise indicated, all of
these vertebrate monosaccharides are assumed to be in the D configuration
(except for fucose and iduronic acid, which are in the L configuration), all
glycosidically linked monosaccharides are assumed to be in the pyranose
form, and all glycosidic linkages are assumed to originate from the
1-position (except for the sialic acids, which are linked from the
2-position). Anomeric notation and destination linkages can be indicated
without spacing/dashes. Although color is useful, these representations will
survive black-and-white printing or photocopying with the colors represented
in different shades (the color values in the figure are the RGB triplet
color settings)*. Modifications of monosaccharides are indicated by
lowercase letters, with numbers indicating linkage positions, if known
(e.g., 9Ac for the 9-O-acetyl group, 3S for the 3-O-sulfate group, 6P for a
6-O-phosphate group, 8Me for the 8-O-methyl group, 9Acy for the 9-O-acyl
group, and 9Lt for the 9-O-lactyl group). Esters and ethers are shown
attached to the symbol with a number. For N-substituted groups, it is
assumed that only one amino group is on the monosaccharide with an already
known position (e.g., NS for an N-sulfate group on glucosamine, assumed to
be at the 2-position). (Middle panel) Typical branched
“biantennary” N-glycan with two types of outer
termini, depicted at different levels of structural details. (Bottom
panel) Some typical glycosaminoglycan (GAG) chains.
*Note: To reproduce precise colors in RGB format, use the following triplet
values: Galactose stereochemistry: Yellow (255,255,0); Glucose
stereochemistry: BLUE (0,0,250); Mannose stereochemistry: GREEN (0,200,50);
Fucose: RED (250,0,0); Xylose: ORANGE (250,100,0); Neu5Ac: PURPLE
(125,0,125); Neu5Gc: LIGHT BLUE (200,250,250); KDN: GREEN (0,200,50); GlcA:
BLUE (0,0,250); IdoA: TAN (150,100,50); GalA: Yellow (255,255,0); ManA:
GREEN (0,200,50).
Details regarding the structural depiction of monosaccharides, linkages, and
oligosaccharides are discussed in
Chapter
2. Many figures in this volume use a simplified style of depiction of sugar
chains (see ). This figure (also
reproduced on the inside front cover) uses a monosaccharide symbol set modified from
the first edition of
Essentials of Glycobiology, which has also
been adopted by several other groups interested in presenting databases of
structures (e.g., the Consortium for Functional Glycomics).
MAJOR CLASSES OF GLYCOCONJUGATES AND GLYCANS
FIGURE 1.6
.
Common classes of animal glycans. (Modified from Varki A. 1997. FASEB
J.
11: 248–255; Fuster M. and Esko J.D. 2005.
Nat. Rev. Can.
7: 526–542.)
FIGURE 1.7
.
Glycan-protein linkages reported in nature. (Updated and redrawn, with
permission of Oxford University Press, from Spiro R.G. 2002.
Glycobiology.
12: 43R–56R.)
Diagrammatic representation of the five distinct types of sugar-peptide
bonds that have been identified in nature, to date. Monosaccharide
abbreviations are as in .
In addition, (Ara) arabinose; (Rha) rhamnose; (FucNAc)
N-acetylfucosamine (2-acetamido-2,6-dideoxy-D-galactose);
(Bac) bacillosamine (2,4-diamino-2,4,6-trideoxy-D-glucose); (Pse)
pseudaminic acid
(5,7-diacetamido-3,5,7,9-tetradeoxy-L-glycero-L-manno-nonulosonic acid);
(Hyl) hydroxylysine; (Hyp) hydroxyproline; (
C-term)
carboxy-terminal amino acid residue. Glypiation is the process by which a
glycosylphosphatidylinositol (GPI) anchor is added to a protein. For other
details, including anomeric linkages, please see the Spiro (2002) review
cited above.
The common classes of glycans found in or on eukaryotic cells are primarily defined
according to the nature of the linkage to the aglycone (protein or lipid) (see and ). A glycoprotein is a glycoconjugate in which a
protein carries one or more glycans covalently attached to a polypeptide backbone,
usually via N or O linkages. An N-glycan (N-linked oligosaccharide, N-(Asn)-linked
oligosaccharide) is a sugar chain covalently linked to an asparagine residue of a
polypeptide chain, commonly involving a GlcNAc residue and the consensus peptide
sequence: Asn-X-Ser/Thr. N-Glycans share a common pentasaccharide core region and
can be generally divided into three main classes: oligomannose (or high-mannose)
type, complex type, and hybrid type (see
Chapter 8). An O-glycan (O-linked oligosaccharide) is frequently linked
to the polypeptide via
N-acetylgalactosamine (GalNAc) to a hydroxyl
group of a serine or threonine residue and can be extended into a variety of
different structural core classes (see
Chapter
9). A mucin is a large glycoprotein that carries many O-glycans that are
clustered (closely spaced). Several other types of O-glycans also exist (e.g.,
attached to proteins via O-linked mannose). A proteoglycan is a glycoconjugate that
has one or more glycosaminoglycan (GAG) chains (see definition below) attached to a
“core protein” through a typical core region ending in a
xylose residue that is linked to the hydroxyl group of a serine residue. The
distinction between a proteoglycan and a glycoprotein is otherwise arbitrary,
because some proteoglycan polypeptides can carry both glycosaminoglycan chains and
different O- and N-glycans (see
Chapter
16). provides a listing
of known glycan-protein linkages in nature.
A glycophosphatidylinositol anchor is a glycan bridge between phosphatidylinositol
and a phosphoethanolamine that is in amide linkage to the carboxyl terminus of a
protein. This structure typically constitutes the only anchor to the lipid bilayer
membrane for such proteins (see Chapter
11). A glycosphingolipid (often called a glycolipid) consists of a glycan
usually attached via glucose or galactose to the terminal primary hydroxyl group of
the lipid moiety ceramide, which is composed of a long chain base (sphingosine) and
a fatty acid (see Chapter 10).
Glycolipids can be neutral or anionic. A ganglioside is an anionic glycolipid
containing one or more residues of sialic acid. It should be noted that these
represent only the most common classes of glycans reported in eukaryotic cells.
There are several other less common types found on one or the other side of the cell
membrane in animal cells (see Chapters
12 and 17).
Although different glycan classes have unique core regions by which they are
distinguished, certain outer structural sequences are often shared among different
classes of glycans. For example, N- and O-glycans and glycosphingolipids frequently
carry the subterminal disaccharide Galβ1-4GlcNAcβ1-
(N-acetyllactosamine or LacNAc) or, less commonly,
GalNAcβ1-4GlcNAcβ1- (LacdiNAc) units. The LacNAc units can
sometimes be repeated, giving extended poly-N-acetyllactosamines
(sometimes incorrectly called “poly-lactosamines”). Less
commonly, the LacdiNAc motif can also be repeated (termed polyLacdiNAc). Outer
LacNAc units can be modified by fucosylation or by branching and are typically
capped by sialic acids or, less commonly, by sulfate, Fuc, α-Gal,
β-GalNAc, or β-GlcA units (see Chapters 13 and 14). In contrast, glycosaminoglycans are linear copolymers of acidic
disaccharide repeating units, each containing a hexosamine (GlcN or GalN) and a
hexose (Gal) or hexuronic acid (GlcA or IdoA) (see Chapter 16). The type of disaccharide unit defines the
glycosaminoglycan as chondroitin or dermatan sulfate
(GalNAcβ1-4GlcA/IdoA), heparin or heparan sulfate
(GlcNAcα1-4GlcA/IdoA), or keratan sulfate (Galβ1-4GlcNAc).
Keratan sulfate is actually a 6-O-sulfated form of
poly-N-acetyllactosamine attached to an N- or O-glycan core, rather
than to a typical Xyl-Ser-containing proteoglycan linkage region. Another type of
glycosaminoglycan, hyaluronan (a polymer of GlcNAcβ1-4GlcA), appears to
exist primarily as a free sugar chain unattached to any aglycone (Chapter 15). The glycosaminoglycans
(except for hyaluronan) also typically have sulfate esters substituting either amino
or hydroyxl groups (i.e., N- or O-sulfate groups). Another anionic polysaccharide
that can be extended from LacNAc units is polysialic acid, a homopolymer of sialic
acid that is selectively expressed only on a few proteins in vertebrates. Polysialic
acids are also found as the capsular polysaccharides of certain pathogenic bacteria
(Chapter 14).
GLYCAN STRUCTURES ARE NOT ENCODED DIRECTLY IN THE GENOME
It is important to reemphasize that unlike protein sequences, which are primary gene
products, glycan chain structures are not encoded directly in the genome and are
secondary gene products. A few percent of known genes in the human genome are
dedicated to producing the enzymes and transporters responsible for the biosynthesis
and assembly of glycan chains (see Chapter
7), typically as posttranslational modifications of proteins or by
glycosylation of core lipids. The glycan chains themselves represent numerous
combinatorial possibilities, generated by a variety of competing and sequentially
acting glycosidases and glycosyltransferases (see Chapter 5) and the subcompartmentalized
“assembly-line” mechanisms of glycan biosynthesis in the
Golgi apparatus (see Chapter 3). Thus,
even with full knowledge of the expression levels of all relevant gene products, we
do not understand enough about the structures and pathways to predict the precise
structures of glycans elaborated by a given cell type. Furthermore, small changes in
environmental cues can cause dramatic changes in glycans produced by a given cell.
It is this variable and dynamic nature of glycosylation that makes it a powerful way
to generate biological diversity and complexity. Of course, it also makes glycans
more difficult to study than nucleic acids and proteins.
SITE-SPECIFIC STRUCTURAL DIVERSITY IN PROTEIN GLYCOSYLATION
One of the most fascinating and yet frustrating aspects of protein glycosylation is
the phenomenon of microheterogeneity. This term indicates that at any given glycan
attachment site on a given protein synthesized by a particular cell type, a range of
variations can be found in the structures of the attached glycan chain. The extent
of this microheterogeneity can vary considerably from one glycosylation site to
another, from glycoprotein to glycoprotein, and from cell type to cell type. Thus, a
given protein originally encoded by a single gene can exist in numerous
“glycoforms,” each effectively a distinct molecular species.
Mechanistically, microheterogeneity might be explained by the rapidity with which
multiple, sequential, partially competitive glycosylation and deglycosylation
reactions must take place in the endoplasmic reticulum (ER) and Golgi apparatus,
through which a newly synthesized glycoprotein passes (see Chapter 3). An alternate possibility is
that each individual cell or cell type is in fact exquisitely specific in the
details of the glycosylation that it produces, but that intercellular variations
result in the observed microheterogeneity of samples from natural multicellular
sources. Whatever the origin of microheterogeneity, it accounts for the anomalous
behavior of glycoproteins in various analytical/separation techniques (such as
sodium dodecyl sulfate–polyacrylamide gel electrophoresis
[SDS-PAGE], in which multiple or diffuse bands are observed)
and makes complete structural analysis of a glycoprotein a difficult task. From a
functional point of view, the biological significance of microheterogeneity remains
unclear. It is possible that this is a type of diversity generator, intended for
diversifying endogenous recognition functions and/or for evading microbes and
parasites, each of which can bind with high specificity only to certain glycan
structures (see Chapters 34 and 39).
CELL BIOLOGY OF GLYCOSYLATION
Most well-characterized pathways for the biosynthesis of major classes of glycans are
confined within the ER and Golgi compartments (see Chapter 3). Thus, for example, newly synthesized proteins
originating from the ER are either cotranslationally or posttranslationally modified
with sugar chains at various stages in their itinerary toward their final
destinations. The glycosylation reactions usually use activated forms of
monosaccharides (nucleotide sugars; see Chapter
4) as donors for reactions that are catalyzed by glycosyltransferases
(for details about their biochemistry, molecular genetics, and cell biology, see
Chapters 3, 5, and 7). In almost all cases, these nucleotide donors are
synthesized within the cytosolic or nuclear compartment from monosaccharide
precursors of endogenous or exogenous origin (see Chapter 4). To be available to perform the glycosylation
reactions, the donors must be actively transported across a membrane bilayer into
the lumen of the ER and Golgi compartments. Much effort has gone into understanding
the mechanisms of glycosylation within the ER and the Golgi apparatus, and it is
clear that a variety of factors determine the final outcome of glycosylation
reactions. Some bulky sugar chains are made on the cytoplasmic face of these
intracellular organelles and are flipped across their membranes to the other side,
but most are synthesized by adding one monosaccharide at a time to the growing
glycan chain on the inside of the ER or the Golgi. Regardless, the portion of a
glycoconjugate that faces the inside of these compartments will ultimately face the
inside of a secretory granule or lysosome and will be topologically unexposed to the
cytosol. The biosynthetic enzymes (glycosyltransferases, sulfotransferases, etc.)
responsible for catalyzing these reactions are well studied (see Chapter 5), and their location has helped
to define various functional compartments of the ER-Golgi pathway. A classical model
envisioned that these enzymes are physically lined up along this pathway in the
precise sequence in which they actually work. This appears to be an oversimplified
view, because there is considerable overlap in the distribution of these enzymes,
and the actual distribution of a given enzyme seems to depend on the cell type.
All of the topological considerations mentioned above are reversed with regard to
nuclear and cytoplasmic glycosylation, because the active sites of the relevant
glycosyltransferases face the cytosol, which is in direct communication with the
interior of the nucleus. Until the mid-1980s, the accepted dogma was that
glycoconjugates, such as glycoproteins and glycolipids, occurred exclusively on the
outer surface of cells, on the internal (luminal) surface of intracellular
organelles, and on secreted molecules. As discussed above, this was consistent with
knowledge of the topology of the biosynthesis of the classes of glycans known at the
time, which took place within the lumen of the ER-Golgi pathway. Thus, despite some
clues to the contrary, the cytosol and nucleus were assumed to be devoid of
glycosylation capacity. However, it is now clear that certain distinct types of
glycoconjugates are synthesized and reside within the cytosol and nucleus (see Chapter 17). Indeed, one of them, called
O-linked GlcNAc (see Chapter 18), may
well be numerically the most common type of glycoconjugate in many cell types. The
fact that this major form of glycosylation was missed by so many investigators for
so long serves to emphasize the relatively unexplored state of the whole field of
glycobiology.
FIGURE 1.8
.
Biosynthesis, use, and turnover of a common monosaccharide. This schematic
shows the biosynthesis, fate, and turnover of galactose, a common
monosaccharide constituent of animal glycans. Although small amounts of
galactose can be taken up from the outside of the cell, most cellular
galactose is either synthesized de novo from glucose or recycled from
degradation of glycoconjugates in the lysosome. The figure is a simplified
view of the generation of the UDP nucleotide sugar UDP-Gal, its equilibrium
state with UDP-glucose, and its uptake and use in the Golgi apparatus for
synthesis of new glycans. (Solid lines) Biochemical
pathways; (dashed lines) pathways for the trafficking of
membranes and glycans.
Like all components of living cells, glycans are constantly being degraded and the
enzymes that catalyze this process cleave sugar chains either at the outer
(nonreducing) terminal end (exoglycosidases) or internally (endoglycosidases) (see
Chapters 3 and
41). Some terminal monosaccharide units
such as sialic acids are sometimes removed and new units reattached during endosomal
recycling, without degradation of the underlying chain. The final complete
degradation of most glycans is generally performed by multiple glycosidases in the
lysosome. Once broken down, their individual unit monosaccharides are then typically
exported from the lysosome into the cytosol so that they can be reused (see ). In contrast to the relatively
slow turnover of glycans derived from the ER-Golgi pathway, the O-GlcNAc
monosaccharide modifications of the nucleus and cytoplasm may be more dynamic and
rapidly turned over (see
Chapter 18).
TOOLS USED TO STUDY GLYCOSYLATION
Unlike oligonucleotides and proteins, glycans are not commonly expressed in a linear,
unbranched fashion. Even when they are found as linear macromolecules (e.g., GAGs),
they often contain a variety of substituents, such as sulfate groups. Thus, the
complete sequencing of glycans is practically impossible to accomplish by a single
method and requires iterative combinations of physical, chemical, and enzymatic
approaches that together yield the details of the structure under study (for a
discussion of the various forms of low- and high-resolution separation and analysis,
including mass spectrometry and NMR, see Chapter 47). Less detailed information on structure may be sufficient to
explore the biology of some glycans and can be obtained by simple techniques, such
as the use of enzymes (endoglycosidases and exoglycosidases), lectins, and other
glycan-binding proteins (see Chapters
45 and 47), chemical
modification or cleavage, metabolic radioactive labeling, antibodies, or cloned
glycosyltransferases (Chapter 49).
Glycosylation can also be perturbed in a variety of ways, for example, by
glycosylation inhibitors and primers (Chapter
50) and by genetic manipulation of glycosylation in intact cells and
organisms (Chapter 46). The directed in
vitro synthesis of glycans by chemical and enzymatic methods has also taken great
strides in recent years, providing many new tools for exploring glycobiology (Chapters 49 and 51). The generation of complex glycan
libraries by a variety of routes has further enhanced this interface of chemistry
and biology (Chapter 49).
GLYCOMICS
Analogous to genomics and proteomics, glycomics represents the systematic
methodological elucidation of the “glycome” (the totality of
glycan structures) of a given cell type or organism (see Chapter 48). In reality, the glycome is far more complex than
the genome or proteome. In addition to the vastly greater structural diversity in
glycans, one is faced with the complexities of glycosylation microheterogeneity (see
above) and the dynamic changes that occur in the course of development,
differentiation, metabolic changes, malignancy, inflammation, or infection. Added
diversity arises from intraspecies and interspecies variations in glycosylation.
Thus, a given cell type in a given species can manifest a large number of possible
glycome states. Glycomic analysis today generally consists of extracting entire cell
types, organs, or organisms; releasing all the glycan chains from their linkages;
and cataloging them via approaches such as mass spectrometry. In a variation called
glycoproteomics, the glycans are analyzed while still attached to protease-generated
fragments of glycoproteins. The results obtained represent a spectacular improvement
over what was possible a few decades ago, but they still constitute an effort
analogous to cutting down all the trees in a forest and cataloging them, without
attention to the layout of the forest and the landscape. This type of glycomic
analysis needs to be complemented by classical methods such as tissue-section
staining or flow cytometry, using lectins or glycan-specific antibodies that aid in
understanding the glycome by taking into account the heterogeneity of glycosylation
at the level of the different cell types and subcellular domains in the tissue under
study. This is even more important because of the common observation that removing
cells from their normal milieu and placing them into tissue culture can result in
major changes in the glycosylation machinery of the cell. However, such classical
approaches suffer from poor quantitation and relative insensitivity to structural
details. A combination of the two approaches is now potentially feasible via
laser-capture microdissection of specific cell types directly from tissue sections,
with the resulting samples being studied by mass spectrometry.
Because most of the genes involved in glycan biosynthetic pathways have been cloned
from multiple organisms, it is possible today to obtain an indirect genomic and
transcriptomic view of the glycome in a specific cell type (see Chapter 7). However, given the relatively
poor correlation between mRNA and protein levels, and the complex assembly line and
competitive nature of the cellular Golgi glycosylation pathways, even complete
knowledge of the mRNA expression patterns of all relevant genes in a given cell
cannot allow accurate prediction of the distribution and structures of glycans in
that cell type. In other words, there is as yet no reliable indirect route toward
elucidating the glycome, other than by actual structural analysis using an array of
methods.
GLYCOSYLATION DEFECTS IN ORGANISMS AND CULTURED CELLS
Many mutant variants of cultured cell lines with altered glycan structures and
specific glycan biosynthetic defects have been described, the most common of which
are those that are lectin resistant (see Chapter 46). Indeed, with few exceptions, mutants with specific defects
at most steps of the major pathways of glycan biosynthesis have been found in
cultured animal cells. The use of such cell lines has been of great value in
elucidating the details of glycan biosynthetic pathways. Their existence implies
that many types of glycans are not crucial to the optimal growth of single cells
living in the sheltered and relatively unchanging environment of the culture dish.
Rather, most glycan structures must be more important in mediating
cell–cell and cell–matrix interactions in intact
multicellular organisms and/or interactions between organisms. In keeping with this
supposition, genetic defects completely eliminating major glycan classes in intact
animals all cause embryonic lethality (see Chapter 42 and Table 6.1). As
might be expected, naturally occurring viable animal mutants of this type tend to
have disease phenotypes of intermediate severity and show complex phenotypes
involving multiple systems. Less severe genetic alterations of outer chain
components of glycans tend to give viable organisms with more specific phenotypes
(see Chapter 42 and Table 6.1 ). Overall, there is much to be learned by
studying the consequences of natural or induced genetic defects in intact
multicellular organisms (see Chapter
42). It is interesting to note that, in the short time since the first
edition of this book, we have gone from asking “What is it that glycans
do anyway?” to having to explain a large number of complex and sometimes
nonviable glycosylation-modified phenotypes in humans, mice, flies, and other
organisms.
THE BIOLOGICAL ROLES OF GLYCANS ARE DIVERSE
A major theme of this volume is the exploration and elucidation of the biological
roles of glycans. Like any biological system, the optimal approach carefully
considers the relationship of structure and biosynthesis to function (see Chapter 6). As might be imagined from
their ubiquitous and complex nature, the biological roles of glycans are quite
varied. Indeed, asking what these roles are is akin to asking the same question
about proteins. Thus, all of the proposed theories regarding glycan function turn
out to be partly correct, and exceptions to each can also be found. Not surprisingly
for such a diverse group of molecules, the biological roles of glycans span the
spectrum from those that are subtle to those that are crucial for the development,
growth, function, or survival of an organism (for further discussion, see Chapter 6). The diverse functions ascribed
to glycans can be more simply divided into two general categories: (i) structural
and modulatory functions (involving the glycans themselves or their modulation of
the molecules to which they are attached) and (ii) specific recognition of glycans
by glycan-binding proteins. Of course, any given glycan can mediate one or both
types of functions. The binding proteins in turn fall into two broad groups: lectins
and sulfated GAG-binding proteins (see Chapter
26). Such molecules can be either intrinsic to the organism that
synthesized the cognate glycans (e.g., see Chapters 28, 29, 30, 31, 32, 33, and 35) or extrinsic (see Chapter 34 and 39 for
information concerning microbial proteins that bind to specific glycans on host
cells). The atomic details of these glycan-protein interactions have been elucidated
in many instances (see Chapter 27).
Although there are exceptions to this notion, the following general theme has
emerged regarding lectins: Monovalent binding tends to be of relatively low
affinity, although there are exceptions to this notion, and such systems typically
achieve their specificity and function by achieving high avidity, via interactions
of multivalent arrays of glycans with cognate lectin-binding sites (see Chapters 30 and 40).
GLYCOSYLATION CHANGES IN DEVELOPMENT, DIFFERENTIATION, AND MALIGNANCY
Whenever a new tool (e.g., an antibody or lectin) specific for a particular glycan is
developed and used to probe its expression in intact organisms, it is common to find
exquisitely specific temporal and spatial patterns of expression of that glycan in
relation to cellular activation, embryonic development, organogenesis, and
differentiation (see Chapter 38).
Certain relatively specific changes in expression of glycans are also often found in
the course of transformation and progression to malignancy (see Chapter 44), as well as other
pathological situations such as inflammation. These spatially and temporally
controlled patterns of glycan expression imply the involvement of glycans in many
normal and pathological processes, the precise mechanisms of which are understood in
only a few cases.
EVOLUTIONARY CONSIDERATIONS IN GLYCOBIOLOGY
Remarkably little is known about the evolution of glycosylation. There are clearly
shared and unique features of glycosylation in different kingdoms and taxa. Among
animals, there may be a trend toward increasing complexity of N- and O-glycans in
more recently evolved (“higher”) taxa. Intraspecies and
interspecies variations in glycosylation are also relatively common. It has been
suggested that the more specific biological roles of glycans are often mediated by
uncommon structures, unusual presentations of common structures, or further
modifications of the commonly occurring saccharides themselves. Such unusual
structures likely result from such unique expression patterns of the relevant
glycosyltransferases or other glycan-modifying enzymes. On the other hand, such
uncommon glycans can be targets for specific recognition by infectious
microorganisms and various toxins. Thus, at least a portion of the diversity in
glycan expression in nature must be related to the evolutionary selection pressures
generated by interspecies interactions (e.g., of host with pathogen or symbiont). In
other words, the two different classes of glycan recognition mentioned above
(mediated by intrinsic and extrinsic glycan-binding proteins) are in constant
competition with each other, with regard to a particular glycan target. The
specialized glycans expressed by parasites and microbes that are of great interest
from the biomedical point of view (see Chapters 20, 21, and 40) are themselves presumably subject to
evolutionary selection pressures. The evolutionary issues presented above are
further considered in Chapter 19, which
also discusses the limited information concerning how various glycan biosynthetic
pathways appear to have evolved and diverged in different life forms.
GLYCANS IN MEDICINE AND BIOTECHNOLOGY
Numerous natural bioactive molecules are glycoconjugates, and the attached glycans
can have dramatic effects on the biosynthesis, stability, action, and turnover of
these molecules in intact organisms. For example, heparin, a sulfated
glycosaminoglycan, and its derivatives are among the most commonly used drugs in the
world. For this and many other reasons, glycobiology and carbohydrate chemistry have
become increasingly important in modern biotechnology. Patenting a glycoprotein
drug, obtaining an FDA approval for its use, and monitoring its production all
require knowledge of the structure of its glycans. Moreover, glycoproteins, which
include monoclonal antibodies, enzymes, and hormones, are by now the major products
of the biotechnology industry, with sales in the tens of billions of dollars
annually, which continues to grow at an increasing rate. In addition, several human
disease states are characterized by changes in glycan biosynthesis that can be of
diagnostic and/or therapeutic significance. The emerging importance of glycobiology
in medicine and biotechnology is further considered in Chapters 43 and 51.
FURTHER READING
Sharon N. Carbohydrates.
Sci. Am.
1980; 243[5]: 90–116.
Rademacher TW, Parekh RB, Dwek RA.
Glycobiology.
Annu. Rev. Biochem.
1988; 57: 785–838.
[PubMed]
Sharon N, Lis H.
Carbohydrates in cell recognition.
Sci. Am.
1993; 268: 82–89.
[PubMed]
Varki A.
Biological roles of oligosaccharides: All of the theories are
correct.
Glycobiology.
1993; 3: 97–130.
[PubMed]
Cabezas JA. The origins of glycobiology.
Biochem. Edu.
1994; 22: 3–7.
Drickamer K, Taylor ME.
Evolving views of protein glycosylation.
Trends Biochem. Sci.
1998; 23: 321–324.
[PubMed]
Etzler ME. Oligosaccharide signaling of plant cells.
J. Cell Biochem. (suppl.).
1998; 30–31: 123–128.
Gagneux P, Varki A.
Evolutionary considerations in relating oligosaccharide
diversity to biological function.
Glycobiology.
1999; 9: 747–755.
[PubMed]
Esko JD, Lindahl U.
Molecular diversity of heparan sulfate.
J. Clin. Invest.
2001; 108: 169–173.
[PubMed]
Roseman S.
Reflections on glycobiology.
J. Biol. Chem.
2001; 276: 41527–41542.
[PubMed]
Hakomori S.
The glycosynapse. Inaugural article.
Proc. Natl. Acad. Sci.
2002; 99: 225–232.
[PubMed]
Spiro RG. Protein glycosylation: Nature, distribution, enzymatic
formation, and disease implications of glycopeptide bonds.
Glycobiology.
2002; 12: 43R–56R.
Haltiwanger RS, Lowe JB.
Role of glycosylation in development.
Annu. Rev. Biochem.
2004; 73: 491–537.
[PubMed]
Sharon N, Lis H. History of lectins: From hemagglutinins to biological
recognition molecules.
Glycobiology.
2004; 14: 53R–62R.
Toole BP. Hyaluronan: From extracellular glue to pericellular cue.
Nat. Rev. Can.
2004; 4: 528–539.
Drickamer K, Taylor ME.
Introduction to glycobiology. 2. Oxford University Press; Oxford: 2006.
Freeze HH.
Genetic defects in the human glycome.
Nat. Rev. Genet.
2006; 7: 537–551.
[PubMed]
Lutteke T, Bohne-Lang A, Loss A, Goetz T, Frank M, von der Lieth CW. GLYCO-SCIENCES.de: An Internet portal to support glycomics
and glycobiology research.
Glycobiology.
2006; 16: 71R–81R.
Ohtsubo K, Marth JD.
Glycosylation in cellular mechanisms of health and disease.
Cell.
2006; 126: 855–867.
[PubMed]
Patnaik SK, Stanley P.
Lectin-resistant CHO glycosylation mutants.
Methods Enzymol.
2006; 416: 159–182.
[PubMed]
Prescher JA, Bertozzi CR.
Chemical technologies for probing glycans.
Cell.
2006; 126: 851–854.
[PubMed]
van Die I, Cummings RD.
Glycans modulate immune responses in helminth infections and
allergy.
Chem Immunol Allergy.
2006; 90: 91–112.
[PubMed]
Varki A.
Nothing in glycobiology makes sense, except in the light of
evolution.
Cell.
2006; 126: 841–845.
[PubMed]
Bishop JR, Schuksz M, Esko JD.
Heparan sulphate proteoglycans fine-tune mammalian physiology.
Nature.
2007; 446: 1030–1037.
[PubMed]
Hart GW, Housley MP, Slawson C.
Cycling of O-linked
β-N-acetylglucosamine on nucleo-cytoplasmic
proteins.
Nature.
2007; 446: 1017–1022.
[PubMed]
Kamerling J, Boons G-J, Lee Y, Suzuki A, Taniguchi N, Voragen AGJ.
Comprehensive glycoscience. 1–4. Elsevier Science; London: 2007.
Copyright ©2009 by The Consortium of Glycobiology Editors, La Jolla,
California
ǀ