![]() | ![]() |
Formats:
|
||||||||||||||||||||||
Copyright © 2008 Yanay et al.; licensee BioMed Central Ltd. Evolution of insect proteomes: insights into synapse organization and synaptic vesicle life cycle 1Department of Biological Chemistry, Institute of Life Sciences, Givat Ram Campus, Hebrew University of Jerusalem, Jerusalem 91904, Israel Corresponding author.#Contributed equally. Chava Yanay: zchava/at/cc.huji.ac.il; Noa Morpurgo: noa76/at/pob.huji.ac.il; Michal Linial: michall/at/cc.huji.ac.il Received September 27, 2007; Revised December 6, 2007; Accepted February 7, 2008. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background The molecular components in synapses that are essential to the life cycle of synaptic vesicles are well characterized. Nonetheless, many aspects of synaptic processes, in particular how they relate to complex behaviour, remain elusive. The genomes of flies, mosquitoes, the honeybee and the beetle are now fully sequenced and span an evolutionary breadth of about 350 million years; this provides a unique opportunity to conduct a comparative genomics study of the synapse. Results We compiled a list of 120 gene prototypes that comprise the core of presynaptic structures in insects. Insects lack several scaffolding proteins in the active zone, such as bassoon and piccollo, and the most abundant protein in the mammalian synaptic vesicle, namely synaptophysin. The pattern of evolution of synaptic protein complexes is analyzed. According to this analysis, the components of presynaptic complexes as well as proteins that take part in organelle biogenesis are tightly coordinated. Most synaptic proteins are involved in rich protein interaction networks. Overall, the number of interacting proteins and the degrees of sequence conservation between human and insects are closely correlated. Such a correlation holds for exocytotic but not for endocytotic proteins. Conclusion This comparative study of human with insects sheds light on the composition and assembly of protein complexes in the synapse. Specifically, the nature of the protein interaction graphs differentiate exocytotic from endocytotic proteins and suggest unique evolutionary constraints for each set. General principles in the design of proteins of the presynaptic site can be inferred from a comparative study of human and insect genomes. Background The completion of the Drosophila malengaster genome in the year 2000 provided the first glimpse at the make-up of animals with a complex nervous system [1,2]. The availability of several genomes from insects, representing an evolutionary distance of 250 to 300 million years, provided a unique opportunity to evaluate the foundation of a functional synapse [3]. With many additional animal genomes now available, including those of primates, marsupials, fish and birds, a molecular correlation between genes and brain complexity is being actively sought [4,5]. Drosophila has been used for decades as a model in which to study synapse formation, embryogenesis, development, and neurogenesis [6]. A combination of biochemical, cell biologic, genetic, morphologic, and electrophysiologic studies have unravelled the molecular mechanisms of synaptic vesicle exocytosis and endocytosis in the fly [7,8] and compared these with the corresponding mechanisms in vertebrates [9]. In all neurons, communication across the synapse is mediated by neurotransmitter release from synaptic vesicles. Because the entire process may take only a fraction of a millisecond (in fast releasing synapses), additional processes ensure the priming, targeting, and docking of synaptic vesicles at the active zone [10]. Only the basic mechanism of vesicle fusion is shared between yeast and human [11]. Specifically, the minimal set of SNARE (Soluble NSF Attachment protein [SNAP] REceptor) functions is a unified mode of vesicle trafficking. The proper targeting and docking of synaptic vesicles is mediated by a cognate interaction between vesicular SNAREs (v-SNAREs) and target membrane SNAREs (t-SNAREs). The genuine synaptic vesicle protein associated membrane protein (VAMP; also called synaptobrevin) acts as v-SNARE, whereas the presynaptic membrane proteins syntaxin and SNAP-25 (SNAP of 25 kDa) are t-SNAREs. The multimeric ATPase NSF (N-ethylmaleimide sensitive fusion ATPase) is later recruited to the SNARE complex by SNAPs [12] and acts to break the extremely stable SNARE complex, thus reactivating the individual SNAREs for future fusion events. Unlike yeast secretion and vesicle trafficking, synaptic vesicle fusion in the presynaptic structure requires a large body of regulators to ensure the spatial and temporal resolution of neurotransmitter release [13]. Regulators of the SNAREs are numerous, and many of them are conserved throughout evolution. Examples are the Rabs and their direct regulators [14]. Specifically, Rab3, Rab5, Rab27, and Rab11 regulate vesicle transport, docking, and exocytosis of synaptic vesicles [15]. Many of the other Rabs function in membrane trafficking in general and are strongly conserved [16,17]. Recently, the composition and the stoichiometry of proteins and lipids of synaptic and transport vesicles from rat brain were presented [18]. Based on Mass spectrometry (MS) proteomics technology, about 80 proteins were identified. The synaptic role of many of these proteins was already established, mainly based on the genetics of model organisms such as Drosophila melanogaster and Caenorhabtidis elegans [2]. Schematically, the proteins of the synaptic vesicles are associated with the following functional groups: organizers and cytoskeletal scaffold proteins; transporters and channels; sensors and signal transduction proteins; priming, docking, and fusion apparatus [19,20]; endocytotic and recycling machinery [7,21-23]; and linkers between the presynaptic and postsynaptic membranes [2]. In addition, scaffolding proteins are critically important during the development and shaping of new synapses [24]. These proteins are a combination of adhesion, cytoskeleton, and signaling proteins. The specificity of neurons in the central nervous system (CNS) is primarily defined by the composition of receptors, transporters, and ion channels in the presynaptic and postsynaptic density (PSD) structures [25]. In addition to their role in neuronal transmission through ion channels, PSD proteins are essential in establishing a protein network that bridges the cytoskeleton to the extracellular matrix [2]. Herein, we focus on the basic function of the synapse, and specifically the trafficking, exocytosis, and endocytosis of synaptic vesicles, and analyze it in molecular terms. We compiled a list of 120 gene prototypes, called 'PS120', which comprises the core set of proteins associated with synaptic vesicles and presynaptic structures. This list includes components of the SNARE complex and their regulators, as well as components of the trafficking and organization apparatus of the active zone. In comparison with humans, there are many fewer paralogous genes in the four insects whose genome sequence has been completed (namely fly, mosquito, honeybee, and beetle). This comparative view is instrumental for in silico genome annotations but it also exposes instances in which a specific gene or a regulation network is lost. We show that the number of protein-protein interactions in which a protein participates and the degree of sequence conservation from insects to human are positively correlated. The architectures of proteins responsible for processes in the synapse such as exocytosis and endocytosis differ markedly. We show that a systematic comparative genomics view of the fly, honeybee, mosquito, and beetle proteomes reveals general principles in the design of presynaptic structures. Results Evolutionary relationships among insects Insects are an ancient group of animals, the first of which probably appeared 360 to 400 million years ago. Analyses of insect genomes and proteomes provide a unique opportunity to compare evolution between the model organism D. melanogaster and numerous additional insect genomes. The insects whose genomes were sequenced ensure coverage of a valuable phylogenetic breadth, spanning the fruit fly (D. melanogaster(, the honey bee (Apis mellifera), the red flour beetle (Tribolium castaneum), the mosquitoes (Anopheles gambiae and Aedes aegypti), the silk worm (Bombyx mori) and the wasp (Nasonia vitripennis). All together, about 330,000 protein sequences from insects are currently available in public protein databases, which already include 12 additional Drosophila genomes. A current list of insect genome projects is accessible in Additional data file 1. In the present study we refer only to representative genomes that are substantially divergent and include the beetle, honeybee, mosquito, and fly (with D. melanogaster being the reference). We focus on establishing a functional synapse whose molecular assembly governs learning and memory as well as the complex behavior of the organism. A catalog of presynaptic gene representatives from human and insects We compiled an extended catalog of mammalian presynaptic proteins based on the detailed anatomy of the synaptic vesicle [18], data from functional annotations by Gene Ontology (GO) [26], and a manual collection of genes of presynaptic function [27]. This collection is compared with insect proteomes. A summary of the sequence conservation of each gene (a total of 120 representative genes) with the insect proteome is shown in Table 1. Analyzing this catalog (PS120 - presynaptic 120 genes) revealed that 50% are well conserved and have a sequence similarity in excess of 65% for most of the sequence. Among them, 60% are at a similarity level in excess of 75% for most of the sequence. Thus, the majority of proteins that participate in human presynaptic structures are extremely well conserved.
Most of the PS120 proteins belong to gene families, with some of the families being very large. For example, synaptotagmins and Rabs have numerous alternative spliced variants in addition to their large number of genes (17 and 60, respectively). For most instances, the size of the gene family in insects is smaller and on average is only 40% when compared with human. To exemplify this observation, we investigate the syntaxin family. There are 12 genes in human (and additional variants) that can be divided into subfamilies. The human subfamily of syntaxin 1, which functions as the t-SNARE in synaptic vesicle fusion (including Stx1, Stx2, Stx3, Stx4, and Stx11), is represented by only two genes in the fly (namely dStx1 and dStx4) [1] and in the other insects. However, in general, there are more gene variants that result from alternatively splicing events in the fly genome relative to the other insects. A search of insect homologs for the PS120 clearly shows that even within the most conserved set between human and insects (60 genes), there are 12 genes for which there is no clear homolog in the current protein databases in at least one of the insect representatives (honeybee, beetle, mosquito, and fly). The same applies to about 30 additional proteins from the remainder of the PS120 gene list. Additional information on protein partners and protein length, and detailed information on the levels of sequence conservation is provided in Additional data files 2. Recovering missed annotation genes by comparative genomics The completion of genomes for at least four insect representatives and the additional information from partially assembled genomes (Additional data file 1) makes it possible to revisit some of the apparently missed genes (Table 1 and Additional data file 2). Evidently, comparing related genomes enhances the quality of in silico genome annotations [28]. A search in the public non-redundant database revealed that about one-third of the PS120 homologous sequences were missing in at least one of the insect representatives (Table 1). Moreover, for a small number of genes, no homologs were detected in any of the insects. In cases in which significant sequence similarity in all four insect representatives is absent, we strongly argue that these genes are genuinely absent in insects. This is supported by a lack of significant similarity in additional fly genomes, and in the silkworm and the wasp genomes (Additional data file 3). Additional data file 3 provides information on apparently missing genes that are not apparent from protein databases (see Materials and methods, below). For 70% significant similarity in the genome-assembled sequences was identified. This high similarity is often supported by the existence of an expressed mRNA. For a few genes, only limited evidence on transcription levels exists. More importantly, for 11 genes no homologs were detected in insects by searching protein data against translated insect genomes. Among these genes are growth-associated protein (GAP)-43, which is implicated in cytoskeleton and protein kinase C signaling during synapse establishment [29], and two large proteins that shape the cytoskeletal mesh at the active zone: bassoon (about 3,900 amino acids) and piccolo (about 5,100 amino acids) [30]. In addition, the SNARE regulator complexin 4, the syntaxin-tubulin binding protein syntabulin (FLJ20366), and SNAP-25-interacting protein are not detected in insects. Although most proteins of the synaptic vesicle membranes are strongly conserved, we were unable to detect SV31 (also called TMEM163; a genuine protein of the synaptic vesicle (SV) membranes) [31] or synaptophysin (one of the most abundant proteins in mammalian synaptic vesicles). Furthermore, no sequence similarity was noted for the syntaxin regulators amisyn (STXBP6) [32] and syntaphilin [33]. Syntaphilin, which has been implicated in regulation of exocytosis and endocytosis [34], is conserved from human to pufferfish and zebrafish but was lost in the branch of the frogs and insects. A borderline similarity to dynactin and α-liprin suggests that the function in cytoskeletal remodeling and in cell-matrix interactions may be taken over by other proteins. Interestingly, many of the genes that are not conserved from human to insects are functionally related to active zone architecture and specifically to the underlying cytoskeleton mesh of the synapse. Insights into the most conserved proteins of the exocytosis core complex In the PS120 gene list, rather close conservation is evident between insect and human genes (measured by a similarity >75% throughout the sequence) for 16 genes. This small set includes the v-SNARE VAMP2, the t-SNARE syntaxin 1A, and a few small GTP proteins (Ral, Rab3A, ARF1, and ARF6). In addition, this set includes essential components of the endocytic machinery (dynamin 1, AP2, AP3, EHD1, and clathrin) and proteins that activate transduction pathways (calmodulin and 14-3-3). That the function of these gene products is indispensable was expected, but proteins that coordinate synaptic vesicles with the active zone are also included in this selected list, namely cytohesin-1 [35] and Mals-1 [36]. Both of these proteins share a function in determining the size of the readily releasable pool of synaptic vesicles and are critical for replenishing this pool. In an attempt to gain new information on the structure and function of presynaptic proteins, we applied a comparative view and conducted multiple sequence alignment (MSA) analysis of human and insects for representatives of the exocytotic machinery, VAMP-2, and synaptotagmin 1 (Figure (Figure1).1
MSA of highly conserved sequences from human to insects was also performed for synaptotagmin (Figure (Figure1b).1b Because endocytosis and membrane recycling are integral processes in presynaptic function, we compared stoned B (STNB) between human and insects [46] (Additional data file 4). Stoned genes (in insects StnA and StnB) are part of the protein lattice network that is involved in clathrin-mediated endocytosis at synapses. The conservation level of human stoned B (called SALF) is rather low (<50% sequence similarity). Several short signatures along the proteins act in the binding of AP2 subunits (for example, AP50 for StnB and α-adaptin for StnA). The number and the positions of these short signatures are not conserved in vertebrates and insects (Additional data file 4). In addition to the binding of AP2 by StnB, it binds to synaptotagmin 1 within the 300 amino acids in the carboxyl-terminal in the fly and human homologs. Stoned proteins may support synaptotagmin 1 recycling by mediating the association with the AP2 complex. Based on the MSA analysis, additional strongly conserved sequences are suggested (syntaxin; Additional data file 5). These sequences are probably essential in interactions between yet undefined partners that are common to mammals and insects. Most MSAs of PS120 show that the level of conservation is much higher among the insect sequences as compared with human or other organisms. We emphasize that MSA from insects to human for strongly conserved proteins (synaptotagmin, syntaxin 1A, and VAMP2) and for much less conserved genes (stoned B, SCAMP1, and synapsin 1) is instrumental in detecting overlooked sequences that may be important for protein interactions, protein modifications, and regulatory functions. The MSA for syntaxin 1 and synapsin 1 is included in Additional data file 5. Sequence conservation among the subunits of the exocyst complex We tested whether the comparative genomics perspective is informative in studying the evolution of physical and functional complexes in exocytosis and trafficking. To this end, we tested the conservation levels for the various components of the exocyst. The exocyst is a large complex that was initially identified at the tip of the yeast bud. It participates in tethering vesicles to the plasma membrane. It coordinates exocytosis with small G-protein signalling molecules such as Ral-A, Arf6, and Rab11 [47]. The exocyst is composed of eight subunits that are denoted EXOC1 to EXOC8 (Figure (Figure2a)2a
Evolutionary constrains on the subunits of the COP complex and the lysosome biogenesis complex Coatomer protein (COP)-1 vesicles are principally involved in transport of cargo between the endoplasmic reticulum (ER) and early Golgi [50,51]. Specifically, they mediate both the anterograde flow of cargo through the Golgi to the cell surface and the retrograde retrieval of recycling proteins from late to early Golgi compartments. COP-1 is composed of 7 genes (α, β, β ', γ, δ, ε, and ζ subunits, additional genes resulting from duplication events, γ2 and ζ2) that are different in sequence and length. For example, whereas COPA (human homolog of α) is composed of 1,200 amino acid, COPZ (human homolog of ζ) consists of only 200 amino acids. Figure Figure3a3a
The synapse is a compact structure with multiple organelles, including transport vesicles, early and late endosomes, lysosomes, and peroxisomes. Indeed, many of the PS120 representatives function in vesicle trafficking and sorting. Snapin (SNAPAP) is among the genes that are missing in some but not all insects. Snapin was initially identified as a SNAP-25 binding protein and a regulator of the interaction of synaptotagmin with the SNAREs [53]. The relevance of snapin in neurotransmitter release regulation was questioned [54], and instead it was postulated to be part of the biogenesis of lysosome-related organelles complex (BLOC) [55,56]. We compared the conservation of the subunits of BLOC-1 in human and insect (Figure (Figure3b).3b A human homolog of snapin (SNAPAP; Table 1) is detected in honeybee, fly, and mosquito, but cannot be detected in the beetle genome (see Additional data files 2 and 3). BLOC1S2 and cappuccino are also missing in the beetle proteome, whereas BLOC1S3 is missing in all insects (Figure (Figure3b).3b Coordination in sequence conservation in biogenesis and trafficking protein complexes along the phylogenetic tree The analysis of BLOC-1, COP-1, and the exocyst complexes (Figures (Figures22
In general, for the four sets of presynaptic sites, all tested species maintain a conservation index in a rather tight range, in which each complex exhibits a unique profile along the evolutionary tree. Specifically, the conservation of fly to human is in accordance with a high degree of coordination among these four complexes. The exocyst and COP-1 are the least diverged whereas the organelle biogenesis complexes (PEX and BLOC-1) exhibit a more active evolutionary divergence for at least some of their components (Figure (Figure4).4 To explore whether the coordination within complexes and functional groups along the evolutionary tree holds for other physical or functional complexes, we examined the DGC [59] and the active signaling complex of the mGC from the postsynaptic membrane [62]. Among the various proteins of these postsynaptic complexes, each species exhibits a different level of conservation relative to human. For example, DGC, the frog DMD (dystrophin), and UTRN (utrophin) [63] are almost identical to human, whereas SGCB and SGCD (β-sarcoglycan and δ-sarcoglycan) are poorly conserved. On the other hand, in zebrafish utrophin is poorly conserved whereas β-sarcoglycan and δ-sarcoglycan are more similar to human. A similar uncoordinated profile for conservation was shown for the mGC. We propose that for biogenesis, exocytosis, and trafficking complexes of the presynaptic sites (but not for postsynaptic signaling complexes), evolutionary constraints have led to co-evolution of the components. Presynaptic proteins participate in interconnected protein interaction graphs Sequential protein interactions are fundamental to the lifecycle of the synaptic vesicle and to trafficking and organelle biogenesis in synapses. This leads to proteins that are engaged in multiple protein interactions. For example, more than 60 different interactions have been reported for syntaxin 1 and tens of interactions for synaptotagmin. Although some findings may result from spurious interactions, many have been experimentally confirmed and others are yet to be discovered. We illustrate this via VAMP2 as a prototype for an extremely conserved protein from human to insects. Figure Figure55
The fraction of connecting edges in the graph relative to the maximal possible edges is a measure of the connectivity among interacting proteins. Density values for the interacting proteins of VAMP2 and RAB3A are 16.8% and 6.4%, respectively. Figure Figure55 We illustrate the properties of protein-protein interaction graphs for several representative proteins from the PS120 set (Figure (Figure6;6
The protein valence (defined as the number of direct edges from the vertex representing the protein) ranges from seven for ERC2 to 23 for DNM1. The graphs of RIMS, ERC2, and NRXN1 have relatively low connectivity. Specifically, in the NRXN1 graph there are only 14 edges, and for RIMS (with a valence of 15) just 29 connecting edges are observed. The density of the different graphs and their network conservation scores are marked (Figure (Figure6).6 The interaction graphs for VAMP2 (Figure (Figure5),5 Valence of proteins in the interaction graphs and sequence conservation levels are positively correlated Almost all proteins in the PS120 gene list are engaged in multiple protein interactions (Additional data file 2). Interactions between proteins in the synapse occur throughout the processes of endocytosis, membrane fusion, protein recruitment, transport of vesicles, and organelle biogenesis. We tested whether conservation (as reflected by percentage sequence identity from insects to human) and the valence of all proteins are correlated. We considered only interactions with high confidence (see Materials and methods, below) and tested the two quantitative measures. Figure Figure7a7a
The next question that was addressed is whether the subsets of proteins that function primarily in endocytosis and in exocytosis exhibit different properties of sequence conservation and protein valence. To this end, we compiled two nonoverlapping subsets for endocytosis and exocytosis (24 proteins each; see Materials and methods, below). These two lists excluded proteins that are bona fide participants of both processes (for instance, DOC2, UNC13, PSCD1, and RIMS) or are connected to cytoskeleton modulation (such as ADD2, bassoon, CADPS, DLG1, KIF1A, and MYRIP). In the set of exocytotic and endocytotic proteins, the average protein valences are 10 and 7.9, respectively (P value for being identical = 0.423; Additional data file 8). However, for the exocytotic set the average valence for proteins with low conservation (<50% sequence identity) is 4.5, and 13.9 for those that share greater than 50% sequence identity. This result is significant (P = 0.034). In the endocytotic set, the valences of low and highly conserved proteins (<50% and >50% identity) were not significantly different (P = 0.605) and measured to be 7.2 and 8.6, respectively (Additional data file 7). Endocytotic proteins are long and composed of several repeated domains We observed that the protein interaction graphs differ substantially with regard to their properties between essential proteins of the exocytotic and endocytotic sets (Additional data file 8). Specifically, several of the endocytotic proteins interact with proteins that are less conserved between insects and human (Figure (Figure6).6
Additional features that were tested include the fraction of low complexity and coiled coil regions. It was shown that in both protein sets approximately 10% of the sequences are occupied by low complexity regions. However, coiled coil domains are detected mostly in endocytotic proteins. The most obvious difference is in the architecture of the exocytotic and endocytotic proteins, in that most endocytotic proteins are multi-domain proteins, some of them repeating several times within a protein and across the protein set. Examples are clathrin (CLTC; clathrin domains repeat seven times) and intersectin (ITSN2; with multiple SH3 domains). (For consistency, we applied the domain assignment according to Pfam [64].) Such features are rare among the exocytotic protein set (Figure (Figure8).8 The abundance of short domains with a broad specificity is rare in the exocytotic context. The domains marked as v-SNARE and t-SNARE, Synaptobrevin and Sec1 (Figure (Figure8)8 Discussion Comparative analysis of insect genomes focuses on evolutionary events on a scale of hundreds of millions of years. The numerous fly-related genomes (as in the case of Drosophila pseudoobscura) provide a snapshot of the evolutionary processes that occurred over tens of millions of years [3]. In this study we show that the rich source of data from insect proteomes is instrumental in deriving insights into the design principles of presynaptic function. It is demonstrated that comparative proteome analysis of insects to mammals provides new information on individual proteins (Figure (Figure1);1 Membranous synaptic vesicle protein composition from human and insects is different Mammalian synaptic vesicles were characterized by three major proteins [71]: SV2, synaptotagmin, and synaptophysin. Many of the synaptic vesicle membranous proteins appeared to be regulators of SNAREs and thus control neurotransmitter release. Examples are the transporter-like SV2, which controls synaptoagmin 1 [72], and synaptophysin, which controls VAMP-2 [73]. Based on a recent study on the composition of a generic mammalian synaptic vesicle [18], it was calculated that VAMP2 and synaptophysin are at a 2:1 molar ratio and both are the most abundant proteins of the synaptic vesicle membrane. In synaptophysin knockout mice, no changes in synapse function and brain morphology have been detected, but VAMP2 concentration on the synaptic vesicle membrane was shown to be markedly altered [74]. Along this line, the absence of synaptophysin gene in insects is intriguing (Table 1). Synaptophysin is part of a small family of four-transmembrane proteins that includs synaptogyrin, pantophysin, and synaptoporin, all of which are missing in insects. A genomic search revealed a weak similarity to synaptogyrin in honeybee supported by a transcript (XR_015081.1; hypothetical LOC552402 having mRNA of 704 nucleotides) and a weak homology in beetle for Synaptoporin (XM_967892.1; similar to synaptoporin, LOC661749). Despite its abundance in mammalian synaptic vesicles, Synaptophysin and the other members of the four-transmembrane protein set must be dispensable for the functionality of synaptic vesicles. This is in agreement with the findings of a recent study on C. elegans [75], in which complete removal of the synaptophysin gene family resulted in normal synaptic properties, synaptogenesis, and neuronal architecture. It is plausible that in insects VAMP accessibility is regulated by an alternative regulator. The extension of VAMP's luminal domain in insects is consistent with the possibility that such a domain is used for regulation (Figure (Figure1).1 Synaptotagmin (Syt), the calcium sensor for neurotransmitter release in synapses, functions in the coupling of synaptic vesicle fusion to recycling. In mammals, synaptotagmin belongs to a large gene family of 16 genes. Among them, synaptotagmins 1 and 2 are exclusively functional in the synaptic vesicle lifecycle. In the fly genome, there are six representatives (close homologs of Syt1, Syt4, Syt5, Syt7, Syt9, and Syt12), four in mosquito (A. aegypti and A. gambiae), and four in the honeybee (close homologs of mammalian Syt1, Syt4, Syt5, and Syt7). Interestingly, several synaptotagmin-like proteins and alternatively spliced transcripts are detected in the fly. Among these are close homologs of Syt1 (CG3139, isoform D) and Syt7 (CG2381, isoform F); both variants lack the amino-terminal domain, including the transmembrane. These variants, although soluble, fully maintain their potential to bind endogenous synaptic regulators and to act as calcium sensors. Although insects have only six genes, as opposed to 17 synaptotagmin genes in human, this is not the case for SV2, which is a transporter-like gene family of the synaptic vesicle membrane [76,77] that was implicated in modulating synaptotogmin [72]. The SV2 family is actually expanded in insects. At least three genes and three variants are evident in each of the insect representatives, suggesting the importance of multiple genes (Additional data file 9). Many expressed sequence tags and mRNA for SV2 homologs were detected from the fly head (not shown). This apparent high expression supports the notion that regulation of synaptotagmin might be executed by the rich collection of SV2 gene products. It is not known whether the different SV2 variants are all expressed in the same synaptic vesicle or whether they describe several subtypes. In general, the sequences of the variants and genes among insects are quite divergent. The expansion of the SV2 family in insects is shown in Additional data file 9. Among the proteins that are missing in some of the tested insects is the Rab3 and its regulators. Rab function is controlled by a large group of regulators that affect interaction with the cytoskeleton, activation and inactivation of the GTP state, interaction with the membrane, and more [78]. For example, RAB3IL1, which is the GEF (guanine nucleotide exchange factor) for Rab3A, is missing in both flies and mosquitoes, but a close homolog is found in honeybee and beetle genomes. RAB3IL1 associates with inositol 6-phosphate kinase and provides another tier of Rab3 regulation in synaptic vesicle exocytosis. On the other hand, Rab3 GTPase-activating protein (Rab3GAP), which is responsible for switching between the active and inactive form [79,80], is missing from the honeybee genome. There is support for the indispensability of RAB3GAP in mammalian synapses in the literature; a mutation in the gene causes a severe brain developmental defect [81]. Surprisingly, many of the genes that are missing in one or more of the insects are linked to Rab3 regulation (Figure (Figure5).5 Comparative genomics perspective on functional complexes Comparative genomics approaches are powerful tools for gaining insights into evolution [82]. We suggest that these approaches be expanded to investigate functional groups and protein complexes (Figures (Figures22 We showed that within the COP-1 complex, only the ε-COP in insects exhibits a specialized evolutionary profile (Figure (Figure3).3 The co-evolution in components of the same complex is not a general trend in the synapse. The DGC is part of the overall architecture of the nerve-muscle postsynaptic site. Indeed, the DGC in the skeletal muscle membranes shows that the components evolve in an uncoordinated mode across individual species. More than ten proteins strongly interact to form the DGC functional complex. The importance of this complex in mammals is evident because mutations in many of the genes account for the pathologies of muscular dystrophies [59,84]. There are numerous complexes of the postsynaptic sites that in the CNS are localized to the PSD. These complexes show a similar trend to DGC, including mGC (Figure (Figure4),4 Architectural design principles in exocytosis and endocytosis The PS120 represents the core of the presynaptic gene list. The two subsets could (somewhat artificially) be divided into the exocytotic and endocytotic proteins. It is clear that the two processes are tightly connected in time and space [23,87]. Nevertheless, fundamental differences could be assigned to each of the subsets in terms of protein length, conservation level and protein valence, abundance of coiled coil regions, membranous interactions, and so on. We argue that most of these features underlie the design principles for the functional distinction between these two processes. Based on protein-protein interaction graphs, a positive correlation between the presynaptic protein conservation score and their valence was demonstrated (Figure (Figure7).7 In the animal kingdom, the exocytic event is rapid (ranging from milliseconds to seconds), accurate, and triggered by a combination of depolarization and calcium signals that lead to synaptic vesicle fusion. On the other hand, the endocytotic machinery is slower and more diverged. For example, after a massive synaptic vesicle exocytosis from the Torpedo electric organ, the synaptic vesicles are recovered within 18 hours [88], and after repeated stimuli in the mouse brain the replenishment of synaptic vesicle pools is completed within 10 to 20 minutes and the recovery of a single synaptic vesicle is over within a few seconds [89]. We suggest that the modes for synaptic vesicle recovery are subject to large variations, and in evolutionary terms the high connectivity for the involved proteins has remained despite substantial sequence divergence. This observation is not restricted to insect endocytotic proteins. A recent study on the properties of endocytotic proteins [90] indicated that the high connectivity in protein-protein interaction reflects the spatial and temporal constraints of the process. Furthermore, endocytosis is not restricted to presynaptic function and the proteins often act in many cellular contexts, often by interacting with nonsynaptic protein partners. Many of the endocytotic proteins have pleiotropic functions that are of varying importance in different species. This property differs from most exocytotic proteins. For example, in the mouse, amphiphysin (AMPH; Figure Figure8)8 In addition to the protein valence and conservation argument, the protein structure and domain composition of endocytotic and exocytotic proteins are substantially different (Figure (Figure8).8 On a functional rather than structural level, the processes of endocytosis and exocytosis rely on multiple preparatory steps [23]. We attribute the difference between the processes mainly to these steps. In exocytosis, a sequential exchange of protein-protein interactions leads to priming and SNARE complex formation [19]. During the preparatory steps for endocytosis, an intimate interaction with the lipid properties must occur. The properties that are needed for recognizing lipids dominate the domain architecture of most endocytotic proteins. These proteins include the adaptors, sensors for phosphatidylinositol 4,5-bisphosphate, domains that induce membrane curvature and others (for details, see Additional data file 2). Among the domains that are enriched in endocytotic proteins (Figure (Figure88 Many of the exocytotic proteins but not the endocytotic proteins contain one or more TMDs (Additional data file 8). This leads to two important outcomes for the exocytotic proteins: the location of membranous proteins to the appropriate site in the synapse is already dictated through the secretory pathway by the sorting machinery; and most of these proteins carry post-translational modifications that were acquired during their maturation in the endoplasmic reticulum and Golgi. Exceptions are proteins such as Rab3A and SNAP-25 that, although lacking a TMD, are modified to ensure their covalent membrane association. Clearly, in cases in which a modification is associated with a protein function, this leads to additional evolutionary constraints on the modification sites. For example, synaptotagmin undergoes both N-glycosylation and O-glycosylation. The N-glycosylation was shown to be essential for redirecting synaptotagmin to the synaptic vesicle membranes [94], whereas the O-glycosylation is enhanced and triggered by VAMP-dependent interaction. Indeed, for synaptotagmin both sites are fully conserved along a broad evolutionary distance [67]. At present, experimental data on the functional importance of conservation of post-translational modifications from insect to human and along the evolutionary tree are lacking for large number of presynaptic proteins. Several endocytotic proteins (but notably not the exocytotic proteins) include regions that are defined as coiled-coil domains (Additional data file 2 [part b]). Surprisingly, many of the architectural principles of endocytotic proteins are shared with the complexes of the PSD. This unified principle ensures multiple and rich connections to the cytoskeleton (in the case of PSD) and the lipids (in the case of endocytosis). As a result of their structural architecture, the PSD core proteins are subjected to rapid divergence, leading to a faded signal in the sequences from human to insects. We propose that this same principle is valid for the lack of bassoon and piccolo in insects genomes. Conclusion Analysis of key presynaptic proteins from numerous insect proteomes yields insights into evolutionary processes that occurred more than 350 million years ago. We demonstrated how an approach based on comparative genomics reveals erroneous and missing annotations. This methodology also reveals instances of gene gain and loss in insect genomes. We conclude that strong evolutionary constraints have led to the co-evolution of protein complexes that are involved in trafficking and organelle biogenesis. Furthermore, presynaptic proteins that participate in multiple protein interactions in exocytosis exhibit high sequence conservation. The analogous statement does not apply, however, to endocytosis. Finally, we discuss the relationship between connectivity and sequence conservation in these two protein sets. We explain these differences in terms of the particular architectural design of these proteins, which is adapted for specific phases in the synaptic vesicle's lifecycle. Materials and methods Compiling the presynaptic PS120 collection The list of proteins known as PS120 is a compilation of human genes that are related to the function of the synaptic vesicle life cycle. The proteins are indexed by their human official gene symbol. The conversion of the official gene symbol to protein accessions (from UniProt or National Center for Biotechnology Information [NCBI] protein collection) is performed using Protein Information Resource retrieval system [95]. This protein list provides a partial but overlapping union of collections from several resources: Mass spectrometry (MS) proteomics analysis from biochemical purified synaptic vesicles [18]; biochemical and genetic studies and literature [9,96]; SynDB collection, which includes synaptic process genes and their orthologs in multiple species, including human and fruit fly [97]; GO assignments [26] for the terms 'asymmetric synapse', 'exocytosis', 'synaptic vesicle', 'endocytosis', 'pre-synapse'; and the presynaptic gene index of mammalian genes [27]. The list was compiled manually to avoid redundancy. The PS120 is an apparently complete collection of representatives of proteins that participate in vesicle trafficking, synaptic vesicle lifecycle, and presynaptic organization. Note that the 150 set of presynaptic genes accounts for 45 genes that are included in our PS120, but the additional 75 proteins were not represented in the report by Hadley and coworkers [27]. For example, there the 17 synaptotagmins and 15 syntaxins [27] that are represented by three synaptotagmins (synaptotagmin 1, 5, and 9) and one syntaxin (syntaxin 1A) in the PS120. From the approximately 60 Rab proteins known in human, only a small number of representatives that are directly involved in synaptic vesicle function is listed (Rab27 and Rab3). In addition, protein variants are not listed; for instance, only Rab3A (but not Rab3B, Rab3C, and Rab3D) is listed. We excluded proteins that function in signal transduction, including kinases and phosphatases. Specifically, the ten calcium calmodulin protein kinases that are included in the 150 gene collection [27] are excluded. We also excluded proteins that function in cell adhesion [98,99]; synapse maintenance and development, including growth factors and extracellular matrix; presynaptic channels and G-protein-coupled receptors [100]; and neurotransmitter transporters and multiple uptake mechanisms [101]. Identifying orthologs in insects Orthologs for each of the human PS120 list with insects were defined by the top reciprocal hit using BLAST2 [102]. BLAST searches were restricted to the nonredundant insect database from NCBI [103] and to the UniProt protein database [104]. The insect representatives that were used throughout are the fruit fly D. melanogaster, the honey bee (A. mellifera), red flour beetle (T. castaneum), and one of the two mosquitoes (A. gambiae and A. aegypti). The identification was compared with the preprocessed list of Ensembl database [105]. Detecting homologs in a genomic scale followed the procedure described in the ProtoBee database [106], in which annotations are assigned through hierarchical classification. Sporadic poor alignments and large deviation in protein length were used as evidence of false homology assignments. In instances in which an orthologous relation could not be validated, a direct search in Genome Sequences Centers and in specialized insect database was performed. The tBLASTn [103] was applied using a protein sequence against the translated nucleotide database in all six frames. Top hits were manually tested to separate spurious similarity for missed annotated genes. We applied the following to detect remote homologs and to resolve borderline similarity level: VectorBase, which focuses on A. gambiae, A. aegypti, Ixodes scapularis, Culex pipiens, and Pediculus humanus [107]; FlyBase, which focuses on a comparative view for a dozen Drosophila species. [108]; Baylor Genome Center (for Honeybee and Beetle genomes) and other genome centers (see list in Additional data file 1); and position-specific iterative BLAST [102]. The percentage identity and similarity were used as comparative measures. Conservation score was determined based on the fraction of proteins in the graph with at least 65% sequence identity between a human protein and its closest insect protein, and the total numbers of proteins in the graph. BLAST identity and similarity levels were recorded and both measures are strongly correlated (R2 = 0.9315). For simplicity we used the identity measure (expressed as a percentage) unless stated otherwise. ClustalW [109] was used to produce MSAs. ClustalW MSA was used for neighbor-joining phylogeny reconstruction. Protein-protein interaction graphs Networks of protein-protein interactions are based on the STRING web tool [110]. The database of STRING unifies most types of protein-protein associations, including direct and indirect associations. Most high quality experimental data covers model organisms. In some instances inference of known interactions from model organisms to other species based on orthology of the respective protein was considered. The numerical confidence score used was >0.9. The score was based on homology, experimental data, and text mining. The high confidence (>0.9) ensured that many of the interacting partners were supported by independent evidence types. We used a threshold for up to 50 interacting proteins and a score of >0.9 throughout. The maximal number of interacting was limited to 50 (only for calmodulin and 14-3-3 was this number below the reported interactions). Density value was calculated from the ratio of the high-quality edges in a graph (following removal of the central protein and its direct edges) and the maximal edges possible. In a graph with n vertices, the maximal number of edges is n (n - 1)/2. Functional assignments The nonoverlapping sets of endocytosis and endocytosis sets (24 proteins each) are based on GO annotations [26] and assignments in SynDB [97]. Because coverage of GO annotations for the PS120 is limited, we added proteins from the manual assignment of SynDB [97]. The endocytotic set is mostly based on 'endocytosis' and 'recycling', and for the exocytotic set we merged 'trafficking' and 'exocytosis'. We deleted proteins that are genuine linkers of the two processes from these lists. Statistical significance (P value) of the properties of nonoverlapping sets of endocytosis and endocytosis sets was calculated by a standard Student's t-test, with the null hypothesis that the means of two independent sets are equal. Sequence features and domains Pfam (version 22, 9300 protein families) [64] was used for domain assignment. TMHMM2 was used to predict the location and topology of transmembrane helices [111]. Coiled coils are detected using the method of COILS [112]. Low complexity regions are predicted using the SEG program and activated by the ProteinPredict meta-server [113]. Abbreviations BAR (Bin-Amphiphysin-Rvs); BLOC, biogenesis of lysosome-related organelles complex; CNS, central nervous system; COP, coatomer protein; DGC, dystrophin glycoprotein complex; GO, Gene Ontology; mGC, metabotropic glutamate receptor; MSA, multiple alignment sequence; PEX, peroxin biogenesis; PS120, presynaptic 120 genes; PSD, postsynaptic density; SNAP, Soluble NSF attachment protein; SNARE, SNAP-receptor; SV, synaptic vesicle; TMD, transmembrane domain; t-SNARE, target membrane SNARE; v-SNARE, vesicle SNARE; VAMP, vesicle-associated membrane protein. Authors' contributions CY and NM contributed equality to the data acquisition, analysis, and interpretation of data. ML contributed to the design of the analyses and to the drafting of the manuscript. All authors have read and approved the manuscript. Additional data files The following additional data are available with the online version of this paper. Additional data file 1 lists the proteomes from multiple insect species and their evolutionary relatedness. Additional data file 2 lists the set of 120 presynaptic protein prototypes in human relative to their insect homologs. Part a provides a detailed information on gene names, synonyms, major partner proteins, and conservation levels from human to insect homologs. Part b is a complementary table based on the UniProtKB database [104]. Additional data file 3 lists the 44 genes that had no clear homologs in certain insects. Additional data file 4 is a ClustalW based MSA of stoned B and SCAMP. Additional data file 5 is a ClustalW-based MSA of the SNARE syntaxin 1 and synapsin. Additional data file 6 shows representative graphs of interactions for six proteins, based on the STRING tool [110]. Additional data file 7 provides a detailed table of the PS120 gene set, including the level of sequence identity and similarity (as a percentage between human and insects), and the protein valence according to the STRING tool. Additional data file 8 presents a detailed table of gene functions in endocytosis and exocytosis (24 genes each). Additional data file 9 is the ClustalW-based MSA of the family of SV2 in insects and a tree-like representation based on distances from the MSA. Additional data file 1 Provided is a summary that lists the proteomes from multiple insect species and their evolutionary relatedness. A list of insect genome centers is compiled from NCBI Genome resources [114]. Click here for file(401K, eps) Additional data file 2 Part a provides a detailed table of the PS120 gene set and associated related information on gene names, synonyms, major partner proteins, and conservation levels from human to insect homologs. Part b is a complementary table based on the UniProtKB database [104]. Click here for file(507K, doc) Additional data file 3 Listed are the 44 genes that had no clear homologs in certain insects. After homology search in the nucleotide databases, the majority could be accounted for as missed annotated genes. Click here for file(92K, doc) Additional data file 4 Provided is a ClustalW-based MSA of stoned B and SCAMP. Click here for file(2.5M, eps) Additional data file 5 Provided is a ClustalW-based MSA of the SNARE syntaxin 1 and synapsin. Click here for file(2.6M, eps) Additional data file 6 Provided are representative graphs of interactions for six proteins based on the STRING tool [110]. Click here for file(685K, eps) Additional data file 7 Provided is a detailed table of the PS120 gene set with the level of sequence identity and similarity (expressed as percentage between human and insects) and the protein valence according to the STRING tool. Click here for file(158K, doc) Additional data file 8 Presented is a detailed table of gene functions in endocytosis and exocytosis (24 genes each). Information on the number of interacting proteins, level of sequence identity between human and insect, and additional structural properties is included. Click here for file(118K, doc) Additional data file 9 Presented is the ClustalW-based MSA of the family of SV2 in insects and a tree-like representation based on distances from the MSA. Click here for file(848K, eps) Acknowledgements We are grateful to Menachem Fromer for critical reading, advice, and support. NM received a fellowship from the SCCB, the Sudarsky Center for Computational Biology. This study was supported by the EU DIAMONDS consortium of the Framework 6 and the Israeli Horvitz Foundation. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
J Cell Biol. 2000 Jul 24; 150(2):F77-82.
[J Cell Biol. 2000]Curr Opin Neurobiol. 2002 Oct; 12(5):491-8.
[Curr Opin Neurobiol. 2002]Genome Biol. 2005; 6(1):201.
[Genome Biol. 2005]Curr Biol. 2007 Jan 23; 17(2):R51-3.
[Curr Biol. 2007]BMC Neurosci. 2006 Oct 30; 7 Suppl 1():S4.
[BMC Neurosci. 2006]Funct Integr Genomics. 2001 Mar; 1(4):235-40.
[Funct Integr Genomics. 2001]Curr Opin Neurobiol. 2002 Oct; 12(5):499-507.
[Curr Opin Neurobiol. 2002]J Neurocytol. 2003 Jun-Sep; 32(5-8):567-89.
[J Neurocytol. 2003]Biochim Biophys Acta. 1996 Jun 10; 1286(2):117-52.
[Biochim Biophys Acta. 1996]Ann N Y Acad Sci. 2004 Apr; 1014():170-8.
[Ann N Y Acad Sci. 2004]Nature. 1994 Jul 21; 370(6486):191-3.
[Nature. 1994]Trends Biochem Sci. 2005 Jul; 30(7):367-72.
[Trends Biochem Sci. 2005]Curr Opin Neurobiol. 1994 Oct; 4(5):633-9.
[Curr Opin Neurobiol. 1994]Trends Biochem Sci. 1994 Apr; 19(4):164-8.
[Trends Biochem Sci. 1994]Cell Struct Funct. 2003 Oct; 28(5):465-74.
[Cell Struct Funct. 2003]Proc Natl Acad Sci U S A. 2007 Jul 24; 104(30):12294-9.
[Proc Natl Acad Sci U S A. 2007]Genome Biol. 2001; 2(5):REVIEWS3007.
[Genome Biol. 2001]Cell. 2006 Nov 17; 127(4):831-46.
[Cell. 2006]Curr Opin Neurobiol. 2002 Oct; 12(5):491-8.
[Curr Opin Neurobiol. 2002]Neuron. 2002 Mar 28; 34(1):9-12.
[Neuron. 2002]Curr Opin Neurobiol. 2006 Jun; 16(3):298-304.
[Curr Opin Neurobiol. 2006]Curr Opin Neurobiol. 2002 Oct; 12(5):499-507.
[Curr Opin Neurobiol. 2002]Curr Opin Neurobiol. 2006 Feb; 16(1):83-9.
[Curr Opin Neurobiol. 2006]Int Rev Neurobiol. 2006; 75():181-216.
[Int Rev Neurobiol. 2006]Curr Opin Neurobiol. 2002 Oct; 12(5):491-8.
[Curr Opin Neurobiol. 2002]Cell. 2006 Nov 17; 127(4):831-46.
[Cell. 2006]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D262-6.
[Nucleic Acids Res. 2004]Genome Biol. 2006; 7(11):R105.
[Genome Biol. 2006]J Cell Biol. 2000 Jul 24; 150(2):F77-82.
[J Cell Biol. 2000]BMC Bioinformatics. 2004 Sep 9; 5():129.
[BMC Bioinformatics. 2004]EMBO J. 2003 Jun 16; 22(12):3004-14.
[EMBO J. 2003]J Neurosci. 2004 Feb 11; 24(6):1507-20.
[J Neurosci. 2004]J Neurochem. 2007 Oct; 103(1):276-87.
[J Neurochem. 2007]J Biol Chem. 2005 Sep 9; 280(36):31615-23.
[J Biol Chem. 2005]Neuron. 2000 Jan; 25(1):191-201.
[Neuron. 2000]J Biol Chem. 2000 Feb 4; 275(5):3221-30.
[J Biol Chem. 2000]Curr Opin Cell Biol. 2006 Apr; 18(2):223-7.
[Curr Opin Cell Biol. 2006]Mol Microbiol. 1994 Jul; 13(1):1-8.
[Mol Microbiol. 1994]Cell. 1995 May 19; 81(4):581-9.
[Cell. 1995]Biochem J. 2007 Jun 15; 404(3):525-34.
[Biochem J. 2007]Int Rev Cytol. 2002; 214():103-59.
[Int Rev Cytol. 2002]Nature. 2006 Dec 21; 444(7122):1092-5.
[Nature. 2006]BMC Neurosci. 2006 Oct 30; 7 Suppl 1():S4.
[BMC Neurosci. 2006]Trends Biochem Sci. 2004 Mar; 29(3):143-51.
[Trends Biochem Sci. 2004]Neuron. 2003 May 22; 38(4):521-4.
[Neuron. 2003]Trends Neurosci. 1995 Apr; 18(4):177-83.
[Trends Neurosci. 1995]J Neurosci. 2001 Feb 15; 21(4):1218-27.
[J Neurosci. 2001]J Neurosci. 2001 Feb 15; 21(4):1218-27.
[J Neurosci. 2001]J Biol Chem. 2004 May 7; 279(19):19875-81.
[J Biol Chem. 2004]Curr Biol. 2002 Mar 19; 12(6):R212-4.
[Curr Biol. 2002]Mol Biol Cell. 2005 Oct; 16(10):4867-81.
[Mol Biol Cell. 2005]Curr Opin Cell Biol. 2004 Aug; 16(4):379-91.
[Curr Opin Cell Biol. 2004]Mol Membr Biol. 2003 Jul-Sep; 20(3):197-207.
[Mol Membr Biol. 2003]EMBO J. 1998 Feb 16; 17(4):985-95.
[EMBO J. 1998]Nat Neurosci. 1999 Feb; 2(2):119-24.
[Nat Neurosci. 1999]J Biol Chem. 2004 Jun 18; 279(25):26251-6.
[J Biol Chem. 2004]Mol Biol Cell. 2007 Mar; 18(3):768-80.
[Mol Biol Cell. 2007]J Biol Chem. 2002 Aug 2; 277(31):28191-9.
[J Biol Chem. 2002]Curr Opin Cell Biol. 2004 Aug; 16(4):458-64.
[Curr Opin Cell Biol. 2004]Curr Opin Cell Biol. 2004 Aug; 16(4):458-64.
[Curr Opin Cell Biol. 2004]Am J Psychiatry. 2006 Mar; 163(3):532-4.
[Am J Psychiatry. 2006]Science. 1995 Nov 3; 270(5237):755-6.
[Science. 1995]Biochim Biophys Acta. 2006 Dec; 1763(12):1678-87.
[Biochim Biophys Acta. 2006]Mol Microbiol. 1992 Dec; 6(23):3455-60.
[Mol Microbiol. 1992]Science. 1995 Nov 3; 270(5237):755-6.
[Science. 1995]Cell Tissue Res. 2006 Nov; 326(2):409-22.
[Cell Tissue Res. 2006]Microsc Res Tech. 2000 Feb 1-15; 48(3-4):155-66.
[Microsc Res Tech. 2000]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D247-51.
[Nucleic Acids Res. 2006]Traffic. 2006 Feb; 7(2):182-90.
[Traffic. 2006]Neuron. 2006 Jul 20; 51(2):149-51.
[Neuron. 2006]J Neurosci. 2001 Feb 15; 21(4):1218-27.
[J Neurosci. 2001]J Biol Chem. 2002 Nov 8; 277(45):43096-103.
[J Biol Chem. 2002]Genome Biol. 2005; 6(1):201.
[Genome Biol. 2005]Proc Natl Acad Sci U S A. 2003 Jun 24; 100(13):7666-71.
[Proc Natl Acad Sci U S A. 2003]Cell Mol Life Sci. 2003 May; 60(5):942-60.
[Cell Mol Life Sci. 2003]Science. 2000 Mar 24; 287(5461):2204-15.
[Science. 2000]J Neurochem. 1989 May; 52(5):1433-7.
[J Neurochem. 1989]J Biol Chem. 2000 Jun 2; 275(22):17195-200.
[J Biol Chem. 2000]Mol Biol Cell. 2003 Dec; 14(12):4909-19.
[Mol Biol Cell. 2003]Cell. 2006 Nov 17; 127(4):831-46.
[Cell. 2006]Proc Natl Acad Sci U S A. 1996 May 14; 93(10):4760-4.
[Proc Natl Acad Sci U S A. 1996]Nat Cell Biol. 2001 Aug; 3(8):691-8.
[Nat Cell Biol. 2001]Cell. 1992 Sep 4; 70(5):861-7.
[Cell. 1992]J Biol Chem. 2000 Jun 2; 275(22):17195-200.
[J Biol Chem. 2000]Annu Rev Cell Dev Biol. 2000; 16():19-49.
[Annu Rev Cell Dev Biol. 2000]EMBO J. 2000 Sep 1; 19(17):4806-16.
[EMBO J. 2000]Proc Natl Acad Sci U S A. 2006 Jun 27; 103(26):10029-34.
[Proc Natl Acad Sci U S A. 2006]Am J Hum Genet. 2006 Apr; 78(4):702-7.
[Am J Hum Genet. 2006]PLoS Comput Biol. 2007 Jul; 3(7):e126.
[PLoS Comput Biol. 2007]J Biol Chem. 1996 Nov 29; 271(48):30725-30.
[J Biol Chem. 1996]EMBO J. 1998 Feb 16; 17(4):985-95.
[EMBO J. 1998]Science. 1995 Nov 3; 270(5237):755-6.
[Science. 1995]Biochim Biophys Acta. 1996 Sep 4; 1283(2):215-22.
[Biochim Biophys Acta. 1996]Cell Tissue Res. 2006 Nov; 326(2):409-22.
[Cell Tissue Res. 2006]Nat Rev Neurosci. 2004 Oct; 5(10):771-81.
[Nat Rev Neurosci. 2004]Nat Cell Biol. 2007 Aug; 9(8):858-67.
[Nat Cell Biol. 2007]Curr Opin Cell Biol. 2006 Aug; 18(4):416-21.
[Curr Opin Cell Biol. 2006]Neuroscientist. 2005 Apr; 11(2):138-47.
[Neuroscientist. 2005]J Neurochem. 1990 Oct; 55(4):1266-73.
[J Neurochem. 1990]Proc Natl Acad Sci U S A. 2005 Apr 26; 102(17):6131-6.
[Proc Natl Acad Sci U S A. 2005]Nature. 2007 Aug 23; 448(7156):883-8.
[Nature. 2007]Traffic. 2002 Jul; 3(7):452-60.
[Traffic. 2002]Nature. 2007 Aug 23; 448(7156):883-8.
[Nature. 2007]Traffic. 2000 Jun; 1(6):474-9.
[Traffic. 2000]Curr Opin Cell Biol. 2006 Aug; 18(4):416-21.
[Curr Opin Cell Biol. 2006]Neuron. 2002 Mar 28; 34(1):9-12.
[Neuron. 2002]Traffic. 2006 Feb; 7(2):182-90.
[Traffic. 2006]J Cell Sci. 2001 Dec; 114(Pt 24):4397-405.
[J Cell Sci. 2001]Neuroscientist. 2004 Oct; 10(5):404-8.
[Neuroscientist. 2004]J Biol Chem. 2002 Nov 8; 277(45):43096-103.
[J Biol Chem. 2002]Nucleic Acids Res. 2000 Jan 1; 28(1):41-4.
[Nucleic Acids Res. 2000]Cell. 2006 Nov 17; 127(4):831-46.
[Cell. 2006]Biochim Biophys Acta. 1996 Jun 10; 1286(2):117-52.
[Biochim Biophys Acta. 1996]Nature. 1995 Jun 22; 375(6533):645-53.
[Nature. 1995]Nucleic Acids Res. 2007 Jan; 35(Database issue):D737-41.
[Nucleic Acids Res. 2007]Genome Biol. 2006; 7(11):R105.
[Genome Biol. 2006]Curr Opin Cell Biol. 1999 Oct; 11(5):549-53.
[Curr Opin Cell Biol. 1999]Naturwissenschaften. 1999 Nov; 86(11):516-24.
[Naturwissenschaften. 1999]Biochem J. 2002 Jul 15; 365(Pt 2):329-36.
[Biochem J. 2002]Neurochem Res. 1996 Sep; 21(9):1053-60.
[Neurochem Res. 1996]Nucleic Acids Res. 1997 Sep 1; 25(17):3389-402.
[Nucleic Acids Res. 1997]Nucleic Acids Res. 2003 Jan 1; 31(1):28-33.
[Nucleic Acids Res. 2003]Bioinformatics. 2006 May 15; 22(10):1284-5.
[Bioinformatics. 2006]Nucleic Acids Res. 2007 Jan; 35(Database issue):D610-7.
[Nucleic Acids Res. 2007]Genome Res. 2006 Nov; 16(11):1431-8.
[Genome Res. 2006]Nucleic Acids Res. 2003 Jul 1; 31(13):3497-500.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D262-6.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2007 Jan; 35(Database issue):D737-41.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D247-51.
[Nucleic Acids Res. 2006]J Mol Biol. 2004 May 14; 338(5):1027-36.
[J Mol Biol. 2004]Curr Opin Struct Biol. 1997 Jun; 7(3):388-93.
[Curr Opin Struct Biol. 1997]Cell Mol Life Sci. 2003 Dec; 60(12):2637-50.
[Cell Mol Life Sci. 2003]Bioinformatics. 2006 May 15; 22(10):1284-5.
[Bioinformatics. 2006]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]FEMS Microbiol Lett. 2002 Nov 5; 216(2):133-8.
[FEMS Microbiol Lett. 2002]Bioinformatics. 2006 May 15; 22(10):1284-5.
[Bioinformatics. 2006]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]