![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||
Copyright © 2009 Gazave et al; licensee BioMed Central Ltd. Origin and evolution of the Notch signalling pathway: an overview from eukaryotic genomes 1Aix-Marseille Universités, Centre d'Océanologie de Marseille, Station marine d'Endoume - CNRS UMR 6540-DIMAR, rue de la Batterie des Lions, 13007 Marseille, France 2School of Biological Sciences, University of Queensland, Brisbane, QLD 4072, Australia 3Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, CNRS UMR 5242, INRA, IFR128 BioSciences Lyon-Gerland, Ecole Normale Supérieure de Lyon, 46, Allée d'Italie, 69007 Lyon, France 4Department of Embryology, Faculty of Biology and Soils, Saint-Petersburg State University, Universitetskaja nab. 7/9, St Petersburg, Russia 5Institut Jacques Monod, UMR 7592 CNRS/Université Paris Diderot - Paris 7, 15 rue Hélène Brion, 75205 Paris Cedex 13, France 6UFR de Biologie et Sciences de la Nature, Université Paris 7 - Denis Diderot, 2 place Jussieu, 75251 Paris Cedex 05, France Corresponding author.Eve Gazave: eve.gazave/at/univmed.fr; Pascal Lapébie: pascal.lapebie/at/univmed.fr; Gemma S Richards: s355446/at/student.uq.edu.au; Frédéric Brunet: Frederic.Brunet/at/ens-lyon.fr; Alexander V Ereskovsky: aereskovsky/at/hotmail.com; Bernard M Degnan: b.degnan/at/uq.edu.au; Carole Borchiellini: carole.borchiellini/at/univmed.fr; Michel Vervoort: vervoort/at/cgm.cnrs-gif.fr; Emmanuelle Renard: emmanuelle.renard/at/univmed.fr Received July 1, 2009; Accepted October 13, 2009. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background Of the 20 or so signal transduction pathways that orchestrate cell-cell interactions in metazoans, seven are involved during development. One of these is the Notch signalling pathway which regulates cellular identity, proliferation, differentiation and apoptosis via the developmental processes of lateral inhibition and boundary induction. In light of this essential role played in metazoan development, we surveyed a wide range of eukaryotic genomes to determine the origin and evolution of the components and auxiliary factors that compose and modulate this pathway. Results We searched for 22 components of the Notch pathway in 35 different species that represent 8 major clades of eukaryotes, performed phylogenetic analyses and compared the domain compositions of the two fundamental molecules: the receptor Notch and its ligands Delta/Jagged. We confirm that a Notch pathway, with true receptors and ligands is specific to the Metazoa. This study also sheds light on the deep ancestry of a number of genes involved in this pathway, while other members are revealed to have a more recent origin. The origin of several components can be accounted for by the shuffling of pre-existing protein domains, or via lateral gene transfer. In addition, certain domains have appeared de novo more recently, and can be considered metazoan synapomorphies. Conclusion The Notch signalling pathway emerged in Metazoa via a diversity of molecular mechanisms, incorporating both novel and ancient protein domains during eukaryote evolution. Thus, a functional Notch signalling pathway was probably present in Urmetazoa. Background The emergence of multicellularity, considered to be one of the major evolutionary events concerning life on Earth, occurred several times independently during the evolution of Eukaryota in the Proterozoic geological period [1]. Multicellular organisms are not only a superimposition of the fundamental unit of life, namely the cell; the emergence of multicellularity further implies that cells must communicate, coordinate and organise. In Embryophyta and Metazoa, higher levels of differentiation and organization of cells resulted in the emergence of organs and their organisation into complex body plans. Reaching this critical step required the elaboration of sophisticated intercellular communication mechanisms [2,3]. Cell-cell interactions through signal transduction pathways are therefore crucial for the development and the evolution of multicellular organisms. The modifications of these signal transduction pathways explain the macroevolution process observed. In metazoans, fewer than 20 different signal transduction pathways are required to generate the observed high diversity of cell types, patterns and tissues [4]. Among them, only seven control most of the cell communications that occur during animal development: Wnt; Transforming Growth Factor β (TGF-β); Hedgehog; Receptor Tyrosine Kinase (RTK); Jak/STAT; nuclear hormone receptor; and Notch [5,6]. These pathways are used throughout development in many and various metazoans to establish polarity and body axes, coordinate pattern formation and choreograph morphogenesis [4]. The common outcome to all of these pathways is that they act, at least in part, through the regulation of the transcription of specific target genes by signal-dependent transcription factors [6]. The Notch signalling pathway is a major direct paracrine signalling system and is involved in the control of cell identity, proliferation, differentiation and apoptosis in various animals (reviewed in [7-12]). Notch signalling is used iteratively in many developmental events and its diverse functions can be categorized into two main modalities "lateral inhibition" and "boundaries/inductive mechanisms" [8,13]. During lateral inhibition, Notch signalling has mainly a permissive function and contributes to binary cell fate choices in populations of developmentally equivalent cells, by inhibiting one of the fates in some cells and therefore allowing them to later adopt an alternative one. Lateral inhibition is a key patterning process that often results in the regular spacing of different cell types within a field. The Notch pathway may also have more instructive roles, whereby signalling between neighbouring populations of different cells and induces the adoption of a third cell fate at their border, establishing a developmental boundary [14,15]. A large number of studies, mainly conducted on Drosophila, Caenorhabditis and vertebrates, have characterized the molecular properties and functions of the main components and auxiliary factors of the Notch pathway. These are strongly conserved in bilaterians (Figure (Figure11
In addition to these core components of the Notch pathway, several other proteins are used to regulate Notch signalling in some cellular contexts, and act either on the receptor Notch or on the ligand DSL (Figure (Figure1).1 Most of what we know about the Notch signalling pathway comes from studies conducted on a few bilaterian species. Recently, studies have shown the existence of a Notch signalling pathway in non-bilaterian species, such as the cnidarian Hydra and the sponge Amphimedon, and its putative functions in the former species [29,30]. However, the ancestral structure, functionality and emergence of this complex multi-component signalling system are still open questions. Few studies have been initiated to understand how signalling pathways appeared and evolved beyond the Bilateria [4-6] but the recent sequencing of the first sponge genome, Amphimedon queenslandica has opened new perspectives for studying the origin and evolution of signalling pathways in the Metazoa [31-35]. With the goal of illuminating the early evolution of the Notch pathway, we have therefore undertaken a comparative genomic study of the components of this pathway across the Eukaryota. Our study encompasses 35 species (31 with fully sequenced genomes) covering the 8 major clades of eukaryotes [36] (Figure (Figure2),2
This wide genomic comparison reveals that most of the Notch components are present in all the metazoan species studied, including putative basal metazoans such as sponges and placozoan, suggesting that a functional Notch pathway was already present in the last common ancestor of present-day metazoans and was subsequently strongly conserved during metazoan evolution. While many of the Notch pathway components are also shared with non-metazoan eukaryote lineages, thus suggesting a more ancient origin, nine of the components are metazoan-specific, including the Notch receptor and the DSL ligands. This indicates that while the Notch pathway is a metazoan synapomorphy, it has been assembled through the co-option of pre-metazoan proteins, and their integration with novel metazoan-specific molecules acquired by various evolutionary mechanisms. Results Genome-wide identification of the main Notch signalling pathway components in eukaryotes To understand more precisely the evolution of the Notch pathway at the scale of the eukaryotes, we systematically searched for all the main Notch pathway elements in completely sequenced genomes and Expressed Sequence Tag (EST) data of 35 different eukaryote species (Figure (Figure2).2 We performed BLAST searches [38] to assess the presence or absence of Notch pathway genes in the sampled species, as described in the methods section. In most cases, the Notch pathway elements are multidomain proteins and share some of their domains with other proteins. For each target protein, only the combined occurrence of all requisite domains was considered diagnostic for identification. We systematically defined a diagnostic domain organization for each target protein (Table 2) and identified genes as detailed in the methods section. We also constructed multiple alignments for each protein and performed phylogenetic analyses to confirm the orthology relationships (Additional files 1 and 2). Figure Figure33
Our data confirm the strong evolutionary conservation of the Notch pathway in bilaterians as all components are present in almost all the analysed bilaterian species (Figure (Figure3).3 Four genes were not found complete outside bilaterians, SMRT, Furin, Numb and Neuralized, suggesting that these genes are specific to bilaterians (Figure (Figure3).3 The absence of some components in some non-bilaterian species may represent a progressive elaboration of the pathway during early metazoan evolution, or else may correspond to secondary losses in some lineages. However, these data can be difficult to interpret in terms of the evolution of the Notch pathway, as the phylogenetic relationships of the aforementioned non-bilaterian species are still controversial [43-45]. Nevertheless, we decided to base our discussion on the metazoan relationships hypothesised in the most recent phylogenomic study [46] as we believe it to be the most robust and complete analysis to date. Our data so far indicate that most of the Notch pathway components were already present in Urmetazoa. Interestingly, among the 22 targeted genes, only nine are specific to metazoans (Notch, Delta, Furin, Mastermind, Numb, Neuralized, Mindbomb, HES and SMRT). Strikingly, among these, nine are the genes encoding the ligand and the receptor, suggesting that the canonical Notch pathway only exists in metazoans. Indeed, in the genome of the choanoflagellate Monosiga, no Notch gene has been found, only cassettes of some protein domains encoded on separate genes have been reported [47]. Of note, we also found another gene in this species that possesses the domain arrangement of a Notch gene (1 signal peptide, 1 EGF domain, 2 LNR domains, a transmembrane domain and 3 ANK domains, Additional file 4). While this gene contains the minimum set of diagnostic Notch domains, it has very weak sequence similarity to Notch genes, and in the absence of further evidence we choose here to name it "Notch-like". Nevertheless, we can not exclude that a "protoNotch" receptor might have been already present in Holozoa. 13 components are found in various other eukaryote taxa; some are likely to have appeared during early eukaryote evolution and may even have been present in the last common ancestor of present-day eukaryotes (LECA). Others seem to have specifically appeared in the opisthokont lineage. Figure Figure44
For the ctenophore species (Mnemiopsis and Pleurobrachia) as well as the homoscleromorph sponge Oscarella, only a few target genes were identified in the available non exhaustive EST databases and we were unable to conclude whether or not the remaining genes are present in those taxa. Focus on Notch and DSL proteins evolution in metazoan: phylogenetic analyses and domain composition arrangement We chose to focus our further analyses on the two main molecules of the pathway, Notch and DSL, and study their evolution in metazoans. We first performed phylogenetic analyses using both Maximum Likelihood (ML) and Bayesian Inference (BI) approaches and then investigated the domain organizations of each. Topologies obtained in our phylogenetic analyses are not fully resolved, as previously noticed for the Notch ligands [50-52]. Long branch attraction bias (LBA) may be suspected in some cases and, as previously reported by different authors [53,54], ML appears more sensitive to LBA than BI. Concerning domain composition and organization, generally all the diagnostic domains in Delta or Notch genes are present in bilaterian sequences, but some domains seem to be lacking in a few species. In these cases, we can not state whether this is due to prediction errors, sequencing gaps in the available genome sequences or secondary losses. When the available software prediction is equivocal, important conserved residues can often be identified in the regions where domains would be expected, suggesting functional conservation. Despite these technical limits, our analysis presents several features of interest. First, a single Notch gene is found in most species (Figure (Figure5,5
Second, in the DSL family (Figure (Figure7,7
To support our phylogenetic analyses, we also systematically investigated the domain arrangements of the DSL family proteins (Figures (Figures8,8 Third, in our ligand domain analyses (Figures (Figures8,8 Surprisingly, in cnidarian genomes, in addition to the Delta and Jagged sequences, we found genes composed only of DSL domains (from 1 to 11 repeats) and one gene composed of a MNLL domain associated to 3 DSL domains (Additional file 7). It remains to be seen either these are true genes with unique functionalities, or represent misassembled regions of the genome. Origin and evolution of protein domains involved in the pathway We focused on five genes that encode multidomain proteins in the pathway: DSL, Notch, Mindbomb, Su(H), Furin. We mapped the possible acquisition(s) and loss events of the different domains during eukaryote evolution according to the phylogenetic hypothesis of Baldauf (2003) [36] (Figures (Figures10,10
On one hand, it appears that various domains have an ancient origin; they are shared by several eukaryote lineages, so we can hypothesize their presence in the LECA (or in the ancestor of eukaryotes bearing mitochondria: all eukaryotes except discicristates and excavates). This is the case for: EGF repeats of Notch and DSL (only present in eukaryotes [59]), ANK repeats of Mindbomb and Notch (present in eukaryotes, Archaea and Bacteria); the LNR domain of Notch; both ZZ and Ring type ZN finger domains of Mindbomb; the IPT RBP-JKappa domain of Su(H); the Subtilisin domain and the Furin domain. In all of these cases, a hypothesis of ancestrality followed by one or more secondary losses is most parsimonious. On the other hand, several domains appear to have originated more recently since they are specific to opisthokonts or even to metazoans: the MNLL, DSL and VWC domains involved in DSL composition; the NOD and NODP domains of Notch; the Mib/Herc2 domain of Mindbomb; the Lag1 and Beta-trefoil domains of Su(H). Thus, a total of six domains may represent synapomorphies of the Metazoa. In the case of the P-proprotein of Furin, the more parsimonious inference is that it may have appeared convergently three times in Excavata, Heterokonta and in Opistokonta (with a secondary loss in Microsporidia). Discussion A functional Notch pathway seems to have been present in the Urmetazoa and comprised at least 17 components [30]. The later addition of five other components (in Eumetazoa or Bilateria) can thus be considered as facultative and responsible for additional regulation properties of the pathway. Our study indicates that the presence of the Notch pathway is a synapomorphy of metazoans as this is the only kingdom to possess all the key components of the pathway, most importantly the receptor and ligands. Our analysis also sheds light on the molecular mechanisms that may have been invoked in the formation of this pathway. Indeed, as we discuss hereafter, our study shows that Notch signalling has originated by cooption of pan-eukaryotic ancestral genes; modification of ancestral functions by new protein-protein interactions (mediated by novel metazoan domains); lateral gene transfer; formation of new proteins by both exon shuffling and duplications + divergence. Cooption of pre-existing genes and ancestral functions This study, at the scale of the Eukaryota super-kingdom, reveals the presence of Notch components in diverse eukaryotic organisms, and thus their ancient origin. Certain highly conserved genes, despite their ancestrality, seem to be absent in Fungi and Microsporidia. This is consistent with previous genomic analyses that have documented massive gene losses in the LCA of Fungi + Microsporidia, and a further round of losses in microsporidies in relation to their parasitic life style [60,61]. The origin of Presenilin and of the γ-secretase complex One of the most striking features uncovered by our study is the evolutionary conservation of the γ-secretase complex [22,62]: the four proteins composing this large transmembrane complex (Nicastrin, APH1, PEN2 and Presenilin [63,64]) are present in both plants and unikonts (except in Fungi + Microsporidia). While the entire γ-secretase complex does not seem to be pan-eukaryotic, our analysis nonetheless supports an altered evolutionary scenario than that formerly proposed for its main player, Presenilin. Previously, authors have hypothesized (based on an early view of the tree of life) a convergent acquisition of this gene in the metazoan and the plant lineages [65]. Our study reveals instead that Presenilin was present in the LECA, and then lost independently twice (in the LCA of Fungi + Microsporidia and in Alveolata). The APH1 and Nicastrin proteins may also be ancestral to Eukaryota, but our data is inconclusive for PEN2 on this point (found only in Unikonta and Plantae). Until now, functional analyses of this complex are available only in Embryophyta [66] and Metazoa, where it is known to be involved in the cleavage of Notch and other proteins such as ErbB4 [67] and APP (amyloid precursor protein, implicated in Alzheimers disease [68]). But the lack of evidence for a complete γ-secretase complex in the LECA (because of the possible later emergence of PEN2) parallels recent functional data indicating that in both mammals [69] and a bryophyte (Physcomitrella patens [66]), Presenilin is also involved in various γ-secretase-independent functions such as protein degradation and trafficking. The association of PEN2 (present either in the LCA of unikonts and plants or acquired independently in these two lineages) is considered to be necessary to acquire the proteolytic activity of Presenilin via conformational changes [70]. These changes may result in the accessibility of the two catalytic motifs Y/WD and GXGD, which are conserved at the eukaryotic scale [71]. This suggests that proteolysis might not have been the ancestral function of Presenilin (alone or in association with Nicastrin [66]), but might have been acquired secondarily by its co-option into the four protein γ-secretase complex (including PEN2). This challenging evolutionary scenario requires further investigations to be tested. The origin of the Notchless inhibitor Notchless encodes a protein containing a NLE domain and WD40-repeats [72]. In Eumetazoa, this member of the WD-repeat (WDR) protein superfamily [73,74] modulates the Notch pathway by binding the NICD [75] but also by interacting with Deltex and Su(H) [72]. Our analysis shows that Notchless was probably present in LECA. Nevertheless, in some of the studied species, the NLE domain is missing, and we cannot define whether this is due to secondary loss or to a high level of sequence divergence obscuring domain prediction. The high conservation of NLE sequences seems to be compatible with functional conservation as shown by transgenic experiments between a plant, Solanum chacoense and an animal, Drosophila [76,77]. However, while both plant and yeast Notchless proteins share an involvement in ribosome biogenesis, until now no such role has been reported in animals [78]. These observations have led authors to propose that either Notchless was primarily involved in ribosome biogenesis in eukaryotes and was secondarily recruited in the metazoans for a new function (regulator of Notch pathway), or that this role may still exist in animals despite the lack of experimental evidence [79]. Ancestrality or lateral gene transfer? Two other members of the Notch pathway show an ambiguous history, in which the eventuality of lateral gene transfer (LGT) cannot be excluded. This is the case for both Fringe [80] and Strawberry Notch (Sno). Our analyses reveal that Fringe is present in Metazoa, but also in plants and parabasalia (Trichomonas). A fringe domain alone was also identified in the studied Ascomyceta species; however no complete Fringe or Fringe-like gene seems to be present in this taxon (data not shown). We could hypothesize that the Fringe gene was present in the LECA and then lost several times; nevertheless, the most parsimonious scenario suggests three independent acquisitions. We can speculate that LGT might have occurred, favoured by either the tight association existing between Parabasalia and Metazoa lineages or via bacterial transfers [81]. However, we failed to find any specific relationships or signatures (Additional file 2) between the Fringe genes of Homo and its parasite Trichomonas as well as we failed to detect Fringe outside Eukaryota to strongly argue for a LGT hypothesis. In our analysis, Sno is shown to be present in Holozoa and Plantae. Unexpectedly, Sno has been reported recently in a nuclear and cytoplasmic large DNA virus (NCLDV) of the haptophyte (taxon related to Heterokonta [36]) Emiliania huxleyi [82]. As our analysis on the genomes of the two chosen heterokonts (Phytophthora sojae and ramorum) failed to reveal the presence of Sno, we chose to extend our research to other heterokonta related species. Interestingly, Sno is not only present in the genome of the haptophyte Emiliania, suggesting a LGT between this species and its virus EHV (Emiliania huxleyi virus), but also in two other heterokonta, Aureococcus anophaegefferens and Thalassiosina pseudonana (Additional file 8). Another interesting feature is that Sno has been shown to be derived from the SNF2/SWI2 ATPases encoding gene of α-proteobacteria [83]. The presence of Sno or Sno-related genes in both NCLDV and α-proteobacteria may suggest LGT events in the history of these genes because i) α-proteobacteria are often found in tight associations with various eukaryote taxa (e.g: Wolbachia/Metazoa; nodosities of Fabaceae plants) and ii) NCLDVs have been reported from Amoebozoa, Haptophyta, Discicristata and Viridiplantae however their ecological distribution and importance is still largely unknown and newly described virophage of NCLDVs may also be involved in LGT [82-84]. In the two cases (Fringe and Sno), further analyses (on more species) are needed to shed light on the origin and history of these genes and to state whether they were acquired by LGT or not. The Notch pathway is specific to Metazoa The cooption or acquisition (by LGT) of "old" genes is not sufficient to explain the formation of the canonical Notch pathway. One of the pivotal steps in the evolutionary history of the Notch pathway seems to be the transition between the choanoflagellates and the animals [85]. Indeed, this study reveals that the majority of Notch components appeared in the LCA of the Holozoa. Nevertheless, several molecular components critical for signal transduction are lacking in choanoflagellates, in particular, the ligand Delta and the receptor Notch (although we found a gene that possesses a domain arrangement similar to that of the metazoan Notch genes, it has very weak sequence similarity to these genes), thus we consider the Notch pathway as a synapomorphy of the Metazoa (this study, [47]). An increase in the complexity of this pathway has also occurred after the divergence between sponges and other metazoans. Several Notch components are absent from the demosponge Amphimedon (Furin, Mastermind, SMRT, Numb and Neuralized), yet the pathway may still be functional in this species [30]. This suggests that these components were not critical for the function of the pathway and may constitute additional regulatory elements that were subsequently added to the pathway in eumetazoans. Nevertheless, the possible pan-metazoan ancestry of these genes (and their subsequent loss in Amphimedon) cannot be excluded; data from other sponges may help to resolve this issue. The absence of Furin in Amphimedon is not really unexpected; although Furin has a critical role for the maturation of the receptor Notch in vertebrates, it has been shown in Drosophila that Furin is not essential for Notch signalling. Indeed, the Notch receptor can still be trafficked to the membrane without this initial cleavage [86]. Furin belongs to the PCSK superfamily which contains diverse families of proteases. Several PCSK proteins are present in Amphimedon although none seem to be bona fide Furins (as they do not group with bilaterian Furin in the phylogenetic tree; additional file 2). Nevertheless, we cannot exclude the possibility that one of these PCSKs may perform the S1 cleavage in the Amphimedon Notch pathway instead of Furin. Indeed, all PCSKs share the same canonical cleavage site R-X-R/K-R and (presently scarce) available functional data suggest that some of them may play similar roles in different cellular lineages [87]. The absence of a complete Neuralized in non-bilaterians is not incompatible with a functional pathway, due to the functional redundancy of Neuralized and Mindbomb [88]: the latter being present in non-bilaterians. Indeed, these two components are both E3 ubiquitin ligases involved in ligand endocytosis and regulation [27,28,89]via ubiquitylation [90], and were shown to be able to rescue each other in Drosophila [91,92]. A functional study of the Notch pathway in Placozoa, which lacks both a complete Neuralized and Mindbomb, would allow a better understanding of the effects of the absence of E3 ubiquitin ligase regulation. Regarding the inhibitor Numb, it inhibits Notch via endocytosis and regulates cell fate acquisition by asymmetric cell division or by lineage decision processes [93-95]. Functional studies in sponges would be necessary to state whether another protein replaces Numb function. Nonetheless, it appears that the mechanism of Numb-mediated asymmetric cell fate acquisition is a synapomorphy of Notch pathway activity in Bilateria. The co-activator Mastermind (MAM) is classically considered an integral part of the co-activation complex. Its non critical nature is highly unexpected and its absence from the demosponge species, as well as several bilaterian species, is puzzling. The high sequence divergence of the MAM proteins in bilaterians (MAM proteins share little sequence similarity apart from the N-terminal region [23], the region which interacts with Su(H) and NICD [96]) could make searching for them by sequence similarity alone inconclusive. Alternatively, these proteins may have been secondarily lost in several species, indicating that MAM proteins may be facultative for pathway function or replaceable by other proteins. In the absence of functional data on species that apparently lack MAM, we cannot distinguish between these two hypotheses. Recent acquisition of new functions: intervention of domain shuffling It is clear from our data that novelty arose either in the LCA of Holozoa or in the metazoan stem lineage, which resulted in the assembly of disparate components into the functional Notch signal transduction pathway in animals. Our study further enables us to partly understand the molecular evolutionary mechanisms that may have facilitated these events. Hereafter, we focus on the origin of the two main players, the receptor Notch and the ligands Delta-Jagged, all of which are metazoan specific multidomain proteins. The origin and evolution of Notch In the light of the recent data concerning sponges [30] and the present study, we can infer that Notch is a synapomorphy of Metazoa and consists of 3 core protein domains: EGF, ANK and LNR. Interestingly, these 3 domains exist in all eukaryotes. Proteins composed of EGF domains, LNR domains or ANK domains have been reported on separate chromosomes in M. brevicollis [47]. It has been proposed that the presence of these domains in separate Monosiga proteins suggests that Notch is the result of a new recombination of existing domains, known as exon or domain shuffling [97,98]. Data concerning the role of the LNR domains also found in the pregnancy associated plasma protein A (PAPP-A) are too scarce to infer the ancestral function of this domain [99]. The only common feature that we can note between the LNR domains of Notch and PAPP-A is a calcium binding capability [100]. In contrast, EGF and ANK are modular protein subunits, that are very common in eukaryote proteins and that are known to be involved in protein-protein interactions [101]. The ANK repeat is one of the most common protein-protein interaction motifs in living beings [102,103]. It has been primarily reported in eukaryotes, although examples from prokaryotes and viruses are also known and may be the result of lateral gene transfer [104]. ANK domains are not only part of the composition of Notch (3 to 5 ANK repeats) but also of Mindbomb (1 to 6). The ANK repeat is a relatively well conserved motif with strongly conserved residues (a Thr-Pro-Leu-His tetrapeptide motif and Val/Ile-Val-X-Leu/Val-Leu-Leu motif) and 2 α-helices [103]. We note that the Mindbomb ANK motifs are less well conserved than the Notch ANKs, suggesting that the structural integrity of the ANK motifs of Mindbomb are less constrained than in Notch. ANK motifs in Notch have a crucial role; they are involved in the assembly and stability of the complex with Su(H) and Mastermind [105,106]. When ANKs are deleted, the Notch signalling pathway is not functional in mice [105]. Mindbomb ANK repeats are important for the Delta internalization process but are not necessary for Delta ubiquitination [107]. As already mentioned, Mindbomb can be functionally replaced by Neuralized; this flexibility may have led to weaker evolutionary constraints on the Mindbomb ANK repeats than on those of Notch. The two enigmatic domains NOD and NODP, the roles of which are still unknown, seem to be an innovation of Eumetazoa. Our analysis does not allow us to infer the process by which they appeared. The origin of DSL proteins: Delta and Jagged Notch has two possible ligands encoded by the two paralogous genes Delta and Jagged. Our analyses show that Delta was ancestrally present in Metazoa, while a complete Jagged is absent from Placozoa and Porifera. Phylogenetic analyses of the ligands do not provide conclusive results. As already mentioned, we can envisage that the ligands are evolving in a rapid and divergent way in each lineage, and this could cause the loss of ancient phylogenetic signals. Experimental data suggest that Delta and Jagged may be complementary, functionally interchangeable or antagonistic [108,109]. They share two protein domains, MNLL and DSL, associated with the EGF repeats that they have in common with Notch, and are directly involved in receptor/ligands interactions. While EGF repeats represent an ancient domain, as previously discussed, MNLL and DSL domains are absent outside Metazoa. Their origin cannot be clarified by our study. Nevertheless, we may speculate that the DSL domain shares ancestry with the LNR domains of the Notch receptor. Indeed, comparison of cysteine patterns from these two domains revealed that, for 4 of the 6 cysteines, positions and spacing are conserved. Despite the common characteristics of Delta and Jagged, they differ by two main features: i) a VWC domain present in Jagged and absent in Delta, the function of which is not clear but it may be involved in protein complex formation; ii) the number and spacing of EGF repeats differ between Delta and Jagged (an average of 7 and 14 respectively). Nevertheless, no correlation between the number of EGF repeats in the ligand and the affinity to the Notch receptor has been reported. Instead, Notch ligand choice is modulated by other proteins such as Fringe and O-fucosyltransferase that modify Notch EGF residues [110]. It is worth noting that the sponges and the placozoan possess complete Delta genes (with or without MNLL domains) but Jagged genes seem to be absent. Nevertheless, in the case of the sponges (Amphimedon and Oscarella) and of Trichoplax, the VWC domain, (the specific domain of Jagged) is indeed present in the genomes, but it is never found in association with a DSL domain (data not shown). Intriguingly though, in Trichoplax, the VWC domain is found in association with EGF domains (7). These observations lead us to propose two possible evolutionary scenarios for the Notch ligands (Figure (Figure1313
- An ancestral Delta gene duplicated before the radiation of the Eumetazoa, followed by an association of the VWC domain to one of these Delta copies. The number of EGFs increased either by tandem duplications within a gene (where a segment is duplicated and the copy inserted next to its origin), exon shuffling (which may be responsible for internal duplications of repeats) or DNA slippage (due to the formation of DNA hairpins) [98,111]. - An ancestral Delta gene duplicated before the radiation of the Eumetazoa, at which time EGF repeats were already independently associated with a VWC domain (the state observed in Placozoa). One copy of the ancestral Delta joined the EGF+VWC motif to create Jagged. This second hypothesis could explain the higher number of EGFs in Jagged compared to Delta (as the result of the addition of two series of EGF repeats). The fact that EGF motifs from Jagged seems to be physically separated into two groups, as shown in Figure Figure9,9 As we failed to find any specific signature in EGF repeats that could allow us to favour one of these two scenarios, the sequencing of additional non-bilaterian genomes may help to resolve this question. Nevertheless we have to keep mind that currently the placozoan phylogenetic position is still controversial [44,45,112,113]. Conclusion This study focusing on the Notch signalling pathway provides for the first time a complete description of Notch components and auxiliary factors across the Eukaryota. These investigations have enabled us to re-assess the ancient origin of some components such as the γ-secretase complex and Notchless. Fringe and Sno are probably old genes that were convergently acquired by lateral gene transfer. Several new functions of the Notch pathway likely originated in the last common ancestor of Holozoa, which already possessed 12 genes of the pathway. Nevertheless, the core genes needed for a functional pathway are only present in metazoans and it apparent that the two main players, Notch and Delta, emerged via both the shuffling of old domains (EGF, ANK, LNR), and the invention of new ones (MNLL, DSL). At present, functional data on non-bilaterian models are scarce, but such efforts need to be realized in order to understand the emergence of functionality in the Notch pathway. More largely this will pertain to an understanding of the emergence of signal transduction pathways during the acquisition of multicellularity in the Metazoa. Methods Data sources and sequence retrieving Genomic data (including 31 complete genomes) were used when available. If not, EST trace files were scanned instead; as was the case for four species: Oscarella carmela (Porifera), Pleurobrachia pileus (Ctenophora), Mnemiopsis leidyi (Ctenophora) and Bigelowiella natans (Rhizaria). As the Amphimedon queenslandica genome is still not annotated, sequences were identified and concatenated following the previously published procedure [85,114]. Regardless of the origin of the sequence data, TBLASTN or BLASTP searches [38] were carried out on genome data (including 31 complete genomes) when available with a cut-off E-value threshold of e-25 or less. When BLASTs against genome data gave results, the sequences obtained were systematically reciprocally BLASTed against the NCBI database. In this way, we could confirm the validity of the sequences retrieved with the initial BLAST searches (reciprocal best hits [115]). Sequences analyses Genes were scored "present", "absent" or "incomplete" (Figure (Figure3).3 For phylogenetic analyses, 18 alignments (one alignment for each gene except for O-fut, SMRT and HES, the latter having been recently reported in [116]) were performed using the online software Muscle (http://www.ebi.ac.uk/Tools/muscle/index.html[117,118]) and subsequently corrected by eye in Bioedit Sequence Alignment Editor 5.09 [119] (additional file 1). The alignments were then treated with the program GBLOCKS with the least-stringent settings to release positions of uncertain homology [120]. For Notch and DSL proteins, the number of EGF domains is variable so they were excluded from the phylogenetic analyses. For the analysis of Notch, the alignment used includes only a part of the sequence from LNR domain to the end (749 bp). The DSL alignment used includes also partially the DSL protein sequence from the beginning to the end of the DSL domain (169 bp). Five sequences were incomplete in the DSL alignment (Delta: O. carmela; S. purpuratus 3; Jagged: L. gigantea 2; H. robusta; B. floridae). For ligand nomenclature, all genes that contain the VWC domain were named Jagged (prefixed with J-). Phylogenetic trees were constructed from the protein alignments using the maximum likelihood method (ML) with the PHYML program under a WAG model of amino acid substitution [121]. To take into account rate variation among sites, we computed likelihood values by using an estimated gamma law with four substitution rate categories and we let the program evaluate the proportion of invariant sites (WAG+I+ Γ4). Node robustness was tested by bootstrap (BP) analysis [122] with 1,000 replicates. In addition, for DSL and Notch phylogenetic analyses, Bayesian analysis was performed with MrBayes 3.1, using the WAG fixed model [123]. Two sets of six independent simultaneous metropolis-couples Markov chains Monte Carlo were run for five million generations and sampled every hundredth generation. The runs were monitored for convergence and an adequate burn-in was removed (above 25% of tree and parameters). Bayesian posterior probabilities (PP) were used for assessing the confidence value of each node [124]. Domain arrangements and composition For multidomain protein coding genes, the presence of specific protein domains and the domain arrangements were checked by scanning sequences with Prosite http://www.expasy.org/prosite/[125], CDD http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml[126] and InterProscan http://www.ebi.ac.uk/Tools/InterProScan/ online software [127]. In addition, for Notch, Delta and Jagged genes we used PSORTII [128] and PESTfind [129] software for identifying the nuclear localisation signal (NLS) and the PEST region respectively. For other regions and/or domains characteristic of the Notch receptors and Delta ligand that cannot be detected by the previous software (C-C linker, RAM motif, cleavage sites) conserved regions were identified "by eye" on the basis of sequence alignments and previous works [19,62,96,130]. It is worth noting that the prediction of cleavage sites was confounded by sequence divergence, such that these sites cannot always be stated with full confidence. For designing the Notch, Delta and Jagged compositions, MyDomains Image Creator from Prosite was used. Five major genes of the Notch pathway were selected for more detailed domain composition analyses: the receptor Notch, the ligand DSL, Suppressor of Hairless, the ligand regulator Mindbomb and the enzyme responsible for the S1 cleavage, Furin. We used multiple software platforms for gene domain prediction (Prosite, Interproscan, SMART [131,132], Pfam [133], Superfamily (supfam.org/SUPERFAMILY/) [134]). Evolution of these five genes among eukaryotes was discussed according to two previously proposed rooting hypotheses [36,135]. Two conflicting hypotheses for the position of the root of the eukaryote tree are currently recognized [36,136]: subdivision of eukaryotes between opisthokonts + amoebozoans and bikonts (all remaining eukaryotes, on the left of Figures Figures10,10 List of abbreviations ADAM: A Disintegrin and Metalloprotease; APH1: Anterior PHarynx defective 1; APP: Amyloid Precursor Protein; ANK: Ankyrin; BI: Bayesian Inference; BLAST: Basic Local Alignment Search Tool; BP: Bootstrap; CA: Common Ancestor; CBF1: C-Repeat/Dre Binding Factor 1; CDD: Conserved Domain Database; CSL: CBF1, Su(H), Lag-1; DSL: Delta Serrate Lag-2; EGF: Epidermal Growth Factor; EHV: Emiliana huxleyi Virus; ErbB4: Erythroblastic leukemia viral oncogene homolog 4; E(Spl): Enhancer of Split; EST: Expressed Sequence Tag; HAC: Histone ACetylase; HDAC: Histone DeACetylase; HECT: Homologous to the E6-Ap Carboxyl Terminus; Herc2: Hect domain and RLD 2; HES: Hairy/Enhancer of Split; HEY: Hairy/Enhancer of split related with YRPW motif 1; IPT RBP-J Kappa: Recombination signal Binding Protein for Immunoglobulin kappa J region; LBA: Long Branch Artefact; LCA: Last Common Ancestor; LECA: Last Eukaryote Common Ancestor; LGT: Lateral Gene Transfer; LNR: Lin12/Notch repeats; LUCA: Last Universal Common Ancestor; MAM: Mastermind; Mib: Mindbomb; ML: Maximum Likelihood; NCBI: National Center for Biotechnology Information; Ncor: Nuclear receptor corepressor; NCLDVs: Nuclear and Cytoplasmic DNA virus; NECD: Notch Extracellular Domain; NEDD4: Neuronal precursor cell-Expressed Developmentally, Downregulated 4; NICD: Notch Intracellular Domain; Nle: Notchless; NLS: Nuclear Localization Signal; O-fut: O-fucosyltransferase; PEN2: Presenilin Enhancer 2; PHYML: Phylogenies by Maximum Likelihood; PP: Posterior Probabilities; RTK: Receptor Tyrosine Kinase; SMART: Simple Modular Architecture Research Tool; SMRT: Silencing Mediator of Retinoid and Thyroid receptors; Sno: Strawberry notch; Su(dx): Suppressor of Deltex; Su(H): Suppressor of Hairless; TGF-β: Transforming Growth Factor β; VWC: Von Willebrand Factor type C; WAG: Whelan and Goldman. Authors' contributions EG, FB, GR, PL, and BDM retrieved the sequences used in the study. EG, GR, and PL made the sequence alignments and performed the phylogenetic analyses. EG and GR performed domain analyses. EG, ER, AEV and CB conceived the study. EG, ER and MV designed and coordinated the study. EG, ER, CB and MV drafted the manuscript and all authors participated in the editing of the manuscript. All authors read and approved the final manuscript. Additional file 1 Alignments used for the phylogenetic analyses. The data provided represent the alignments used for the 18 phylogenetic analyses. Click here for file(415K, DOC) Additional file 2 Phylogenetic analyses. In this file we provide the phylogenetic trees constructed from the protein alignments using the maximum likelihood method (ML) with the PHYML program for 16 Notch components (excepted Notch and Delta/Jagged). Click here for file(212K, PDF) Additional file 3 Diagnostic domains table. In this table we report the presence or absence of the domains that compose each protein in all species. Click here for file(319K, DOC) Additional file 4 "Notch-like" in Monosiga brevicollis. The sequence of Monosiga brevicollis presenting a domain arrangement of a Notch gene is provided. Click here for file(21K, DOC) Additional file 5 Notch phylogenetic tree. Notch phylogenetic tree constructed from the protein alignments using the maximum likelihood method (ML) with the PHYML program. Click here for file(167K, PDF) Additional file 6 DSL phylogenetic tree. DSL proteins phylogenetic tree constructed from the protein alignments using the maximum likelihood method (ML) with the PHYML program. Click here for file(242K, PDF) Additional file 7 Unusual arrangement of DSL domains in Nematostella vectensis. In this file we report the sequences of Nematostella vectensis presenting an unusual arrangement of DSL domains. Click here for file(25K, DOC) Additional file 8 Strawberry notch sequences. In this file we report the Strawberry notch sequences identified in two heterokonts. Click here for file(39K, DOC) Acknowledgements We are extremely grateful to the Department of Energy (DoE) Joint Genome Institute for sequencing the genomes of the different species used in this study and for making these sequences publicly available. We are also very grateful to Pr M. Manuel for providing us Pleurobrachia pileus sequences, Ajna Rivera and David Weisblat for some Helobdella robusta sequences, Romain Derelle for his help and advices, Pr Jean-Nicolas Volff for hosting E.G. in his lab and Dr. Jean Vacelet for his advices. Our work has been supported by the Marine genomics Europe network through the GAP Fellowship (to E.G. Project no.GOCE-CT-2004-505403) E.G. and P.L. hold a fellowship from the Ministère Français de la Recherche. B.M.D is supported by grants from the Australian Research Council. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||
Dev Cell. 2004 Sep; 7(3):313-25.
[Dev Cell. 2004]Comp Biochem Physiol A Mol Integr Physiol. 2001 Jun; 129(2-3):433-60.
[Comp Biochem Physiol A Mol Integr Physiol. 2001]J Cell Sci. 2004 May 15; 117(Pt 12):2579-90.
[J Cell Sci. 2004]Nat Rev Genet. 2003 Jan; 4(1):39-49.
[Nat Rev Genet. 2003]Teratology. 1999 Oct; 60(4):226-39.
[Teratology. 1999]Genes Dev. 1998 Jun 15; 12(12):1751-62.
[Genes Dev. 1998]Curr Opin Genet Dev. 2004 Oct; 14(5):506-12.
[Curr Opin Genet Dev. 2004]Science. 1999 Apr 30; 284(5415):770-6.
[Science. 1999]Curr Biol. 2004 Feb 3; 14(3):R129-38.
[Curr Biol. 2004]Science. 2006 Dec 1; 314(5804):1414-5.
[Science. 2006]EMBO Rep. 2005 Dec; 6(12):1120-5.
[EMBO Rep. 2005]Genes Dev. 1998 Jun 15; 12(12):1751-62.
[Genes Dev. 1998]Science. 1999 Apr 30; 284(5415):770-6.
[Science. 1999]Cell. 1985 Dec; 43(3 Pt 2):567-81.
[Cell. 1985]Cell. 1997 Jul 25; 90(2):281-91.
[Cell. 1997]EMBO Rep. 2005 Dec; 6(12):1120-5.
[EMBO Rep. 2005]Nat Rev Mol Cell Biol. 2006 Sep; 7(9):678-89.
[Nat Rev Mol Cell Biol. 2006]Genetics. 1999 Jun; 152(2):567-76.
[Genetics. 1999]Genetics. 1990 Nov; 126(3):665-77.
[Genetics. 1990]Development. 1993 Oct; 119(2):377-95.
[Development. 1993]Proc Natl Acad Sci U S A. 2001 May 8; 98(10):5637-42.
[Proc Natl Acad Sci U S A. 2001]Dev Biol. 2007 Mar 1; 303(1):376-90.
[Dev Biol. 2007]Curr Biol. 2008 Aug 5; 18(15):1156-61.
[Curr Biol. 2008]Nat Rev Genet. 2003 Jan; 4(1):39-49.
[Nat Rev Genet. 2003]Genes Dev. 2002 May 15; 16(10):1167-81.
[Genes Dev. 2002]Curr Biol. 2007 Oct 9; 17(19):R836-7.
[Curr Biol. 2007]Nat Rev Mol Cell Biol. 2006 Sep; 7(9):678-89.
[Nat Rev Mol Cell Biol. 2006]Science. 2003 Jun 13; 300(5626):1703-6.
[Science. 2003]J Mol Biol. 1990 Oct 5; 215(3):403-10.
[J Mol Biol. 1990]Int J Dev Biol. 2003 Sep; 47(6):397-404.
[Int J Dev Biol. 2003]Genetics. 2004 Mar; 166(3):1281-9.
[Genetics. 2004]Curr Biol. 2000 Jun 29; 10(13):R471-3.
[Curr Biol. 2000]Regul Pept. 2009 Jan 8; 152(1-3):54-60.
[Regul Pept. 2009]Nature. 2008 Aug 21; 454(7207):955-60.
[Nature. 2008]Curr Biol. 2009 Apr 28; 19(8):706-12.
[Curr Biol. 2009]Nature. 2008 Feb 14; 451(7180):783-8.
[Nature. 2008]Nature. 2008 Aug 21; 454(7207):955-60.
[Nature. 2008]Curr Biol. 2009 Apr 28; 19(8):706-12.
[Curr Biol. 2009]BMC Genomics. 2007 Jul 13; 8():233.
[BMC Genomics. 2007]FEMS Microbiol Lett. 2005 Jul 1; 248(1):23-30.
[FEMS Microbiol Lett. 2005]EMBO Rep. 2005 Dec; 6(12):1120-5.
[EMBO Rep. 2005]Nature. 2008 Aug 21; 454(7207):955-60.
[Nature. 2008]Curr Biol. 2009 Apr 28; 19(8):706-12.
[Curr Biol. 2009]Genesis. 2007 Mar; 45(3):113-22.
[Genesis. 2007]Development. 2003 May; 130(10):2161-71.
[Development. 2003]BMC Evol Biol. 2005 Jan 28; 5(1):8.
[BMC Evol Biol. 2005]Mol Biol Evol. 2007 Sep; 24(9):2108-18.
[Mol Biol Evol. 2007]Curr Opin Cell Biol. 1999 Dec; 11(6):699-704.
[Curr Opin Cell Biol. 1999]Dev Biol. 2008 Aug 1; 320(1):304-18.
[Dev Biol. 2008]Nature. 2008 Aug 21; 454(7207):955-60.
[Nature. 2008]Curr Biol. 2009 Apr 28; 19(8):706-12.
[Curr Biol. 2009]Dev Genes Evol. 2003 Jun; 213(5-6):254-63.
[Dev Genes Evol. 2003]Nature. 2008 Aug 21; 454(7207):955-60.
[Nature. 2008]Curr Biol. 2009 Apr 28; 19(8):706-12.
[Curr Biol. 2009]Nature. 2008 Aug 21; 454(7207):955-60.
[Nature. 2008]Curr Biol. 2009 Apr 28; 19(8):706-12.
[Curr Biol. 2009]Genome Biol. 2008; 9(3):R55.
[Genome Biol. 2008]Science. 2003 Jun 13; 300(5626):1703-6.
[Science. 2003]Science. 2003 Jun 13; 300(5626):1703-6.
[Science. 2003]Science. 2003 Jun 13; 300(5626):1703-6.
[Science. 2003]Science. 2003 Jun 13; 300(5626):1703-6.
[Science. 2003]J Mol Biol. 1999 Jun 18; 289(4):729-45.
[J Mol Biol. 1999]Curr Biol. 2008 Aug 5; 18(15):1156-61.
[Curr Biol. 2008]Proc Natl Acad Sci U S A. 2000 Oct 10; 97(21):11319-24.
[Proc Natl Acad Sci U S A. 2000]Genome Res. 2003 Oct; 13(10):2229-35.
[Genome Res. 2003]Nat Rev Mol Cell Biol. 2002 Sep; 3(9):673-84.
[Nat Rev Mol Cell Biol. 2002]Nature. 1998 May 28; 393(6683):382-6.
[Nature. 1998]Dev Cell. 2002 Jul; 3(1):85-97.
[Dev Cell. 2002]Genome Biol. 2002 Oct 23; 3(11):reviews3014.
[Genome Biol. 2002]Gene. 2003 Dec 24; 323():115-23.
[Gene. 2003]EMBO J. 1998 Dec 15; 17(24):7351-60.
[EMBO J. 1998]Nature. 1994 Sep 22; 371(6495):297-300.
[Nature. 1994]BMC Genomics. 2003 Dec 12; 4(1):50.
[BMC Genomics. 2003]Mol Cell Biol. 2006 May; 26(9):3541-9.
[Mol Cell Biol. 2006]Planta. 2007 Apr; 225(5):1107-20.
[Planta. 2007]Cell. 1994 Nov 18; 79(4):595-606.
[Cell. 1994]Mol Biol Evol. 2000 Nov; 17(11):1769-73.
[Mol Biol Evol. 2000]Science. 2003 Jun 13; 300(5626):1703-6.
[Science. 2003]Virus Res. 2006 Apr; 117(1):156-84.
[Virus Res. 2006]Curr Opin Microbiol. 2003 Oct; 6(5):490-7.
[Curr Opin Microbiol. 2003]Virol J. 2005 Aug 16; 2():62.
[Virol J. 2005]Mol Biol Evol. 2008 May; 25(5):980-96.
[Mol Biol Evol. 2008]Nature. 2008 Feb 14; 451(7180):783-8.
[Nature. 2008]Curr Biol. 2008 Aug 5; 18(15):1156-61.
[Curr Biol. 2008]Mech Dev. 2002 Jul; 115(1-2):41-51.
[Mech Dev. 2002]Int J Biol Sci. 2006; 2(3):125-32.
[Int J Biol Sci. 2006]PLoS Biol. 2005 Apr; 3(4):e96.
[PLoS Biol. 2005]Proc Natl Acad Sci U S A. 2001 May 8; 98(10):5637-42.
[Proc Natl Acad Sci U S A. 2001]Dev Cell. 2003 Jan; 4(1):67-82.
[Dev Cell. 2003]Dev Dyn. 2006 Apr; 235(4):886-94.
[Dev Dyn. 2006]Development. 2005 Apr; 132(8):1751-62.
[Development. 2005]J Biol Chem. 2003 Jun 20; 278(25):23196-203.
[J Biol Chem. 2003]Proc Natl Acad Sci U S A. 1996 Oct 15; 93(21):11925-32.
[Proc Natl Acad Sci U S A. 1996]Nat Rev Mol Cell Biol. 2006 Sep; 7(9):678-89.
[Nat Rev Mol Cell Biol. 2006]Cell. 2006 Mar 10; 124(5):985-96.
[Cell. 2006]Curr Biol. 2008 Aug 5; 18(15):1156-61.
[Curr Biol. 2008]Nature. 2008 Feb 14; 451(7180):783-8.
[Nature. 2008]Gene. 1999 Sep 30; 238(1):103-14.
[Gene. 1999]Curr Opin Genet Dev. 2001 Dec; 11(6):673-80.
[Curr Opin Genet Dev. 2001]Mol Cell Biol. 2004 Nov; 24(21):9265-73.
[Mol Cell Biol. 2004]Development. 1995 Mar; 121(3):855-65.
[Development. 1995]Development. 1997 Sep; 124(17):3439-48.
[Development. 1997]J Biol Chem. 2005 Sep 16; 280(37):32133-40.
[J Biol Chem. 2005]Curr Biol. 2009 Apr 28; 19(8):706-12.
[Curr Biol. 2009]Curr Opin Genet Dev. 2001 Dec; 11(6):673-80.
[Curr Opin Genet Dev. 2001]PLoS Comput Biol. 2006 Aug 25; 2(8):e114.
[PLoS Comput Biol. 2006]Gene. 1999 Sep 30; 238(1):103-14.
[Gene. 1999]PLoS Comput Biol. 2006 Aug 25; 2(8):e114.
[PLoS Comput Biol. 2006]PLoS Biol. 2009 Jan 27; 7(1):e20.
[PLoS Biol. 2009]Nature. 2008 Aug 21; 454(7207):955-60.
[Nature. 2008]Curr Biol. 2004 Nov 23; 14(22):R944-5.
[Curr Biol. 2004]Mol Biol Evol. 2008 May; 25(5):980-96.
[Mol Biol Evol. 2008]Evol Dev. 2006 Mar-Apr; 8(2):150-73.
[Evol Dev. 2006]J Mol Biol. 1990 Oct 5; 215(3):403-10.
[J Mol Biol. 1990]Bioinformatics. 2008 Feb 1; 24(3):319-24.
[Bioinformatics. 2008]BMC Evol Biol. 2007 Mar 2; 7():33.
[BMC Evol Biol. 2007]BMC Bioinformatics. 2004 Aug 19; 5():113.
[BMC Bioinformatics. 2004]Nucleic Acids Res. 2004; 32(5):1792-7.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2008 Jul 1; 36(Web Server issue):W465-9.
[Nucleic Acids Res. 2008]Syst Biol. 2003 Oct; 52(5):696-704.
[Syst Biol. 2003]Bioinformatics. 2001 Aug; 17(8):754-5.
[Bioinformatics. 2001]Science. 2001 Dec 14; 294(5550):2310-4.
[Science. 2001]Nucleic Acids Res. 2008 Jan; 36(Database issue):D245-9.
[Nucleic Acids Res. 2008]Nucleic Acids Res. 2007 Jan; 35(Database issue):D237-40.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2005 Jul 1; 33(Web Server issue):W116-20.
[Nucleic Acids Res. 2005]Trends Biochem Sci. 1999 Jan; 24(1):34-6.
[Trends Biochem Sci. 1999]Science. 1986 Oct 17; 234(4774):364-8.
[Science. 1986]Mol Cell. 2000 Feb; 5(2):207-16.
[Mol Cell. 2000]Nature. 1998 May 28; 393(6683):382-6.
[Nature. 1998]Cell. 2006 Mar 10; 124(5):985-96.
[Cell. 2006]Proc Natl Acad Sci U S A. 1998 May 26; 95(11):5857-64.
[Proc Natl Acad Sci U S A. 1998]Nucleic Acids Res. 2009 Jan; 37(Database issue):D229-32.
[Nucleic Acids Res. 2009]Nucleic Acids Res. 2008 Jan; 36(Database issue):D281-8.
[Nucleic Acids Res. 2008]Nucleic Acids Res. 2009 Jan; 37(Database issue):D380-6.
[Nucleic Acids Res. 2009]Science. 2003 Jun 13; 300(5626):1703-6.
[Science. 2003]