![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2008, American Society of Plant Biologists Arabidopsis Reactome: A Foundation Knowledgebase for Plant Systems Biology[W] aDepartment of Computational and Systems Biology, John Innes Centre, Norwich NR4 7UH, United Kingdom bDepartment of Metabolic Biology, John Innes Centre, Norwich NR4 7UH, United Kingdom cDepartment of Cell and Developmental Biology, John Innes Centre, Norwich NR4 7UH, United Kingdom dEuropean Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, United Kingdom eCold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724 fNew York University, School of Medicine, New York, New York 10016 1Address correspondence to michael.bevan/at/bbsrc.ac.uk or sean.walsh/at/bbsrc.ac.uk. This article has been cited by other articles in PMC.New ways of capturing and representing biological knowledge are needed to enable individual researchers to remain abreast of relevant discoveries and to permit computational approaches for interpreting the large volumes of diverse data generated by modern biological research. Here, we describe a promising approach that expands the term “reaction” to represent biological processes. We show how users can represent a wide variety of biological processes in plants in terms of the concept of a reaction and assemble the information obtained from the model plant Arabidopsis thaliana into an online knowledgebase called Arabidopsis Reactome. Its curated and imported pathways currently cover ~8% of the Arabidopsis proteome. Arabidopsis Reactome events have also been electronically projected onto five other predicted plant proteomes. Such a system allows the visualization and interpretation of high-throughput data, hypothesis formulation in systems biology, and is a useful learning resource. The Arabidopsis Reactome project (www.arabidopsisreactome.org) is open access, open source, and open to contributions. OVERVIEW Currently, the genome sequences of six higher plants and a moss species have been assembled, annotated, and published (Arabidopsis Genome Initiative, 2000; International Rice Genome Sequencing Project, 2005; Tuskan et al., 2006; Jaillon et al., 2007; Velasco et al., 2007; Ming et al., 2008; Rensing et al., 2008). The availability of large populations of sequence-tagged insertion mutations for nearly all Arabidopsis genes (Alonso and Ecker, 2006), surveys of polymorphisms in many Arabidopsis ecotypes (Clark et al., 2007), and the free availability of microarray data and data-mining tools (Craigon et al., 2004; Zimmermann et al., 2004) have all greatly accelerated the scale and scope of plant research (Somerville and Koornneef, 2002), reflected by more than 240 publications per month citing Arabidopsis. Given the acceleration in the number of plant genome sequencing projects, it is increasingly difficult for individual researchers to stay abreast of relevant literature and to make connections between different sets of information. This difficulty is compounded by the general inaccessibility of information contained in the literature to computer-based analysis, which severely limits its value as a source of biological knowledge (Jensen et al., 2006). Therefore, developing computational methods for capturing and representing biological knowledge is a high priority, particularly for model organisms that are the focus of most experimental work. Several bioinformatics resources and software packages have been developed to manage and exploit the wealth of data generated by plant genome projects, functional genomics resources, and high-throughput transcriptomics experiments. The predicted proteins in the Arabidopsis genome have been systematically described by The Arabidopsis Information Resource (TAIR; http://www.arabidopsis.org/portals/genAnnotation/) using Gene Ontology (GO)–controlled descriptions of gene functions according to the biological process, molecular function, and cellular component of individual genes (Ashburner et al., 2000). This has enabled much more rapid, accurate, and consistent assignment of predicted functions to genes and permits the development of more accurate relationships between genes in different organisms. Several databases and software applications relate gene entities to each other in networks in Arabidopsis. The AraCyc database (Mueller et al., 2003) displays computationally predicted Arabidopsis metabolic pathways that are largely manually curated. MAPMAN uses a hierarchical ontology different from GO terms that can be used for visualizing large data sets onto metabolic pathways and other biological processes (Thimm et al., 2004). The VirtualPlant (Gutierrez et al., 2007) and ONDEX (Kohler et al., 2006) systems have created graph-based integrations of knowledge and gene functional inferences that may be queried, filtered, and appended using tools like Cytoscape (Suderman and Hallett, 2007) to generate new functional insights. GENEVESTIGATOR (Zimmermann et al., 2004, 2005) provides web-based analytical services that relate gene expression data to a wide variety of gene-related entities, such as GO terms, mutant phenotypes, pathways, and developmental processes. Reactome is an extensively curated pathway knowledgebase that focuses on human processes (Joshi-Tope et al., 2003, 2005; de Bono et al., 2007; Vastrik et al., 2007). A key feature of Reactome is its elegant data model that extends the notion of a biochemical reaction, where substrates go in, products come out, and a catalyst is frequently required to lower the free energy of the transformation. This concept also can be used to represent the binding of a ligand to a membrane receptor, the formation of a complex, the binding of a transcription factor in a promoter region, or the translocation of a molecule between subcellular compartments. In this way, the data model expresses molecular processes in the same way that scientists understand them and allows connected reactions to represent biological processes (e.g., transcription and cell cycle) in terms of their underlying molecular transformations, associations, and translocations. Based on this extended definition of reaction, reactants and products can be proteins, lipids, nucleotides, small molecules, or complexes of these. The data model further distinguishes among the topologically or functionally different forms of each molecule. For instance, this allows the distinction between chloroplastic maltose and cytosolic maltose or between the various posttranslational modifications of a protein. Here, we describe Arabidopsis Reactome, a knowledgebase of biological processes from the model plant Arabidopsis. Release 2 (www.arabidopsisreactome.org) comprises seven curated and 311 imported superpathways that together represent 8% of the Arabidopsis proteome. We show, using examples based on the mitotic cell cycle, that the knowledgebase has wide applicability for exchanging structured data with other databases, for comparative network analysis, data integration, and for visualization and protein interaction analysis. The straightforward authoring tool and realistic detailed descriptions of biological processes inherent in Reactome's data model provide an excellent foundation for representing, exchanging, and integrating biological information, suggesting that it will find wide application in the Arabidopsis community as a gold standard for pathway knowledge and key foundation for systems biology research. PATHWAY CURATION The information in Arabidopsis Reactome was generated from curated pathways that have been manually entered and reviewed by experts and imported pathways from several third-party pathway databases. Knowledge acquisition for the curated pathways followed the process established for the human Reactome system. Essentially, pathways were authored, curated, and peer reviewed by expert biologists (PhD level and above) and bioinformaticians. The curatorial process used a set of applications, namely, Reactome Author Tool and Reactome Curator Tool, developed specifically for the purpose of collecting and validating pathway models (Joshi-Tope et al., 2005). Every protein, gene, or small molecule in Arabidopsis Reactome has a reference identifier that points into a public reference database. In the case of protein sequences, the primary source of identifiers is UniProt. Entities such as chemical compounds are referenced by the ChEBI database. For the imported KEGG and AraCyc enzymes and chemical compounds, referencing was performed in an automated manner that is also applicable to newer additions. In addition, entries were automatically cross-referenced to external databases, such as UniProt (Schneider et al., 2005), TAIR (Rhee et al., 2003), Munich Information Center for Protein Sequences (MIPS) (Schoof et al., 2004), National Center for Biotechnology Information (NCBI) Entrez Gene (Maglott et al., 2005), KEGG COMPOUND (Kanehisa and Goto, 2000), and ChEBI (Degtyarenko et al., 2007). CURATED PATHWAYS The long-term goal in Arabidopsis Reactome is to establish a detailed set of curated pathways representing all major biological processes in Arabidopsis. Initially, metabolic pathways and the mitotic cell cycle were selected as contrasting processes that would challenge the plasticity of the data model and provide a foundation for data integration and modeling. An essential piece of information for a reaction to be incorporated into the curated area of Arabidopsis Reactome was the existence of experimental evidence, usually a reference to a published article. In addition to reactions and literature references, the data model contains fields for species, GO molecular function, subcellular location, and other relevant information that were filled out during the curatorial process. In some instances, Arabidopsis reactions imported from KEGG and AraCyc databases were used as structured reference material to start the curatorial process. When there was insufficient experimental evidence of a particular reaction in Arabidopsis but its existence could be inferred (for example, gene functions inferred from sequence similarity), then the equivalent reaction could be manually inferred from a different organism from which there was sufficient experimental evidence. For example, we deduced the binding of the five-subunit Replication factor C (RFC) onto DNA that results in the displacement of the polymerase α (POLA) on the basis of the experimentally confirmed activity in human and by the identification in Arabidopsis of all five RFC subunits containing the characteristic sequence motifs of other eukaryotic RFCs, as described by Shultz and Furukawa (Furukawa et al., 2003; Shultz et al., 2007). In the Arabidopsis Reactome user interface, manually inferred reactions are flagged in magenta to distinguish them clearly from the curated ones, which appear in blue. Once new experimental evidence becomes available in Arabidopsis for any of the inferred reactions, these reactions can be replaced by the experimentally determined examples during a scheduled pathway review and then will appear as blue arrows in the next public release. Inferred reactions are used sparingly so as not to jeopardize the integrity of a pathway. All curated and manually inferred reactions occupy the central part of the reaction map on the Arabidopsis Reactome home page (Figure 1
IMPORTED PATHWAYS We imported Arabidopsis metabolic pathways from KEGG (release 38.0) and AraCyc (release 2.5) databases into Arabidopsis Reactome as text files from their ftp servers. The files were parsed, and the data were stored in a MySQL relational database using custom database schemata developed to represent each source. Using the Perl XML Generator module (http://www.cpan.org/), these data were used to construct documents in the Reactome Author Tool native, XML-based, GKB format. The documents were then opened with the Author Tool, and reactions were joined manually to form pathways according to the pathway diagrams found at their source websites. These files were then imported into the Reactome Curation Tool that was used to deposit the data in the central Arabidopsis Reactome database. Once in the database, the arrows representing the reactions were manually laid out in the reaction map using the Reactome pathway visualization tool. Pathways involved in related or similar processes were laid out in close proximity to each other within either the KEGG or AraCyc areas of the reaction map. Importation software has been written to allow Arabidopsis Reactome to be updated from KEGG and AraCyc sources as new versions become available.Our analyses identified important differences between pathways represented in the KEGG and AraCyc databases that allowed us to improve data quality. Originally, these pathways were computationally predicted for the sequenced Arabidopsis genome using inference methods from GENES and MetaCyc databases, respectively (Kanehisa et al., 2002; Mueller et al., 2003). Since then, many AraCyc pathways have been manually curated. One of the consequences of the absence of manual curation of the Arabidopsis data in the KEGG database is that key enzymes, such as sucrose-6-phosphate phosphatase (SPP), do not appear on KEGG reaction diagrams. Similarly, although AraCyc describes the four SPP isoforms, it relies on computational inference and not literature references. Finally, SPPs are also absent from the Arabidopsis inferred part of the human Reactome since humans do not synthesize sucrose. SUBA (Heazlewood et al., 2007) and UniProt subcellular localization information was used to assign the subcellular location of the imported reactions. For example, the enzyme peroxisomal 2,4-dienoyl-CoA reductase was used to assign its catalyzing reaction “monovinyl protochlorophyllide a + NADPH <=> NADP+ + chlorophyllide a [AT3G12800]” and its components to the peroxisome. This substantially enriches the knowledge associated with metabolic pathway data. ELECTRONIC INFERENCE OF ARABIDOPSIS PATHWAYS ONTO OTHER PLANTS The predicted proteomes from the published genomes of rice (Oryza sativa; International Rice Genome Sequencing Project, 2005), poplar (Populas trichocarpa; Tuskan et al., 2006), the moss Physcomitrella patens (Rensing et al., 2008), and the two grape varieties (Vitis vinifera and V. vinifera var Pinot Noir; Jaillon et al., 2007; Velasco et al., 2007) were downloaded from their websites. Using the NCBI BLASTP algorithm, we matched the Arabidopsis proteome downloaded from the TAIR ftp site to the predicted proteomes of these species and then fed the results to the OrthoMCL algorithm (Li et al., 2003) to identify and cluster the orthologous proteins into groups. OrthoMCL results were then appropriately formatted so they could be used by the scripts included in the Reactome system to produce the equivalent organism-specific reactions and pathways. A total of 8269 reactions and 2196 proteins from Arabidopsis were projected onto the five plant species (Table 1) to identify evolutionary conserved pathways. These projected pathways can be used for overlaying “-omics” data, cross-species comparisons of biological processes, rapid proteome annotation of a newly sequenced plant genome, and to provide a template for the establishment of other species-specific Reactome knowledgebases.
Comparison of the Arabidopsis cell cycle with the electronically inferred cell cycle in rice, poplar, P. patens, and the two grape varieties showed that almost 60% of the reactions were conserved between Arabidopsis and poplar, compared with 45% in rice and 33% in moss (Table 2). In the case of grape, the two genomes showed different levels of conserved cell cycle reactions (51 and 39%). This may be attributable to different gene finding methods or problems in genome assembly where the genome is highly heterozygous. However, both S and M phases were more conserved in all species compared with the G1, G0, and G2 phases and their transitions. These data can be seen on the Arabidopsis Reactome website when the cross-species comparison box is ticked. By revealing apparent similarities and differences across species, Arabidopsis Reactome identifies gaps in pathways that can be used to either confirm a different molecular basis for equivalent phenomena or to improve existing gene models in light of comparative evidence.
THE USER INTERFACE The Arabidopsis Reactome home page is divided into two main panels: the reaction map and the table of contents (Figure 1 Entries in Arabidopsis Reactome can be searched using simple or advanced search facilities located at the top of the home page. By selecting a reaction, the web interface of Arabidopsis Reactome returns an increasing level of detailed information. This includes a description, the reaction components (compounds, enzymes, etc.), GO annotation (subcellular location and molecular function), preceding and following events, organism name, equivalent events in other organisms, reference in the literature, and, where applicable, links to external databases, such as UniProt, TAIR, MIPS, NCBI Entrez Gene, KEGG COMPOUND, and ChEBI (Figure 2
DATA INTEGRATION AND ANALYSIS A key feature of a curated knowledgebase is its utility for integrating diverse data sets into a comprehensive description of biological processes. SkyPainter is a useful feature of the Reactome system that allows researchers to visualize and analyze their own data sets in relation to the reaction maps. It can be found on the top menu bar of the Arabidopsis Reactome home page. Researchers can upload a list of genes or other identifiers to color the reaction map in a number of ways. The SkyPainter module recognizes a range of gene identifiers, such as Arabidopsis Genome Initiative code, Affymetrix probe set, and UniProt ID. Plants with electronically inferred pathways can also be searched by selecting the plant species on the SkyPainter page and using the appropriate gene or sequence identifier. We demonstrate the use of SkyPainter using two sets of published experimental data: for the overrepresentation analysis of glucose upregulated genes (Li et al., 2006) and for the visualization of the progression of gene expression during the cell cycle (Menges et al., 2003). In the first example, SkyPainter was used for the overrepresentation analysis of genes maximally induced at 6 h in response to glucose treatment (Figure 3
In the second example, SkyPainter was used to represent dynamic changes in expression of cell cycle genes throughout the cell cycle. Using publicly available microarray data available from NASCARRAYS (Craigon et al., 2004), we selected experiment NASCARRAYS-360 on genome-wide cell cycle studies. The experiment was designed to follow the expression of genes throughout the cell cycle in a synchronized Arabidopsis cell suspension. The cells were synchronized using aphidocolin, which upon removal allowed the resumption of the S phase and progression through the cell cycle (Menges et al., 2003). Samples were taken at 10 time points between 0 and 19 h after the removal of aphidocolin and analyzed using the Affymetrix ATH1 array (Menges et al., 2003). The expression data (normalized using Affymetrix MAS 5.0 scaling protocol) was downloaded, and the expression values of the 100 cell cycle genes that correspond to the genes curated within the cell cycle module of the Arabidopsis Reactome were extracted. These expression values were then used with Skypainter to create a movie that follows the quantitative expression levels of mitotic genes through the various phases of the cell cycle and reaching its peak at the 10-h time point (Figure 4
The large increase in protein–protein interaction data (Cui et al., 2007; Geisler-Lee et al., 2007; Van Leene et al., 2007) provides an exceptionally rich source for extending known pathways and complexes. We integrated protein interaction data with the cell cycle pathway in Arabidopsis Reactome to identify new protein clusters associated with the cell cycle (see Supplemental Methods Online). The cell cycle data from Arabidopsis Reactome was imported into Cytoscape in SBML format, and its proteins were joined with proteins from the experimental cell cycle interactome determined by protein mass spectrometry (Van Leene et al., 2007) to form a cell cycle network (see Supplemental Figures 2 and 3 online). We then enriched this network by importing the comprehensive set of predicted protein interactions from AtPID, which contains 11,708 proteins and 24,419 interactions (Cui et al., 2007) into Cytoscape and merged this data with the cell cycle network (Figure 5
DATA AND SOFTWARE AVAILABILITY The data contained in Arabidopsis Reactome can be viewed in Cytoscape (Shannon et al., 2003) and Protégé (http://protege.stanford.edu/) or can be exported in SBML level 2 (Finney and Hucka, 2003), BioPax (http://www.biopax.org/), SVG, and PDF format. The entire content of the Arabidopsis Reactome and tools for curating biological pathways can be downloaded from the Arabidopsis Reactome website by following the “Download” link. COMPARISON AND INTEGRATION WITH OTHER PATHWAY RESOURCES There has been a rapid growth in the availability of pathway tools since the focus of biological research has moved toward a systems understanding of biological processes. The important features of a pathway tool are ease of use, quality of data model and experimental evidence, wide coverage, standard format, facilities to integrate data sets, and that it is open source and open access. AraCyc and KEGG are both comprehensive and standardized pathway resources from which we have already incorporated the data into Arabidopsis Reactome. AraCyc has the widest coverage, mainly of metabolic pathways, many of which have been manually curated. The standard BioPAX exchange format means that these pathways are widely used in other pathway resources, such as MetNet (Wurtele et al., 2007) and Virtual Plant (Gutierrez et al., 2007). KEGG Pathway is limited by the data being mainly homology based. BioPathAt (Lange and Ghassemian, 2005) and MetNet are both sources of curated pathway information that can be used to aid the manual curation of pathways in the future; however, the lack of standardized exchange formats means that these pathways cannot be automatically incorporated into Arabidopsis Reactome. MAPMAN (Thimm et al., 2004) is a user-driven tool for displaying large genomics data sets onto diagrams of metabolic pathways and other processes. Its wide coverage of metabolic and other biological processes, expert curation, and useful hierarchical ontologies with gene mappings make it a popular choice for integrating datasets. However, it lacks a standard format for exchanging pathway information with other pathway resources, and it lacks the level of detail of reactions compared with Reactome due to its ontology-based structure. Currently, we are developing a common exchange format between Arabidopsis Reactome and MAPMAN. SUMMARY AND FUTURE PROSPECTS The human Reactome team has established a data model and software tools that allow scientists to capture a wide variety of biological knowledge in a form allowing computational manipulation, and these have been developed as a knowledgebase for human biological processes (Joshi-Tope et al., 2005). Apart from the human Reactome, there are currently four other Reactomes under development for Drosophila melanogaster, Caenorhabditis elegans, Archea, and chicken. In this essay, we have demonstrated the application of the Reactome system to the reference plant species Arabidopsis and shown how the data model accurately captures many aspects of different biological processes. Although these models currently encompass only a small fraction of the molecular entities in Arabidopsis, this is an important first step toward a quality-assured whole-genome network reconstruction. The current release of Arabidopsis Reactome (version 2) describes the functions of 276 proteins (233 reactions) curated from 262 primary literature sources. These proteins function in a number of pathways, including the mitotic cell cycle, DNA synthesis, cell cycle checkpoints, alternative cell cycles (including endoreduplication), and several metabolic pathways. Additionally, 1919 proteins have been imported from KEGG (Kanehisa and Goto, 2000) and AraCyc (Mueller et al., 2003) pathway databases. Together with the curated proteins, they represent 8% of the estimated 27,238 Arabidopsis protein-coding genes (http://www.arabidopsis.org/portals/genAnnotation/). The curated and imported proteins in Arabidopsis Reactome appear in a total of 8269 reactions from 318 superpathways. The Arabidopsis proteome was electronically projected in Arabidopsis Reactome onto five other published plant and moss predicted proteomes via orthologous transfer. This allows the visualization of the equivalent reactions and pathways on other sequenced plant species and assesses pathway conservation between species. Researchers can interact with the web-based interface to browse pathways and visualize “-omics” data on the reaction map. Arabidopsis Reactome pathways allow users to formulate hypotheses based on a much wider range of data and knowledge than hitherto possible. These include studies of network topology (Siegal et al., 2007), network properties such as instability (Wilhelm, 2007) the determination of system attractors (Bornholdt, 2005; Davidich and Bornholdt, 2008), metabolic flux analysis (Morgan and Rhodes, 2002), and the ability to validate algorithms used for the reverse engineering of networks from data (Bansal et al., 2007). The addition of kinetic functions and their parameters will enable approaches to dynamical modeling and metabolic control analysis (Morgan and Rhodes, 2002). We have demonstrated three examples of this functionality. First, orthologous transfer from Arabidopsis leads to the identification of candidate genes that are potentially involved in the same pathway in other species and identified potential gaps in gene function between plant species. Such predictions can be tested using functional genomics approaches and reannotation. Second, overlaying gene expression data on pathways using SkyPainter identified coordinately regulated genes within pathways and visualizes gene expression data in a clear and intuitive way. Finally, by merging curated Arabidopsis Reactome modules, such as the cell cycle with experimental (Van Leene et al., 2007) and predicted protein–protein interactions (Cui et al., 2007), we identified clusters of genes that can be tested for cell cycle–related phenotypes. It is conceivable that any other pairwise inference (e.g., genetic interaction or protein domain) could also be used as a connection (edge or weighted edge) in the Cytoscape graph and analyzed with a network query application, such as MCODE, for example, to establish genetic interaction networks. Most of the knowledge of biological processes in plants is described as free text in the published literature and other resources. The Reactome approach is to extract this distributed knowledge using authors and curators who understand the scientific field and can make expert judgments on what constitutes good evidence and can resolve ambiguities in the use of natural language. This process requires a high degree of knowledge and commitment, but once achieved for reference species such as Arabidopsis, it will potentially benefit the wider plant research community since such pathway knowledge can be projected electronically onto other plant genomes using Reactome. Currently, knowledge from Arabidopsis has been electronically applied to poplar, rice, grape, and P. patens and is available from the Arabidopsis Reactome website. With increased content, it should be possible to annotate newly sequenced plant genomes by orthologous transfer from Arabidopsis Reactome using algorithms such as OrthoMCL (Li et al., 2003) and InParanoid (Remm et al., 2001). As 35 plant genome-sequencing projects are currently underway (http://www.ncbi.nlm.nih.gov/genomes/leuks.cgi?p3=11:Plants), this function of Arabidopsis Reactome will be highly relevant. Current methods of knowledge generation involve the production of text and images that are largely intractable to electronic manipulation. Text mining is currently not capable of reconstructing this knowledge in a computational form (Jensen et al., 2006); therefore, it is currently merely an aid to the curation process rather than a comprehensive solution. One way to facilitate the growth and uptake of Arabidopsis Reactome among users is through a collaborative model involving the editorial process and publishers. Such an approach has recently been proposed that will require authors to deposit data in the TAIR database as part of the publication process (http://www.arabidopsis.org/news/plant_phys_partnership.txt). Arabidopsis Reactome is ideally suited for this type of activity as it captures a wide variety of knowledge about biological entities, such as genes, transcripts, proteins, modified proteins, and metabolites. This and other examples would promote its use, and the benefits of high-quality electronically available knowledge would be realized more widely. The standardization of pathway knowledge representation is critically important in pathway curation as it aids the integration of data in systems biology to generate new hypotheses (Draghici et al., 2007). Arabidopsis Reactome provides a powerful foundation, based on careful and comprehensive curation of the literature, which will facilitate many areas of biological research in plants and help to integrate biological knowledge. Supplemental Data The following materials are available in the online version of this article.
[Supplemental Data]
Acknowledgments We thank the Reactome development team for their support and useful criticism and Yunhai Li for the glucose microarray data and analysis. This work was supported by funds from the Biotechnology and Biological Sciences Research Council (BBSRC) under the BBSRC Bioinformatics and e-Science initiative (BBS/B/13829) and by the European Commission Project Arabidopsis GROwth Network integrating OMICS technologies (Contract 037704). E.E.S., I.V., E.B., G.W., P.D., and L.D.S. are supported by a grant from the U.S. National Institutes of Health, a grant from the European Union Sixth Framework Programme, and subcontracts from the National Institutes of Health Cell Migration Consortium and the European Bioinformatics Institute Industry Programme. Notes [W]Online version contains Web-only data. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Nature. 2000 Dec 14; 408(6814):796-815.
[Nature. 2000]Nature. 2005 Aug 11; 436(7052):793-800.
[Nature. 2005]Science. 2006 Sep 15; 313(5793):1596-604.
[Science. 2006]Genome Res. 2003 Nov; 13(11):2498-504.
[Genome Res. 2003]PLoS One. 2007 Dec 19; 2(12):e1326.
[PLoS One. 2007]Nat Genet. 2000 May; 25(1):25-9.
[Nat Genet. 2000]Plant Physiol. 2003 Jun; 132(2):453-60.
[Plant Physiol. 2003]Plant J. 2004 Mar; 37(6):914-39.
[Plant J. 2004]Genome Biol. 2007; 8(1):R7.
[Genome Biol. 2007]Bioinformatics. 2006 Jun 1; 22(11):1383-90.
[Bioinformatics. 2006]Cold Spring Harb Symp Quant Biol. 2003; 68():237-43.
[Cold Spring Harb Symp Quant Biol. 2003]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D428-32.
[Nucleic Acids Res. 2005]Genome Biol. 2007; 8(3):R39.
[Genome Biol. 2007]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D428-32.
[Nucleic Acids Res. 2005]Plant Physiol. 2005 May; 138(1):59-66.
[Plant Physiol. 2005]Nucleic Acids Res. 2003 Jan 1; 31(1):224-8.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D373-6.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D54-8.
[Nucleic Acids Res. 2005]Plant Mol Biol. 2003 Sep; 53(1-2):15-25.
[Plant Mol Biol. 2003]Plant Physiol. 2007 Aug; 144(4):1697-714.
[Plant Physiol. 2007]Nucleic Acids Res. 2002 Jan 1; 30(1):42-6.
[Nucleic Acids Res. 2002]Plant Physiol. 2003 Jun; 132(2):453-60.
[Plant Physiol. 2003]Nucleic Acids Res. 2007 Jan; 35(Database issue):D213-8.
[Nucleic Acids Res. 2007]Nature. 2005 Aug 11; 436(7052):793-800.
[Nature. 2005]Science. 2006 Sep 15; 313(5793):1596-604.
[Science. 2006]Science. 2008 Jan 4; 319(5859):64-9.
[Science. 2008]Nature. 2007 Sep 27; 449(7161):463-7.
[Nature. 2007]PLoS One. 2007 Dec 19; 2(12):e1326.
[PLoS One. 2007]Nucleic Acids Res. 2002 Jan 1; 30(1):42-6.
[Nucleic Acids Res. 2002]Plant Physiol. 2003 Jun; 132(2):453-60.
[Plant Physiol. 2003]Genome Res. 2006 Mar; 16(3):414-27.
[Genome Res. 2006]Plant Mol Biol. 2003 Nov; 53(4):423-42.
[Plant Mol Biol. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D575-7.
[Nucleic Acids Res. 2004]Plant Mol Biol. 2003 Nov; 53(4):423-42.
[Plant Mol Biol. 2003]Nucleic Acids Res. 2008 Jan; 36(Database issue):D999-1008.
[Nucleic Acids Res. 2008]Plant Physiol. 2007 Oct; 145(2):317-29.
[Plant Physiol. 2007]Mol Cell Proteomics. 2007 Jul; 6(7):1226-38.
[Mol Cell Proteomics. 2007]BMC Bioinformatics. 2003 Jan 13; 4():2.
[BMC Bioinformatics. 2003]J Cell Sci. 2006 Aug 1; 119(Pt 15):3227-37.
[J Cell Sci. 2006]Genome Res. 2003 Nov; 13(11):2498-504.
[Genome Res. 2003]Biochem Soc Trans. 2003 Dec; 31(Pt 6):1472-3.
[Biochem Soc Trans. 2003]Genome Biol. 2007; 8(1):R7.
[Genome Biol. 2007]Phytochemistry. 2005 Feb; 66(4):413-51.
[Phytochemistry. 2005]Plant J. 2004 Mar; 37(6):914-39.
[Plant J. 2004]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D428-32.
[Nucleic Acids Res. 2005]Nucleic Acids Res. 2000 Jan 1; 28(1):27-30.
[Nucleic Acids Res. 2000]Plant Physiol. 2003 Jun; 132(2):453-60.
[Plant Physiol. 2003]Genetica. 2007 Jan; 129(1):83-103.
[Genetica. 2007]Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Jul; 76(1 Pt 1):011911.
[Phys Rev E Stat Nonlin Soft Matter Phys. 2007]Science. 2005 Oct 21; 310(5747):449-51.
[Science. 2005]PLoS One. 2008 Feb 27; 3(2):e1672.
[PLoS One. 2008]Metab Eng. 2002 Jan; 4(1):80-9.
[Metab Eng. 2002]Genome Res. 2003 Sep; 13(9):2178-89.
[Genome Res. 2003]J Mol Biol. 2001 Dec 14; 314(5):1041-52.
[J Mol Biol. 2001]Nat Rev Genet. 2006 Feb; 7(2):119-29.
[Nat Rev Genet. 2006]Genome Res. 2007 Oct; 17(10):1537-45.
[Genome Res. 2007]