![]() | ![]() |
Formats:
|
||||||||
Copyright © 2006 The Author(s) PATRIC: The VBI PathoSystems Resource Integration Center 1Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA 2Department of Microbiology and Immunology, Loyola University Medical Center, Maywood, IL 60153, USA 3Center for Human–Computer Interaction, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA 4Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA 5VA-MD Regional College of Veterinary Medicine, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA 6Centers for Disease Control and Prevention, Atlanta, GA 30333, USA *To whom correspondence should be addressed. Tel: +1 540 231 5428; Fax: +1 540 231 2606; Email: eesnyder/at/vbi.vt.edu Present addresses: Y. Guo, NIAID Office of Technology Information Systems (OTIS), Contractor, MSD Inc., Bethesda, MD 20892, USA J. D. Eckart, Advanced Technology Division, Aflac, Columbus, GA 31999, USA Received August 15, 2006; Revised September 14, 2006; Accepted October 1, 2006. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract The PathoSystems Resource Integration Center (PATRIC) is one of eight Bioinformatics Resource Centers (BRCs) funded by the National Institute of Allergy and Infection Diseases (NIAID) to create a data and analysis resource for selected NIAID priority pathogens, specifically proteobacteria of the genera Brucella, Rickettsia and Coxiella, and corona-, calici- and lyssaviruses and viruses associated with hepatitis A and E. The goal of the project is to provide a comprehensive bioinformatics resource for these pathogens, including consistently annotated genome, proteome and metabolic pathway data to facilitate research into counter-measures, including drugs, vaccines and diagnostics. The project's curation strategy has three prongs: ‘breadth first’ beginning with whole-genome and proteome curation using standardized protocols, a ‘targeted’ approach addressing the specific needs of researchers and an integrative strategy to leverage high-throughput experimental data (e.g. microarrays, proteomics) and literature. The PATRIC infrastructure consists of a relational database, analytical pipelines and a website which supports browsing, querying, data visualization and the ability to download raw and curated data in standard formats. At present, the site warehouses complete sequences for 17 bacterial and 332 viral genomes. The PATRIC website (https://patric.vbi.vt.edu) will continually grow with the addition of data, analysis and functionality over the course of the project. INTRODUCTION Bioterrorism became an important national security issue (1) following the deliberate release of anthrax spores into the US postal system in October 2001 (2). Meanwhile, emerging and reemerging infectious diseases (3) have had profound effects on public health in many parts of the world. Recognizing the pathogens responsible for these diseases as threats to homeland security, the National Institute of Allergy and Infectious Diseases (NIAID) of the US National Institutes of Health has embarked upon a series of initiatives aimed at developing a comprehensive understanding of the organisms identified as NIAID category A, B and C priority pathogens (for a complete list, see http://www3.niaid.nih.gov/biodefense/bandc_priority.htm). The Virginia Bioinformatics Institute's PathoSystems Resource Integration Center (PATRIC) is one of eight Bioinformatics Resource Centers (BRCs) established to study the NIAID priority pathogens and develop these information resources for the research community. While database resources for bacterial ((4) and those cited in (5)) and viral (6,7) genomics have been available for number of years, this project seeks to integrate genomics with comparative genomics and pathway analysis and ultimately proteomics, transcriptomics, immune epitope mapping, host-response and other downstream technologies. The goal is to help researchers and clinicians better detect and respond to biothreat agents (and infectious diseases in general) by facilitating the development of diagnostics, vaccines and therapeutics. This requires access to comprehensive information on the molecular biology, physiology and pathogenicity of these organisms. THE PATHOSYSTEMS RESOURCE INTEGRATION CENTER PATRIC is responsible for the eight organism categories listed in Table 1. The three genera of proteobacteria are all intracellular pathogens that are known or potential biowarfare agents. In the 1950s, Brucella suis was the first infectious agent developed for use as a biowarfare agent by the United States. Brucellosis, caused by Brucella sp., is an important agricultural disease infecting cattle, sheep, goats and swine as well as humans. It is highly contagious and readily dispersed as an aerosol (8). Coxiella burnetii, the causative agent of Q fever, is a highly infectious agent of relatively low lethality. Its interest as a biowarfare agent stems from its high infectivity, stability to heat and desiccation and potential for aerosol dispersal. The genus Rickettsia contains the organisms responsible for numerous types of typhus and arthropod-borne spotted fevers (9,10). Rickettsia prowazekii was developed as a bioweapon by the USSR in the 1930s and was used by the Japanese in Manchuria during World War II (11).
The five categories of viruses studied by PATRIC are all positive-strand ssRNA viruses, with the exception of Lyssaviruses, which have negative-strand ssRNA genomes. While there are no reports of any of these viruses being weaponized, they represent the causative agents for a number of emerging and reemerging diseases including Severe Acute Respiratory Syndrome (SARS), rabies and transmissible gastroenteritis. Recombinant vaccines for these viruses are either still in development or unavailable in areas where these infections are endemic or epidemic, compounding the public health risk. The pace of research on these organisms has increased significantly since the turn of the millennium, with outbreaks, such as that of SARS in 2003 (12,13), spawning a flurry of scientific activity. The widespread use of automated DNA sequencing, microarray gene expression analysis and other high-throughput laboratory technologies has increased the volume of data produced, but not necessarily its accessibility. Currently, significant genomics and bioinformatics expertise is required to extract, process and interpret this wealth of data. To address these problems, PATRIC has created an interdisciplinary team of bioinformaticians, software engineers, computational biologists and organism experts to build a publicly accessible resource aimed at providing high quality, analyzed and curated data to the infectious disease community working on these pathogens. To date, we have achieved the following objectives:
Longer-term goals include integration of data from gene expression and proteomics experiments (including host-response), predicted protein and RNA secondary and tertiary structures, and well-cataloged literature compilations. Ultimately, we hope our website will become an essential tool for researchers working on these pathogens and provide networking opportunities within the pathogen research communities. DATABASE DESCRIPTION PATRIC is implemented on Oracle 9i RDBMS using the Genomics Unified Schema (GUS) version 3.5, developed at the Computational Biology and Informatics Laboratory at the University of Pennsylvania (see http://www.gusdb.org). GUS is used to store all sequence data and associated annotation with the exception of metabolic pathway data, which is stored in a separate schema developed for use with the Pathway Tools software (15) (see section 4.3). To minimize modifications to GUS and ensure compatibility with future releases, the Curation/Annotation Schema was developed to support PATRIC-specific curation activities. The database is populated with all known full-length or nearly full-length genomic sequences for the eight organism categories listed in Table 1. Automated scripts query GenBank (16) daily to identify new or updated records. The corresponding sequences, annotation and associated literature are retrieved from NCBI and loaded following curatorial review to remove redundancies and assign unique names to each genome. RefSeq (17) records are used when available to take advantage of their more thorough and consistent annotation. Draft genome sequences from Joint Genome Institute (JGI)/Los Alamos National Labs (LANL) and the NIAID-funded Microbial Sequencing Centers will also be part of the PATRIC dataset. In addition to genome sequences and primary annotation from the original GenBank or RefSeq entry, the database stores the results of all automated and manual analyses described in the following section. DATA ANALYSIS AND CURATION Our motivation to invest resources in sequence-level annotation is to maintain a high standard of quality over time. Even when good reference annotation is available, there are many reasons to re-annotate microbial genomes (18). GenBank data are of variable quality and there is a trend towards depositing draft genome sequences with no annotation at all. In-house annotation also allows us to present supporting evidence and keep the annotation up to date. This is of particular importance for alignment-based annotation since databases such as GenBank (16) and UniProt (19) continue to grow at a prodigious rate. Genome curation Due to the large number of closely related genomes in each organism category, we have adopted an annotation strategy in which automated methods are applied to all genomes while detailed manual curation is applied to a limited number of reference genomes. The species B.suis 1330, C.burnetii RSA 493 and R.prowazekii str. Madrid E were chosen as reference genomes for their respective categories. Each viral category has (or will have) multiple reference genomes, representing phylogenetically diverse strains. Automated nucleic acid and protein sequence annotation is accomplished using a Java-based genome annotation pipeline (unpublished), which reads an XML script containing the names and parameters of the analytical applications. The bacterial pipeline executes the gene prediction programs Glimmer (20) and GeneMark (21,22) followed by start site correction programs RBSfinder (23) and TICO (24). BLASTX (25) searches the non-redundant protein database, complementing the ab initio gene prediction methods. RNA genes are identified by tRNAscan-SE (26) and BLASTN searching against a ribosomal RNA database (27,28). The annotation protocol containing the full list of applications and parameters is available online at https://patric.vbi.vt.edu/documents/ under ‘standard operating procedures’. Results of the genome analysis pipeline are merged with original GenBank or RefSeq features for automated interpretation. A decision tree is used to classify genes into categories based on the level of agreement between the various prediction methods. Genes that are unambiguously predicted by multiple methods are automatically ‘finalized’, creating new ‘gene’, ‘CDS’ and/or ‘[t/r]RNA’ features. The remaining genes are marked for manual curation. For viral genomes, an abbreviated pipeline is executed that emphasizes sequence alignment for gene identification and employs GeneMarkHMM optimized for mammalian (host) genomes. Proteome curation After curatorial review, finalized protein-coding (CDS) features are translated and subjected to another pipeline executing InterProScan and structure prediction methods such as MEMSAT 2 (29). Currently, each protein is associated with GO terms (30), TIGRroles, Enzyme Commission numbers (31) based on Pfam (32) and TIGRfam alignments (for a description of TIGRfam and TIGRroles, see: http://www.tigr.org/TIGRFAMs/). The protocol for automated proteome annotation is also available online. Manually curated protein sequences will be available in early 2007. Once protein sequences are inferred from each genome in an organism category, putative ortholog groups are generated using BLASTP for all pairwise genome combinations and applying the conventional bidirectional-best-hit (BBH) criterion (33). While putative ortholog groups within the bacterial categories are generally well defined, many viral proteins cannot be readily clustered using the stringent BBH criterion. This is an active area of curation. Using the ortholog groups as a starting point, a reference protein list is created for each bacterial category consisting of the proteins of the reference genome (each representing one ortholog group) plus a representative protein from each ortholog group identified in the associated genomes. A gene occurring in only a single genome constitutes a ‘group’ of one and would be included in the reference list. The reference protein lists will be manually curated and include, whenever possible, detailed functional descriptions, gene symbols, GO terms and EC numbers. Thus, every protein in the database will either be manually curated or be linked to an ortholog group member that has been manually curated. The ortholog groups are further processed to create multiple sequence alignments (MSAs) using MUSCLE (34) with default parameters. Phylogenetic estimations using the neighbor-joining method (35) were created based on trimmed alignments using PHYLIP (36). Trees were validated by bootstrapping (37) using a minimum of 100 replicates. Metabolic pathway curation To help users understand the function of the bacterial proteins in context, we have adopted the Pathway Tools system (15) to derive pathways from genome annotation and to fill potential gaps in annotation known as pathway holes. The system takes a list of protein names, descriptions and EC numbers as input. Proteins with EC numbers can be assigned roles directly; the roles of other proteins are suggested by lexicographic analysis of descriptive information and/or analysis of gene order from homologous regions of related genomes and confirmed or rejected by the curation staff. The output is a database with integrated web server that allows users to browse and query the organism's metabolic pathways. This system has been integrated with the PATRIC web site, allowing users to access pathway information for all bacterial reference genomes. The current analysis was based on preexisting RefSeq or GenBank annotations; later releases will incorporate data curated in house, unifying the genomic and pathway versions of the data. The analysis of pathways can facilitate the identification of metabolic choke points, critical enzymes that could be targeted by drugs that may have valuable antimicrobial properties. Pathway analysis can also yield clues to pathogenesis by comparing virulent and avirulent strains and examining the roles of genes not present in both strains. Website The PATRIC website is hosted on a Sun Microsystems v20z server running SuSE Linux using the Apache web server. Applications are written in PHP and Perl, accessing data from an Oracle 9i server hosted on a Sun Microsystems E15000 running Sun OS. The conceptual organization of the website is described in Figure 1
The organism category page also contains links to a pathogen summary, ortholog group table and a phylogenetic tree based on 16S rRNAs for bacteria or a selected protein family for viruses. For bacterial genomes, detailed pathosystem information is available, provided by the VBI PathInfo documents (14). The ortholog group table shows the presence or absence of reference gene list proteins for each organism in the organism category and provides links to an MSA and tree viewer and the Base-By-Base MSA editor (38) for every ortholog group. Base-By-Base allows users to add sequences to the MSA, recalculate it using Clustal (39), T-Coffee (40) or MUSCLE and generate the corresponding tree using neighbor-joining or a number of clustering algorithms. The PATRIC website also supports analytical and query tools. A database search page allows user-supplied sequences to be BLASTed against reference and curated sequences from PATRIC organisms. The page also supports MUMMER (41) comparisons between genomes in the database or with a user-supplied sequence. A query tool is available throughout the site by which users can retrieve genes by name, ID, description, as well as GO and EC identifiers and descriptions. Questions, comments and suggestions concerning the website and its contents may be submitted via the ‘feedback’ page, accessible from the menu bar. AVAILABILITY The PATRIC database is hosted at the Virginia Bioinformatics Institute at Virginia Tech and can be accessed via web browser at https://patric.vbi.vt.edu. Sequences and annotation in GFF3 format (see http://song.sourceforge.net/gff3.shtml) can be downloaded by following the ‘downloads’ link on the main menu bar. GFF3 files are also available through BRC-Central at: http://brc-central.org. FUTURE DIRECTIONS This paper presents the first detailed description of the PATRIC website. Future development will advance on several fronts. Genome and proteome curation will continue, complemented by improved tools for query, analysis and visualization. For viruses, we will transition to the more widely accepted ICTV taxonomy (42). The website's user interface is being enhanced to integrate organism-, tool/task- and data-centric approaches to data access, allowing users more efficient and effective access to PATRIC resources. This will be followed up by prioritized curation targeted at potential drug and vaccine targets, virulence factors and genes with differential representation or polymorphisms associated with clinically significant phenotypes. Leveraging another NIAID-funded VBI project, the Administrative Resource for Biodefense Proteomics Research (http://www.proteomicsresource.org/), we plan to integrate expression profiling and proteomics data from pathogen and host to better understand the pathosystem's biology and help the community identify targets for counter-measures. The integration of these disparate data types into a single, easy-to-use system is a goal that we anticipate will enable pathogen researchers to make full use of available data to develop diagnostics, vaccines and therapeutics. Acknowledgments We would like to thank Chris Upton for making the application Base-By-Base (38) available for incorporation into our website and to Peter Karp for making a similar contribution with his Pathway Tools software (15). This work is funded through NIAID contract HHSN266200400035C to Bruno Sobral. Funding to pay the Open Access publication charges for this article was provided by NIAID contract HHSN266200400035C to Bruno Sobral. Conflict of interest statement. None declared. REFERENCES 1. Fauci A.S. Biodefence on the research agenda. Nature. 2003;421:787. [PubMed] 2. Jernigan D.B., Raghunathan P.L., Bell B.P., Brechner R., Bresnitz E.A., Butler J.C., Cetron M., Cohen M., Doyle T., Fischer M., et al. Investigation of bioterrorism-related anthrax, United States, 2001: epidemiologic findings. Emerg. Infect. Dis. 2002;8:1019–1028. [PubMed] 3. Morens D.M., Folkers G.K., Fauci A.S. The challenge of emerging and re-emerging infectious diseases. Nature. 2004;430:242–249. [PubMed] 4. Peterson J.D., Umayam L.A., Dickinson T., Hickey E.K., White O. The comprehensive microbial resource. Nucleic Acids Res. 2001;29:123–125. [PubMed] 5. Chaudhuri R.R., Pallen M.J. xBASE, a collection of online databases for bacterial comparative genomics. Nucleic Acids Res. 2006;34:D335–D337. [PubMed] 6. Esteban D.J., Da Silva M., Upton C. New bioinformatics tools for viral genome analyses at Viral Bioinformatics—Canada. Pharmacogenomics. 2005;6:271–280. [PubMed] 7. Kulkarni-Kale U., Bhosle S., Manjari G.S., Kolaskar A.S. VirGen: a comprehensive viral genome resource. Nucleic Acids Res. 2004;32:D289–D292. [PubMed] 8. Bossi P., Tegnell A., Baka A., Van Loock F., Hendriks J., Werner A., Maidhof H., Gouvras G. Bichat guidelines for the clinical management of brucellosis and bioterrorism-related brucellosis. Euro Surveill. 2004;9:E15–E16. [PubMed] 9. Kelly D.J., Richards A.L., Temenak J., Strickman D., Dasch G.A. The past and present threat of rickettsial diseases to military medicine and international public health. Clin. Infect. Dis. 2002;34:S145–S169. [PubMed] 10. Azad A.F., Beard C.B. Rickettsial pathogens and their arthropod vectors. Emerg. Infect. Dis. 1998;4:179–186. [PubMed] 11. Walker D.H. Principles of the malicious use of infectious agents to create terror: reasons for concern for organisms of the genus Rickettsia. Ann. N Y Acad Sci. 2003;990:739–742. [PubMed] 12. Centers for Disease Control and Prevention (CDC). Outbreak of severe acute respiratory syndrome—worldwide, 2003. MMWR Morb. Mortal. Wkly Rep. 2003;52:226–228. [PubMed] 13. Fouchier R.A., Kuiken T., Schutten M., van Amerongen G., van Doornum G.J., van den Hoogen B.G., Peiris M., Lim W., Stohr K., Osterhaus A.D. Aetiology: Koch's postulates fulfilled for SARS virus. Nature. 2003;423:240. [PubMed] 14. He Y., Vines R.R., Wattam A.R., Abramochkin G.V., Dickerman A.W., Eckart J.D., Sobral B.W. PIML: the Pathogen Information Markup Language. Bioinformatics. 2005;21:116–121. [PubMed] 15. Karp P.D., Paley S., Romero P. The Pathway Tools software. Bioinformatics. 2002;18(Suppl. 1):S225–S232. [PubMed] 16. Benson D.A., Karsch-Mizrachi I., Lipman D.J., Ostell J., Wheeler D.L. GenBank. Nucleic Acids Res. 2006;34:D16–D20. [PubMed] 17. Pruitt K.D., Tatusova T., Maglott D.R. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2005;33:D501–D504. [PubMed] 18. Ouzounis C.A., Karp P.D. The past, present and future of genome-wide re-annotation. Genome Biol. 2002;3:COMMENT2001. [PubMed] 19. Wu C.H., Apweiler R., Bairoch A., Natale D.A., Barker W.C., Boeckmann B., Ferro S., Gasteiger E., Huang H., Lopez R., et al. The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res. 2006;34:D187–D191. [PubMed] 20. Delcher A.L., Harmon D., Kasif S., White O., Salzberg S.L. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 1999;27:4636–4641. [PubMed] 21. Lukashin A.V., Borodovsky M. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res. 1998;26:1107–1115. [PubMed] 22. Borodovsky M., McIninch J.D. GenMark: parallel gene recognition for both DNA strands. Comput. Chem. 1993;17:123–133. 23. Suzek B.E., Ermolaeva M.D., Schreiber M., Salzberg S.L. A probabilistic method for identifying start codons in bacterial genomes. Bioinformatics. 2001;17:1123–1130. [PubMed] 24. Tech M., Pfeifer N., Morgenstern B., Meinicke P. TICO: a tool for improving predictions of prokaryotic translation initiation sites. Bioinformatics. 2005;21:3568–3569. [PubMed] 25. Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. [PubMed] 26. Lowe T.M., Eddy S.R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. [PubMed] 27. Wuyts J., Perriere G., Van De Peer Y. The European ribosomal RNA database. Nucleic Acids Res. 2004;32:D101–D103. [PubMed] 28. Cannone J.J., Subramanian S., Schnare M.N., Collett J.R., D'Souza L.M., Du Y., Feng B., Lin N., Madabusi L.V., Muller K.M., et al. The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics. 2002;3:2. [PubMed] 29. Jones D.T., Taylor W.R., Thornton J.M. A model recognition approach to the prediction of all-helical membrane protein structure and topology. Biochemistry. 1994;33:3038–3049. [PubMed] 30. Harris M.A., Clark J., Ireland A., Lomax J., Ashburner M., Foulger R., Eilbeck K., Lewis S., Marshall B., Mungall C., et al. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004;32:D258–D261. [PubMed] 31. Bairoch A. The ENZYME database in 2000. Nucleic Acids Res. 2000;28:304–305. [PubMed] 32. Bateman A., Birney E., Cerruti L., Durbin R., Etwiller L., Eddy S.R., Griffiths-Jones S., Howe K.L., Marshall M., Sonnhammer E.L. The Pfam protein families database. Nucleic Acids Res. 2002;30:276–280. [PubMed] 33. Tatusov R.L., Koonin E.V., Lipman D.J. A genomic perspective on protein families. Science. 1997;278:631–637. [PubMed] 34. Edgar R.C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004;5:113. [PubMed] 35. Saitou N., Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 1987;4:406–425. [PubMed] 36. Retief J.D. Phylogenetic analysis using PHYLIP. Methods Mol. Biol. 2000;132:243–258. [PubMed] 37. Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–791. 38. Brodie R., Smith A.J., Roper R.L., Tcherepanov V., Upton C. Base-By-Base: single nucleotide-level analysis of whole viral genome alignments. BMC Bioinformatics. 2004;5:96. [PubMed] 39. Chenna R., Sugawara H., Koike T., Lopez R., Gibson T.J., Higgins D.G., Thompson J.D. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. 2003;31:3497–3500. [PubMed] 40. Notredame C., Higgins D.G., Heringa J. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 2000;302:205–217. [PubMed] 41. Kurtz S., Phillippy A., Delcher A.L., Smoot M., Shumway M., Antonescu C., Salzberg S.L. Versatile and open software for comparing large genomes. Genome Biol. 2004;5:R12. [PubMed] 42. Fauquet C.M., Fargette D. International Committee on Taxonomy of Viruses and the 3,142 unassigned species. Virol. J. 2005;2:64. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||
Nature. 2003 Feb 20; 421(6925):787.
[Nature. 2003]Emerg Infect Dis. 2002 Oct; 8(10):1019-28.
[Emerg Infect Dis. 2002]Nature. 2004 Jul 8; 430(6996):242-9.
[Nature. 2004]Nucleic Acids Res. 2001 Jan 1; 29(1):123-5.
[Nucleic Acids Res. 2001]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D335-7.
[Nucleic Acids Res. 2006]Euro Surveill. 2004 Dec 15; 9(12):E15-6.
[Euro Surveill. 2004]Clin Infect Dis. 2002 Jun 15; 34(Suppl 4):S145-69.
[Clin Infect Dis. 2002]Emerg Infect Dis. 1998 Apr-Jun; 4(2):179-86.
[Emerg Infect Dis. 1998]Ann N Y Acad Sci. 2003 Jun; 990():739-42.
[Ann N Y Acad Sci. 2003]MMWR Morb Mortal Wkly Rep. 2003 Mar 21; 52(11):226-8.
[MMWR Morb Mortal Wkly Rep. 2003]Nature. 2003 May 15; 423(6937):240.
[Nature. 2003]Bioinformatics. 2005 Jan 1; 21(1):116-21.
[Bioinformatics. 2005]Bioinformatics. 2005 Jan 1; 21(1):116-21.
[Bioinformatics. 2005]Bioinformatics. 2002; 18 Suppl 1():S225-32.
[Bioinformatics. 2002]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D16-20.
[Nucleic Acids Res. 2006]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D501-4.
[Nucleic Acids Res. 2005]Genome Biol. 2002; 3(2):COMMENT2001.
[Genome Biol. 2002]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D16-20.
[Nucleic Acids Res. 2006]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D187-91.
[Nucleic Acids Res. 2006]Nucleic Acids Res. 1999 Dec 1; 27(23):4636-41.
[Nucleic Acids Res. 1999]Nucleic Acids Res. 1998 Feb 15; 26(4):1107-15.
[Nucleic Acids Res. 1998]Bioinformatics. 2001 Dec; 17(12):1123-30.
[Bioinformatics. 2001]Bioinformatics. 2005 Sep 1; 21(17):3568-9.
[Bioinformatics. 2005]Nucleic Acids Res. 1997 Sep 1; 25(17):3389-402.
[Nucleic Acids Res. 1997]Biochemistry. 1994 Mar 15; 33(10):3038-49.
[Biochemistry. 1994]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D258-61.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2000 Jan 1; 28(1):304-5.
[Nucleic Acids Res. 2000]Nucleic Acids Res. 2002 Jan 1; 30(1):276-80.
[Nucleic Acids Res. 2002]Science. 1997 Oct 24; 278(5338):631-7.
[Science. 1997]BMC Bioinformatics. 2004 Aug 19; 5():113.
[BMC Bioinformatics. 2004]Mol Biol Evol. 1987 Jul; 4(4):406-25.
[Mol Biol Evol. 1987]Methods Mol Biol. 2000; 132():243-58.
[Methods Mol Biol. 2000]Bioinformatics. 2002; 18 Suppl 1():S225-32.
[Bioinformatics. 2002]Bioinformatics. 2005 Jan 1; 21(1):116-21.
[Bioinformatics. 2005]BMC Bioinformatics. 2004 Jul 14; 5():96.
[BMC Bioinformatics. 2004]Nucleic Acids Res. 2003 Jul 1; 31(13):3497-500.
[Nucleic Acids Res. 2003]J Mol Biol. 2000 Sep 8; 302(1):205-17.
[J Mol Biol. 2000]Genome Biol. 2004; 5(2):R12.
[Genome Biol. 2004]