![]() | ![]() |
Formats:
|
||||||||||
Copyright © 2004 Oxford University Press VISTA: computational tools for comparative genomics Perlegen Sciences, Inc., 2021 Stierlin Court, Mountain View, CA 94043, USA, 1Department of Mathematics, University of California—Berkeley, Berkeley, CA, 94720, USA, 2Genomics Division, Lawrence Berkeley National Laboratory, MS 84-171, Berkeley, CA 94720, USA and 3Department of Energy Joint Genome Institute, 2800 Mitchell Avenue, Walnut Creek, CA 94598, USA *To whom correspondence should be addressed. Tel: +1 510 495 2419; Fax: +1 510 486 5614; Email: ildubchak/at/lbl.gov aThe online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. a© 2004, the authors Received February 16, 2004; Revised April 26, 2004; Accepted April 26, 2004. This article has been cited by other articles in PMC.Abstract Comparison of DNA sequences from different species is a fundamental method for identifying functional elements in genomes. Here, we describe the VISTA family of tools created to assist biologists in carrying out this task. Our first VISTA server at http://www-gsd.lbl.gov/vista/ was launched in the summer of 2000 and was designed to align long genomic sequences and visualize these alignments with associated functional annotations. Currently the VISTA site includes multiple comparative genomics tools and provides users with rich capabilities to browse pre-computed whole-genome alignments of large vertebrate genomes and other groups of organisms with VISTA Browser, to submit their own sequences of interest to several VISTA servers for various types of comparative analysis and to obtain detailed comparative analysis results for a set of cardiovascular genes. We illustrate capabilities of the VISTA site by the analysis of a 180 kb interval on human chromosome 5 that encodes for the kinesin family member 3A (KIF3A) protein. INTRODUCTION In light of the increasing number of available DNA sequences from multiple species, the need for comparative genomics tools to functionally annotate these sequences is also growing. These tools require efficient alignment algorithms as well as easy-to-interpret visualization strategies for investigating megabases of genomic intervals and whole-genome assemblies. Although many individual programs have been developed separately for alignment and visualization, few services have attempted to integrate the two, and with the exception of the VISTA site and the PipMaker suite of tools (1–3) there have been few examples of continuous extensive development of web-accessible software packages. Our VISTA family of tools (4–6) is based on global alignment strategies and a curve-based visualization technique for rapid identification of conserved sequences in long alignments. Unlike other existing tools at the time of starting the VISTA project, the AVID alignment program allowed for real-time global alignment of megabase-long sequences and the accompanying visualization program provided an easy method for the visual and computational analysis of conservation. This approach was extended to the pairwise and three-way alignment of whole-genome assemblies by adding a mapping component as a first step before global alignment of putative orthologous regions of two species (7,8). This method is also used for aligning individual sequences against whole-genome assemblies of several species. Improved prediction of functional signals such as transcription factor binding sites was obtained by taking into consideration conservation among species, and this feature also became available as a part of the VISTA family of tools (9). All VISTA tools use a standard platform of software for the analysis of conservation and visualization, making it easy to compare results from different applications. VISTA is a result of close collaboration among biologists, mathematicians and computer scientists and has been widely used by the biological community. A number of biological studies have utilized VISTA to answer various questions, from comparing genes from the same gene families (10,11), to discovering functional non-coding elements (12,13) and finding patterns of conservation on a whole-genome scale (14,15). As we have mentioned, the VISTA system is fundamentally based on global alignments, and this should be contrasted with the PipMaker tools, which are based on local alignment strategies. A comparative review of the alignment and visualization features of PipMaker and VISTA has recently been published (16). In addition, a recent paper (17) attempts to carefully analyze the benefits and drawbacks of different alignment methods and programs. It is important to note that as alignment algorithms become more sophisticated, it is becoming harder to distinguish between local and global alignment tools. For example, a chaining option for BLASTZ (3) allows for the extraction of global alignments from BLASTZ local alignments, and similarly Shuffle-LAGAN (18) and MAVID (19), which are global aligners, explicitly deal with rearrangements between sequences. VISTA SUITE OF TOOLS The web page http://www-gsd.lbl.gov/vista/ serves as a portal for access to the suite of VISTA tools. One of them is VISTA Browser, which allows the user to view pre-computed whole-genome alignments of many species. There are three VISTA servers, GenomeVISTA, mVISTA and rVISTA, that allow the user to submit DNA sequences for analysis. For GenomeVISTA (7) the user submits a single sequence (draft or finished) which is compared with publicly available completed whole-genome assemblies. mVISTA (4,6) is the original program, designed for comparison of orthologous sequences of different species. rVISTA (9) combines a transcription factor binding sites database search (20) with a comparative sequence analysis. The Phylo-VISTA program, a new member of the VISTA family of tools, allows a user to visualize submitted multiple sequence alignment data while taking the phylogenetic relationships between sequences into account (21). The VISTA web site also provides access to the comparative analyses of the set of cardiovascular genes, studied by the Berkeley Program for Genomic Applications (PGA). VISTA pages provide extensive help on selecting a type of analysis, finding optimal parameters for a particular project and navigating the web site. VISTA Browser for pre-computed pairwise and multiple whole-genome alignments We have developed an automatic computational scheme for the alignment and analysis of conservation between large vertebrate genomes, which was originally applied to the comparative study of the human and mouse genomes (7,22). Our method uses the BLAT (23) local alignment program to find anchors on the base genome to identify regions of possible homology for a query sequence. These regions are post-processed to find the best candidates, which are then globally aligned using the AVID (6) or LAGAN (24) global alignment programs. When the rat genome assembly became available, the method was expanded to the comparison of three genomes, for which the global alignment stage was accomplished using the MLAGAN multiple alignment program (8,15). Details on the strategy, as well as validation of our alignments and comparison to other methods have been published recently elsewhere (7,8). The resulting whole-genome alignments exhibit a high degree of sensitivity, covering more than 90% of known coding exons in the human genome. It is important to note that whole-genome alignment is an ongoing area of active research (3,19,25,26 and references therein) and the alignment tools used in the VISTA servers are undergoing constant development and testing. Although VISTA Browser is mostly used for biological application described in this paper, it has also proved to be extremely efficient as a tool for comparing and contrasting alignments. When new genome assemblies become available they are aligned to previously available genomes in a timely manner. Currently our site provides access to multiple human–mouse–rat alignment, pairwise alignments of the human genome with the chicken and chimpanzee assemblies, Drosophila melanogaster with Drosophila pseudoobscura, Caenorhabditis elegans with Caenorhabditis briggsae, and alignments of several plant genomes. Visualization of aligned genome sequences There are two schemes of visual data presentation on the whole-genome scale available for the user—VISTA Browser, and VISTA track on the mirrored UCSC genome browser. VISTA Browser is a Java applet, very efficient for interactively visualizing results of comparative sequence analysis in the VISTA format on the scale of whole chromosomes along with annotations. The user may select any genome as the reference or base and display the level of conservation between this reference and the sequences of another species in a particular interval. Conserved segments with percentage identity X and length Y are defined to be regions of the alignment in which every contiguous subsegment of length Y in the base sequence is at least X% identical to its paired sequence. The user can apply default values for conservation cutoffs (X% over Y bp) or specify them. These regions are highlighted under the curve, with different colors used for coding and non-coding sequences. The browser has a number of options, such as zoom, extraction of a region to be displayed, user-defined parameters for conservation level and options for selecting sequence elements to study. VISTA track, accessible through the VISTA Browser, displays results of our comparative analysis in the context of the whole human genome annotation on the mirrored UCSC Human Genome Browser (27). VISTA track dynamically creates VISTA plots for each defined region and unlike VISTA Browser displays multiple individual plots if there is an overlap in alignments. VISTA Browser and VISTA track are linked to the Text Browser, which allows a user to examine detailed information about each sequence aligned to the selected region on the base genome. For each region, information such as exact locations of alignments on both genomes, the sequences, alignments and coordinates of conserved regions are easily retrieved. Text Browser also gives access to rVISTA to obtain a prediction of potential transcription factor binding sites for any region of a base genome (see detailed description of rVISTA below). In addition to alignments of whole-genome assemblies, VISTA Browser provides access to multiple alignments of orthologous sequences for different species of genomic intervals containing cardiovascular genes currently under investigation in the Berkeley PGA program (28). VISTA Browser annotation of the KIF3A on human chromosome 5q31 Kinesin family member 3A (KIF3A) is expressed in the kidney and photoreceptor cells, where it is required for the proper formation and maintenance of cilia. Tissue-specific inactivation of KIF3A in the kidneys of mice causes polycystic kidney disease (29) and inactivation in photoreceptor cells leads to cell death, as found in retinitis pigmentosa (30). Here we use VISTA Browser to analyze the 180 kb interval on human chromosome 5 (5q31) surrounding KIF3A to identify conserved non-coding sequences which may potentially regulate its expression. In Figure Figure1a1
VISTA servers for comparative analysis of user-submitted sequences GenomeVISTA Genome VISTA is an automatic server that allows the user to find candidate orthologous regions for a draft or finished DNA sequence from one species on the base genome of a second species, and provides detailed comparative analysis. The user can currently align a sequence to the following base genomes: human, mouse, rat, D.melanogaster, C.elegans, Arabidopsis thaliana, rice. We are constantly adding new base genomes to the server when their assemblies become available. Genome VISTA uses the same computational strategy as used for the alignment of whole-genome assemblies, where query sequence contigs are anchored on the base genome by local alignment matches (23) and then globally aligned to candidate regions with the AVID program (6,7). A sequence up to 300 kb long can be submitted by pasting it into a window in plain FASTA format, by uploading a FASTA file from the user's computer or by providing a GenBank accession number to the server. After submitting the sequence, the user immediately receives a link to the computation results. The resulting alignments of the query sequence against the base genome and detailed comparative analysis of conservation can be viewed using VISTA Browser and Text Browser. When two or more high-scoring alignments are obtained for the query sequences and the base genome sequence, the results for all alignments are provided to the user in Text Browser. For each alignment a link to rVISTA is also provided. Use of GenomeVISTA to annotate the KIF3A interval It is well established that the human and dog genomes have a higher level of sequence similarity to each other than either one has to the mouse genome (5,31). Thus, the landscape of conservation observed in the pairwise comparison of orthologous human and dog DNA sequences can be quite different from that observed in the pairwise comparison of orthologous human and mouse DNA sequences. Here, we used GenomeVISTA to align the orthologous dog sequence to the human 5q31 interval containing KIF3A (Figure (Figure2).2
mVISTA mVISTA is designed to perform pairwise alignments of DNA sequences up to megabases long from two or more species and to visualize these alignments together with annotations. AVID is the alignment engine behind mVISTA, and it allows the global alignment of DNA sequences of arbitrary length (6). The key features of the algorithm are speed, accuracy, the ability to detect weak homologies and to align with one of the sequences in draft (by ordering and orienting the contigs automatically). The mVISTA visualization module is designed to display global sequence alignments of genomic sequences from different species (4). To use mVISTA for comparative sequence analysis, two or more sequences in FASTA format (plain text only) or GenBank accession numbers together with a gene annotation file are submitted to the Web server. One of the two sequences is selected as the base or reference sequence. The server automatically uses RepeatMasker to mask repetitive elements in the reference sequence. The x-axis of the generated plot represents the base sequence and the y-axis represents the percentage identity in the predefined window of an alignment. If a user provides an annotation of the base sequence, the genes will be shown above the plot as dark gray arrows and the exons and untranslated regions will be marked by colored rectangles. mVISTA can also display the positions and orientation of draft sequences, indicate gaps in the alignment, display locations and types of repeats and show SNPs on the base sequence. Advanced mVISTA options include: utilizing an algorithm that simultaneously compares all pairwise sequence alignments of three or more species to evaluate percentage identity and length cutoffs for identifying a level of active non-coding conservation in all of them (5), and displaying a level of sequence difference rather than conservation (used for evolutionarily close species). In the latter case the y-scale is calculated automatically to allow for optimal visual analysis of a plot. rVISTA rVISTA (regulatory VISTA) combines searching the major transcription factor binding site database TRANSFAC™ Professional from Biobase with a comparative sequence analysis. It can be used directly or through links in mVISTA, GenomeVISTA and VISTA Browser. Identifying candidate transcriptional regulatory elements in non-coding genomic sequences is a challenging problem. Analyzing non-coding sequences for the presence of known transcription factor binding sites produces a huge number of false positive predictions that are randomly and uniformly distributed. Combining database searches with comparative sequence analysis reduces the number of predicted transcription factor binding sites by several orders of magnitude (9). rVISTA makes predictions based on using the Match™ program (33) and the TRANSFAC Professional library or user-submitted matrices to identify potential transcription factor binding sites in each of two aligned sequences, and determines which of the predicted sites are aligned and conserved between the species in the alignment. Predictions can also be based on user-submitted position weight matrices or a consensus sequence. TRANSFAC searches are performed using the default core and matrix similarity values or parameters submitted by the user. The visualization program for rVISTA allows the user to look at binding sites for a single transcription factor and/or various combinations of transcription factor binding sites, which allows the user easily to examine the clustering of binding sites for factors that are believed to interact with one another. Both global (AVID) and local (BLASTZ) alignment algorithms are incorporated into rVISTA. Use of rVISTA to annotate the candidate regulatory region of KIF3A A question usually asked immediately about a candidate regulatory region is whether transcription factor binding sites can be computationally identified in the interval. Here we use rVISTA to address this question about the candidate regulatory region which is located downstream of KIF3A and contains several conserved non-coding elements (Figure (Figure1a).1
Phylo-VISTA for visualization and analysis of multiple sequence alignments The Phylo-VISTA program with its associated web server presents a novel method for the visualization and analysis of conservation in multiple sequence alignments by providing several significant extensions to VISTA tools (21). It displays the similarity of DNA sequences from multiple species while considering an associated phylogenetic tree. Features include a broad spectrum of resolution parameters for examining the alignment and the ability easily to compare any sub-tree of sequences within a complete alignment dataset. Phylo-VISTA uses not an individual sequence, but the entire multiple alignment as a base in the x-axis, which is similar to the Synplot method for pairwise alignments (34). As a result, the tool is capable of displaying location and length of gaps in all sequences as well as providing annotations beyond a single base sequence. The Phylo-VISTA server requires submission of a multiple alignment file in the multi-FASTA format, the phylogenetic tree used in the alignment program or produced by it, and annotation files associated with individual sequences if available. FUTURE DIRECTIONS The VISTA family of tools has proven to be useful for biologists carrying out various comparative genomics studies. The VISTA web site with all of its associated programs has been actively maintained and improved for the past four years. Since the introduction of our first online VISTA server mVISTA in 2000, this tool alone has processed more than 50 000 comparative analysis queries. In addition, we have distributed close to 2000 copies of the standalone version of the mVISTA software to academic and commercial institutions in 53 countries.We are planning to work on more efficient algorithms and software implementation to be able to efficiently compare the DNA sequences of a wide range of species at varying evolutionary distances. As more whole-genome sequences become available we will incorporate them as base genomes into VISTA Browser. Additionally, we plan to link VISTA Browser to a number of external databases of relevant genomic information. ACKNOWLEDGEMENTS We are grateful to Simon Minovitsky, Shyam Prabhakar, Michael Teplitsky and Dmitriy Ryaboy for ongoing development, maintenance and support of all our tools. The VISTA project is a collaborative effort of a large group of scientists and engineers. Among contributors are Nicolas Bray (the author of AVID and contributor to the design of whole-genome alignment strategies), Michael Brudno (contributor to MLAGAN and multiple whole-genome alignment), Nameeta Shah (the author of Phylo-VISTA), Olivier Couronne (contributor to whole-genome alignment), Gabriela Loots (rVISTA), Ivan Ovcharenko (contributor to rVISTA and whole-genome alignment), Chris Mayor (contributor to mVISTA) and Brian Klock and Lila Tretikov (VISTA Browser). Our special thanks to the biologists of the Genomics Division at LBNL (Dario Boffelli, Jim Bristow, Jan-Fang Cheng, Marcelo Nobrega, Len Pennacchio, James Priest and many others) for their help, support and critical comments. This project was partially supported by the Programs for Genomic Applications grant from the NHLBI/NIH and the Office of Biological and Environmental Research, Office of Science, US Department of Energy. REFERENCES 1. Schwartz S., Zhang,Z., Frazer,K.A., Smit,A., Riemer,C., Bouck,J., Gibbs,R., Hardison,R. and Miller,W. (2000) PipMaker—a web server for aligning two genomic DNA sequences. Genome Res., 10, 577–586. [PubMed] 2. Schwartz S., Elnitski,L., Li,M., Weirauch,M., Riemer,C., Smit,A., Green,E.D., Hardison,R.C., Miller,W. and NISC Comparative Sequencing Program (2003) MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res., 31, 3518–3524. [PubMed] 3. Schwartz S., Kent,W.J., Smit,A., Zhang,Z., Baertsch,R., Hardison,R.C., Haussler,D. and Miller,W. (2003) Human–mouse alignments with BLASTZ. Genome Res., 13, 103–107. [PubMed] 4. Mayor C., Brudno,M., Schwartz,J.R., Poliakov,A., Rubin,E.M., Frazer,K.A., Pachter,L.S. and Dubchak,I. (2000) VISTA: visualizing global DNA sequence alignments of arbitrary length. Bioinformatics, 16, 1046–1047. [PubMed] 5. Dubchak I., Brudno,M., Pachter,L.S., Loots,G.G., Mayor,C., Rubin,E.M. and Frazer,K.A. (2000) Active conservation of noncoding sequences revealed by 3-way species comparisons. Genome Res., 10, 1304–1306. [PubMed] 6. Bray N., Dubchak,I. and Pachter,L. (2003) AVID: a global alignment program. Genome Res., 13, 97–102. [PubMed] 7. Couronne O., Poliakov,A., Bray,N., Ishkhanov,T., Ryaboy,D., Rubin,E., Pachter,L. and Dubchak,I. (2002) Strategies and tools for whole-genome alignments. Genome Res., 13, 73–80. [PubMed] 8. Brudno M., Poliakov,A., Salamov,A., Cooper,G.M., Sidow,A., Rubin,E.M., Solovyev,V., Batzoglou,S. and Dubchak,I. (2004) Automated whole-genome multiple alignment of rat, mouse, and human. Genome Res., 14, 685–92. [PubMed] 9. Loots G., Ovcharenko,I., Pachter,L., Dubchak,I. and Rubin,E. (2002) rVISTA for comparative sequence-based discovery of functional transcription factor binding sites. Genome. Res., 12, 832–839. [PubMed] 10. Parent S.A., Zhang,T., Chrebet,G., Clemas,J.A., Figueroa,D.J., Ky,B., Blevins,R.A., Austin,C.P. and Rosen,H. (2002) Molecular characterization of the murine SIGNR1 gene encoding a C-type lectin homologous to human DC-SIGN and DC-SIGNR. Gene, 293, 33–46. [PubMed] 11. Chen J., Kitchen,C.M., Streb,J.W. and Miano,J.M. (2002) Myocardin: a component of a molecular switch for smooth muscle differentiation. J. Mol. Cell Cardiol., 34, 1345–1356. [PubMed] 12. Anguita E., Sharpe,J.A., Sloane-Stanley,J.A., Tufarelli,C., Higgs,D.R. and Wood,W.G. (2002) Deletion of the mouse α-globin regulatory element (HS-26) has an unexpectedly mild phenotype. Blood, 100, 3450–3456. [PubMed] 13. Touchman J.W., Dehejia,A., Chiba-Falek,O., Cabin,D.E., Schwartz,J.R., Orrison,B.M., Polymeropoulos,M.H. and Nussbaum,R.L. (2001) Human and mouse alpha-synuclein genes: comparative genomic sequence analysis and identification of a novel gene regulatory element. Genome Res., 11, 78–86. [PubMed] 14. Cooper G.M., Brudno,M., Stone,E.A, Dubchak,I., Batzoglou,S. and Sidow,A. (2004) Characterization of evolutionary rates and constraints in three mammalian genomes. Genome Res., 14, 539–548. [PubMed] 15. Rat Genome Sequencing Project Consortium (2004) Genome sequence of the brown norway rat yields insights into mammalian evolution. Nature, 428, 493–521. [PubMed] 16. Frazer K.A., Elnitski,L., Church,D.M., Dubchak,I. and Hardison,R.C. (2003) Cross-species sequence comparisons: a review of methods and available resources. Genome Res., 13, 1–12. [PubMed] 17. Pollard D.A., Bergman,C.M., Stoye,J., Celniker,S.E. and Eisen,M.B. (2004) Benchmarking tools for the alignment of functional noncoding DNA. BMC Bioinformatics, 5, 6. [PubMed] 18. Brudno M., Malde,S., Poliakov,A., Do,C.B., Couronne,O., Dubchak,I. and Batzoglou,S. (2003) Glocal alignment: finding rearrangements during alignment. Bioinformatics, 1, I54–I62. [PubMed] 19. Bray N. and Pachter,L. (2004) MAVID: constrained ancestral alignment of multiple sequences. Genome Res., 14, 693–699. [PubMed] 20. Heinemeyer T., Wingender,E., Reuter,I., Hermjakob,H., Kel,A.E., Kel,O.V., Ignatieva,E.V., Ananko,E.A., Podkolodnaya,O.A., Kolpakov,F.A., Podkolodny,N.L. and Kolchanov,N.A. (1998) Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL. Nucleic Acids Res., 26, 362–367. [PubMed] 21. Shah N., Couronne,O., Pennacchio,L.A., Brudno,M., Batzoglou,S., Bethel,E.W., Rubin,E.M., Hamann,B. and Dubchak,I. (2004) Phylo-VISTA: an interactive visualization tool for multiple DNA sequence alignments. Bioinformatics, 20, 636–643. [PubMed] 22. Waterston R.H., Lindblad-Toh,K., Birney,E., Rogers,J., Abril,J.F., Agarwal,P., Agarwala,R., Ainscough,R., Alexandersson,M., An,P. et al. (2002) Initial sequencing and comparative analysis of the mouse genome. Nature, 420, 520–562. [PubMed] 23. Kent W.J. (2002) BLAT—the BLAST-like alignment tool. Genome Res., 12, 656–664. [PubMed] 24. Brudno M., Do,C.B., Cooper,G.M., Kim,M.F., Davydov,E., Green,E.D., Sidow,A., Batzoglou,S. and NISC Comparative Sequencing Program (2003) LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res., 13, 721–731. [PubMed] 25. Blanchette M., Kent,W.J., Riemer,C., Elnitski,L., Smit,A.F., Roskin,K.M., Baertsch,R., Rosenbloom,K., Clawson,H., Green,E.D., Haussler,D. and Miller,W. (2004) Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res., 14, 708–715. [PubMed] 26. Kurtz S., Phillippy,A., Delcher,A.L., Smoot,M., Shumway,M., Antonescu,C. and Salzberg,S.L. (2004) Versatile and open software for comparing large genomes. Genome Biol., 5, R12. [PubMed] 27. Kent W.J., Sugnet,C.W., Furey,T.S., Roskin,K.M., Pringle,T.H., Zahler,A.M. and Haussler,D. (2002) The human genome browser at UCSC. Genome Res., 12, 996–1006. [PubMed] 28. Cheng J.F., and Pennacchio,L.A. (2003) Comparative and functional analysis of cardiovascular-related genes. Pharmacogenomics, 4, 571–582. [PubMed] 29. Lin F., Hiesberger,T., Cordes,K., Sinclair,A.M., Goldstein,L.S.B., Somlo,S. and Igarashi,P. (2003) Kidney-specific inactivation of the KIF3A subunit of kinesin-II inhibits renal ciliogenesis and produces polycystic kidney disease. Proc. Natl Acad. Sci. USA, 100, 5286–5291. [PubMed] 30. Marszalek J.R., Liu,X., Roberts,E.A., Chui,D., Marth,J.D., Williams,D.S. and Goldstein,L.S.B. (2000) Genetic evidence for selective transport of opsin and arrestin by kinesin-II in mammalian photoreceptors. Cell, 102, 175–187. [PubMed] 31. Kirkness E.F., Bafna,V., Halpern,A.L., Levy,S., Remington,K., Rusch,D.B., Delcher,A.L., Pop,M., Wang,W., Fraser,C.M. and Venter,J.C. (2003) The dog genome: survey sequencing and comparative analysis. Science, 301, 1898–1903. [PubMed] 32. Frazer K.A., Tao,H., Osoegawa,K., de Jong,P.J., Chen,X., Doherty,M.F. and Cox,D.R (2004). Non-coding sequences conserved in a limited number of mammals in the SIM2 interval are frequently functional. Genome Res., 14, 367–372. [PubMed] 33. Kel A.E., Gossling,E., Reuter,I., Cheremushkin,E., Kel-Margoulis,O.V. and Wingender,E. (2003) MATCH: a tool for searching transcription factor binding sites in DNA sequences. Nucleic Acids Res., 31, 3576–3579. [PubMed] 34. Göttgens B., Gilbert,J.G., Barton,L.M., Grafham,D., Rogers,J., Bentley,D.R. and Green,A.R. (2001) Long-range comparison of human and mouse SCL loci: localized regions of sensitivity to restriction endonucleases correspond precisely with peaks of conserved noncoding sequences. Genome Res., 11, 87–97. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||
Genome Res. 2000 Apr; 10(4):577-86.
[Genome Res. 2000]Genome Res. 2003 Jan; 13(1):103-7.
[Genome Res. 2003]Bioinformatics. 2000 Nov; 16(11):1046-7.
[Bioinformatics. 2000]Genome Res. 2003 Jan; 13(1):97-102.
[Genome Res. 2003]Genome Res. 2003 Jan; 13(1):73-80.
[Genome Res. 2003]Genome Res. 2004 Apr; 14(4):685-92.
[Genome Res. 2004]Genome Res. 2002 May; 12(5):832-9.
[Genome Res. 2002]Gene. 2002 Jun 26; 293(1-2):33-46.
[Gene. 2002]J Mol Cell Cardiol. 2002 Oct; 34(10):1345-56.
[J Mol Cell Cardiol. 2002]Blood. 2002 Nov 15; 100(10):3450-6.
[Blood. 2002]Genome Res. 2001 Jan; 11(1):78-86.
[Genome Res. 2001]Genome Res. 2004 Apr; 14(4):539-48.
[Genome Res. 2004]Genome Res. 2003 Jan; 13(1):1-12.
[Genome Res. 2003]BMC Bioinformatics. 2004 Jan 21; 5():6.
[BMC Bioinformatics. 2004]Genome Res. 2003 Jan; 13(1):103-7.
[Genome Res. 2003]Bioinformatics. 2003; 19 Suppl 1():i54-62.
[Bioinformatics. 2003]Genome Res. 2004 Apr; 14(4):693-9.
[Genome Res. 2004]Genome Res. 2003 Jan; 13(1):73-80.
[Genome Res. 2003]Bioinformatics. 2000 Nov; 16(11):1046-7.
[Bioinformatics. 2000]Genome Res. 2003 Jan; 13(1):97-102.
[Genome Res. 2003]Genome Res. 2002 May; 12(5):832-9.
[Genome Res. 2002]Nucleic Acids Res. 1998 Jan 1; 26(1):362-7.
[Nucleic Acids Res. 1998]Genome Res. 2003 Jan; 13(1):73-80.
[Genome Res. 2003]Nature. 2002 Dec 5; 420(6915):520-62.
[Nature. 2002]Genome Res. 2002 Apr; 12(4):656-64.
[Genome Res. 2002]Genome Res. 2003 Jan; 13(1):97-102.
[Genome Res. 2003]Genome Res. 2003 Apr; 13(4):721-31.
[Genome Res. 2003]Genome Res. 2002 Jun; 12(6):996-1006.
[Genome Res. 2002]Pharmacogenomics. 2003 Sep; 4(5):571-82.
[Pharmacogenomics. 2003]Proc Natl Acad Sci U S A. 2003 Apr 29; 100(9):5286-91.
[Proc Natl Acad Sci U S A. 2003]Cell. 2000 Jul 21; 102(2):175-87.
[Cell. 2000]Genome Res. 2002 Apr; 12(4):656-64.
[Genome Res. 2002]Genome Res. 2003 Jan; 13(1):97-102.
[Genome Res. 2003]Genome Res. 2003 Jan; 13(1):73-80.
[Genome Res. 2003]Genome Res. 2000 Sep; 10(9):1304-6.
[Genome Res. 2000]Science. 2003 Sep 26; 301(5641):1898-903.
[Science. 2003]Genome Res. 2004 Mar; 14(3):367-72.
[Genome Res. 2004]Genome Res. 2003 Jan; 13(1):97-102.
[Genome Res. 2003]Bioinformatics. 2000 Nov; 16(11):1046-7.
[Bioinformatics. 2000]Genome Res. 2000 Sep; 10(9):1304-6.
[Genome Res. 2000]Genome Res. 2002 May; 12(5):832-9.
[Genome Res. 2002]Nucleic Acids Res. 2003 Jul 1; 31(13):3576-9.
[Nucleic Acids Res. 2003]Bioinformatics. 2004 Mar 22; 20(5):636-43.
[Bioinformatics. 2004]Genome Res. 2001 Jan; 11(1):87-97.
[Genome Res. 2001]