| PubMed | Nucleotide | Protein | Genome | Structure | Taxonomy |
| Danio rerio genome data and search tips | Revised March 1, 2007 |
|
The Map Viewer help document describes how to use the Map Viewer software. This page describes the data available for Danio rerio (zebrafish) and the search tips specific to that organism. You can also return to the Danio rerio genome view search page. The Map Viewer home page allows you to search the genome data of any organism represented in Map Viewer.
|
|
|
| Zebrafish Genomic Sequence Data |
|
|
The current zebrafish genome build (2.1) is based on the Zv6 assembly provided by the Wellcome Trust Sanger Institute in March 2006. The sequencing strategy produced a mixed assembly of 6.5-7X coverage consisting of whole genome shotgun sequence and BAC sequence. The mitochondrial sequence presented, NC_002333, is derived from a single individual of strain ABC. |
| BLAST of Zebrafish Genomic Sequence |
|
|
The complete set of zebrafish sequence databases available for BLAST searching is shown in the pop-up menu on the zebrafish BLAST page, which includes a link to the database descriptions. |
| Additional Zebrafish Genome Resources |
|
| In addition to the zebrafish data available in the Map Viewer and through BLAST, links to NCBI resources and external sites are available from the Zebrafish Genome Resources Guide. |
|
|
| The available maps for zebrafish include: |
| Sequence Maps |
|
| Ab initio |
Models generated by Gnomon. mRNA alignments were used to segment the genomic sequence by putative gene boundaries, and Gnomon was executed on these segments to predict genes. Gnomon uses protein alignments in addition to transcript alignments and, in order to capture as much coding information in the genome as possible in this assembly, Gnomon models may represent partial as well as complete coding sequences. Models built using alignments are blue, the models with frameshifts or premature stops are green, and the pure ab initio predictions are brown. Pure ab initio status indicates that the model was built without the support of mRNA or protein alignments, either through a failure to align the sequence to the genome or an alignment ignored by Gnomon due to a score falling below a pre-determined threshold. |
| Component | Provides the tiling path of GenBank accessions used to build each of the "NW_xxxxxxxxx" contigs, described below. |
| Contig | Shows the chromosomal placement of NW_xxxxxxxxx contigs on the assembled genomic sequence. The individual GenBank records used to assemble the contigs are shown on the Component map, described above. |
| CpG Island | Shows regions of high G + C content on the assembled genome sequence. Two sets of criteria were used for finding CpG islands: "strict" and "relaxed," described below. The algorithm (and cutoffs) were taken from Takai and Jones, 2002.
|
| Dr RNA | Shows the alignment of individual zebrafish RNAs to the assembled genomic sequence. The corresponding alignment of EST clusters is shown in the Dr UniG map, described below. |
| Act RNA | Shows the alignment of individual Actinopterygii (ray-finned fishes) RNAs to the assembled genomic sequence. The corresponding alignment of EST clusters is shown in the Act UniG map, described below. |
| GenBank DNA | Shows the alignment of zebrafish genomic DNA sequences from GenBank that were not used in the assembly of contigs. It includes zebrafish genomic sequences longer than 500 bp that have at least 97% identity to the components for at least 98 base pairs. If a sequence extends beyond a contig, that portion of sequence is not shown. The length of a line represents the upper and lower-most points on the genome assembly to which sequence fragments from a single GenBank record were aligned. When the GenBank DNA map is displayed as the master map, in the default verbose mode, the descriptive text includes several columns: Total Bases, which shows the total number of bases in the GenBank record; Aligned Bases, which shows the total number of bases from that record that were aligned to the genome; % identity for the alignment; % coverage, which shows how much of the Genbank record aligned to the genome as a percentage; Alignment-length ratio, which is the ratio of the alignment length in the genome to the alignment length of the Genbank record; and Strain from which the Genbank record was derived, when available. |
| Gene | Genes that have been annotated on the genomic contigs. This includes known and putative genes placed as a result of alignments of mRNAs to the contigs. If multiple models exist for a single gene, corresponding to splicing variants, the Gene map presents a flattened view of all the exons that can be spliced together in various ways. For example, if one splice variant uses exons 1, 3, 4, and another splice variant uses exons 2, 3, 4, the Gene map shows exons 1, 2, 3, 4. Genes shown on the left of the grey line are transcribed in the - orientation (from the bottom up) and those on the right in the + orientation (from the top down). Models with Interim GeneIDs (LOC######) may be paralogs, genes not yet curated, duplications because of assembly errors, or pseudogenes. The genome assembly and annotation pipeline assigns Interim GeneIDs when there is no unambiguous solution to what they should be. Interim GeneIDs are associated with RefSeq XM_* accessions (model mRNAs). |
| RefSeq Transcripts | Diagrams of the RNAs that are predicted on the genomic contigs. The RefSeq Transcript map and Gene map are built in the same way using the same types of evidence. The Gene map, however, shows a view of all the exons in a gene while the RefSeq Transcript map shows the combinations of exons (i.e., splice variants) that are valid if mRNA sequences indicate alternative splice variants. |
| STS | Placement of STSs from a variety of sources onto the assembled genomic sequence (the NW_xxxxxxxxx contigs, described above) using Electronic-PCR (e-PCR). |
| Dr UniG | Alignment of zebrafish EST clusters to the assembled genomic sequence. ESTs are clustered based on shared exon-intron boundaries and alignment to a common position on the genome. Those ESTs can come from one or more UniGene clusters, whose IDs are noted by the EST cluster. (UniGene clusters are made with a different build procedure, so there is not necessarily a one-to-one correspondence between EST clusters on the Dr UniG map and clusters in the UniGene resource.) |
| Act UniG | Alignment of Actinopterygii (ray-finned fishes) EST clusters to the assembled genomic sequence. ESTs are clustered based on shared exon-intron boundaries and alignment to a common position on the genome. Those ESTs can come from one or more UniGene clusters, whose IDs are noted by the EST cluster. (UniGene clusters are made with a different build procedure, so there is not necessarily a one-to-one correspondence between EST clusters on the Dr UniG map and clusters in the UniGene resource.) |
| Genetic Linkage (Meiotic) Maps |
|
| Gates et al. (GAT) | A Haploid Panel containing 389 markers is described by Gates et al. (1999) in "A genetic linkage map for zebrafish: comparative analysis and localization of genes and expressed sequences" (Genome Research 9:334-47). More: http://zebrafish.stanford.edu/genome/Gates99_GR/, Will Talbot (Stanford) |
| Heat Shock (HS) | A diploid panel produced by heat shock treatment of haploid embryos (one-cell stage) was described by Kelly et al. (2000) in "Genetic linkage mapping of zebrafish genes and ESTs" (Genome Research 10:558-567). More: http://zebrafish.stanford.edu/genome/HeatShock99, Will Talbot (Stanford) |
| Boston MGH Cross (MGH) | A microsatellite map covering 2350 cM of the zebrafish genome, with an average resolution of 3.3 cM, is described by Knapik et al. (1998) in "A microsatellite genetic linkage map for zebrafish" (Nature Genetics 18:338-343), and Shimoda et al. (1999) in "Zebrafish genetic map with 2000 microsatellite markers" (Genomics 58:219-232). More: http://zebrafish.mgh.harvard.edu/, Marc Fishman (Harvard) |
| Mother of Pearl (MOP) | A haploid panel constructed from a single female heterozygous for the mother-of-pearl (mop) coloration mutation. This panel contains 556 markers with a framework of 211 RAPD (random amplified polymorphic DNA) markers and 122 SSLP (simple sequence-length polymorphism) microsatellite markers. It is described by Postlethwait et al. (1998) in "Vertebrate genome evolution and the zebrafish gene map" (Nature Genetics 18:345-349). More: http://www.neuro.uoregon.edu/ionmain/htdocs/faculty/postleth.html John Postlethwait (University of Oregon) |
| Zebrafish Genome Integrated Map (ZMAP) | A composite genetic map of coding sequence markers from the T51 and LN54 RH maps, the HS map, and the MGH map. ZMAP was constructed by insertion of previously mapped T51, LN54, and HS markers into framework of the MGH map. The ZMAP was created by Allen Day, Tom Conlin, and John H. Postlethwait at the University of Oregon. As of May 2004, the map has 30,012 markers and is continuing to be updated. More information about ZMAP is available at ZFIN: http://zfin.org/cgi-bin/webdriver?MIval=aa-crossview.apg&OID=ZDB-REFCROSS-010114-1. |
| Radiation Hybrid Maps |
|
| Loeb/NIH/5000/4000 (LN54) | A radiation hybrid panel of 93 cell lines, constructed via fusion of zebrafish and mouse cells irradiated with 4000 rads (81 lines) or 5000 rads (12 lines). This panel is described by Hukriede et al. (1999) in "Radiation hybrid mapping of the zebrafish genome" (PNAS 96:9745-9750). The data are updated at http://mgchd1.nichd.nih.gov:8000/zfrh/rharchive/ZFIN.txt on a monthly basis. More: http://dir.nichd.nih.gov/lmg/lmgdev.htm, Neil Hukriede, Jonathan Epstein (NIH). Additional LN54 mapping data was provided by the WashU-Zebrafish Genome Resources Project
http://zfish.wustl.edu/.
|
| Goodfellow T51 (T51) | A whole-genome radiation hybrid panel of 94 cell lines, constructed by fusion of zebrafish and hamster cells irradiated with 3000 rads. The panel covers a total genetic distance of 27729 cR3000 with a resolution of 1cR = 61kb, and is described by Geisler et al. (1999) in "A radiation hybrid map of the zebrafish genome" (Nature Genetics 23:86-89). More: http://wwwmap.tuebingen.mpg.de/, Robert Geisler (Tuebingen). Additional T51 mapping data was provided by the Children's Hospital Zebrafish Genome Initiative, http://134.174.23.167/zonrhmapper/.
|
|
|
| Searchable Terms |
|
The Map Viewer supports searching on any term that describes an element on any map, including:
|
| Map Positions |
|
As noted in the Search By Position section of the Map Viewer help document,
there are three main ways to search by map position from the
Map View of a chromosome:
|
| General Tips |
|
|
As mentioned in the Searchable Terms section of the Map Viewer Help Document, any term entered in the query box will be treated as an independent entity to be joined by the 'AND' Boolean operator. It is also possible to construct more complex queries by using explicit Boolean operators (AND, OR, NOT), field restriction, or limiting retrieval to records that have certain properties. The Advanced Search page allows you to use a number of query options by simply checking boxes or radio buttons that represent various search fields, properties, and object types. It also allows you to limit your query to one or more chromosomes. The Advanced Search page is accessible from the header region of the genome view page. |
|
|
If you would like to create WWW links to the Map Viewer, the instructions for constructing URLs are given in the general Map Viewer Help document. You can construct URLs that either perform a search or display a specific mapped object or chromosomal region. For example:
|
|
Questions or Comments? Write to the NCBI Service Desk |