Sign in to NCBI

What is NCBI Remap?

Back to NCBI Remap Page

NCBI Remap is a tool that allows users to project annotation data from one coordinate system  to another. This remapping (sometimes called 'liftover') uses genomic alignments to project features from one sequence to the other. For each feature on the source sequence, we perform a base by base analysis of each feature on the source sequence in order to project the feature through the alignment to the new sequence.

We support three variations of Remap. Assembly-Assembly allows the remapping of features from one assembly to another. RefSeqGene allows for the remapping of features from assembly sequences to RefSeqGene sequences (including transcript and protein sequences annoted on the RefSeqGene) or from RefSeqGene sequences to an assembly. Alt loci remap allows for the mapping of features between the Primary assembly and the alternate loci and Patches available for GRC assemblies.

You can view a short video describing how to use remap here: http://www.youtube.com/watch?v=0lhcMGGReVQ

What's new

With the November 2012 update, we added the following features:

  • Alt locus remap: remap features between the primary assembly and the alternate loci/patches in GRC assemblies.
  • Clinical Remap: When you run this we will now make a call to the variation reporter and insert the results into Clincal Remap.
  • Added support for upload of compressed files. Currently GZip (.gz) and BZip2 (.bz) files are supported.
  • Improved HGVS nomenclature.

Specifying the data

Assembly-Assembly 

In order to use the NCBI Remap service, you must select the organism of interest, the assembly your features are on (Source Assembly) and the assembly on which you wish to project these features (Target Assembly). If you would like to request additional organisms or assemblies to be added to the list, please use the Write to the Help Desk to make this request.

List of supported assembly-assembly alignments in remap:

Organism Source Assembly Target Assembly Software version Last Updated
Mus musculus GRCm38.p1 GRCm38.p2 1.6 12/13/2013 13:51:46
Mus musculus MGSCv37 GRCm38.p2 1.6 12/30/2013 13:40:36
Mus musculus MGSCv36 GRCm38.p2 1.6 12/30/2013 13:42:26
Mus musculus MGSCv35 GRCm38.p2 1.6 12/30/2013 13:46:43
Mus musculus MGSCv34 GRCm38.p2 1.6 12/30/2013 13:56:34
Mus musculus MGSCv3 GRCm38.p2 1.6 12/30/2013 15:25:06
Mus musculus Mm_Celera GRCm38 1.6 12/30/2013 21:54:09
Mus musculus Mm_Celera GRCm38.p2 1.6 12/30/2013 22:03:45
Mus musculus Mm_Celera GRCm38.p2 1.6 12/30/2013 22:04:09
Mus musculus Mm_Celera GRCm38.p1 1.6 12/30/2013 22:10:18
Mus musculus Mm_Celera GRCm38 1.6 12/30/2013 22:15:26
Mus musculus Mm_Celera MGSCv37 1.6 12/30/2013 22:19:04
Mus musculus Mm_Celera GRCm38.p1 1.6 12/30/2013 22:19:57
Mus musculus Mm_Celera MGSCv37 1.6 12/30/2013 22:25:17
Mus musculus MGSCv37 GRCm38 1.6 01/01/2014 16:37:31
Mus musculus MGSCv36 GRCm38 1.6 01/01/2014 16:38:22
Mus musculus MGSCv35 GRCm38 1.6 01/01/2014 16:41:33
Mus musculus MGSCv34 GRCm38 1.6 01/01/2014 16:54:18
Mus musculus MGSCv3 GRCm38 1.6 01/01/2014 18:32:28
Mus musculus Mm_Celera Mm_Celera 1.6 01/03/2014 17:15:47
Mus musculus mm129svJae1.0 GRCm38.p2 1.6 01/03/2014 18:10:06
Mus musculus MmusALLPATHS2 GRCm38.p2 1.6 01/03/2014 19:11:59
Mus musculus GRCm38.p2 GRCm38 1.6 01/06/2014 08:25:33
Mus musculus MGSCv35 MGSCv37 1.6 01/16/2014 23:10:41
Mus musculus MGSCv36 MGSCv37 1.6 01/16/2014 23:12:46
Mus musculus MGSCv34 MGSCv37 1.6 01/16/2014 23:30:06
Mus musculus MGSCv3 MGSCv37 1.6 01/17/2014 01:07:50
Rattus norvegicus RGSC_v3.4 Rnor_5.0 1.6 12/30/2013 16:01:13
Rattus norvegicus Rn_Celera Rnor_5.0 1.6 12/30/2013 16:12:00
Rattus norvegicus Rn_Celera Rnor_5.0 1.6 12/30/2013 16:40:30
Rattus norvegicus Rn_Celera RGSC_v3.4 1.6 12/30/2013 22:53:31
Rattus norvegicus Rn_Celera RGSC_v3.4 1.6 01/01/2014 18:38:38
Rattus norvegicus Rn_Celera Rn_Celera 1.6 01/06/2014 09:35:04
Heterocephalus glaber HetGla_1.0 HetGla_female_1.0 1.6 12/30/2013 21:05:53
Vitis vinifera 8x_WGS 12X 1.6 01/22/2014 12:18:19
Cucumis sativus CSB10A_v1 CucSat_1.0 1.6 12/30/2013 20:17:53
Pan troglodytes verus CCYSCv1 Pan_troglodytes-2.1.4 1.6 01/03/2014 14:58:42
Arabidopsis thaliana TAIR8 TAIR10 1.6 12/30/2013 12:27:38
Arabidopsis thaliana TAIR7 TAIR10 1.6 12/30/2013 13:47:02
Arabidopsis thaliana TAIR9 TAIR10 1.6 01/02/2014 22:47:01
Oryza sativa Japonica Group IRGSP_3.0 Build 4.0 1.6 12/30/2013 12:34:35
Hydra vulgaris h7 Hydra_RP_1.0 1.6 12/30/2013 21:01:02
Nomascus leucogenys Nleu1.0 Nleu_3.0 1.6 12/30/2013 13:59:19
Caenorhabditis elegans WS195 WBcel215 1.6 12/30/2013 12:25:26
Caenorhabditis elegans WS190 WBcel215 1.6 12/30/2013 12:25:28
Acyrthosiphon pisum Acyr_1.0 Acyr_2.0 1.6 01/03/2014 18:59:48
Nasonia vitripennis Nvit_1.0 Nvit_2.0 1.6 12/30/2013 12:53:06
Nasonia giraulti Ngir_1.0 Nvit_2.0 1.6 12/30/2013 14:07:18
Nasonia longicornis Nlon_1.0 Nvit_2.0 1.6 12/30/2013 13:58:28
Apis mellifera Amel_2.0 Amel_4.5 1.6 12/30/2013 12:49:22
Apis mellifera Amel_4.0 Amel_4.5 1.6 12/30/2013 12:49:34
Strongylocentrotus purpuratus Spur_v2.1 Spur_3.1 1.6 12/30/2013 19:04:20
Strongylocentrotus purpuratus Spur_0.5 Spur_3.1 1.6 12/30/2013 19:07:02
Ciona intestinalis v1.0 KH 1.6 12/30/2013 20:00:51
Danio rerio Zv7 Zv9 1.6 12/30/2013 13:23:04
Danio rerio Zv8 Zv9 1.6 12/30/2013 13:37:36
Oreochromis niloticus Orenil1.0 Orenil1.1 1.6 01/03/2014 15:46:37
Xenopus (Silurana) tropicalis v4.2 Xtropicalis_v7 1.6 01/03/2014 15:28:36
Gallus gallus Gallus_gallus-2.1 Gallus_gallus-4.0 1.6 12/30/2013 13:49:45
Macaca fascicularis CE_1.0 Macaca_fascicularis_5.0 1.6 01/04/2014 11:52:24
Macaca fascicularis MacFas_Jun2011 Macaca_fascicularis_5.0 1.6 01/25/2014 14:43:20
Macaca mulatta CR_1.0 Mmul_051212 1.6 01/07/2014 12:18:13
Pan troglodytes Pan_troglodytes-2.1.3 Pan_troglodytes-2.1.4 1.6 12/30/2013 17:45:08
Pan troglodytes Pan_troglodytes-2.1 Pan_troglodytes-2.1.4 1.6 12/30/2013 19:29:47
Homo sapiens NCBI35 GRCh38 1.6 02/26/2014 16:50:54
Homo sapiens NCBI36 GRCh38 1.6 02/26/2014 16:51:48
Homo sapiens NCBI33 GRCh38 1.6 02/26/2014 16:52:01
Homo sapiens GRCh37 GRCh38 1.6 02/26/2014 16:52:04
Homo sapiens GRCh37.p10 GRCh38 1.6 02/26/2014 16:52:41
Homo sapiens GRCh37.p5 GRCh38 1.6 02/26/2014 16:52:54
Homo sapiens GRCh37.p2 GRCh38 1.6 02/26/2014 16:53:33
Homo sapiens GRCh37.p9 GRCh38 1.6 02/26/2014 16:53:51
Homo sapiens GRCh37.p11 GRCh38 1.6 02/26/2014 16:53:56
Homo sapiens GRCh37.p13 GRCh38 1.6 02/26/2014 16:53:56
Homo sapiens NCBI34 GRCh38 1.6 02/26/2014 16:55:08
Homo sapiens GRCh37.p12 GRCh38 1.6 02/26/2014 16:56:33
Homo sapiens CRA_TCAGchr7v2 GRCh38 1.6 02/26/2014 17:03:02
Homo sapiens CHM1_1.1 GRCh38 1.6 02/26/2014 17:40:38
Homo sapiens CHM1_1.0 GRCh38 1.6 02/26/2014 17:42:38
Homo sapiens YH_2.0 GRCh38 1.6 02/26/2014 17:50:25
Homo sapiens HuRef GRCh38 1.6 02/26/2014 18:11:51
Homo sapiens NCBI35 GRCh37 1.6 12/27/2013 10:28:47
Homo sapiens NCBI34 GRCh37 1.6 12/27/2013 10:28:50
Homo sapiens NCBI34 GRCh37.p13 1.6 12/27/2013 10:29:02
Homo sapiens NCBI33 GRCh37 1.6 12/27/2013 10:29:46
Homo sapiens NCBI36 GRCh37.p13 1.6 12/27/2013 10:30:04
Homo sapiens NCBI36 GRCh37 1.6 12/27/2013 10:30:29
Homo sapiens NCBI33 GRCh37.p13 1.6 12/27/2013 10:30:39
Homo sapiens NCBI35 GRCh37.p13 1.6 12/27/2013 10:31:57
Homo sapiens HuRef GRCh37.p13 1.6 12/30/2013 15:00:46
Homo sapiens CHM1_1.1 GRCh37.p13 1.6 12/30/2013 16:01:42
Homo sapiens Hs_Celera GRCh37 1.6 12/30/2013 18:59:27
Homo sapiens CHM1_1.0 GRCh37.p13 1.6 12/30/2013 19:30:33
Homo sapiens CHM1_1.1 GRCh37 1.6 12/30/2013 19:55:58
Homo sapiens HuRef GRCh37 1.6 12/30/2013 20:04:21
Homo sapiens CHM1_1.1 HuRef 1.6 12/30/2013 20:44:35
Homo sapiens CHM1_1.0 GRCh37 1.6 12/30/2013 21:32:57
Homo sapiens CHM1_1.0 HuRef 1.6 12/30/2013 21:54:33
Homo sapiens NCBI34 NCBI35 1.6 01/01/2014 17:04:24
Homo sapiens NCBI33 NCBI35 1.6 01/01/2014 17:05:14
Homo sapiens NCBI35 NCBI36 1.6 01/01/2014 17:09:33
Homo sapiens NCBI33 NCBI36 1.6 01/01/2014 17:10:51
Homo sapiens NCBI34 NCBI36 1.6 01/01/2014 17:11:12
Homo sapiens NCBI34 NCBI34 1.6 01/03/2014 15:28:14
Homo sapiens CHM1_1.0 CHM1_1.1 1.6 01/03/2014 15:31:51
Homo sapiens CRA_TCAGchr7v2 GRCh37 1.6 01/03/2014 15:36:50
Homo sapiens CRA_TCAGchr7v2 NCBI36 1.6 01/03/2014 15:36:52
Homo sapiens CRA_TCAGchr7v2 GRCh37.p13 1.6 01/03/2014 15:37:35
Homo sapiens CRA_TCAGchr7v2 NCBI35 1.6 01/03/2014 15:40:05
Homo sapiens CRA_TCAGchr7v2 NCBI34 1.6 01/03/2014 15:54:12
Homo sapiens CRA_TCAGchr7v2 HuRef 1.6 01/03/2014 15:55:28
Homo sapiens CRA_TCAGchr7v2 CHM1_1.1 1.6 01/03/2014 16:00:41
Homo sapiens Hs_Celera GRCh37.p13 1.6 01/06/2014 08:37:36
Homo sapiens YH_2.0 GRCh37 1.6 01/06/2014 09:57:01
Homo sapiens YH_2.0 GRCh37.p13 1.6 01/06/2014 09:59:11
Homo sapiens NCBI33 NCBI34 1.6 01/17/2014 09:41:46
Homo sapiens CRA_TCAGchr7v2 GRCh37.p10 1.6 01/24/2014 14:05:23
Homo sapiens GRCh37.p9 GRCh37.p10 1.6 01/24/2014 14:19:30
Homo sapiens NCBI36 GRCh37.p10 1.6 01/24/2014 14:23:35
Homo sapiens NCBI34 GRCh37.p10 1.6 01/24/2014 14:28:05
Homo sapiens NCBI33 GRCh37.p10 1.6 01/24/2014 14:29:22
Homo sapiens NCBI35 GRCh37.p10 1.6 01/24/2014 14:31:46
Homo sapiens CHM1_1.0 GRCh37.p10 1.6 01/24/2014 14:42:45
Homo sapiens HuRef GRCh37.p10 1.6 01/24/2014 15:24:46
Canis lupus familiaris CanFam2.0 CanFam3.1 1.6 12/30/2013 13:34:06
Mustela putorius furo MusPutFurMale1.0 MusPutFur1.0 1.6 12/30/2013 20:46:52
Felis catus catChrV17e Felis_catus-6.2 1.6 01/04/2014 12:28:34
Sus scrofa Sscrofa5 Sscrofa10.2 1.6 01/03/2014 15:31:15
Sus scrofa Sscrofa5 Sscrofa10 1.6 01/03/2014 15:42:59
Sus scrofa Sscrofa10 Sscrofa10.2 1.6 01/03/2014 15:46:39
Sus scrofa Sscrofa9.2 Sscrofa10.2 1.6 01/03/2014 15:59:33
Sus scrofa Sscrofa9.2 Sscrofa10 1.6 01/03/2014 16:21:34
Bos taurus Btau_3.1 Btau_4.6.1 1.6 12/27/2013 10:22:52
Bos taurus Btau_4.0 Btau_4.6.1 1.6 12/27/2013 10:26:15
Bos taurus Btau_4.2 Btau_4.6.1 1.6 12/27/2013 10:26:53
Bos taurus Btau_3.1 Bos_taurus_UMD_3.1 1.6 12/30/2013 18:51:48
Bos taurus Btau_4.0 Bos_taurus_UMD_3.1 1.6 12/30/2013 18:57:09
Bos taurus Btau_4.2 Bos_taurus_UMD_3.1 1.6 12/30/2013 19:00:43
Bos taurus Btau_4.6.1 Bos_taurus_UMD_3.1 1.6 12/30/2013 19:03:33
Bos taurus Btau_3.1 Btau_4.2 1.6 01/01/2014 17:19:55
Bos taurus Btau_4.0 Btau_4.2 1.6 01/01/2014 17:33:46
Bos taurus UMD Bos_taurus 2.0 Bos_taurus_UMD_3.1 1.6 01/03/2014 15:43:58

Clinical Remap

Only human is supported for the RefSeqGene tab, so all that is needed is for you to select the sequence upon which your features are annotated (either an assembly or RefSeqGenes) and the sequences to which you want the features mapped (either RefSeqGenes or an assembly). 

Alt loci remap

Alt loci remap allows you to map data between the Primary Assembly and the Alternate Loci/Patches that may be available for an assembly. Only assemblies produced by the Genome Reference Consortium are supported on this page. All you need to select on this page is the organism and the assembly, the software will figure out the direction in which you want to map. 

NOTE: For both Clinical Remap and Alt loci remap if you map FROM an assembly to either the RefSeqGenes or the Alternate Loci/Patches, you may have a lot of failed features as both of these sequences only cover a fraction of the genome. To see genome coverage for Alternate Loci/Patches see the GRC pages for human and mouse.

Remapping Options

Some configuration options are available that will allow you to configure the stringency of remapping. This options are only configurable in the Assembly-Assembly tab.

  1. Minimum ratio of bases that must be remapped (default: 0.5): This option specifies the percentage of the interval that must be able to be remapped. Raising this value increases the stringency of the remapping process.
  2. Maximum ratio for difference between the source length and the target length (default 2.0): This feature allows the remapping algorithm to tolerate insertions and deletions in the alignment. This is calculated by taking the interval length on the target assembly (stop-start+1) and dividing it by the interval length on the source assembly (stop-start+1). An insertion or deletion in the target assembly will affect this ratio. Lowering this value will increase the stringency of the remapping process.
  3. Allow multiple locations to be returned (default: on): We perform alignments in two phases (see 'About our alignments'). Selecting this option will allow the 'Second Pass' alignments to be used and improve coordinate projection in regions of duplication. This can also lead to multiple features being remapped to the same location.
  4. Merge Fragments (default: on): An insertion in the target assembly will split a feature on the source assembly, selecting this option will merge these two locations into a single location in the annotation file. Turning this feature off will increase the stringency of the remapping process, specifically in cases where there is an insertion in the target sequence as each remapped interval will be compared to the original interval.

The merge function can help you remap features that cross an assembly gap, or have a large insertion that causes a gap in the alignment.

Example of a feature crossing a gap

Figure 1: A region with a feature that crosses an assembly gap. This feature was successfully remapped because the merge function was on.

However, in regions with message alignments, the merge function can cause a feature to be remapped to the same, or overlapping positions. This only happens when using the Second Pass alignments for reamapping as these alignments are not guaranteed to bee unique.

Region with complicated alignments in the second pass.

Figure 2: A region with nice First Pass alignments and many Second Pass alignments. 

Using the merge function, this feature remaps to six locations in GRCh37, one using the First Pass alignments and five using the Second Pass. These are easily distinguished using the remap report as the 'recip' column specifies whether the first pass or second pass alignments were used.

remap report for feature in region with complicated second pass alignments.

Figure 3: Remap report for feature with multiple locations returned due to complicated second pass alignments.

These features are relatively easy to identify in a post processing step, or you can turn the merge function off. This will, however, negatively affect features that cross a gap. You may need to review the alignments (which you can do using the Genome Workbench project files) to determine the best course of action.

Note: Alignments are processed in a strand specific manner. If a feature aligns to a region for which there are alignments on both strands, you may get a placement returned for the plus and the minus strand. Using the merge feature may increase the chances of this as merge helps to span alignment gaps. Turning merge off will cause a decrease in remapped features as gaps will not be crossed on either strand.

Providing Data

We accept file formats that are commonly used in the bioinformatics community. We currently accept:

Because the GTF/GFF/GFF3 formats are so similar we provide a single menu item for these formats.
The default behavior is to provide the remapped annotation file in the same format as the input file, but you can specify a different format for the output.
If you have a small amount of data, you can just copy and paste the data in the large text box labeled 'Paste data here'. Otherwise, you can just upload the data file.
Please note: the larger your file is, the longer it will take to perform the remapping process. If you find that the process is taking a very long time, or failing, you may want to split your files into smaller ones, perhaps based on chromosome assignment. There is also an absolute limit on the amount of RAM available to the system. If this is exceeded, Remap will fail. If this happens try again with a smaller file. 

You may also provide data in the text box provided. In addition to the formats described above, you can put a region into the text box. For example:

chr1:10349-25000

Clinical Remap tab only Data options

Mapping from a RefSeqGene(s) to an assembly: In this case, an additional option is provided (checked by default). This will allow the service to return features on both the genomic sequences as well as any transcripts (NMs) or proteins (NPs) available at that locus.

Mapping from an assembly to RefSeqGenes: In this case, you have the ability to map to any available RefSeqGene (default) or you can specify a list of RefSeqGenes as targets. If you select to map to any available RefSeqGene there are two additional options for providing locations on transcripts (NMs) or proteins. One is to provide the transcript (NM) and protein (NP) locations for features that map to RefSeqGenes and the other is to provide transcript (NM) and protein (NP) locations even if there isn't a RefSeqGene where your feature maps. Not all genes in the genome have a RefSeqGene. There is a link on the page that allows you to request the construction of a RefSeqGene if one is not available for your gene of interest.

Output files

Summary Data: This is a global report to provide an overview of remapping results. The format of the report is (by column):

  • ID: The sequence ID in the source assembly (often something like 'chr1' or NC_000001.9).
  • Source Features: The number of features on the ID in the source file.
  • Remapped Features: The number of features that could be projected onto the Target assembly.
  • Source Intervals: The number of intervals on the ID in the source file. This happens because some features will have more than one sequence interval, for example, mRNA features will often have multiple intervals (corresponding to exons).
  • Remapped Intervals: The number of intervals that could be projected onto the Target assembly.

The summary data appears on the web page and is available for download.

Mapping Report: This is a report that provides a feature by feature breakdown of the remapping status. The format of this report on the web page is (by column):

  • Feature: The name or ID of the feature (the source of this will depend on the format submitted, but it should be possible to robustly associate the information in this column with the data in the input file).
  • Src. Intervals: Number of intervals the feature has in the source file.
  • Remap Intervals: Number of intervals that were projected to the target assembly.
  • Src location: The feature location in the input file.
  • Src length: The length of the feature in the input file.
  • Map Location: Projected location (or reason that the remap failed) on the target assembly.
  • Map length: Length of the feature on the target assembly.
  • Coverage: Coverage of feature on the target assembly.

Only a few lines of this report are displayed on the web page, but the entire report is available for download in a tab separated file (tsv) that can be easily parsed, or loaded to spreadsheet program. The downloaded report has 18 columns as follows:

  1. #feat_name: user supplied feature name. If no feature name is supplied, a name is calculated using the line number in the file or the location.
  2. source_int: The number of intervals in the source file (useful for tracking features with multiple intervals, like genes).
  3. mapped_int: the number of intervals in the remapped file.
  4. source_id: sequence identifier the feature maps to in the source file.
  5. mapped_id: sequence identifier the features maps to on the target assembly.
  6. source_length: length of the feature on the source assembly.
  7. mapped_length: length of the feature on the target assembly.
  8. source_start: first base of the feature on the source assembly.
  9. source_stop: last base of the feature on the source assembly.
  10. source_strand: strand the feature is annotated on in the source assembly.
  11. source_sub_start: first base of sub interval on the source assembly (i.e. an exon feature).
  12. source_sub_start: last base of sub interval on the source assembly (i.e. an exon feature).
  13. mapped_start: first base of remapped interval.
  14. mapped_stop: last base of remapped interval.
  15. mapped_strand: strand of remapped base.
  16. coverage: This is calculated by taking the ratio of the mapped_length to the source_length. If coverage =1 the remapped and source interval are identical. A coverage score of less than 1 indicates a deletion in the target assembly and a score of greater than 1 indicates an insertion in the target assembly.
  17. recip: Two possible values are in this column. First Pass means the remapping is based on the 'First Pass' or reciprocal best hit alignments. 'Second Pass' means the remapping is based on the non-reciprocal best hit alignments.
  18. asm_unit: The assembly unit to which the mapped_id belongs. For more information on assembly units, see: http://www.ncbi.nlm.nih.gov/projects/genome/assembly/model.shtml

Features that don't remap will have the word 'NOMAP' in column 15 and the reason for not mapping in column 16. The reasons are:

  • NOALIGN: There was no alignment for this region.
  • LOWCOV: The percent of the interval covered in the alignment was below the coverage threshold specified in the 'Remapping Options' (Minimum ratio of bases that must be remapped).
  • EXPANDED: The ratio of the length on the target sequence versus the length on the source sequence is greater than specified in the remap options (default is 2).

Clinical Remap Only Output:

When you run Cliincal Remap, we will make a call to the Variation Reporter to provide an analysis of your variant data. We then inject the report produced by the Variation Reporter into the Remap output. For more information on the Variation Reporter report, so the help page. 

Annotation Data: This file contains only the remapped features, in the format specified on the input page. No sample data is shown on the web page, but the file is available for download and display in your favorite viewer.

Genome Workbench Files: These are files that can be loaded directly into our client side viewer called Genome Workbench. They contain the sequence information for both the source and target assemblies, the assembly-assembly alignments used in the remapping and feature annotations (both the source features and the remap features). These files are available for download and are very useful for understanding how the alignments influenced the feature remapping (see Figure 4).

Example of a GBench file produced by Remap

Figure 4: View of remapping in genome workbench. The sequence being shown in this view is the Target assembly. The tracks are (in order from the top):

  • Ruler: showing basepair coordinates.
  • Sequence: for some organisms this will be colored and for others it will be grey. This track will show you the actual base pairs if you zoom in enough.
  • Tiling Path: Shows the INSDC sequences used to construct the sequence.
  • Genes Track: Gene annotation from NCBI annotation process.
  • Alignments: Alignment to the Source assembly. This will have the 'First Pass' alignments and the 'Second Pass' alignments if the 'Allow duplications' option was checked. The alignments are zoomed to the base pair level. Mismatches are colored in red. Insertions are shown using a blue triangle (none in this view).
  • SNP features: Variation features defined by dbSNP.
  • Only the remapped features are shown here. In this example features from dbVar were mapped from NCBI36->GRCh37.p9. Only remapped features are shown on the target assembly. If you open a sequence that is part of the Source assembly you can see the orginal features. 
Write to the Help Desk