NCBI Homo sapiens Updated Annotation Release 109.20190607

The RefSeq genome records for Homo sapiens were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies.

Updated Annotation Release 109.20190607 is an update of NCBI Homo sapiens Annotation Release 109. The known RefSeq transcripts (with NM_ and NR_ prefixes) that were current on Jun 7 2019 were placed on the genome and used to update the annotated features. In addition, model RefSeq predicted in the last full annotation (Annotation Release 109) that were still current on Jun 7 2019 were included in the updated annotation. These models were not re-calculated for this update. For more information on the evidence used for generating the model RefSeq, please consult the report for NCBI Homo sapiens Annotation Release 109.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.


Annotation Release information

This annotation should be referred to as NCBI Homo sapiens Updated Annotation Release 109.20190607

Annotation release ID: 109.20190607
Date of Entrez queries for transcripts and proteins: Jun 7 2019
Date of submission of annotation to the public databases: Jun 14 2019
Software version: 8.2

Assemblies

The following assemblies were included in this annotation run:
Assembly nameAssembly accessionSubmitterAssembly dateReference/AlternateAssembly content
GRCh38.p13GCF_000001405.39Genome Reference Consortium02-28-2019Reference25 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

FeatureGRCh38.p13GRCh38.p13
Primary Assembly
GRCh38.p13
All Alt Loci
GRCh38.p13
PATCHES
Genes and pseudogenes help54,54954,2202,3971,523
  protein-coding19,89219,802834597
  non-coding17,98517,833676387
  Transcribed pseudogenes1,1311,1239080
  Non-transcribed pseudogenes15,09615,029628426
  genes with variants19,96619,887603379
  Immunoglobulin/T-cell receptor gene segments39738816122
  other4845811
  placed on multiple assembly-units help3,424na659na
mRNAs112,255111,9931,9131,372
  fully-supported112,103111,8581,9011,367
  with > 5% ab initio help918083
  partial2631266172
  with filled gap(s) help0000
  placed on multiple assembly-units help2,886na634na
  known RefSeq (NM_) help53,53853,4501,7761,335
  model RefSeq (XM_)58,71758,54313737
non-coding RNAs help46,95845,0831,818743
  fully-supported45,08543,6911,517715
  with > 5% ab initio help0000
  partial745635
  with filled gap(s) help0000
  placed on multiple assembly-units help677na182na
  known RefSeq (NR_) help15,19115,182504343
  model RefSeq (XR_) help29,90828,5231,013372
pseudo transcripts help1,4381,42210990
  fully-supported1,4271,41310790
  with > 5% ab initio help0000
  partial-2196
  with filled gap(s) help0000
  placed on multiple assembly-units helpnananana
  known RefSeq (NR_) help1,3301,32510187
  model RefSeq (XR_) help1089783
CDSs112,842112,3822,0731,381
  fully-supported112,103111,8581,9011,367
  with > 5% ab initio help122107114
  partial514348363161
  with major correction(s) help67646156
  known RefSeq (NP_) help53,53853,4501,7731,322
  model RefSeq (XP_) help58,73058,54313737

Detailed reports

The counts below do not include pseudogenes.

References

Support Center