NCBI Mirounga angustirostris Annotation Release GCF_029215605.1-RS_2024_04

The genome sequence records for Mirounga angustirostris RefSeq assembly GCF_029215605.1 (mMirAng1.0.hap1) were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
BUSCO results: Annotation completeness assessed with BUSCO
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as "GCF_029215605.1-RS_2024_04".

Date of Entrez queries for transcripts and proteins: Apr 19 2024
Date of submission of annotation to the public databases: Apr 25 2024
Software version: 10.2

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
mMirAng1.0.hap1	GCF_029215605.1	UCLA	03-14-2023	Reference	unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	mMirAng1.0.hap1
Genes and pseudogenes	31,900
protein-coding	19,977
non-coding	7,123
Transcribed pseudogenes	191
Non-transcribed pseudogenes	4,552
genes with variants	11,747
Immunoglobulin/T-cell receptor gene segments	27
other	30
mRNAs	60,392
fully-supported	58,607
with > 5% ab initio	852
partial	78
with filled gap(s)	0
known RefSeq (NM_)	0
model RefSeq (XM_)	60,392
non-coding RNAs	14,192
fully-supported	11,157
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	13,523
pseudo transcripts	191
fully-supported	158
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	191
CDSs	60,419
fully-supported	58,607
with > 5% ab initio	981
partial	78
with major correction(s)	941
known RefSeq (NP_)	0
model RefSeq (XP_)	60,392

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	27,130	44,020	13,515	44	2,419,801
All transcripts	74,584	3,886	3,045	37	104,371
mRNA	60,392	4,078	3,273	102	104,371
misc_RNA	4,519	4,419	3,296	101	75,240
tRNA	669	74	73	71	87
lncRNA	6,650	3,468	1,975	37	99,165
snoRNA	711	114	124	44	329
snRNA	1,180	117	107	53	198
rRNA	433	555	119	119	4,660
Single-exon transcripts	2,279	1,553	960	102	24,311
coding transcripts (NM_/XM_ )	2,279	1,553	960	102	24,311
CDSs	60,392	2,077	1,527	96	103,125
Exons	266,557	418	144	1	98,413
in coding transcripts (NM_/XM_ )	242,057	373	141	1	48,774
in non-coding transcripts (NR_/XR_ )	47,279	551	150	10	98,413
Introns	238,548	6,782	1,585	30	1,025,356
in coding transcripts (NM_/XM_ )	220,113	6,490	1,534	30	1,025,356
in non-coding transcripts (NR_/XR_ )	40,307	7,460	1,843	30	452,947

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.79	1	1	50
Number of exons per transcript	12.04	9	1	346

BUSCO analysis of gene annotation

BUSCO v4.1.4 was run in "protein" mode on the annotated gene set picking one longest protein per gene, and run using the carnivora_odb10 lineage dataset. Results are reported for the gene set from the primary assembly unit, and presented in BUSCO notation.

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 19977 coding genes, 19754 genes had a protein with an alignment covering 50% or more of the query and 17049 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker (if calculated), for each assembly. RepeatMasker results are only calculated for organisms with complete Dfam HMM model collections.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with WindowMasker
mMirAng1.0.hap1	GCF_029215605.1	34.75%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez Nucleotide, Entrez Protein, and SRA, and aligned to the genome.

Transcript alignments

The alignments of the following transcripts with Splign were used for gene prediction:

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species Genbank	22	22 (100.00%)	21 (95.45%)	99.48%	99.36%
Carnivora known RefSeq (NM_/NR_)	3,493	3,133 (89.69%)	2,329 (66.68%)	93.50%	98.59%
Carnivora Genbank	7,175	5,846 (81.48%)	3,092 (43.09%)	92.87%	98.44%
Carnivora TSA	319,728	281,681 (88.10%)	130,743 (40.89%)	97.66%	99.09%
Carnivora EST	428,476	276,470 (64.52%)	207,374 (48.40%)	93.34%	97.48%

RNA-Seq alignments

The alignments of the following RNA-Seq reads with STAR were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	6,006,380,860	86%	23%	296,978
SAMN02991557	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991557)	29,423,884	72%	31%	142,657
SAMN02991558	NA	muscle (Mirounga angustirostris, 0.8 yr, female, SAMN02991558)	24,633,892	74%	32%	135,198
SAMN02991559	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991559)	23,273,452	78%	31%	138,187
SAMN02991560	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991560)	31,768,774	77%	32%	150,025
SAMN02991561	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991561)	29,314,468	79%	32%	147,309
SAMN02991562	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991562)	30,093,204	77%	32%	148,555
SAMN02991563	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991563)	29,023,110	80%	33%	142,658
SAMN02991564	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991564)	28,202,450	79%	32%	141,574
SAMN02991565	NA	muscle (Mirounga angustirostris, 0.8 year, female, SAMN02991565)	30,656,404	80%	30%	150,107
SAMN04595358	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595358)	31,898,806	88%	23%	177,823
SAMN04595359	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595359)	37,976,704	88%	23%	182,917
SAMN04595360	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595360)	31,207,878	88%	24%	181,288
SAMN04595361	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595361)	30,543,264	87%	25%	181,715
SAMN04595362	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595362)	28,617,724	90%	23%	178,865
SAMN04595363	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595363)	28,332,214	91%	25%	176,349
SAMN04595364	NA	blubber (Mirounga angustirostris, 0.8 year, male, SAMN04595364)	34,049,164	89%	22%	199,989
SAMN04595365	NA	blubber (Mirounga angustirostris, 0.8 year, male, SAMN04595365)	32,956,000	86%	25%	177,269
SAMN04595366	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595366)	31,107,166	88%	24%	179,814
SAMN04595367	NA	blubber (Mirounga angustirostris, 0.8 year, female, SAMN04595367)	33,423,316	87%	23%	177,871
SAMN09792024	NA	juvenile, blubber, inner layer (Mirounga angustirostris, male, SAMN09792024)	78,069,798	88%	14%	75,794
SAMN09792025	NA	juvenile, blubber, inner layer (Mirounga angustirostris, male, SAMN09792025)	126,786,484	87%	8%	122,427
SAMN09792026	NA	juvenile, blubber, inner layer (Mirounga angustirostris, male, SAMN09792026)	102,153,120	87%	8%	133,280
SAMN09792027	NA	juvenile, blubber, inner layer (Mirounga angustirostris, male, SAMN09792027)	96,899,648	88%	9%	118,915
SAMN09792028	NA	juvenile, blubber, inner layer (Mirounga angustirostris, male, SAMN09792028)	84,004,560	88%	9%	106,787
SAMN09792029	NA	juvenile, blubber, inner layer (Mirounga angustirostris, male, SAMN09792029)	108,918,762	88%	9%	134,207
SAMN09792030	NA	juvenile, blubber, inner layer (Mirounga angustirostris, male, SAMN09792030)	108,264,816	88%	8%	114,340
SAMN09792031	NA	juvenile, blubber, inner layer (Mirounga angustirostris, female, SAMN09792031)	105,092,572	80%	9%	152,651
SAMN09792032	NA	juvenile, blubber, inner layer (Mirounga angustirostris, female, SAMN09792032)	98,992,204	87%	8%	112,481
SAMN09792033	NA	juvenile, blubber, inner layer (Mirounga angustirostris, female, SAMN09792033)	138,663,304	87%	9%	97,741
SAMN09792034	NA	juvenile, blubber, inner layer (Mirounga angustirostris, female, SAMN09792034)	123,259,606	87%	10%	134,229
SAMN09792035	NA	juvenile, blubber, inner layer (Mirounga angustirostris, female, SAMN09792035)	113,298,796	87%	8%	128,616
SAMN09792036	NA	juvenile, blubber, inner layer (Mirounga angustirostris, female, SAMN09792036)	99,899,354	88%	10%	129,154
SAMN09792037	NA	juvenile, blubber, inner layer (Mirounga angustirostris, female, SAMN09792037)	123,832,584	87%	9%	136,402
SAMN15065689	33123026	skeletal muscle (Mirounga angustirostris, female, SAMN15065689)	147,257,614	89%	10%	105,096
SAMN15065690	33123026	skeletal muscle (Mirounga angustirostris, female, SAMN15065690)	121,438,336	73%	10%	39,064
SAMN15065691	33123026	skeletal muscle (Mirounga angustirostris, female, SAMN15065691)	110,167,832	85%	11%	146,821
SAMN15065692	33123026	skeletal muscle (Mirounga angustirostris, female, SAMN15065692)	195,279,106	87%	10%	51,192
SAMN15065693	33123026	skeletal muscle (Mirounga angustirostris, female, SAMN15065693)	180,765,112	87%	10%	71,821
SAMN15065694	33123026	skeletal muscle (Mirounga angustirostris, female, SAMN15065694)	167,931,612	88%	10%	87,060
SAMN15065696	33123026	skeletal muscle (Mirounga angustirostris, female, SAMN15065696)	144,139,254	86%	11%	116,095
SAMN30526147	NA	blubber - deep (Mirounga angustirostris, juvenile, female, SAMN30526147)	54,365,234	83%	22%	140,131
SAMN30526148	NA	blubber - deep (Mirounga angustirostris, juvenile, female, SAMN30526148)	52,498,064	83%	20%	150,539
SAMN30526149	NA	blubber - deep (Mirounga angustirostris, juvenile, female, SAMN30526149)	57,529,304	82%	21%	160,192
SAMN30526150	NA	blubber - superficial (Mirounga angustirostris, juvenile, female, SAMN30526150)	45,371,768	83%	23%	138,782
SAMN30526151	NA	blubber - superficial (Mirounga angustirostris, juvenile, female, SAMN30526151)	62,767,428	84%	19%	160,330
SAMN30526152	NA	blubber - superficial (Mirounga angustirostris, juvenile, female, SAMN30526152)	59,950,236	84%	20%	144,017
SAMN36084795	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084795)	37,951,502	92%	36%	202,239
SAMN36084796	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084796)	47,509,782	92%	36%	207,662
SAMN36084797	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084797)	48,806,862	93%	36%	210,117
SAMN36084798	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084798)	52,625,212	93%	36%	208,478
SAMN36084799	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084799)	43,548,508	93%	36%	203,890
SAMN36084800	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084800)	39,607,550	93%	36%	199,597
SAMN36084801	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084801)	46,423,358	93%	37%	203,266
SAMN36084802	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084802)	40,643,680	93%	37%	198,224
SAMN36084803	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084803)	36,939,858	92%	35%	194,464
SAMN36084804	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084804)	38,923,858	92%	34%	199,318
SAMN36084805	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084805)	40,354,162	93%	36%	194,027
SAMN36084806	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084806)	38,272,546	92%	35%	192,920
SAMN36084807	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084807)	40,143,038	92%	36%	197,014
SAMN36084808	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084808)	42,401,688	92%	35%	198,217
SAMN36084809	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084809)	45,592,260	93%	35%	208,146
SAMN36084810	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084810)	41,144,634	92%	35%	195,819
SAMN36084811	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084811)	50,296,050	92%	35%	202,142
SAMN36084812	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084812)	49,274,516	93%	36%	197,459
SAMN36084813	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084813)	38,880,888	93%	36%	200,535
SAMN36084814	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084814)	44,616,356	92%	36%	206,630
SAMN36084815	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084815)	40,135,346	92%	36%	199,190
SAMN36084816	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084816)	36,683,816	92%	35%	196,165
SAMN36084817	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084817)	44,658,524	93%	36%	203,736
SAMN36084818	NA	precision-cut slices of blubber (whole core) (Mirounga angustirostris, 4-6 weeks, female, SAMN36084818)	44,926,720	93%	36%	204,071
SAMN37413899	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413899)	54,526,660	83%	40%	169,387
SAMN37413900	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413900)	69,900,448	85%	41%	173,975
SAMN37413901	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413901)	65,379,006	84%	39%	171,527
SAMN37413902	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413902)	84,472,366	86%	36%	170,195
SAMN37413903	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413903)	71,812,698	85%	37%	172,222
SAMN37413904	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413904)	69,394,530	85%	37%	172,326
SAMN37413905	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413905)	65,754,196	83%	37%	171,758
SAMN37413906	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413906)	67,213,746	82%	37%	174,131
SAMN37413907	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413907)	76,161,774	84%	35%	175,799
SAMN37413908	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413908)	59,726,010	85%	36%	168,539
SAMN37413909	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413909)	62,126,114	85%	36%	168,481
SAMN37413910	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413910)	69,263,092	88%	39%	175,060
SAMN37413911	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413911)	65,823,090	84%	39%	173,138
SAMN37413912	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413912)	74,612,454	85%	38%	176,190
SAMN37413913	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413913)	64,024,120	82%	38%	171,338
SAMN37413914	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413914)	74,521,244	83%	39%	176,880
SAMN37413915	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413915)	66,387,036	85%	40%	175,269
SAMN37413916	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413916)	77,996,816	85%	40%	178,478
SAMN37413917	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413917)	55,618,916	84%	39%	166,364
SAMN37413918	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413918)	83,437,826	85%	37%	177,976
SAMN37413919	NA	placental artery, endothelial cell, (Mirounga angustirostris, adult, female, SAMN37413919)	113,769,622	85%	37%	181,493

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR1553317	SRX682029	SRP045540	SAMN02991557	29,423,884	72%	31%
SRR1553318	SRX682832	SRP045540	SAMN02991558	24,633,892	74%	32%
SRR1553319	SRX682833	SRP045540	SAMN02991559	23,273,452	78%	31%
SRR1553989	SRX683252	SRP045540	SAMN02991560	31,768,774	77%	32%
SRR1554279	SRX683568	SRP045540	SAMN02991561	29,314,468	79%	32%
SRR1554281	SRX683570	SRP045540	SAMN02991562	30,093,204	77%	32%
SRR1554278	SRX683566	SRP045540	SAMN02991563	29,023,110	80%	33%
SRR1554282	SRX683571	SRP045540	SAMN02991564	28,202,450	79%	32%
SRR1554284	SRX683572	SRP045540	SAMN02991565	30,656,404	80%	30%
SRR3402455	SRX1713227	SRP045540	SAMN04595358	31,898,806	88%	23%
SRR3402456	SRX1713228	SRP045540	SAMN04595359	37,976,704	88%	23%
SRR3402457	SRX1713229	SRP045540	SAMN04595360	31,207,878	88%	24%
SRR3402458	SRX1713230	SRP045540	SAMN04595361	30,543,264	87%	25%
SRR3402459	SRX1713231	SRP045540	SAMN04595362	28,617,724	90%	23%
SRR3402460	SRX1713232	SRP045540	SAMN04595363	28,332,214	91%	25%
SRR3402461	SRX1713233	SRP045540	SAMN04595364	34,049,164	89%	22%
SRR3402462	SRX1713234	SRP045540	SAMN04595365	32,956,000	86%	25%
SRR3402463	SRX1713235	SRP045540	SAMN04595366	31,107,166	88%	24%
SRR3402464	SRX1713236	SRP045540	SAMN04595367	33,423,316	87%	23%
SRR7668294	SRX4528817	SRP157071	SAMN09792024	78,069,798	88%	14%
SRR7668295	SRX4528816	SRP157071	SAMN09792025	126,786,484	87%	8%
SRR7668296	SRX4528815	SRP157071	SAMN09792026	102,153,120	87%	8%
SRR7668297	SRX4528814	SRP157071	SAMN09792027	96,899,648	88%	9%
SRR7668298	SRX4528813	SRP157071	SAMN09792028	84,004,560	88%	9%
SRR7668299	SRX4528812	SRP157071	SAMN09792029	108,918,762	88%	9%
SRR7668300	SRX4528811	SRP157071	SAMN09792030	108,264,816	88%	8%
SRR7668301	SRX4528810	SRP157071	SAMN09792031	105,092,572	80%	9%
SRR7668302	SRX4528809	SRP157071	SAMN09792032	98,992,204	87%	8%
SRR7668303	SRX4528808	SRP157071	SAMN09792033	138,663,304	87%	9%
SRR7668304	SRX4528807	SRP157071	SAMN09792034	123,259,606	87%	10%
SRR7668305	SRX4528806	SRP157071	SAMN09792035	113,298,796	87%	8%
SRR7668306	SRX4528805	SRP157071	SAMN09792036	99,899,354	88%	10%
SRR7668307	SRX4528804	SRP157071	SAMN09792037	123,832,584	87%	9%
SRR11884691	SRX8432408	SRP265387	SAMN15065689	147,257,614	89%	10%
SRR11884690	SRX8432407	SRP265387	SAMN15065690	121,438,336	73%	10%
SRR11884689	SRX8432406	SRP265387	SAMN15065691	110,167,832	85%	11%
SRR11884688	SRX8432405	SRP265387	SAMN15065692	195,279,106	87%	10%
SRR11884687	SRX8432404	SRP265387	SAMN15065693	180,765,112	87%	10%
SRR11884686	SRX8432403	SRP265387	SAMN15065694	167,931,612	88%	10%
SRR11884684	SRX8432401	SRP265387	SAMN15065696	144,139,254	86%	11%
SRR21224532	SRX17234991	SRP394188	SAMN30526147	54,365,234	83%	22%
SRR21224531	SRX17234992	SRP394188	SAMN30526148	52,498,064	83%	20%
SRR21224530	SRX17234993	SRP394188	SAMN30526149	57,529,304	82%	21%
SRR21224529	SRX17234994	SRP394188	SAMN30526150	45,371,768	83%	23%
SRR21224528	SRX17234995	SRP394188	SAMN30526151	62,767,428	84%	19%
SRR21224527	SRX17234996	SRP394188	SAMN30526152	59,950,236	84%	20%
SRR25079831	SRX20833342	SRP446677	SAMN36084795	37,951,502	92%	36%
SRR25079830	SRX20833343	SRP446677	SAMN36084796	47,509,782	92%	36%
SRR25079813	SRX20833360	SRP446677	SAMN36084797	48,806,862	93%	36%
SRR25079819	SRX20833354	SRP446677	SAMN36084798	52,625,212	93%	36%
SRR25079814	SRX20833359	SRP446677	SAMN36084799	43,548,508	93%	36%
SRR25079812	SRX20833361	SRP446677	SAMN36084800	39,607,550	93%	36%
SRR25079811	SRX20833362	SRP446677	SAMN36084801	46,423,358	93%	37%
SRR25079810	SRX20833363	SRP446677	SAMN36084802	40,643,680	93%	37%
SRR25079809	SRX20833364	SRP446677	SAMN36084803	36,939,858	92%	35%
SRR25079808	SRX20833365	SRP446677	SAMN36084804	38,923,858	92%	34%
SRR25079827	SRX20833346	SRP446677	SAMN36084805	40,354,162	93%	36%
SRR25079829	SRX20833344	SRP446677	SAMN36084806	38,272,546	92%	35%
SRR25079828	SRX20833345	SRP446677	SAMN36084807	40,143,038	92%	36%
SRR25079826	SRX20833347	SRP446677	SAMN36084808	42,401,688	92%	35%
SRR25079825	SRX20833348	SRP446677	SAMN36084809	45,592,260	93%	35%
SRR25079824	SRX20833349	SRP446677	SAMN36084810	41,144,634	92%	35%
SRR25079823	SRX20833350	SRP446677	SAMN36084811	50,296,050	92%	35%
SRR25079822	SRX20833351	SRP446677	SAMN36084812	49,274,516	93%	36%
SRR25079818	SRX20833355	SRP446677	SAMN36084813	38,880,888	93%	36%
SRR25079821	SRX20833352	SRP446677	SAMN36084814	44,616,356	92%	36%
SRR25079820	SRX20833353	SRP446677	SAMN36084815	40,135,346	92%	36%
SRR25079817	SRX20833356	SRP446677	SAMN36084816	36,683,816	92%	35%
SRR25079816	SRX20833357	SRP446677	SAMN36084817	44,658,524	93%	36%
SRR25079815	SRX20833358	SRP446677	SAMN36084818	44,926,720	93%	36%
SRR26073692	SRX21788631	SRP460809	SAMN37413899	54,526,660	83%	40%
SRR26073691	SRX21788632	SRP460809	SAMN37413900	69,900,448	85%	41%
SRR26073680	SRX21788643	SRP460809	SAMN37413901	65,379,006	84%	39%
SRR26073678	SRX21788645	SRP460809	SAMN37413902	84,472,366	86%	36%
SRR26073677	SRX21788646	SRP460809	SAMN37413903	71,812,698	85%	37%
SRR26073676	SRX21788647	SRP460809	SAMN37413904	69,394,530	85%	37%
SRR26073675	SRX21788648	SRP460809	SAMN37413905	65,754,196	83%	37%
SRR26073674	SRX21788649	SRP460809	SAMN37413906	67,213,746	82%	37%
SRR26073673	SRX21788650	SRP460809	SAMN37413907	76,161,774	84%	35%
SRR26073672	SRX21788651	SRP460809	SAMN37413908	59,726,010	85%	36%
SRR26073690	SRX21788633	SRP460809	SAMN37413909	62,126,114	85%	36%
SRR26073689	SRX21788634	SRP460809	SAMN37413910	69,263,092	88%	39%
SRR26073688	SRX21788635	SRP460809	SAMN37413911	65,823,090	84%	39%
SRR26073687	SRX21788636	SRP460809	SAMN37413912	74,612,454	85%	38%
SRR26073686	SRX21788637	SRP460809	SAMN37413913	64,024,120	82%	38%
SRR26073685	SRX21788638	SRP460809	SAMN37413914	74,521,244	83%	39%
SRR26073684	SRX21788639	SRP460809	SAMN37413915	66,387,036	85%	40%
SRR26073683	SRX21788640	SRP460809	SAMN37413916	77,996,816	85%	40%
SRR26073682	SRX21788641	SRP460809	SAMN37413917	55,618,916	84%	39%
SRR26073681	SRX21788642	SRP460809	SAMN37413918	83,437,826	85%	37%
SRR26073679	SRX21788644	SRP460809	SAMN37413919	113,769,622	85%	37%

Protein alignments

The alignments of the following proteins with ProSplign were used for gene prediction:

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Carnivora GenBank	5,671	5,498 (96.95%)	5,498 (96.95%)	78.70%	87.06%
Carnivora known RefSeq (NP_)	2,978	2,943 (98.82%)	2,943 (98.82%)	76.44%	91.70%
Homo sapiens known RefSeq (NP_)	67,661	65,047 (96.14%)	65,047 (96.14%)	79.36%	86.54%
Same-species GenBank	18	18 (100.00%)	18 (100.00%)	88.35%	93.06%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments.

First Pass	Total
mMirAng1.0.hap1 (Current) Coverage: 93.88%	mMirAng1.0.hap1 (Current) Coverage: 94.32%
ASM2128878v3 (Previous) Coverage: 96.43%	ASM2128878v3 (Previous) Coverage: 97.12%
Percent Identity: 98.10%	Percent Identity: 98.12%

Comparison of the current and previous annotations

The annotations produced for this release were compared to the annotations in the previous release for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides a link to the details of the comparison in tabular format.

	mMirAng1.0.hap1 (Current) to ASM2128878v3 (Previous)
Identical	21%
Minor changes	50%
Major changes	14%
New	13%
Deprecated	9%
Other	1%
Download the report	tabular

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
BUSCO: Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. Molecular biology and evolution 2021.38(10):4647-4654
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20
STAR: Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. Bioinformatics 2013 Jan 1;29(1):15-21.
Minimap2: Li H. Bioinformatics 2018 Sep 15;34(18):3094-3100

RefSeq

Integrated reference sequences