Homo sapiens complex locus DGCR6, encoding DiGeorge syndrome critical region gene 6.
RefSeq summary
[DGCR6] DiGeorge syndrome, and more widely, the CATCH 22 syndrome, are associated with microdeletions in chromosomal region 22q11.2. The product of this gene shares homology with the Drosophila melanogaster gonadal protein, which participates in gonadal and germ cell development, and with the gamma-1 subunit of human laminin. This gene is a candidate for involvement in DiGeorge syndrome pathology and in schizophrenia. [provided by RefSeq].

RefSeq annotates one representative transcript (NM included in AceView variant.a), but Homo sapiens cDNA sequences in GenBank, dbEST, Trace and SRA, filtered against clone rearrangements, coaligned on the genome and clustered in a minimal non-redundant way by the manually supervised AceView program, support at least 12 spliced variants.

Note that this locus is complex: it appears to produce several proteins with no sequence overlap.
Expression: According to AceView, this gene is expressed at high level, 1.4 times the average gene in this release. The sequence of this gene is defined by 164 GenBank accessions from 149 cDNA clones, some from brain (seen 17 times), cerebellum (15), pectoral muscle (after mastectomy) (13), hypothalamus (10), eye (8), skeletal muscle (7), prostate (6) and 55 other tissues. We annotate structural defects or features in 3 cDNA clones.
Alternative mRNA variants and regulation: The gene contains 13 distinct introns (12 gt-ag, 1 gc-ag). Transcription produces 15 different mRNAs, 12 alternatively spliced variants and 3 unspliced forms. There are 5 probable alternative promotors, 3 non overlapping alternative last exons and 4 validated alternative polyadenylation sites (see the diagram). The mRNAs appear to differ by truncation of the 5' end, truncation of the 3' end, presence or absence of a cassette exon, overlapping exons with different boundaries.
2 variants were isolated in vivo, despite the fact that they are predicted targets of nonsense mediated mRNA decay (NMD).
Efficacy of translation may be reduced by the presence of a shorter translated product (uORF) initiating at an AUG upstream of the main open reading frame (in variant cAug10).
Function: There are 9 articles specifically referring to this gene in PubMed. Functionally, the gene has been tested for association to diseases (DiGeorge syndrome; Schizophrenia) and proposed to participate in processes (cell adhesion, organ morphogenesis). Proteins are expected to localize in various compartments (proteinaceous extracellular matrix, nucleus).
Protein coding potential: 12 spliced and 2 unspliced mRNAs putatively encode good proteins, altogether 14 different isoforms (11 complete, 2 COOH complete, 1 partial), some containing DiGeorge syndrome critical region 6 (DGCR6) protein domain [Pfam], a N-myristoylation domain, a coiled coil stretch [Psort2]. The remaining mRNA variant (unspliced) appears not to encode a good protein.

Map: This gene DGCR6 maps on chromosome 22, at 22q11.21|22q11 according to Entrez Gene. In AceView, it covers 6.78 kb, from 18893084 to 18899859 (NCBI 37, August 2010), on the direct strand.
Other names: The gene is also known as DGCR6, LOC8214. It has been described as protein DGCR6, DiGeorge syndrome critical region protein 6.
The closest mouse gene, according to BlastP, is the AceView gene Dgcr6 (e=6 10-24).
The closest A.thaliana gene, according to BlastP, is the AceView gene AT5G08460 (e=0.20)
DGCR6 Gene expression in 15 primates, 16 tissues, from the NHPRTR project in sFPKM BAB SkeletalMuscle WholeBlood CHP Kidney Liver Lung Spleen CMC CMM Cerebellum HUM Brain Colon Heart LymphNode Ovary Testis JMI BoneMarrow MLM OWL PTM RMC RMI Pituitary SQM Thymus MST SMY RTL 1.26 2.19 2.90 2.19 1.10 24.8 1.18 0.83 3.33 8.19 3.57 13.3 6.65 12.4 10.1 21.6 6.65 6.21 32.8 4.71 24.8 3.82 2.90 0.78 2.05 7.64 4.39 3.57 2.52 4.10 2.19 3.57 14.3 16.4 13.3 4.71 6.65 4.71 1.91 4.71 4.39 10.8 3.82 2.70 0.63 4.39 6.65 1.35 2.90 5.04 8.78 6.21 2.35 3.57 3.10 3.57 5.40 3.57 1.66 2.70 1.45 0.59 4.39 3.57 4.71 2.05 6.21 8.78 4.71 5.79 3.10 2.52 2.90 3.33 5.79 3.82 5.79 5.04 7.64 3.10 5.79 3.82 2.90 8.78 5.04 4.71 4.10 3.33 6.65 6.21 4.10 2.70 2.90 6.65 2.90 6.65 6.65 5.04 5.79 7.64 7.13 6.21 8.78 5.04 5.40 5.40 8.78 3.10 5.04 3.10 4.39 2.70 2.35 3.82 2.70 2.35 5.79 6.21 5.40 4.71 2.52 3.57 4.39 1.45 2.19 1.91 7.13 3.82 2.52 3.10 4.10 1.55 1.78 0.63 0.55 0.45 3.82 5.79 3.57 5.79 2.35 3.82 1.66 4.39 1.55 5.40 6.21 4.71 2.05 3.82 1.91 Expression quantiles None Weak 1 2 3 4 5 6 7 8 10 20 This gene All genes log2 distributions RNA-seq gene expression profile across 16 selected tissues from the Non-Human Primates Reference Transcriptome Resource (link to NHPRTR project).
- Primates: Apes (HUM: Human (Illumina BodyMap 2), CHP: Chimpanzee), Old World monkeys (PTM: Pig-Tailed Macaque, JMI Japanese Macaque, RMI Rhesus Macaque Indian, RMC Rhesus Macaque Chinese, CMM Cynomolgus Macaque Mauritian, CMC Cynomolgus Macaque Chinese, BAB Olive Baboon, SMY Sooty Mangabey); New World monkeys (MST common Marmoset, SQM Squirrel Monkey, OWL Owl Monkey); and Lemurs (MLM Mouse Lemur, RTL Ring-Tailed Lemur).
- The level for significantly expressed genes is color coded in 8 equal sized bins (light to dark green). Light gray is for weak not-accurately measured expression (2 to 8 reads above intergenic background); dark gray for no expression or no sequence conservation (0 read in gene). The plot to the right shows the distribution of measured expression values in all tissues for all genes (blue) and for this gene (green), in Magic index = log2(1000 sFPKM).
You may also examine the strand-specific genome coverage plots on the experimental AceView/Magic hub at UCSC, by tissue or by species. Tracks may be slow to load; please reload if some tracks come up yellow-greenish, and thanks to UCSC for the great work!.
mRNA variant mRNA matching the genome Best predicted protein 5' UTR 3' UTR uORF Upstream sequence Transcription
aAug10 1213 bp 220 aa 152 bp 398 bp 2kb including Promoter 5865 bp 1kb
bAug10 1297 bp 212 aa 657 bp 2kb 6319 bp 1kb
cAug10 1452 bp 192 aa 470 bp 403 bp 432 bp 2kb including Promoter 5750 bp 1kb
dAug10 596 bp 172 aa 32 bp 75 bp 2kb probably including promoter 4618 bp 1kb
eAug10 1022 bp 155 aa 7 bp 547 bp 2kb including Promoter 1102 bp 1kb
fAug10 1573 bp 152 aa 464 bp 650 bp 39 bp 2kb possibly including promoter 6476 bp 1kb
gAug10 745 bp 138 aa 67 bp 261 bp 2kb possibly including promoter 5125 bp 1kb
hAug10 550 bp 131 aa 23 bp 131 bp 2kb including Promoter 4762 bp 1kb
iAug10 499 bp 131 aa 102 bp 2kb including Promoter 2592 bp 1kb
jAug10 783 bp 129 aa 198 bp 195 bp 2kb possibly including promoter 1294 bp 1kb
kAug10-unspliced 581 bp 115 aa 217 bp 16 bp 267 bp 2kb including Promoter 581 bp 1kb
lAug10-unspliced 570 bp 112 aa 233 bp 105 bp 2kb including Promoter 570 bp 1kb
mAug10 194 bp 63 aa 2kb probably including promoter 291 bp 1kb
nAug10 804 bp 152 aa 26 bp 343 bp 2kb probably including promoter 4584 bp 1kb
oAug10-unspliced 781 bp 78 aa 160 bp 384 bp 2kb 781 bp 1kb

               D:disease,C:conserved,I:interactions,R:regulation,P:publications         Read more...
