Ataxia-telangiectasia locus: sequence analysis of 184 kb of human genomic DNA containing the entire ATM gene

Genome Res. 1997 Jun;7(6):592-605. doi: 10.1101/gr.7.6.592.

Abstract

Ataxia-telangiectasia (A-T) is an autosomal recessive disorder involving cerebellar degeneration, immunodeficiency, chromosomal instability, radiosensitivity, and cancer predisposition. The genomic organization of the A-T gene, designated ATM, was established recently. To date, more than 100 A-T-associated mutations have been reported in the ATM gene that do not support the existence of one or several mutational hotspots. To allow genotype/phenotype correlations it will be important to find additional ATM mutations. The nature and location of the mutations will also provide insights into the molecular processes that underly the disease. To facilitate the search for ATM mutations and to establish the basis for the identification of transcriptional regulatory elements, we have sequenced and report here 184,490 bp of genomic sequence from the human 11q22-23 chromosomal region containing the entire ATM gene, spanning 146 kb, and 10 kb of the 5'-region of an adjacent gene named E14/NPAT. The latter shares a bidirectional promoter with ATM and is transcribed in the opposite direction. The entire region is transcribed to approximately 85% and translated to 5%. Genome-wide repeats were found to constitute 37.2%, with LINE (17.1%) and Alu (14.6%) being the main repetitive elements. The high representation of LINE repeats is attributable to the presence of three full-length LINE-1s, inserted in the same orientation in introns 18 and 63 as well as downstream of the ATM gene. Homology searches suggest that ATM exon 2 could have derived from a mammalian interspersed repeat (MIR). Promoter recognition algorithms identified divergent promoter elements within the CpG island, which lies between the ATM and E14/NPAT genes, and provide evidence for a putative second ATM promoter located within intron 3, immediately upstream of the first coding exon. The low G+C level (38.1%) of the ATM locus is reflected in a strongly biased codon and amino acid usage of the gene.

MeSH terms

  • Ataxia Telangiectasia / genetics*
  • Ataxia Telangiectasia Mutated Proteins
  • Base Composition
  • Base Sequence
  • Cell Cycle Proteins*
  • Chromosome Mapping
  • Chromosomes, Human, Pair 11
  • Cloning, Molecular
  • CpG Islands
  • DNA Transposable Elements
  • DNA-Binding Proteins
  • Electronic Data Processing
  • Exons
  • Humans
  • Introns
  • Molecular Sequence Data
  • Nuclear Proteins*
  • Polymerase Chain Reaction
  • Promoter Regions, Genetic
  • Protein Serine-Threonine Kinases*
  • Proteins / genetics*
  • Repetitive Sequences, Nucleic Acid
  • Sequence Analysis, DNA
  • Tumor Suppressor Proteins

Substances

  • Cell Cycle Proteins
  • DNA Transposable Elements
  • DNA-Binding Proteins
  • NPAT protein, human
  • Nuclear Proteins
  • Proteins
  • Tumor Suppressor Proteins
  • ATM protein, human
  • Ataxia Telangiectasia Mutated Proteins
  • Protein Serine-Threonine Kinases

Associated data

  • GENBANK/U40887
  • GENBANK/U40888
  • GENBANK/U40889
  • GENBANK/U40890
  • GENBANK/U40891
  • GENBANK/U40892
  • GENBANK/U40893
  • GENBANK/U40894
  • GENBANK/U40895
  • GENBANK/U40896
  • GENBANK/U40897
  • GENBANK/U40898
  • GENBANK/U40899
  • GENBANK/U40900
  • GENBANK/U40901
  • GENBANK/U40902
  • GENBANK/U40903
  • GENBANK/U40904
  • GENBANK/U40905
  • GENBANK/U40906
  • GENBANK/U40907
  • GENBANK/U40908
  • GENBANK/U40909
  • GENBANK/U40910
  • GENBANK/U40911
  • GENBANK/U40912
  • GENBANK/U40913
  • GENBANK/U40914
  • GENBANK/U40915
  • GENBANK/U82828