Evolution and organization of the human protein C gene

Proc Natl Acad Sci U S A. 1986 Feb;83(3):546-50. doi: 10.1073/pnas.83.3.546.

Abstract

We have isolated overlapping phage genomic clones covering an area of 21 kilobases that encodes the human protein C gene. The gene is at least 11.2 kilobases long and is made up of nine exons and eight introns. Two regions homologous to epidermal growth factor and transforming growth factor are encoded by amino acids 46-91 and 92-136 and are precisely delimited by introns, as is a similar sequence in the genes for coagulation factor IX and tissue plasminogen activator. When homologous amino acids of factor IX and protein C are aligned, the positions of all eight introns correspond precisely, suggesting that these genes are the product of a relatively recent gene duplication. Nevertheless, the two genes are sufficiently distantly related that no nucleic acid homology remains in the intronic regions and that the size of the introns varies dramatically between the two genes. The similarity of the genes for factor IX and protein C suggests that they may be the most closely related members of the serine protease gene family involved in coagulation and fibrinolysis.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Base Sequence
  • Biological Evolution
  • Cloning, Molecular
  • Endopeptidases / genetics
  • Genes*
  • Glycoproteins / genetics*
  • Humans
  • Protein C
  • Serine Endopeptidases
  • Transcription, Genetic

Substances

  • Glycoproteins
  • Protein C
  • Endopeptidases
  • Serine Endopeptidases

Associated data

  • GENBANK/M12682
  • GENBANK/M12683
  • GENBANK/M12684
  • GENBANK/M12685
  • GENBANK/M12686
  • GENBANK/M12687
  • GENBANK/M12712