A cosmid containing the entire coding region for human carcinoembryonic antigen has been isolated. Detailed analysis and sequencing have determined an organization comprising nine exons encoding amino acids and one for a 3' untranslated fragment. Comparison with other family members reveals a complex pattern of homology at the 3' end of the gene. The 5' noncoding region is rich in purine-rich motifs and possible enhancer elements and has a region with properties similar to those of HTF islands.