Evolution of CD4 (A) phylogenetic tree in mammals. The color of the branches indicates the number of Cys of the CSD motif in D2. The CSD is found in species with red branches. Species with purple branches have only the C-terminal Cys. The CSD was likely acquired in primates (A–D) around 75 MYA, and independently in muroid rodents viz mice, rats (O) and hamsters (P), after the divergence of squirrels (Q, sciuridae), also around 75 MYA. Acquisition of the CSD in muroids was accompanied by rapid evolution of the CD4 protein as evidenced by the long branch lengths of this clade (upper red branches). The four major clades of Eutherian mammals are Afrotheria (including elephants E); Xenartha (none depicted); Laurasiatheria (F–L, shown in pink) and Euarchontaglires (primates, A–D, green circle; glires, N–S yellow circle; and treeshrews, M) (Murphy et al., ) Key: A, Lesser apes; B, Greater apes; C, Marmoset (Platyrhines); D, Galago; E, African Elephant; F, Bats; G, Panda/Mink/Ferret; H, Dog and cat; I, Pig; J, Dolphin, Whale and Camel; K, Sheep, Bovine, Goat; L, Horse; M, Tree shrew; N, Rabbit; O, Mouse and rat; P, Chinese hamster; Q, Squirrel/wood chuck; R, Guinea pig; S, Mole rat; T, Wallaby and opossum; U, Platypus. (B) Evolution of CD4 and related molecules showing important thiol-containing sites. Molecules are depicted from the more ancestral forms of the molecule (leftmost) to more recently evolved forms to the right. The CD4 homologs consist of two to four extracellular Ig domains, a single transmembrane region and a short cytoplasmic tail. Three important thiol-bearing functional sites have been identified: the CXC motif in the cytoplasmic tail, the between-sheet disulfide in D1, and the cross-strand disulfide (CSD) in D2. The CXC motif is present in all CD4 orthologs (β–ε), but not in the CD4-like LAG3 molecule. The between-sheet disulfide in D1 is present in CD4 and LAG3 (α–ζ). In some teleosts, such as trout and fugu, the Cys on strand B is lost, leaving a single unpaired Cys on strand F. D1 can be definitively identified in CD4REL because it is distinctively encoded on two exons in molecules β–ε (Laing et al., ). It is involved in interaction with MHC II molecules. Thiols in the D2 site have evolved over time. Originally D2 contained Cys residues on strands B and F, typical of Ig domains (α–γ). Around 250 MYA, the thiol on strand B was lost (δ–ε). Around 75 MYA, a new thiol appeared on strand C allowing the novel CSD to form upon oxidation (ζ). α–γ are CD4-like proteins which evolved in a trout ancestor around 450 MYA. Interacting proteins are shown in gray: all proteins bind MHC class II; the CD4-related molecule LAG3 (α) binds MHC but not LCK. D3 and D4 contain glycosylation sites which may serve to fend off adventitious interactions with other membrane proteins (Laing et al., ). Ig domains are depicted as bisected hexagons with each half representing one sheet: V-like domains are depicted in green; C-like domains in magenta. A CXXC motif in LCK binds to the CXC motif of CD4 via Zn2+ (depicted as a blue diamond) (Lin et al., ). The extracellular and cytoplasmic sides of the membrane are labeled on the right.