Ligand binding site and mutation map of the human type I collagen fibril. Protein sequences of the triple helix are shown (GenBankTM, α1(I) accession #NP000079.2 and α2(I) NP_000080.2; proline and hydroxyproline are designated as P, the latter being the third position in glycine-X-Y sequences. Note that Ala-459, α2(I) results from a single nucleotide polymorphism observed in some patient samples in the non-redundant protein data base, whereas α2(I) Pro-459 appears in the NP_000080.2 sequence. Yet, the association of Ala to Pro-459 substitution mutations with intracranial aneurisms (26) justifies its inclusion here. Ligand binding regions are indicated by rectangular boxes adjacent to relevant sequences; ligand binding to type I procollagen or tropocollagen (gray), or to either α1(I) or α2(I) chains (unshaded). Ligand binding hot spots 1, 2, and 3 are enclosed by dashed-lined black boxes. Highlighted sequences include: the α1β1/α2β1 integrin binding site GFPGER502-507 (maroon rectangle); MMP-1 cleavage site (green arrow); N- and C-terminal intermolecular cross-links (purple rectangles). Studies responsible for mapping many of the functional domains on collagen are cited in the first map publication (9) and in the supplemental materials. New sites on the revised map include endo180 (57), HSP47 (58), micro-unfolding and N-anchor domains (59, 60), MMP-interaction domain (11), prolyl-3-hydroxylase substrate P986 (61), SPARC (62), phosphorphoryn (63, 64), von Willebrand Factor (65), discoidin domain receptor 2 (66), and activities of THPs on angiogenesis (13), endothelial cell activation (67), and osteoblast differentiation (68). Mutations were from sources cited under the “Experimental Procedures.” This figure was modified from a previously published illustration (9).