Sequence of the human FKRP cDNA and analysis of the primary sequence. A, Partial nucleotide sequence and open reading frame are shown. The 4-kb cDNA encodes a protein of 495 amino acids, with a predicted molecular weight of 54.6 kDa. B,Multiple sequence alignment of fukutin homologues. Missense and nonsense mutations that were found to occur within the region aligned are indicated by red asterisks (*) and slashes (/), respectively. Amino acid residues are colored according to an 80% consensus: plus signs (+) indicate positively charged residues (H, K, and R; blue), minus signs (−) indicate negatively charged residues (D and E; red), black asterisks indicate serine or threonine residues (light blue on grey background), “a” indicates aromatic residues (F, H, W, and Y; blue on yellow background), “b” indicates big residues (E, F, H, I, K, L, M, Q, R, W, and Y; blue on yellow background), “c” indicates charged residues (DEHKR; pink), “h” indicates hydrophobic residues (A, C, F, G, H, I, L, M, T, V, W, and Y; black on yellow background), “l” indicates aliphatic residues (I, L, and V; grey on yellow), “p” indicates polar residues (C, D, E, H, K, N, Q, R, S, and T; blue), and s indicates small residues (A, C, D, G, N, P, S, T, and V; green). Residues that are conserved in >80% of sequences are shown as yellow on black. Residues excised from the alignment are indicated by numbers in parentheses. Predicted secondary structures are indicated below the alignment (e/E, extended or β-strand structure; h/H, helix). Residue numbers and GeneInfo identifiers (or else cosmid names) are shown following the alignment. Ca = Candida albicans; Ce = Caenorhabditis elegans; Dm = Drosophila melanogaster; HELP1 = haemolysin erythrocyte lysis protein 1; Hi = Haemophilus influenzae; Hs = Homo sapiens; Ll = Lactobacillus lactis lactis; Pi = Prevotella intermedia; Rp = Rickettsia prowazekii; Sc = Saccharomyces cerevisiae; Sp = Streptococcus pyogenes; Tb = Trypanosoma brucei. Fukutin homologues partially encoded by ESTs in T. brucei (EST AQ655742), T. cruzi(AI077222), and Leishmania major (AQ852136), as well as a Bacillus subtilis sequence similar to L. lactis TrpF C-terminal domain (GeneInfo number 1750106; bases 12108–12419), are not shown. Two C. elegans sequences, marked with a dagger (†), are alternative gene predictions that differ from published versions by the prediction of additional exons.