Send to

Choose Destination
Ann Hum Genet. 2002 May;66(Pt 3):183-93. doi: 10.1017/S0003480002001124.

Determination of probability distribution of diplotype configuration (diplotype distribution) for each subject from genotypic data using the EM algorithm.

Author information

Division of Statistical Genetics, Institute of Rheumatology, Tokyo Women's Medical University, Japan.


Haplotype analysis is important for mapping traits. Recently, methods for estimating haplotype frequencies from genotypes of unrelated individuals based on the expectation-maximization (EM) algorithm have been developed. Our program estimates haplotype frequencies in the population and determines the posterior probability distribution of diplotype configuration (diplotype distribution) for each subject based on the estimated haplotype frequencies. Samples from three ethnic groups for the smoothelin gene (SMTN) and those from three Japanese groups for serum amyloid A genes (SAA@) were analyzed. The estimated diplotype distribution for each individual was concentrated, in most cases, in a single diplotype configuration. The diplotype configuration thus determined was the same as that determined in in vitro experiments, with one exception. Thus, the diplotype configurations determined using the estimated haplotype frequencies from unrelated individuals are reliable. Using this method, the risk of a subject developing a phenotype may be estimated from the diplotype distribution when the phenotype is associated with diplotype configurations.

[Indexed for MEDLINE]
Free full text

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center