Approaches to handling incomplete data in family-based association testing

Ann Hum Genet. 2007 Mar;71(Pt 2):141-51. doi: 10.1111/j.1469-1809.2006.00325.x. Epub 2006 Nov 10.

Abstract

The high throughput of data arising from the complete sequence of the human genome has left statistical geneticists with a rich and extensive information source. The wide availability of software and the increase in computing power has improved the possibilities to access and process such data. One problem is incompleteness of the data: unobserved or partially observed data points due to technical reasons or reasons associated with the patient's status or erroneous measurements of phenotype or genotype, to name a few. When not properly accounted for, these sources of incompleteness may seriously jeopardize the credibility of results from analyses. In this paper we provide some perspectives on the occurrence and analysis of different forms of incomplete data in family-based genetic association testing.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Data Interpretation, Statistical
  • Family
  • Female
  • Genetics, Medical / statistics & numerical data*
  • Genotype
  • Haplotypes
  • Humans
  • Male
  • Models, Statistical