Format

Send to

Choose Destination
Ann Appl Stat. 2015;9(2):572-596.

SEX, LIES AND SELF-REPORTED COUNTS: BAYESIAN MIXTURE MODELS FOR HEAPING IN LONGITUDINAL COUNT DATA VIA BIRTH-DEATH PROCESSES.

Author information

1
Department of Biostatistics, Yale School of Public Health.
2
Department of Biostatistics, UCLA Fielding School of Public Health.
3
Department of Biostatistics, UCLA Fielding School of Public Health ; Departments of Biomathematics and Human Genetics, David Geffen School of Medicine at UCLA.

Abstract

Surveys often ask respondents to report non-negative counts, but respondents may misremember or round to a nearby multiple of 5 or 10. This phenomenon is called heaping, and the error inherent in heaped self-reported numbers can bias estimation. Heaped data may be collected cross-sectionally or longitudinally and there may be covariates that complicate the inferential task. Heaping is a well-known issue in many survey settings, and inference for heaped data is an important statistical problem. We propose a novel reporting distribution whose underlying parameters are readily interpretable as rates of misremembering and rounding. The process accommodates a variety of heaping grids and allows for quasi-heaping to values nearly but not equal to heaping multiples. We present a Bayesian hierarchical model for longitudinal samples with covariates to infer both the unobserved true distribution of counts and the parameters that control the heaping process. Finally, we apply our methods to longitudinal self-reported counts of sex partners in a study of high-risk behavior in HIV-positive youth.

KEYWORDS:

Bayesian hierarchical model; Coarse data; Continuous-time Markov chain; Heaping; Mixture model; Rounding

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center