Format

Send to

Choose Destination
Multivariate Behav Res. 2017 Mar-Apr;52(2):259-270. doi: 10.1080/00273171.2017.1279957. Epub 2017 Feb 16.

Modeling Information Content Via Dirichlet-Multinomial Regression Analysis.

Author information

1
a Department of Brain and Behavioural Sciences , University of Pavia.

Abstract

Shannon entropy is being increasingly used in biomedical research as an index of complexity and information content in sequences of symbols, e.g. languages, amino acid sequences, DNA methylation patterns and animal vocalizations. Yet, distributional properties of information entropy as a random variable have seldom been the object of study, leading to researchers mainly using linear models or simulation-based analytical approach to assess differences in information content, when entropy is measured repeatedly in different experimental conditions. Here a method to perform inference on entropy in such conditions is proposed. Building on results coming from studies in the field of Bayesian entropy estimation, a symmetric Dirichlet-multinomial regression model, able to deal efficiently with the issue of mean entropy estimation, is formulated. Through a simulation study the model is shown to outperform linear modeling in a vast range of scenarios and to have promising statistical properties. As a practical example, the method is applied to a data set coming from a real experiment on animal communication.

KEYWORDS:

Dirichlet distribution; Dirichlet-multinomial regression; entropy; information

PMID:
28207283
DOI:
10.1080/00273171.2017.1279957
[Indexed for MEDLINE]

Supplemental Content

Loading ...
Support Center