Using machine learning to model dose-response relationships

Ariel Linden; Paul R Yarnold; Brahmajee K Nallamothu

doi:10.1111/jep.12573

Using machine learning to model dose-response relationships

J Eval Clin Pract. 2016 Dec;22(6):856-863. doi: 10.1111/jep.12573. Epub 2016 May 30.

Authors

Ariel Linden^{1

2}, Paul R Yarnold^{3

4}, Brahmajee K Nallamothu⁵

Affiliations

¹ Linden Consulting Group, LLC, Ann Arbor, MI, USA.
² Division of General Medicine, Medical School-University of Michigan, Ann Arbor, MI, USA.
³ Optimal Data Analysis, LLC, Chicago, IL, USA.
⁴ Southern Network on Adverse Reactions (SONAR), College of Pharmacy, University of South Carolina, Columbia, SC, USA.
⁵ Division of Cardiovascular Diseases, Department of Internal Medicine, Medical School-University of Michigan, Ann Arbor, MI, USA.

PMID: 27240883
DOI: 10.1111/jep.12573

Abstract

Rationale, aims and objectives: Establishing the relationship between various doses of an exposure and a response variable is integral to many studies in health care. Linear parametric models, widely used for estimating dose-response relationships, have several limitations. This paper employs the optimal discriminant analysis (ODA) machine-learning algorithm to determine the degree to which exposure dose can be distinguished based on the distribution of the response variable. By framing the dose-response relationship as a classification problem, machine learning can provide the same functionality as conventional models, but can additionally make individual-level predictions, which may be helpful in practical applications like establishing responsiveness to prescribed drug regimens.

Method: Using data from a study measuring the responses of blood flow in the forearm to the intra-arterial administration of isoproterenol (separately for 9 black and 13 white men, and pooled), we compare the results estimated from a generalized estimating equations (GEE) model with those estimated using ODA.

Results: Generalized estimating equations and ODA both identified many statistically significant dose-response relationships, separately by race and for pooled data. Post hoc comparisons between doses indicated ODA (based on exact P values) was consistently more conservative than GEE (based on estimated P values). Compared with ODA, GEE produced twice as many instances of paradoxical confounding (findings from analysis of pooled data that are inconsistent with findings from analyses stratified by race).

Conclusions: Given its unique advantages and greater analytic flexibility, maximum-accuracy machine-learning methods like ODA should be considered as the primary analytic approach in dose-response applications.

Keywords: adherence; data mining; dose-response; efficacy; machine learning.

MeSH terms

Algorithms
Blood Flow Velocity / drug effects
Cardiotonic Agents / pharmacology
Data Mining
Discriminant Analysis
Dose-Response Relationship, Drug*
Humans
Isoproterenol / pharmacology
Machine Learning*
Male

Substances

Cardiotonic Agents
Isoproterenol