Send to

Choose Destination
J Clin Invest. 2020 Jan 13. pii: 129196. doi: 10.1172/JCI129196. [Epub ahead of print]

The promise and reality of therapeutic discovery from large cohorts.

Author information

Calico Life Sciences LLC, South San Francisco, California, USA.
Department of Genetics and.
Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.


Technological advances in rapid data acquisition have transformed medical biology into a data mining field, where new data sets are routinely dissected and analyzed by statistical models of ever-increasing complexity. Many hypotheses can be generated and tested within a single large data set, and even small effects can be statistically discriminated from a sea of noise. On the other hand, the development of therapeutic interventions moves at a much slower pace. They are determined from carefully randomized and well-controlled experiments with explicitly stated outcomes as the principal mechanism by which a single hypothesis is tested. In this paradigm, only a small fraction of interventions can be tested, and an even smaller fraction are ultimately deemed therapeutically successful. In this Review, we propose strategies to leverage large-cohort data to inform the selection of targets and the design of randomized trials of novel therapeutics. Ultimately, the incorporation of big data and experimental medicine approaches should aim to reduce the failure rate of clinical trials as well as expedite and lower the cost of drug development.

Free full text

Supplemental Content

Full text links

Icon for American Society for Clinical Investigation
Loading ...
Support Center