Format

Send to

Choose Destination
Bioinformatics. 2016 Jun 15;32(12):i8-i17. doi: 10.1093/bioinformatics/btw243.

What time is it? Deep learning approaches for circadian rhythms.

Author information

1
Department of Computer Science.
2
Department of Statistics.
3
Department of Biological Chemistry, University of California-Irvine, Irvine, CA 92697, USA.
4
Department of Computer Science, Department of Biological Chemistry, University of California-Irvine, Irvine, CA 92697, USA.

Abstract

MOTIVATION:

Circadian rhythms date back to the origins of life, are found in virtually every species and every cell, and play fundamental roles in functions ranging from metabolism to cognition. Modern high-throughput technologies allow the measurement of concentrations of transcripts, metabolites and other species along the circadian cycle creating novel computational challenges and opportunities, including the problems of inferring whether a given species oscillate in circadian fashion or not, and inferring the time at which a set of measurements was taken.

RESULTS:

We first curate several large synthetic and biological time series datasets containing labels for both periodic and aperiodic signals. We then use deep learning methods to develop and train BIO_CYCLE, a system to robustly estimate which signals are periodic in high-throughput circadian experiments, producing estimates of amplitudes, periods, phases, as well as several statistical significance measures. Using the curated data, BIO_CYCLE is compared to other approaches and shown to achieve state-of-the-art performance across multiple metrics. We then use deep learning methods to develop and train BIO_CLOCK to robustly estimate the time at which a particular single-time-point transcriptomic experiment was carried. In most cases, BIO_CLOCK can reliably predict time, within approximately 1 h, using the expression levels of only a small number of core clock genes. BIO_CLOCK is shown to work reasonably well across tissue types, and often with only small degradation across conditions. BIO_CLOCK is used to annotate most mouse experiments found in the GEO database with an inferred time stamp.

AVAILABILITY AND IMPLEMENTATION:

All data and software are publicly available on the CircadiOmics web portal: circadiomics.igb.uci.edu/

CONTACTS:

fagostin@uci.edu or pfbaldi@uci.edu

SUPPLEMENTARY INFORMATION:

Supplementary data are available at Bioinformatics online.

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center