Format

Send to

Choose Destination
Data Min Knowl Discov. 2019;33(1):96-130. doi: 10.1007/s10618-018-0589-3. Epub 2018 Sep 25.

Domain agnostic online semantic segmentation for multi-dimensional time series.

Author information

1
1Department of Computer Science and Engineering, University of California, Riverside, USA.
2
2Department of Computer Science, University of Massachusetts Boston, Boston, USA.
3
3Department of Kinesiology, Recreation, and Sport Studies, The University of Tennessee Knoxville, Knoxville, USA.

Abstract

Unsupervised semantic segmentation in the time series domain is a much studied problem due to its potential to detect unexpected regularities and regimes in poorly understood data. However, the current techniques have several shortcomings, which have limited the adoption of time series semantic segmentation beyond academic settings for four primary reasons. First, most methods require setting/learning many parameters and thus may have problems generalizing to novel situations. Second, most methods implicitly assume that all the data is segmentable and have difficulty when that assumption is unwarranted. Thirdly, many algorithms are only defined for the single dimensional case, despite the ubiquity of multi-dimensional data. Finally, most research efforts have been confined to the batch case, but online segmentation is clearly more useful and actionable. To address these issues, we present a multi-dimensional algorithm, which is domain agnostic, has only one, easily-determined parameter, and can handle data streaming at a high rate. In this context, we test the algorithm on the largest and most diverse collection of time series datasets ever considered for this task and demonstrate the algorithm's superiority over current solutions.

KEYWORDS:

Online algorithms; Semantic segmentation; Time series

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center