Introducing time series snippets: a new primitive for summarizing long time series

Citation DataData Mining and Knowledge Discovery, ISSN: 1573-756X, Vol: 34, Issue: 6, Page: 1713-1743

Publication Year2020

11
Citations
0
Usage
34
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
11
- Citation Indexes
  11
Captures
34
- Readers
  34

Article Description

The first question a data analyst asks when confronting a new dataset is often, “Show me some representative/typical data.” Answering this question is simple in many domains, with random samples or aggregate statistics of some kind. Surprisingly, it is difficult for large time series datasets. The major difficulty is not time or space complexity, but defining what it means to be representative data for this data type. In this work, we show that the obvious candidate definitions: motifs, shapelets, cluster centers, random samples etc., are all poor choices. We introduce time series snippets, a novel representation of typical time series subsequences. Informally, time series snippets can be seen as the answer to the following question. If a user, which could be a human or a higher-level algorithm, only has resources (including human time) to inspect k subsequences of a long time series, which k subsequences should be chosen? Beyond their utility for visualizing and summarizing massive time series collections, we show that time series snippets have utility for high-level comparison of large time series collections.

Bibliographic Details

DOI10.1007/s10618-020-00702-y

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85087440344&origin=inward; http://dx.doi.org/10.1007/s10618-020-00702-y; https://link.springer.com/10.1007/s10618-020-00702-y; https://link.springer.com/content/pdf/10.1007/s10618-020-00702-y.pdf; https://link.springer.com/article/10.1007/s10618-020-00702-y/fulltext.html; https://dx.doi.org/10.1007/s10618-020-00702-y; https://link.springer.com/article/10.1007/s10618-020-00702-y

AUTHOR(S)

Shima Imani; Frank Madrid; Eamonn Keogh; Wei Ding; Scott E. Crouter

PUBLISHER(S)

Springer Science and Business Media LLC

TAG(S)

Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know