Finding K Most Significant Motifs in Big Time Series Data | Zendy

Zaher Al Aghbari | Zendy; Ayoub Al-Hamadi | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Finding K Most Significant Motifs in Big Time Series Data

Author(s) -

Zaher Al Aghbari,

Ayoub Al-Hamadi

Publication year - 2020

Publication title -

procedia computer science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.334

H-Index - 76

ISSN - 1877-0509

DOI - 10.1016/j.procs.2020.03.131

Subject(s) - subsequence , computer science , series (stratigraphy) , representation (politics) , big data , time complexity , algorithm , time series , point (geometry) , longest common subsequence problem , data mining , mathematics , machine learning , mathematical analysis , paleontology , politics , political science , law , bounded function , biology , geometry

An efficient discovery algorithm of frequently occurring patterns, called motifs, in a time series would be useful as a tool for summarizing and visualizing big time series databases. In this paper, we propose an efficient approximate algorithm, called DiscMotifs, to discover the K most significant (KMS) motifs from time series. First, the proposed algorithm transforms the time series into a SAX representation and then the algorithm divides the SAX representation into subsequences. Next, these subsequences are linearized by projecting them into a one-dimensional space based on their distances form a randomly selected reference point, or a subsequence. By utilizing the linear ordering of subsequences, DiscMotifs efficiently discovers the KMS motifs. DiscMotifs algorithm requires a storage space linear to the number of subsequences. We demonstrate the feasibility of this approach on several synthetic and real application datasets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research