Issue |
RAIRO-Oper. Res.
Volume 53, Number 1, January–March 2019
ROADEF 2017
|
|
---|---|---|
Page(s) | 243 - 259 | |
DOI | https://doi.org/10.1051/ro/2018089 | |
Published online | 14 February 2019 |
Grasp heuristic for time series compression with piecewise aggregate approximation
University Clermont Auvergne, CNRS, LIMOS, 63000
Clermont-Ferrand, France
* Corresponding author: vsiyou@gmail.com
Received:
30
April
2017
Accepted:
12
October
2018
The Piecewise Aggregate Approximation (PAA) is widely used in time series data mining because it allows to discretize, to reduce the length of time series and it is used as a subroutine by algorithms for patterns discovery, indexing, and classification of time series. However, it requires setting one parameter: the number of segments to consider during the discretization. The optimal parameter value is highly data dependent in particular on large time series. This paper presents a heuristic for time series compression with PAA which minimizes the loss of information. The heuristic is built upon the well known metaheuristic GRASP and strengthened with an inclusion of specific global search component. An extensive experimental evaluation on several time series datasets demonstrated its efficiency and effectiveness in terms of compression ratio, compression interpretability and classification.
Mathematics Subject Classification: 90C59
Key words: Time series / optimization / compression / classification
© The authors. Published by EDP Sciences, SMAI 2019
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.