Abstract

Time series classification is an essential data processing task that relies on assigning class labels to sequences of temporal data. A fundamental component of any time series classification method is data representation. There exist several approaches to that task ranging from straightforward sequence distance-based methods to neural networks. We focus on symbolic time series representation-based methods. The literature of the domain repeatedly underlines their flexibility and good classification quality. We propose a new approach to convert numeric time series into symbolic ones based on fuzzy clustering. The goal is to reduce noise in the data. The proposed method utilizes cluster membership values to determine symbols that characterize the time series. The new approach was tested in an empirical procedure to validate its correctness while achieving satisfying results.

Recommended Citation

Jastrzebska, A., Matusiewicz, Z. & Nápoles, G. (2024). A New Symbolic Time Series Representation Method Based on Data Fuzzification. In B. Marcinkowski, A. Przybylek, A. Jarzębowicz, N. Iivari, E. Insfran, M. Lang, H. Linger, & C. Schneider (Eds.), Harnessing Opportunities: Reshaping ISD in the post-COVID-19 and Generative AI Era (ISD2024 Proceedings). Gdańsk, Poland: University of Gdańsk. ISBN: 978-83-972632-0-8. https://doi.org/10.62036/ISD.2024.6

Paper Type

Poster

DOI

10.62036/ISD.2024.6

Share

COinS
 

A New Symbolic Time Series Representation Method Based on Data Fuzzification

Time series classification is an essential data processing task that relies on assigning class labels to sequences of temporal data. A fundamental component of any time series classification method is data representation. There exist several approaches to that task ranging from straightforward sequence distance-based methods to neural networks. We focus on symbolic time series representation-based methods. The literature of the domain repeatedly underlines their flexibility and good classification quality. We propose a new approach to convert numeric time series into symbolic ones based on fuzzy clustering. The goal is to reduce noise in the data. The proposed method utilizes cluster membership values to determine symbols that characterize the time series. The new approach was tested in an empirical procedure to validate its correctness while achieving satisfying results.