tsdataleaks

From MaRDI portal
Software:5976368



CRANtsdataleaksMaRDI QIDQ5976368

Exploit Data Leakages in Time Series Forecasting Competitions

Thiyanga S. Talagala

Last update: 6 February 2024

Copyright license: GNU General Public License, version 3.0, GNU General Public License, version 2.0

Software version identifier: 2.1.1



Forecasting competitions are of increasing importance as a mean to learn best practices and gain knowledge. Data leakage is one of the most common issues that can often be found in competitions. Data leaks can happen when the training data contains information about the test data. For example: randomly chosen blocks of time series are concatenated to form a new time series, scale-shifts, repeating patterns in time series, white noise is added in the original time series to form a new time series, etc. 'tsdataleaks' package can be used to detect data leakages in a collection of time series.