Multivariate Singular Spectrum Analysis: A Principled, Practical, and Performant Solution for Time Series Imputation and Forecasting

dc.contributor.advisorDevavrat Shah
dc.contributor.authorABDULLAH OMAR MOHAMMAD ALOMAR
dc.date2021
dc.date.accessioned2022-06-02T16:55:09Z
dc.date.available2022-06-02T16:55:09Z
dc.degree.departmentComputational Science and Engineering
dc.degree.grantorMassachusetts Institute of Technology
dc.description.abstractThe analysis of multivariate time series data is of great interest across many domains, including cyberphysical systems, finance, retail, and healthcare to name a few. A common goal across all of these domains is accurate imputation and forecasting of multivariate time series in the presence of noisy or missing data. Given the growing need to embed predictive functionality in high-performance systems, especially in applications with time series data (e.g., financial systems, control systems), it is increasingly vital that we build principled prediction algorithms that are statistically and computationally performant, and more broadly accessible. To that end, we introduce a novel variant of multivariate singular spectrum analysis (mSSA) that allows for accurate imputation and forecasting of latent time-varying mean and variance of multivariate time series. We further justify this algorithm by introducing a natural Spatio-temporal factor model, under which the algorithm is theoretically analyzed; Specifically, we establish the in-sample prediction error of our mSSA variant for both imputation and forecasting. Further, we propose an incremental variant of the algorithm, upon which, a real-time prediction system for time series data, tspDB, is instantiated and evaluated. tspDB aims to increase accessibility to predictive functionalities for time series data through the direct integration with existing relational time series Databases. Finally, through rigorous experiments, we show that tspDB provides state-of-the-art statistical accuracy while maintaining a superior computational performance with an incremental model update, low model training time, and low latency for prediction queries.
dc.identifier.urihttps://drepo.sdl.edu.sa/handle/20.500.14154/63828
dc.language.isoen
dc.titleMultivariate Singular Spectrum Analysis: A Principled, Practical, and Performant Solution for Time Series Imputation and Forecasting
sdl.thesis.levelMaster
sdl.thesis.sourceSACM - United States of America

Files

Copyright owned by the Saudi Digital Library (SDL) © 2025