Functional principal component analysis for incomplete space-time data
Keywords
Computational learning
Advanced Numerical Methods for Scientific Computing
Statistics
Statistical learning
Geosciences/Protection of Land and Water Resources
Code:
29/2024
Title:
Functional principal component analysis for incomplete space-time data
Date:
Tuesday 19th March 2024
Author(s):
Palummo, A.;, Arnone, E.; Formaggia, L.; Sangalli, L.M.
Abstract:
Environmental signals, acquired, e.g., by remote sensing, often present large gaps of missing observations in space and time. In this work, we present an innovative approach to identify the main variability patterns, in space-time data, when data may be affected by complex missing data structures. We formalise the problem in the framework of Functional Data Analysis, proposing an innovative method of functional Principal Component Analysis (fPCA) for incomplete space-time data. The functional nature of the proposed method permits to borrow information from measurements observed at nearby spatio-temporal locations. The resulting functional principal components are smooth fields over the considered spatio-temporal domain, and can lead to interesting insights in the spatio-temporal dynamic of the phenomenon under study. Moreover, they can be used to provide a reconstruction of the missing entries, also under severe missing data patterns. The proposed model combines a weighted rank-one approximation of the data matrix with a roughness penalty. We show that the estimation problem can be solved using a Majorize-Minimization approach, and we provide a numerically efficient algorithm for its solution. Thanks to a discretization based on finite elements in space and B-splines in time, the proposed method can handle multidimensional spatial domains with complex shapes, such as water bodies with complicated shorelines, or curved spatial regions with complex orography. As shown by simulation studies, the proposed space-time fPCA is superior to alternative techniques for Principal Component Analysis with missing data. We further highlight the potentiality of the proposed method for environmental problems, by applying space-time fPCA to the study of the Lake Water Surface Temperature (LWST) of Lake Victoria, in Central Africa, starting from satellite measurements with large gaps. LWST is considered one of the fundamental indicators of how climate change is affecting the environment, and is recognized as an Essential Climate Variable.
This report, or a modified version of it, has been also submitted to, or published on
Environmental and Ecological Statistics
Environmental and Ecological Statistics