Principal component analysis in Bayes spaces for sparsely sampled density functions with an application to changing temperature distributions

 
Speaker:
Sonja Greven
Affiliation:
Humboldt-Universität zu Berlin
When:
Wednesday 23rd October 2024
Time:
10:00:00
Where:
Aula Saleri - Seminari VI piano
Link to seminar:
Link to seminar will become active ten minutes before due starting time: Click HERE to refresh page
Abstract:
I will present a novel approach to functional principal component analysis (FPCA) in Bayes spaces in the setting where densities are the object of analysis, but only few individual samples from each density are observed. We use the observed data directly to account for all sources of uncertainty, instead of relying on prior estimation of the underlying densities in a two-step approach, which can be inaccurate if small or heterogeneous numbers of samples per density are available. To account for the constrained nature of densities, we base our approach on Bayes spaces, which extend the Aitchison geometry for compositional data to density functions. For modeling, we exploit the isometric isomorphism between the Bayes space and the L2 subspace L2_0 with integration-to-zero constraint through the centered log-ratio transformation. As only discrete draws from each density are observed, we treat the underlying functional densities as latent variables within a maximum likelihood framework and employ a Monte Carlo Expectation Maximization (MCEM) algorithm for model estimation. Resulting estimates are useful for exploratory analyses of density data, for dimension reduction in subsequent analyses, as well as for improved preprocessing of sparsely sampled density data compared to existing methods. The proposed method is applied to analyze the distribution of maximum daily temperatures in Berlin during the summer months for the last 70 years, as well as the distribution of rental prices in the districts of Munich. The talk is based on joint work wit Lisa Steyer, available at https://arxiv.org/abs/2309.11352 This initiative is part of the “Ph.D. Lectures” activity of the project "Departments of Excellence 2023-2027" of the Department of Mathematics of Politecnico di Milano. This activity consists of seminars open to Ph.D. students, followed by meetings with the speaker to discuss and go into detail on the topics presented at the talk. Contatti: laura.sangalli@polimi.it, alessandra.menafoglio@polimi.it, piercesare.secchi@polimi.it, simone.vantini@polimi.it.
PDF: