Modeling time-varying genetic effects on binary disease risk via functional Mendelian Randomization

Keywords

Statistical learning
Health Analytics
Code:
57/2026
Title:
Modeling time-varying genetic effects on binary disease risk via functional Mendelian Randomization
Date:
Monday 29th June 2026
Author(s):
Fontana, N.; Secchi, P.; Di Angelantonio, E.; Ieva, F.
Download link:
Abstract:
Motivation: Genome-wide association studies have identified thousands of genetic variants associated with complex traits, establishing Mendelian Randomization (MR) as a powerful framework for causal inference using variants as natural experiments. However, existing MR methods treat causal effects as static, relying on cross-sectional exposure measurements and ignoring how genetic predispositions to disease operate dynamically across the life course. Recovering age-specific causal effect functions from longitudinal data requires combining functional data representations of exposure trajectories with instrumental variable estimation strategies suitable for binary disease endpoints, a methodological gap that has remained unaddressed. Results: We develop a functional MR framework for binary outcomes that integrates Functional Principal Component Analysis with Two-Stage Residual Inclusion (2SRI), ensuring consistent estimation under the nonlinear logistic link function that renders standard instrumental variable estimators inconsistent. Simulations across different causal effect trajectory shapes, varying measurement densities, and varying instrument strengths demonstrate accurate recovery of time-varying genetically predicted effects with minimal bias. Applied to UK Biobank data, the framework identifies an age-specific causal effect of genetically predicted body mass index on type 2 diabetes risk concentrated in early mid-adulthood and progressively attenuating thereafter. Concordance between the proposed 2SRI estimator applied to type 2 diabetes and the established continuous-outcome functional MR estimator applied to the paired glycated haemoglobin marker in the same cohort provides indirect empirical support for the validity of the proposed approach. Availability and implementation: The method is implemented in the R package mvfmr, with a full tutorial vignette.
This report, or a modified version of it, has been also submitted to, or published on
Bioinformatics (accettato)