Data-driven Differential Equation Identification

Identify differential equation from trajectory data

Data-driven differential equation identification seeks to uncover the governing dynamical models underlying observed phenomena, whether traced from physical experiments or emerging from biological processes.

Our research pursues three interconnected objectives:

  • Algorithm Development — designing effective, robust, and interpretable methods for recovering differential equations from noisy, sparse, or high-dimensional data.
  • Identifiability Theory — establishing mathematical frameworks to characterize identifiability conditions for various differential equations.
  • Uncertainty Quantification — characterizing and propagating uncertainty through the identification pipeline to produce reliable, trustworthy models.

Identification from Noisy Trajectory Data

Unlike classical regression, the feature matrix consists of differential quantities estimated from single noisy trajectory data. Finite difference schemes inherently amplify noise, making accurate feature estimation and reliable model selection highly challenging. We develop robust computational techniques and rigorous model validation methods to accurately recover governing equations from noisy observations.

Noisy finite difference
Denoised derivative (SDD)
Model validation (MTEE)
Accuracy vs. noise level

References

(He et al., 2022)


Varying Coefficient PDE Identification

Description coming soon.

Some identification results
Consistent and Sparse Local Regression (CaSLR) identifies varying coefficient PDEs using local patches, within each of which the varying coefficients are well approximated by constants.

References

(He et al., 2023) (He et al., 2025) (Tang et al., 2025) (He et al., 2024)


Stochastic ODE/PDE Identification

Description coming soon.

Figure placeholder. Replace with your result figure.

References

(Cui & He, 2025)


Two-phase PDE Identification

Description coming soon.

Phase identification
Covering patches(SDD)
Evolution based localization
Uncertainty quantification

References

(Yang & He, 2026)


Identifiability Theory

A fundamental question in data-driven PDE identification is: given observed solution data, when can the underlying PDE be uniquely recovered?

  • Data space characterization — for elliptic operators, all snapshots of a single trajectory stay $\varepsilon$-close to a linear space of dimension $O(|\log \varepsilon|^2)$, revealing the intrinsic ill-conditioning of single-trajectory identification
  • Identifiability from two instants — for PDEs with constant coefficients, the parameters are uniquely determined from solutions at two time instants $u(x,t_1)$, $u(x,t_2)$ if the Fourier support $Q$ satisfies sharp combinatorial and geometric conditions
  • Variable coefficient identifiability — for PDEs with variable coefficients, $\binom{n+d}{d}$ time instants suffice for local recovery, provided the solution contains sufficiently diverse Fourier modes
  • Stability analysis — high-frequency perturbations to elliptic operators have limited impact on the solution, with explicit bounds depending on the operator order and regularity of the initial data

References

(He et al., 2024) (He et al., 2022)

References

2026

  1. Phase-IDENT: Identification of Two-phase PDEs with Uncertainty Quantification
    Edward L Yang and Roy Y He
    arXiv preprint arXiv:2601.11922, 2026

2025

  1. Group projected subspace pursuit for block sparse signal reconstruction: Convergence analysis and applications
    Roy Y He, Haixia Liu, and Hao Liu
    Applied and Computational Harmonic Analysis, 2025
  2. WG-IDENT: Weak group identification of PDEs with varying coefficients
    Cheng Tang, Roy Y He, and Hao Liu
    Journal of Computational Physics, 2025
  3. Stoch-ident: New method and mathematical analysis for identifying spdes from data
    Jianbo Cui and Roy Y He
    arXiv preprint arXiv:2508.19177, 2025

2024

  1. How much can one learn a partial differential equation from its solution?
    Yuchen He, Hongkai Zhao, and Yimin Zhong
    Foundations of Computational Mathematics, 2024

2023

  1. Group projected subspace pursuit for identification of variable coefficient differential equations (GP-IDENT)
    Yuchen He, Sung Ha Kang, Wenjing Liao, Hao Liu, and Yingjie Liu
    Journal of Computational Physics, 2023

2022

  1. Robust identification of differential equations by numerical techniques from a single set of noisy observation
    Yuchen He, Sung-Ha Kang, Wenjing Liao, Hao Liu, and Yingjie Liu
    SIAM Journal on Scientific Computing, 2022
  2. Asymptotic theory of-regularized pde identification from a single noisy trajectory
    Yuchen He, Namjoon Suh, Xiaoming Huo, Sung Ha Kang, and Yajun Mei
    SIAM/ASA Journal on Uncertainty Quantification, 2022