← All initiatives
OR · 02/ Research paper/ In review · workshop track

Causal Structure Recovery under Partial Observability

We show that the standard estimate-then-discover pipeline for causal inference in partially observed dynamical systems is systematically biased when built on RTS-smoothed states. We prove that filtered state estimates preserve the exogeneity conditions required for Granger-style discovery, and demonstrate empirically that switching representation — at zero additional cost — recovers near-oracle structure.

ORIA ResearchCausal Inference · State-space
2026 · workshop trackPreprint · in review
§ 01 · Context

The variables of interest are rarely the ones we observe.

In most real-world settings — power systems, sensor networks, neural recordings — the variables whose causal relationships we want to understand are latent. We observe noisy projections of them through time, and we reconstruct the rest.

The standard pipeline has two steps. First, recover the latent trajectory with a Kalman filter or an RTS smoother. Then, run a causal discovery procedure — typically Granger, partial correlation, or VAR — on the recovered trajectory.

This paper argues that the first step is not neutral. The choice of state representation silently determines whether the second step is valid at all.

§ 02 · Claim

Smoothed states create endogeneity. Filtered states do not.

The core claim is structural, not empirical. RTS smoothing injects information from future observations into the estimate at every timestep:

t|T  =  f( y1, …, yt, …, yT ) (1)

When this estimate is regressed on its own past to test for Granger causality, the regressor is correlated with the error term. The OLS assumption fails. Bias follows.

t|t  =  f( y1, …, yt ) (2)

The filtered estimate depends only on the past. Exogeneity is preserved. The same downstream test — unchanged — returns near-oracle structure.

RTS-smoothed states create endogenous regressors in causal models. Filtered states preserve exogeneity.

The consequence is a pipeline-level, not method-level, error. It is not fixed by changing the discovery algorithm. It is fixed by changing what you feed it.

Figure 01 · Interactive

A three-node system, four ways of looking at it.

A ground-truth causal graph over x, y, z — only two edges — passed through a partially observed linear dynamical system. Switch the state representation and watch which edges the Granger test discovers.

A · Latent trajectoryt = 0 / 200
Noisy linear dynamics with two ground-truth edges: x → y and y → z. Observations are a partial, noisy projection.
B · Discovered causal graphRepresentation · True
Edges are shown with strength proportional to the p-value of the Granger F-test at a 0.05 threshold.
True positives
2 / 2
False positives
0
Estimated p̅ (true edges)
< .001
Verdict
Oracle
§ 04 · Experiment

The same test, four times.

We construct a linear-Gaussian state-space system with a known two-edge causal graph, apply a partial observation matrix, and feed each of four representations into the same battery of discovery methods — Granger F, VAR coefficient tests, partial correlation, and LASSO.

04.1Conditions

  1. Varying observability — rank of the observation matrix from 1 to full.
  2. Varying sample size — 200 to 20,000 timesteps.
  3. Varying noise — process and observation noise independently.
  4. Varying system dimension — 3 to 12 latent variables.

04.2Result

Across every condition, the ranking is the same. True and filtered are near-identical. Smoothed produces systematic false edges. Innovations are causally inert.

Representation Signal Bias Discovered
Truepreservednoneoracle
Filteredpreservednegligiblenear-oracle
Smootheddistortedstructuralfalse edges
Innovationsremovedn/ano edges

Critically, the smoothed-state false-positive rate increases with sample size. This rules out finite-sample noise as the cause. It is a structural artefact of the representation.

§ 05 · Implication

A one-line change. A different answer.

If you run an estimate-then-discover pipeline on a partially observed system, replace the smoothed trajectory with the filtered one. The Kalman recursion already produces it.

In any estimate-then-discover pipeline, use filtered states instead of smoothed states for causal inference.

05.1Limits

  • Analysis assumes linear Gaussian systems with correct model specification.
  • Experiments use synthetic data. Real-world behaviour is the next section of work.
  • Recovery degrades under very low observability or high latent dimension.
  • The nonlinear case remains open and is the direction we are moving toward.

05.2Direction

  • Integrated estimation-and-discovery formulations in which the two steps are jointly aware of each other.
  • Extension to nonlinear state estimation — particle and ensemble filters.
  • Evaluation on real datasets: power networks, neural population recordings, sensor arrays.
Colophon
Set in EB Garamond and JetBrains Mono. Margin notes after Tufte. Figure 01 runs a small linear-Gaussian Kalman simulation client-side; representations are computed on demand.
Status
OR · 02 · Workshop submission in review. Preprint available on request via Apply.