Bridging Correlation and Causation: A Causal Inference Framework for Robust Pharmacovigilance Signal Detection

Alaa Elgnainy
| February 6, 2026

Pharmacovigilance stands as the critical defense line in patient safety, tasked with identifying adverse drug reactions (ADRs) after medicines reach the market. Its primary quantitative tool, disproportionality analysis (DPA), sifts through millions of spontaneous reports to find statistical associations between drugs and adverse events. However, a fundamental and persistent challenge has been the chasm between statistical association and true causation. A new methodological paradigm, detailed in the comprehensive paper “Causal Inference Tools for Pharmacovigilance,” offers a rigorous framework to bridge this gap. By integrating Directed Acyclic Graphs (DAGs) from causal inference into pharmacovigilance practice, the authors provide a structured way to map, understand, and mitigate the biases that have long plagued signal detection.

The Core Problem: Why Association Is Not Causation

Spontaneous reporting systems are rich but messy data sources. Reports are influenced by a multitude of factors beyond a drug’s true biological effect: under-reporting, media attention (“notoriety bias”), concomitant diseases and medications, diagnostic suspicion, and more.

Traditional DPA, which calculates metrics like the Reporting Odds Ratio (ROR) or Information Component (IC), identifies signals of “disproportionate reporting.” Yet, as the paper emphasizes, disproportionate reporting can arise from at least five distinct mechanisms other than direct causality:

Reverse Causality: The event leads to the drug’s use (e.g., impulsivity leading to an antipsychotic prescription).
Confounding Bias: A common cause influences both drug exposure and event occurrence (e.g., a disease like schizophrenia causing both haloperidol use and impulsivity).
Collider Bias: Conditioning on a common effect of the drug and event opens a spurious path (e.g., analyzing only fatal reports when a drug improves survival but an infection causes death).
Measurement Error: Imperfect or incomplete reporting of exposures, outcomes, or confounders.
Reporting Bias: Systemic distortions in the reporting process itself (e.g., notoriety bias, masking).

Neglecting these mechanisms leads to either false positives (spurious signals wasting regulatory resources and causing undue alarm) or false negatives (missing true safety risks). The paper argues that merely listing biases in a “limitations” section is insufficient; a proactive, structured approach is needed.

The Solution: Directed Acyclic Graphs (DAGs) as a Formal Language

The paper’s central proposition is to use Directed Acyclic Graphs (DAGs) as an “epistemological framework.” A DAG is a visual model where:

Nodes represent variables (Drug D, Event E, Confounder C).
Directed Arrows (→) represent assumed causal relationships.
The graph is Acyclic, meaning no variable can cause itself in a loop.

DAGs are not a new statistical test but a tool for explicitly stating causal assumptions before analysis. They force researchers to translate domain knowledge (e.g., “schizophrenia causes impulsivity and is an indication for haloperidol”) into a testable causal diagram. This transparency is the first step towards robust inference.

A Detailed Walkthrough: Formalizing and Tackling Biases

The paper meticulously demonstrates how DAGs formalize specific pharmacovigilance biases, using real-world examples from the FDA Adverse Event Reporting System (FAERS).

Confounding Bias (The Open Back-Door Path): This occurs when a variable (C) causes both exposure to Drug (D) and the occurrence of Event (E). The classic example is confounding by indication.
- Example: A signal suggests haloperidol → impulsivity. The DAG reveals Schizophrenia → haloperidol and Schizophrenia → impulsivity. Schizophrenia is an open “back-door path” creating a spurious association.
- Mitigation: To close this path, one must condition on the confounder. The analysis is restricted to reports where schizophrenia is recorded. In this subset, the haloperidol-impulsivity association disappears, correctly indicating the signal was confounded.
Collider Bias (The Opened Path): This occurs when conditioning on a variable (F) that is a common effect of both D and E.
- Example: Analyzing CAR-T therapy → fatal infections. If you restrict analysis only to fatal reports, you condition on a collider: CAR-T improves survival (D→F¯), while infections increase death risk (E→F). This conditioning creates a false positive association.
- Mitigation: Awareness from the DAG warns against conditioning on the collider (fatality). The crude analysis on all reports is less biased.
Reporting Bias (The Generative Process): The paper provides a nuanced DAG for the unique reporting mechanism. For a report to exist, at least one drug must be reported (Rd) and one event (Re). This inherent selection creates colliders.
- Example – Notoriety Bias: After an FDA warning on aripiprazole and impulse control disorders, reporting for that pair increases. The DAG shows this inflates the IC.
- Mitigation: Restricting analysis to reports before the regulatory warning removes this inflationary bias.
Measurement Error: DAGs distinguish between the true construct (e.g., having cancer) and its measurement in the database (e.g., a reported cancer indication). Conditioning on a poorly measured confounder may not fully close the back-door path. The paper suggests using composite measures (e.g., cancer indication OR concomitant chemotherapy) to better capture the underlying construct.

A Practical Workflow: From Knowledge to Action

The authors propose a systematic, multi-step workflow (illustrated in their Figures 7 & 8) for applying DAGs to a real signal assessment, using aripiprazole and impulse control disorders as a case study:

Define & Inquire: Specify drug (aripiprazole) and event (impulse control disorders).
Build the DAG (Knowledge Elicitation):
- Add confounders (e.g., bipolar disorder, which causes both aripiprazole use and impulsivity).
- Add colliders (e.g., masking by other dopamine agonists).
- Add reporting biases (e.g., notoriety from the 2016 FDA warning).
- Link constructs to their imperfect measurements in FAERS (e.g., using lithium use as a surrogate for bipolar disorder).
Design the Analysis: Use the DAG to plan conditioning strategies. For aripiprazole, this meant:
- Conditioning on bipolar disorder/lithium (to close a confounder).
- Restricting to pre-warning reports (to address notoriety bias).
- Removing reports of competing drugs (to address masking).
Interpret Results: The forest plot shows the signal persists even after these adjustments, strengthening the case for a causal relationship, while transparently acknowledging residual biases from unmeasured variables.

Limitations, Future Directions, and Transformative Potential

The paper is careful to state that DAGs are not a magic bullet. They require deep domain expertise, can be mis-specified, and do not eliminate unmeasured confounding. The “upfront investment” in building them is substantial. However, their benefits are profound:

Transparency & Critique: They make assumptions public, enabling constructive scientific debate.
Improved Signal Specificity: By proactively adjusting for known biases, DPA becomes more reliable, reducing the flood of spurious signals.
Guide for Further Research: DAGs explicitly highlight missing information, guiding the design of targeted pharmacoepidemiological studies to confirm or refute signals.

Future work involves tailoring DAGs further to pharmacovigilance, better mapping reporting processes, and integrating DPA results into formal evidence-synthesis workflows.

Conclusion

This paper marks a significant advancement in pharmacovigilance methodology. It moves the field from passively acknowledging the limitations of disproportionality analysis to actively managing them with tools from formal causal inference.

By adopting DAGs, pharmacovigilance scientists can transform DPA from a blunt hypothesis-generating tool into a more refined, knowledge-driven component of a cumulative evidence synthesis process. The ultimate winner is patient safety, as signals become more reliable, communication more transparent, and regulatory action better informed.

Causal Inference Tools for Pharmacovigilance: Using Causal Graphs to Identify and Address Biases in Disproportionality Analysis Download

Reporting Side Effects

AlVigiLance

Advancing Medication Safety Through Knowledge and Vigilance

Recently Posted

All Post
Drug Information
General
Latest Drug Alerts
News & Updates
Reporting Side Effects

Bridging Correlation and Causation: A Causal Inference Framework for Robust Pharmacovigilance Signal Detection

The Core Problem: Why Association Is Not Causation

The Solution: Directed Acyclic Graphs (DAGs) as a Formal Language

A Detailed Walkthrough: Formalizing and Tackling Biases

A Practical Workflow: From Knowledge to Action

Limitations, Future Directions, and Transformative Potential

Conclusion

AlVigiLance

Recently Posted

The EPVC Newsletter (Volume 20, Issue 1- January 2026)

How to Code Medical Device Adverse Events: The IMDRF Handbook

MedDRA Version 29.0: The Latest Updates in Medical Terminology for Pharmacovigilance

Category

About

Contact

Privacy Policy

Terms and Conditions