RemNote Community
Community

Epidemiology - Causal Inference Legal and Advanced Concepts

Understand causal inference principles, legal interpretation of epidemiologic evidence, and advanced epidemiologic concepts such as age adjustment, Mendelian randomization, and spatial analysis.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz

Quick Practice

What is the primary goal of epidemiology beyond merely identifying correlations?
1 of 19

Summary

Causal Inference in Epidemiology Introduction One of epidemiology's most important challenges is determining whether an exposure actually causes disease, rather than merely being associated with it. This distinction matters enormously—in public health policy, clinical practice, and law. This section explores how epidemiologists establish causal relationships and the framework they use to evaluate evidence. Foundations of Causal Inference Why Correlation Isn't Enough At the heart of causal inference is a fundamental principle: correlation alone cannot establish causation. Two variables may be associated without either causing the other. Consider this classic example: ice cream sales and drowning deaths are highly correlated, but neither causes the other. Both are caused by a third variable—warm weather. Epidemiology specifically seeks to identify true causal relationships: situations where an exposure actually produces a change in disease risk, not merely appears alongside it. Types of Causal Conditions Not all causes work the same way. Understanding different types of causation helps us think precisely about how disease develops: Necessary Causes are conditions that must be present for disease to occur. For example, the human papillomavirus (HPV) is considered a necessary cause of cervical cancer—the cancer cannot develop without HPV infection. Sufficient Causes are conditions that, by themselves, always produce disease. For example, a lethal dose of cyanide is sufficient to cause death. However, most environmental and lifestyle exposures are not sufficient causes in this strict sense. Probabilistic Causes are exposures that increase the probability of disease but don't guarantee it. Smoking is a probabilistic cause of lung cancer—most smokers don't develop lung cancer, but smoking increases the risk. This is the most common type of causation encountered in epidemiology. The Causal Pie Model The causal pie model provides a helpful visual framework for understanding how multiple causes combine to produce disease. Imagine disease as a "pie"—the disease occurs when all the necessary component causes come together, like assembling all the pieces of a pie. In this model: No single component cause is sufficient to produce disease by itself All components must be present together for disease to occur Different people's "pies" may have different component causes For example, colorectal cancer might result from the combination of genetic predisposition, chronic inflammation, and dietary factors in one person, but a different combination of factors in another person. This model explains why causation is often complex and multifactorial. Bradford Hill Criteria In 1965, Sir Austin Bradford Hill proposed a set of nine criteria to evaluate whether an observed association is likely to be causal. These criteria remain the gold standard in epidemiology for assessing causal evidence. They are not absolute rules—not all must be met, and their presence doesn't prove causation. Rather, they provide a framework for weighing evidence. Strength of Association Strength of association refers to the magnitude of the relationship between exposure and disease. A strong association (measured by relative risk, odds ratio, or similar measures) is more likely to be causal than a weak one. Why? Weak associations are more easily explained by unmeasured confounding or bias. If an exposure increases disease risk 10-fold, it's harder to explain away as coincidence than an association of 1.1-fold. For example, the association between smoking and lung cancer (relative risk 20) is much stronger than many other suspected relationships, making causation more plausible. However, causation can still exist with weak associations, so this criterion alone is insufficient. Consistency Consistency requires that similar findings appear across different studies, populations, settings, and time periods. If one study shows an association but others don't, we have less confidence in causation. When many researchers in different places using different methods all find the same association, it strengthens the causal argument. For example, the relationship between smoking and lung cancer has been demonstrated consistently across dozens of countries, different age groups, and over decades—this consistency is compelling evidence for causation. Specificity Specificity suggests that causation is more likely when: A specific exposure leads to a specific disease (not multiple diseases) In a specific population For instance, the association between asbestos and mesothelioma is quite specific—asbestos exposure particularly causes this cancer, not a broad range of cancers. This specificity makes a causal relationship more plausible. However, we should note that many causes do produce multiple effects (a single exposure can cause several diseases), so the absence of perfect specificity doesn't rule out causation. Temporality Temporality is perhaps the most fundamental criterion: the exposure must precede the outcome in time. You cannot be exposed to something after disease has already developed (except in rare cases where the disease was subclinical). This is why prospective studies—where exposure is measured before disease develops—provide stronger evidence than retrospective studies. For example, following never-smokers forward in time and measuring who subsequently develops lung cancer is stronger evidence than asking cancer patients to recall their smoking history. Biological Gradient (Dose-Response) Biological gradient (or dose-response relationship) indicates that greater exposure leads to greater disease incidence or severity. If doubling someone's exposure approximately doubles their risk, this suggests causation. For example, research shows that lung cancer risk increases as smoking increases—a person smoking 20 cigarettes daily has higher risk than someone smoking 5 daily. This dose-response relationship supports causation. Biological Plausibility Biological plausibility considers whether a biologically sensible mechanism explains how the exposure produces disease. Does our understanding of biology make the causal pathway reasonable? For smoking and lung cancer, the plausibility is clear: tobacco smoke contains known carcinogens that can damage DNA in lung cells, initiating cancer. Without biological plausibility, we might question whether an observed association reflects causation or is merely coincidental. Coherence Coherence evaluates whether epidemiologic findings align with laboratory evidence and existing biological knowledge. When diverse lines of evidence point in the same direction, causation becomes more likely. For example, if epidemiologic studies show increased cancer risk from an exposure, and laboratory studies confirm that the substance damages DNA in animal models, coherence between epidemiologic and laboratory findings strengthens the causal argument. Experiment Experiment notes that experimental evidence—particularly randomized controlled trials—provides the strongest causal evidence when available. In experimental studies, researchers control which people receive the exposure, eliminating confounding and establishing clear temporal relationships. However, many causal questions in epidemiology cannot be studied experimentally for ethical or practical reasons. We cannot randomize people to smoke or to live with pollution, for example. Analogy Analogy suggests that similar exposures producing similar effects can strengthen causal arguments. If we've established that compound A causes disease X, and compound B (which is chemically or structurally similar) appears associated with disease X, this analogy supports causation. For instance, establishing that one chlorofluorocarbon depletes stratospheric ozone strengthens the argument for similar compounds doing the same. General vs. Specific Causation A crucial distinction in epidemiology—especially in legal and policy contexts—exists between two types of causal questions: General causation asks: "Can this exposure cause this disease in a population?" Epidemiology specializes in answering this question through population-level studies. When we use Bradford Hill criteria to evaluate evidence, we're assessing general causation. Specific causation asks: "Did this exposure cause disease in this particular individual?" Epidemiology cannot directly answer this question. Even if smoking causes lung cancer in general, we cannot definitively prove that Ms. Jones's specific lung cancer was caused by her smoking rather than by genetic factors, radon exposure, or other causes. This distinction is critical in legal cases. A court might establish that Agent Orange can cause disease in general (general causation), but proving it caused disease in a specific plaintiff requires additional evidence beyond epidemiologic data, such as medical history and exposure documentation. Study Design and Weight of Evidence Different epidemiologic study designs provide different levels of evidence for causation. The hierarchy shown here reflects how well each design can establish causality: Experimental studies (at the top) provide the strongest causal evidence because researchers directly control exposure assignment, eliminating confounding and establishing clear temporal relationships. Observational analytical studies (the middle tier—cohort, case-control, cross-sectional) provide progressively stronger evidence as you move from cross-sectional to cohort designs. Cohort studies are particularly strong because they establish temporality clearly. Descriptive studies (at the bottom) describe patterns but provide the weakest evidence for causation. However, they can generate hypotheses that lead to stronger studies. <extrainfo> Related Epidemiologic Concepts While not directly central to establishing causality, these concepts are related to epidemiologic research and disease investigation: Age adjustment is a statistical technique used to compare disease rates between populations with different age structures, since many diseases occur more frequently at certain ages. Contact tracing is an infectious disease control measure that identifies and monitors individuals exposed to a disease case, used particularly during outbreak investigations. Compartmental models are mathematical frameworks describing how individuals move through disease states (susceptible → infected → recovered), useful for understanding disease transmission dynamics. Mendelian randomization is an advanced method using genetic variants as "natural experiments" to infer causation, helping overcome confounding in observational studies. Spatial epidemiology studies how disease is distributed geographically and identifies environmental causes by examining these patterns. The famous 1854 cholera map (shown in the article image) is a classic example: by mapping cholera deaths, John Snow identified that a contaminated water pump was causing an outbreak in London. </extrainfo>
Flashcards
What is the primary goal of epidemiology beyond merely identifying correlations?
Identifying true causal relationships
What are the three types of conditions causes can be for a health outcome?
Necessary conditions Sufficient conditions Probabilistic conditions
Which model visualizes multiple component causes working together to produce a disease?
The causal pie model
How does the strength of association relate to the likelihood of causality?
Larger effect sizes increase the likelihood of causality
What does the criterion of consistency require for findings to be considered causal?
Similar findings across different studies, populations, and settings
When is causation considered more likely according to the specificity criterion?
When a particular exposure leads to a specific disease in a specific population
What is the requirement of the temporality criterion in causal inference?
Exposure must precede the outcome
What does the biological gradient criterion indicate about the relationship between exposure and disease?
Greater exposure generally leads to greater incidence of disease
What factor does the plausibility criterion evaluate regarding the relationship between exposure and effect?
Whether a biologically plausible mechanism exists
Which criterion evaluates the agreement between epidemiologic findings and laboratory evidence?
Coherence
What does the analogy criterion suggest strengthens a causal argument?
Similar exposures producing similar effects
What is the definition of general causation in an epidemiologic and legal context?
Whether an agent can cause disease in a population
Why is epidemiology usually insufficient to prove specific causation?
It does not prove that an agent caused disease in a specific individual
What is the purpose of the age adjustment technique in epidemiology?
To compare populations with different age structures by standardizing rates
What is the primary function of contact tracing?
Identifying and following individuals exposed to an infectious disease case
What do compartmental models describe in the context of disease?
The flow of individuals through disease states (e.g., susceptible, infected, recovered)
How does Mendelian randomization use genetic variants to infer causal relationships?
It uses them as instrumental variables
What are the two main focuses of spatial epidemiology?
Geographic distribution of health outcomes and their environmental determinants
What is the definition of a zoonosis?
A disease transmitted from animals to humans

Quiz

In epidemiology, what does “general causation” refer to?
1 of 10
Key Concepts
Causation and Inference
Causal inference
Bradford Hill criteria
General causation
Specific causation
Mendelian randomization
Epidemiological Methods
Age adjustment
Contact tracing
Compartmental model
Spatial epidemiology
Zoonosis