Applications

Where conformal prediction earns its keep

The site spends a while on what the guarantee doesn’t give you. Here is the other side: the large, high-stakes territory where coverage is precisely the objective, and conformal prediction is hard to beat.

One test decides it. Is your loss a function of whether the truth lands in a set, or of where it lands? When the answer is “whether,” the set is the deliverable, a coverage guarantee is exactly what you want, and conformal prediction supplies it distribution-free and finite-sample. A surprising amount of real work lives in that column. The papers below are ones that, on inspection, use the tool for what it does, drawn from a survey of the literature (the methods behind them are in the theory list).

1 · Selective prediction and human-in-the-loop decisions

The most direct use. Instead of one answer, return a small set guaranteed to contain the truth (say) 95% of the time, and route the hard cases (the large sets) to a human. The contract is exactly a coverage guarantee: the human is handed the right shortlist with controlled probability, and set size becomes a calibrated “how sure are we” signal. This is the form behind medical triage, content moderation, and document review.

Angelopoulos, Bates, Jordan & Malik (2021). Uncertainty sets for image classifiers using conformal prediction (RAPS). ICLR. arXiv:2009.14193: small, stable label sets for deep classifiers.
Cresswell, Sui, Kumar & Vouitsis (2024). Conformal prediction sets improve human decision making. arXiv:2401.13744: a pre-registered trial: people decide better with conformal sets than fixed-size sets of equal coverage.
Straitouri, Wang, Okati & Gomez-Rodriguez (2022). Improving expert predictions with conformal prediction. arXiv:2201.12006: the set as decision support, with a calibrated coverage/size trade-off.
Making deep learning models clinically useful: improving diagnostic confidence in inherited retinal disease with conformal prediction (MICCAI UNSURE workshop, 2024). doi:10.1007/978-3-031-73158-7_5: LAPS/APS/RAPS sets over the Eye2Gene classifier, flagging cases for a second opinion.
Hullman, Wu, Xie, Guo & Gelman (2025). Conformal prediction and human decision making. arXiv:2503.11709: the decision-theoretic audit of this section’s premise — when calibrated sets actually help a person decide, and when they do not.
Sadinle, Lei & Wasserman (2019). Least ambiguous set-valued classifiers with bounded error levels. JASA. arXiv:1609.00451: the smallest-ambiguity prediction sets at a target coverage; the method behind the adaptive prediction sets demo.
Romano, Sesia & Candès (2020). Classification with valid and adaptive coverage (APS). NeurIPS. arXiv:2006.02544: adaptive label sets with markedly better coverage across inputs.

2 · Anomaly, novelty, and out-of-distribution detection

Here conformal prediction is not a forecaster at all but a distribution-free hypothesis test: the conformal \(p\)-value asks “is this new point exchangeable with what I have seen?” and coverage becomes exact Type-I error control. This is conformal prediction on its firmest ground, and it powers fraud detection, fault monitoring, and covariate-shift alarms.

Smith, Nouretdinov, Craddock, Offer & Gammerman (2014). Anomaly detection of trajectories with kernel density estimation by conformal prediction. Springer: distribution-free anomaly \(p\)-values for sequences.
Hu & Lei (2020). A distribution-free test of covariate shift using conformal prediction. arXiv:2010.07147: valid Type-I control for detecting shift, no model assumptions.
Bates, Candès, Lei, Romano & Sesia (2023). Testing for outliers with conformal p-values. arXiv:2104.08279: the formal treatment of conformal outlier testing.

3 · Retrieval, recommendation, and screening

When the product is a candidate set (a shortlist of items, documents, or molecules), coverage is recall, and a distribution-free recall (or false-discovery-rate) guarantee is exactly the deliverable. Screening millions of compounds down to a guaranteed-to-contain-the-hits shortlist is the canonical example.

Angelopoulos, Bates, Fisch, Lei & Schuster (2024). Conformal risk control. ICLR: control expected recall / FNR, not just coverage.
Recommendation with distribution-free reliability guarantees (2022). arXiv:2207.01609: return a set of items with finite-sample false-discovery-rate control.
Cortés-Ciriano & Bender (2019). Concepts and applications of conformal prediction in drug discovery. arXiv:1908.03569: conformal screening and applicability domains in cheminformatics.
Morsomme & Smirnova (2019). Conformal prediction for students’ grades in a course recommender. PMLR: valid intervals as a reliability signal for recommendations.

4 · Language models and structured outputs

A fast-growing, genuinely good fit. Emit a set of candidate answers guaranteed to contain the correct one with high probability, or abstain / ask for help when that set grows too large. Coverage becomes a task-completion guarantee, and set size becomes a principled, distribution-free trigger for deferral, a real handle on hallucination.

Ren, Dixit, Bodrova, …, Sadigh, Zeng & Majumdar (2023). Robots that ask for help: uncertainty alignment for LLM planners (KnowNo). CoRL. arXiv:2307.01928: conformal sets decide when an LLM-driven agent should defer to a human.
Quach, Fisch, Schuster, … Barzilay & Jaakkola (2024). Conformal language modeling. arXiv:2306.10193: generation sets with calibrated correctness guarantees.
API is enough: conformal prediction for LLMs without logit access (2024). arXiv:2403.01216: coverage-guaranteed answer sets from black-box APIs.
Mohri & Hashimoto (2024). Language models with conformal factuality guarantees. ICML. arXiv:2402.10978: back off a generation to the subset of claims that is correct with high probability.

5 · Robotics, control, and safety envelopes

This is the richest seam of correct use. The question is containment: will the true state, trajectory, or obstacle lie inside a region with a chosen probability? That is conformal prediction’s native object. Prediction regions on other agents’ motion become provably safe controllers; conformal reachable sets certify a planner; runtime monitors flag failures with formal confidence.

Lindemann, Cleaveland, Shim & Pappas (2023). Safe planning in dynamic environments using conformal prediction. IEEE RA-L. arXiv:2210.10254: prediction regions on pedestrian trajectories give a provably safe MPC.
Muthali, Shen, Deglurkar, Lim, Roelofs, Faust & Tomlin (2023). Multi-agent reachability calibration with conformal prediction. arXiv:2304.00432: probabilistically safe, feasible prediction sets for planning.
Lin & Bansal (2023). Verification of neural reachable tubes via conformal prediction. arXiv:2312.08604: probabilistic safety guarantees, equivalent to scenario optimization.
Lindemann et al. (2023). Conformal prediction for STL runtime verification. arXiv:2211.01539: formal guarantees for predictive runtime monitoring.
Bay-area safety from sparse feedback (2025). Learning robot safety from sparse human feedback. arXiv:2501.04823: a warning region with a guaranteed miss rate.

6 · Risk control and certified decisions

Generalize coverage from “contain the truth” to “keep any monotone risk below a budget”: false-negative rate, miss rate, expected loss. The deliverable is a certificate on a decision rule, distribution-free and finite-sample, which is precisely what audited and safety-critical systems need.

Bates, Angelopoulos, Lei, Malik & Jordan (2021). Distribution-free, risk-controlling prediction sets (RCPS). JACM. arXiv:2101.02703: the foundation of risk control.
Online conformal risk control for distributed/remote inference (2024). arXiv:2409.07902: worst-case FNR guarantees under communication limits.
Snell, Zollo, Deng, Pitassi & Zemel (2022). Quantile risk control. arXiv:2212.13629: control quantiles of a loss, not just the mean.
Angelopoulos, Bates, Candès, Jordan & Lei (2021). Learn then test: calibrating predictive algorithms to achieve risk control. arXiv:2110.01052: turn any (even non-monotone) risk target into finite-sample control via multiple testing.

7 · Scientific discovery and screening with error control

In drug discovery, design, and high-throughput science the goal is a guaranteed-quality set of candidates to take forward, with the false-discovery rate held in check. Coverage and FDR control are the native objects, and conformal methods deliver them without distributional assumptions on the assay or simulator.

Fannjiang, Bates, Angelopoulos, Listgarten & Jordan (2022). Conformal prediction for the design problem. arXiv:2202.03613: valid uncertainty when you optimize over designs (feedback covariate shift).
Jin & Candès (2023). Selection by prediction with conformal p-values. arXiv:2210.01408: pick candidates above a threshold with FDR control.
Conditional calibration for false-discovery-rate control under dependence (2020). arXiv:2007.10438: finite-sample FDR with dependent statistics.

8 · Causal inference and counterfactuals

Causal questions are coverage questions wearing a disguise. An individual treatment effect, or a counterfactual outcome, is never observed, so the right deliverable is an interval guaranteed to contain it. Conformal inference supplies one without a fully specified outcome model, and the better versions degrade gracefully, widening rather than breaking, when unmeasured confounding is bounded rather than assumed away.

Lei & Candès (2021). Conformal inference of counterfactuals and individual treatment effects. JRSS-B. arXiv:2006.06138: valid intervals for an effect that is never directly seen.
Jin, Ren & Candès (2023). Sensitivity analysis of individual treatment effects: a robust conformal inference approach. PNAS. arXiv:2111.12161: coverage that holds under a bounded amount of unmeasured confounding.

9 · Survival analysis and time-to-event

Time-to-event data are censored, but a guaranteed lower bound on survival time is exactly a one-sided coverage statement, and that is what a clinician or a reliability engineer actually acts on. Conformal methods deliver it distribution-free even when follow-up is incomplete, turning a calibrated prognosis into a statement you can stand behind.

Candès, Lei & Ren (2023). Conformalized survival analysis. JRSS-B. arXiv:2103.09763: calibrated lower predictive bounds on survival time under censoring.
Gui, Hore, Ren & Barber (2024). Conformalized survival analysis with adaptive cut-offs. Biometrika. arXiv:2211.01227: tighter bounds via covariate-adaptive calibration.

10 · Medical imaging and diagnostics

Clinical decision support is a textbook fit: the deliverable is a set or a flag, and what matters is a controlled error rate, not a sharper score. Return the grades or diagnoses consistent with the image at a stated confidence, route the high-uncertainty cases to a clinician, and the coverage guarantee bounds how often the truth is missed, even on scans from a new scanner or site. It also gives fairness a precise handle: equalised coverage across patient subgroups is a coverage statement, not a vague aspiration.

Olsson, Kartasalo, Mulliqi et al. (2022). Estimating diagnostic uncertainty in AI-assisted pathology using conformal prediction. Nature Communications. nature.com: flags unreliable prostate-biopsy gradings, cutting errors sharply on out-of-distribution slides.
Lu, Angelopoulos & Pomerantz (2022). Improving trustworthiness of AI disease-severity rating with ordinal conformal prediction sets. MICCAI. arXiv:2207.02238: guaranteed-coverage ordinal sets for spinal-stenosis grading on MRI, flagging the uncertain cases.
Lu, Lemay, Chang, Höbel & Kalpathy-Cramer (2022). Fair conformal predictors for applications in medical imaging. AAAI. arXiv:2109.04392: equalised coverage across patient skin tones for skin-lesion classification.
Sreenivasan, Vaivade, Noui et al. (2025). Conformal prediction enables disease-course prediction and individualized diagnostic uncertainty in multiple sclerosis. npj Digital Medicine. nature.com: per-patient prediction sets at a stated confidence from electronic health records.

Across all of these the marginal coverage guarantee is not a consolation prize; it is the whole point. The skill is recognizing which column your problem is in. When the answer you need is a set with a controlled error rate, reach for conformal prediction without hesitation; when you need a sharp distribution, that work belongs to the model, and conformal can certify it afterward.

← Back to the demonstrations Methods & theory →

Using conformal prediction in your own project? Tell Claude: “Read https://conformalprediction.net/SKILL.md and create a project skill from it.” It adds a check for whether your coverage is conditionally trustworthy.