About me

I'm an Assistant Professor of AI and Decision-making in the MIT EECS department. I work to understand uncertainty and reliable decision-making with data. In particular, I develop tools for statistical inference with AI models, data impacted by strategic behavior, and settings with distribution shift. In addition, I work on applications in the life sciences and sustainability.

Previously, I was a postdoctoral researcher with Michael I. Jordan in the UC Berkeley Statistics and EECS departments. I completed my Ph.D. in the Stanford Department of Statistics advised by Emmanuel Candès, where I was awarded the Theodore W. Anderson Theory of Statistics Dissertation Award. Before my Ph.D., I studied statistics and mathematics at Harvard University, and spent a year teaching mathematics at NYU Shanghai. Outside research, I enjoy triathlons, sailing, hiking, and reading speculative fiction novels.

Research themes

I believe that the conceptual, algorithmic, and mathematical advances enable us to use data and AI models to better understand complex patterns in the physical and social world and to build reliable automated systems. To this end, I focus on developing statistical principles and formal frameworks to understand challenging types of data that are increasingly important. In particular, I work on

Uncertainty and statistical inference with AI systems. AI models based on deep neural networks are increasingly used in real-world systems. Their use is motivated by the fact that they have the best performance with high-dimensional data, such as image and natural language data. However, the standard statistical toolbox does not apply here; users seeking assurances about the reliability of these models, such as confidence intervals on predictions or bounds on the false discovery rate across multiple decisions, are left with little recourse based on the existing literature. I seek to build out a rich statistical toolbox for AI models, so that researchers can use these powerful systems while remaining on solid statistical ground. My work in this theme builds on core statistical techniques such as resampling methods, multiple hypothesis testing, and empirical process theory. [e.g., 1, 2, 3, 4]

Statistical foundations for agents. Data emerging from systems with human and algorithmic decision-makers is increasingly important, and the possible strategic behavior raises new inferential challenges. For example, profit-sensitive pharmaceutical companies sponsor clinical trials — which are then analyzed according to some statistical protocol — and are heavily rewarded for drugs that are approved. Similarly, AI agents increasingly interact with the real world, taking actions and soliciting information from other agents, each with their own utility function. I am developing statistical principles and methods for such settings, drawing on concepts from decision theory, game theory, and statistics. [e.g., 1, 2, 3, 4]

Shifting distributions and feedback loops. More broadly, data are increasingly collected from dynamic environments with shifting distributions, and these shifts can be caused by changes made to the system or policy. I work to extend statistical methods in such non-I.I.D. settings. For example, consider protein design, where the analyst has access to some set of proteins and an associated fitness score. The goal is to design a new protein that has higher fitness than those seen previously. The analyst might fit a model predicting fitness from protein structure, and then chooses a good candidate protein to synthesize and measure the fitness of in a wet-lab experiment. This process is repeated several times, so there is a feedback loop; the model the analyst fits affects the subsequent data collection. Such, non-I.I.D. settings with shifting distributions are increasingly relevant to modern data analysis, and it is essential to create techniques to address this. [e.g., 1, 2, 3]

I'm especially interested in applications in the life sciences and sustainability.

News

In spring 2026 I'll be teaching special topics course 6.S986 Uncertainty Quantification with AI.

I'll be teaching 6.7730 Modern Mathematical Statistics in fall 2025.

I'm co-organizing the 2025 ICLR workshop Quantify Uncertainty and Hallucination in Foundation Models: The Next Frontier in Reliable AI.

Our book draft of Theoretical Foundations of Conformal Prediction is up! Comments welcome.

I'm co-organizing the 2024 NeurIPS Workshop on Statistical Frontiers in LLMs and Foundation Models.

Postdoc position available: I have an opening for a postdoc to join my research group to work on statistical uncertainty quantification with AI and/or AI for science.

Postdoc position available: I have an opening for a postdoc to join my research group together with Martin Wainwright's group to work on statistical inference with multiple strategic agents.

I'll be teaching 6.S951 Modern Mathematical Statistics in fall 2024.

PhD positions available: I have openings for PhD students to join my research group. If you're at MIT, send me an email and we can find a time to talk about research. If you're not yet at MIT, consider applying to the EECS PhD program and mention me in your application.

How to use AI for discovery — without leading science astray: press about our recent work on how to rigorously use AI predictions as part of scientific analysis.

Putting clear bounds on uncertainty: press about our recent work on uncertainty quantification with generative models.

I'm co-organizing the 2022 ICML Workshop on Distribution-free Uncertainty Quantification, which will take place on Saturday, July 23, 2022.

Check out our new tutorial A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification with the accompanying video:

I'm co-organizing the 2021 ICML Workshop on Distribution-free Uncertainty Quantification, which will take place on Saturday, July 24, 2021.

Select recent papers

“Prediction-Powered Inference with Imputed Covariates and Nonuniform Sampling”: D. Kluger, K. Lu, T. Zrnic, S. Wang, S. Bates.
arXiv preprint, 2025.
[arXiv] [code] [bibtex]

“Sharp Results for Hypothesis Testing with Risk-Sensitive Agents”: F. C. Shi, S. Bates, M. J. Wainwright.
arXiv preprint, 2024.
[arXiv] [bibtex]

“Incentive-Theoretic Bayesian Inference for Collaborative Science”: S. Bates, M. I. Jordan, M.Sklar, J. A. Soloff.
arXiv preprint, 2023.
[arXiv] [bibtex]

“Prediction-Powered Inference”: A. N. Angelopoulos, S. Bates, C. Fannjiang, M. I. Jordan, and T. Zrnic.
Science, 2023.
[arXiv] [free journal version] [journal] [code] [bibtex]

“Conformal Risk Control”: A. N. Angelopoulos, S. Bates, A. Fisch, L. Lei, T. Schuster.
ICLR, 2024. (spotlight presentation)
[arXiv] [code] [bibtex]

“Testing for Outliers with Conformal p-values”: S. Bates, E. Candès, L. Lei, Y. Romano, and M. Sesia.
Annals of Statistics, 2023.
[arXiv] [journal] [code] [bibtex]

“Distribution-Free, Risk-Controlling Prediction Sets”: S. Bates, A. Angelopoulos, L. Lei, J. Malik, and M. I. Jordan.
Journal of the ACM, 2021.
[arXiv] [journal] [video] [blog] [code] [bibtex]

“Cross-validation: what does it estimate and how well does it do it?”: S. Bates, T. Hastie, and R. Tibshirani.
Journal of the American Statistical Association (JASA), 2023.
[arXiv] [journal] [code] [bibtex]

“Causal Inference in Genetic Trio Studies”: S. Bates, M. Sesia, C. Sabatti, and E. Candès.
Proceedings of the National Academy of Sciences of the USA (PNAS), 2020.
[arXiv] [journal] [video] [tutorials+code] [bibtex]
*Selected as a cover article and for invited commentary.