Computational data analysis

Modern astronomy is, increasingly, a data analysis problem. Surveys now deliver spectra and photometry for hundreds of millions of stars, and the bottleneck is rarely the telescope — it's our ability to model that data faithfully, at scale, with honest uncertainties.

A lot of my research is methodological: building models and inference machinery that are fast enough to run on enormous datasets, but principled enough that we can trust what comes out. That spans data-driven models (learning the mapping from data to physical labels), careful likelihood-based inference, and the unglamorous-but-essential work of making methods numerically stable and reproducible.

Themes I keep coming back to:

Scalability without compromise — methods that work on millions of objects but still respect the physics and the noise model.
Honest uncertainties — propagating what we don't know, not just reporting what we do.
Open, reusable software — most of these ideas only matter if other people can pick them up and use them.

Because these tools are general, this work has let me contribute across a surprisingly wide range of domains — from stellar spectroscopy to gravitational waves.

Note

This is a short overview that I am still expanding.

Last updated on 20 June, 2026

Stellar astrophysics Astrometry