Bruno Régaldo-Saint Blancard, Ph.D.

I am a Research Fellow in the Center for Computational Mathematics at the Flatiron Institute in New York, where I work at the interface between (astro)physics and data science. I develop statistical methods for astrophysics, cosmology, and beyond using signal processing and machine learning. I tackle various problems including generative modeling, inference, denoising, and source separation.

These problems naturally emerged from my applied research in modeling interstellar dust emission, analyzing cosmic microwave background data, and studying galaxy clustering (as part of the SimBIG collaboration).

Lately, I have been particularly focused on deep generative models and their application to scientific endeavors. I am also actively involved in the Polymathic AI initiative, which aims to leverage these models for the development of foundation models for science.

Before my fellowship, I earned a Ph.D. in Astrophysics in 2021 from the École Normale Supérieure, Paris. Prior to that, I graduated from the Ecole Polytechnique (X2014) and obtained a Master's degree in Astrophysics from the Observatoire de Paris. Download my CV.

Research Highlights

Bayesian Blind Denoising with Gibbs Diffusion
February 2024

Blind denoising problems are not exclusive to natural image processing; they are also prevalent in many scientific applications where the noise distribution is unknown or hard to model. In our new preprint, we introduce GDiff, a novel solution to blind denoising in a fully Bayesian context. By combining Gibbs sampling and a diffusion model, we build a rigorous method to sample the posterior distribution of the signal and the noise parameters for any kind of diffusion-based signal prior!

We show that GDiff is directly relevant to the analysis of cosmic microwave background (CMB) data, by taking an original view on the problem of separating the CMB from its foregrounds. Have you ever thought of the CMB as the noise of a blind denoising problem, and the foregrounds as the signal? From that perspective, we show that GDiff can directly separate dust and CMB while solving cosmological inference at the same time! Stay tuned for future applications to observational data!

Update 05/24: Accepted at ICML 2024!

Removing Dust from CMB Observations with Diffusion Models
October 2023

Diffusion models have revolutionized the modeling of natural images. Can they also help us to analyze cosmic microwave background (CMB) data? Thanks to my talented intern David Heurtel-Depeiges, and the collaboration of Blaskeley Burkhart and Ruben Ohana, we make a first demonstration of the potential of diffusion models for the separation of Galactic dust and CMB. We show that dust+CMB observations can be seen as the result of a diffusion process that can be reversed in time, thus naturally solving source separation.

We are already working on the next step: a diffusion-based approach for cosmological inference. Stay tuned!

Update 11/23: Spotlight talk at ML4PS NeurIPS 2023 Workshop!

Stacking for Simulation-Based Inference
October 2023

With simulation-based inference, it is typical to end up with a multitude of models/approximations of the same target posterior distribution. This usually results from the investigation of different inference algorithms, different architectures, or can simply be due to the randomness of initialization and stochastic gradients. While most practitioners usually choose to select the best of their models, with Yuling Yao and Justin Domke, we show that there is much better to do, and it's called stacking. We show that models can all be combined at once in a systematic way to improve precision, calibration, coverage, and bias at the same time. Check out our new preprint on Simulation-Based Stacking!

Update 01/24: Accepted at AISTATS 2024!

SimBIG Collaboration: Second Wave of Papers
October 2023

We are taking simulation-based inference for the analysis of galaxy clustering to the next level with our second release of papers! We now explore galaxy clustering data through the lenses of the wavelet scattering transform, convolutional networks, and bispectrum statistics. For each of these, we get new cosmological constraints leveraging non-linear information from the data. Check out our new website for more information!

With Michael Eickenberg, we led the wavelet scattering transform (WST) analysis. The WST statistics capture a wealth of non-Gaussian information from the data improving constraints on cosmological parameters. However, we show in our paper that these statistics might be too rich as they can also capture unrealistic specifics of the forward models, raising model misspecifications issues when applied to observational data. Our next challenge will be to address this in detail!

Update 02/24: Accepted in PRD!

Polymathic AI and Multiple Physics Pretraining
October 2023

I am lucky to be part of the amazing Polymathic AI initiative which aims to create a foundation model for advancing scientific discovery. We recently released a series of paper, check out our blog to find out about it!

In particular, in a project led by Michael McCabe, we introduce “Multiple Physics Pretraining”, an autoregressive task-agnostic pretraining approach for physical surrogate modeling. In this paper, we notably show that a single transformer model trained on a broad range of physical tasks can perform better than task-specific models on a variety of downstream applications.

Statistical Component Separation for Targeted Signal Recovery in Noisy Mixtures
June 2023

In a 2021 paper, we had introduced a new algorithm to separate astrophysical signals with very distinctive statistical natures. Since then, this method has found interest in various astrophysical applications such as the denoising of dust emission maps, the separation of dust and CIB, or the removal of glitches in seismic data from the InSight Mars mission. With Michael Eickenberg, we now explore some mathematical aspects of this method and provide first denoising benchmarks in our new preprint.

Update 02/24: Published in TMLR!

SimBIG: Simulation-Based Inference of Galaxies
November 2022

Glad to announce the release of the two first papers of the SimBIG collaboration (led by ChangHoon Hahn): letter, mock challenge. The SimBIG framework enables the analysis of cosmological information from galaxy surveys on small nonlinear scales using simulation-based inference. It relies on the SimBIG forward model, which connects the cosmological parameters to realistic mock galaxy surveys. Take a look at how this model compares to BOSS data!

Update 10/23: Published in PNAS and JCAP!

Generative Models of Multi-frequency Dust Emission Maps
August 2022

Check out our recent paper, where we use the Wavelet Phase Harmonic statistics to build generative models of multi-frequency dust emission maps from a single example. Want to try this on your own data? Take a look at the code associated with the paper.

Update 01/23: Published in the Astrophysical Journal!

Wavelet Moments for Cosmological Parameter Estimation
April 2022

I was recently involved in Eickenberg et al. paper, which introduced a new set of wavelet statistics, called "Wavelet Moments", to extract non-Gaussian information from 3D cosmological fields. Fisher forecasts based on the Quijote simulations show that these statistics improve constraints on the cosmological parameters by a factor 5 to 10 with respect to the power spectrum baseline.

Ph.D. Thesis: Statistical Modeling of the Polarized Emission of Interstellar Dust
November 2021

I conducted my Ph.D. research at the LPENS, École Normale Supérieure, Paris, under the supervision of François Levrier and François Boulanger. My work was motivated by challenges in analyzing cosmic microwave background (CMB) data. I focused on the statistical modeling of one of the CMB foregrounds, namely the emission of interstellar dust. These foregrounds constitute major obstacles for the next generation of CMB experiments. I developed data-driven models using the wavelet scattering transform — a technique closely related to the mathematics of convolutional neural networks. You can learn more about this in my Ph.D. thesis.

Selected Papers

  1. D. Heurtel-Depeiges, C. C. Margossian, R. Ohana & B. Régaldo-Saint Blancard; Listening to the Noise: Blind Denoising with Gibbs Diffusion; ICML (2024). ArXiv
  2. D. Heurtel-Depeiges, B. Burkhart, R. Ohana & B. Régaldo-Saint Blancard; Removing Dust from CMB Observations with Diffusion Models; ML4PS Workshop at NeurIPS (2023). ArXiv
  3. Y. Yao, B. Régaldo-Saint Blancard & J. Domke; Simulation Based Stacking; AISTATS (2024). ArXiv
  4. B. Régaldo-Saint Blancard, C. Hahn, S. Ho, J. Hou, P. Lemos, E. Massara, C. Modi, A. Moradinezhad Dizgah, L. Parker, Y. Yao & M. Eickenberg; SimBIG: Galaxy Clustering Analysis with the Wavelet Scattering Transform; Physical Review D (2024). ArXiv DOI
  5. C. Hahn, P. Lemos, L. Parker, B. Régaldo-Saint Blancard, M. Eickenberg, S. Ho, J. Hou, E. Massara, C. Modi, A. Moradinezhad Dizgah & D. Spergel; SimBIG: The First Cosmological Constraints from Non-Gaussian and Non-Linear Galaxy Clustering; (2023) ArXiv
  6. M. McCabe, B. Régaldo-Saint Blancard, L. Holden Parker, R. Ohana, M. Cranmer, A. Bietti, M. Eickenberg, S. Golkar, G. Krawezik, F. Lanusse, M. Pettee, T. Tesileanu, K. Cho & S. Ho; Multiple Physics Pretraining for Physical Surrogate Models; AI4Science Workshop at NeurIPS - Oral & Best Paper Award (2023) ArXiv
  7. B. Régaldo-Saint Blancard & M. Eickenberg; Statistical Component Separation for Targeted Signal Recovery in Noisy Mixtures; Transactions on Machine Learning Research (2024). ArXiv
  8. C. Hahn, M. Eickenberg, S. Ho, J. Hou, P. Lemos, E. Massara, C. Modi, A. Moradinezhad Dizgah, B. Régaldo-Saint Blancard & M. Abidi; SimBIG: A Forward Modeling Approach To Analyzing Galaxy Clustering; Proceedings on National Academy of Sciences (2023). ArXivDOI
  9. B. Régaldo-Saint Blancard, E. Allys, C. Auclair, F. Boulanger, M. Eickenberg, F. Levrier, L. Vacher & S. Zhang; Generative Models of Multi-channel Data from a Single Example - Application to Dust Emission; The Astrophysical Journal (2023). ArXivDOI
  10. M. Eickenberg, E. Allys, A. Moradinezhad Dizgah, P. Lemos, E. Massara, M. Abidi, C. Hahn, S. Hassan, B. Régaldo-Saint Blancard, S. Ho, S. Mallat, J. Anden & F. Villaescusa-Navarro; Wavelet Moments for Cosmological Parameter Estimation; ArXiv
  11. N. Jeffrey, F. Boulanger, B. D. Wandelt, B. Regaldo-Saint Blancard, E. Allys & F. Levrier; Single frequency CMB B-mode inference with realistic foregrounds from a single training image; Monthly Notices of the Royal Astronomical Society: Letters (2021). ArXiv DOI
  12. B. Regaldo-Saint Blancard, E. Allys, F. Boulanger, F. Levrier & N. Jeffrey; A new approach for the statistical denoising of Planck interstellar dust polarization data; Astronomy & Astrophysics: Letters (2021). ArXiv DOI
  13. B. Regaldo-Saint Blancard, S. Codis, J. R. Bond & G. Stein; Statistical exploration of halo anisotropic clustering and intrinsic alignments with the mass-Peak Patch algorithm; Monthly Notices of the Royal Astronomical Society (2021). ArXiv DOI
  14. B. Regaldo-Saint Blancard, F. Levrier, E. Allys, E. Bellomi & F. Boulanger; Statistical description of dust polarized emission from the diffuse interstellar medium - A RWST approach; Astronomy & Astrophysics (2020). ArXiv DOI
  15. E. Allys, F. Levrier, S. Zhang, C. Colling, B. Regaldo-Saint Blancard, F. Boulanger, P. Hennebelle & S. Mallat; The RWST, a comprehensive statistical description of the non-Gaussian structures in the ISM; Astronomy & Astrophysics (2019). ArXiv DOI

Software

I am committed to promoting open-source practices to facilitate the reproducibility of my research. You can find all the open-source projects I have been involved in on my GitHub page. I have also developed a few Python packages that you might find useful for your research:

Name Description
PyWST Python package for the statistical analysis of 2D data with the (Reduced) Wavelet Scattering Transform.
PyWPH Python package for GPU-accelerated computations of the Wavelet Phase Harmonic statistics from 2D data.
GalacticWavelets Python package for GPU-accelerated computations of Wavelet Scattering Statistics for 3D fields and Galaxy Surveys.

Teaching

  • 2018 - 2021: Teaching assistant at the École Normale Supérieure, Paris, for the course "Numerical methods for differential equations in Physics" (Master's level, faculty: L. Tuckerman). Exercises.
  • 2019 - 2021: Lecturer at the École Normale Supérieure, Paris, for the course "Physique pour Tous" ("Physics for All") intended for a broad non-scientific audience.
  • 2014 - 2015: Educational coordinator for homework assistance program at Association Le Rocher (primary and secondary levels).

Talks

I try to keep track of some of my past talks in my CV. Fun fact, I gave a TEDx talk during my Ph.D. on the topic "Un Univers sans limite ?" (in French, at Pôle Universitaire Léonard de Vinci, Paris-La Défense). You can watch it here!

Contact