You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

PI:  John F. Wu (website)
Group Website: ISM*@ST
Project Duration: 6-12 months, can evolve into thesis project

Project Abstract: The goal is to design and optimize a neural network capable of generating realistic galaxy images conditioned on their properties such as stellar masses, star formation rates, metallicities, ionization parameters, etc. The project will build on traditional techniques used for inferring galaxy properties, such as spectral energy distribution fitting, and novel deep learning approaches, such as conditional de-noising diffusion models. This work will serve as the basis for a Hierarchical Bayesian model that will jointly infer galaxy properties using imaging data and photometric catalogs.

Background: As galaxies grow and evolve, they leave signatures of their evolution on their spectra as well as their appearances. Galaxy spectra are often observed in several broadband filters, and the total amount of light across the galaxies at each wavelength is measured as a flux in each band. Astronomers have devoted great care to modeling synthetic stellar populations in order to match the observed fluxes across the electromagnetic spectrum; this technique is called spectral energy distribution (SED) fitting (e.g. Conroy 2013). However, the appearances of galaxies are often reduced to just a few morphological parameters (such as concentration, asymmetry, smoothness, Gini coefficients) or human-made classifications, and none of these capture the richness of morphological detail that we readily see in galaxy images.

Wu's research group and collaborators have already exploited survey imaging data in order to study a variety of topics related to galaxy evolution. These include estimating galaxy gas-phase metallicities (Wu & Boada 2019), predicting neutral hydrogen gas content (Wu 2020), separating AGN and star-forming systems (Holwerda, Wu, et al. 2021; Guo, Wu, & Sharon 2022), identifying rare nearby dwarf satellite galaxies (Wu et al. 2022; Darragh-Ford, Wu et al. in prep), and even predicting the entire optical spectrum of galaxies (Wu & Peek 2020, see below). In each case, deep convolutional neural networks play a key role in representing the morphological information found in galaxy images.

Figure 1, from Wu & Peek (2020). Examples of galaxy image cutouts with corresponding observed spectra (black) and CNN-predicted spectra (red).


The current project will use cutting-edge machine learning methods to generate images of galaxies, conditioned on their physical properties. The ultimate goal is to infer galaxy properties using images and photometric catalogs simultaneously.

Student work:


  • No labels