ßÉßɱ¬ÁÏ

This website stores cookies on your computer. These cookies are used to collect information about how you interact with our website and allow us to remember your browser. We use this information to improve and customize your browsing experience, for analytics and metrics about our visitors both on this website and other media, and for marketing purposes. By using this website, you accept and agree to be bound by UVic’s Terms of Use and Protection of Privacy Policy. If you do not agree to the above, you must not use this website.

Skip to main content

Aydan McKay

  • BSc (ßÉßɱ¬ÁÏ, 2022)
Notice of the Final Oral Examination for the Degree of Master of Science

Topic

Extending the Reach of Gaia with Masked Stellar Autoencoders

Department of Physics and Astronomy

Date & location

  • Friday, August 15, 2025
  • 8:00 A.M.
  • Clearihue Building, Room B017

Examining Committee

Supervisory Committee

  • Dr. Sébastien Fabbro, Department of Physics and Astronomy, ßÉßɱ¬ÁÏ (Co-Supervisor)
  • Dr. Kim Venn, Department of Physics and Astronomy, UVic (Co-Supervisor)

External Examiner

  • Dr. Guillaume Thomas, Instituto de Astrofísica de Canarias

Chair of Oral Examination

  • Dr. Daler Rakhmatov, Department of Electrical and Computer Engineering, UVic

Abstract

I present the Masked Stellar Autoencoder, a new data-driven holistic stellar model for Galactic archaeology. The MSA is trained using the complete Gaia DR3 XP spectra catalogue by implementing a self-supervised masking algorithm to enforce the learning of the relationships within the data itself. Photometry from six additional surveys spanning optical and infrared wavelengths are integrated into the dataset, making the model robust to missing spectroscopic and photometric data. This allows the embeddings to retain accuracy beyond the depth of the XP spectra. The model was first pretrained on the ∼220 million stars from Gaia DR3 with photometry for the purpose of reconstructing the information. I then demonstrate the informative embeddings produced by this astronomical foundation model with the predictive task of deriving atmospheric parameters and stellar ages using high-resolution spectroscopic surveys (APOGEE, GALAH). The model achieved mean absolute errors of 92 K in 𝑇eff, 0.08 dex in log 𝑔, and 0.09 dex in [Fe/H], demonstrating its competitive position with XGBoost and transformer-based models trained with APOGEE labels. Furthermore, the model achieved mean absolute errors of 0.05 dex in [𝛼/Fe] and 1.3 Gyr in age, with only marginal increases in metrics when missing XP spectra. The MSA also predicts errors for the stellar parameters, which were shown to be largely representative of the predicted values, with slight underconfidence in the width of the asymmetric errors. The change in the accuracy of the predictions with pretraining dataset size was examined, and the model was leveraged to predict stellar parameters for a subset of open clusters and dwarf galaxies Leo I and Fornax. These estimates displayed a potential improvement in parallax measurements at higher distances and crowded regions. This model effectively bridges the gap between spectroscopic and photometric samples within a single, consistent framework, poised to improve with the inclusion of additional photometric surveys and upcoming Gaia releases.