| Issue |
A&A
Volume 709, May 2026
|
|
|---|---|---|
| Article Number | A241 | |
| Number of page(s) | 20 | |
| Section | Numerical methods and codes | |
| DOI | https://doi.org/10.1051/0004-6361/202557763 | |
| Published online | 19 May 2026 | |
QUEST: Quasar unsupervised encoder and synthesis tool
A machine-learning framework for generating quasar spectra
1
Hamburger Sternwarte, Universität Hamburg,
Gojenbergsweg 112,
21029
Hamburg,
Germany
2
INAF – Osservatorio Astronomico di Trieste,
Via G.B. Tiepolo, 11,
34143
Trieste,
Italy
3
Department of Astronomy, University of Geneva,
Chemin Pegasi 51,
1290
Versoix,
Switzerland
4
Leiden Observatory, Leiden University,
PO Box 9513,
2300 RA
Leiden,
The Netherlands
5
Department of Physics, University of California,
Santa Barbara,
CA
93106,
USA
6
Institute for Theoretical Physics, Heidelberg University,
Philosophenweg 12,
69120
Heidelberg,
Germany
7
Max-Planck-Institut für Astronomie,
Königstuhl 17,
69117
Heidelberg,
Germany
★ Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
20
October
2025
Accepted:
26
February
2026
Abstract
Context. Quasars at the redshift frontier (z > 7.0) are fundamental probes of black hole growth and evolution, but are notoriously difficult to identify. At these redshifts, machine-learning-based selection methods have proven to be efficient, but require appropriate training sets to reach their full potential.
Aims. We present the variational auto-encoder QUEST, which can generate realistic quasar spectra that can be post-processed to generate synthetic photometry and impute spectra.
Methods. We started from the SDSS DR16Q catalogue, pre-processed the spectra, and vetted the sample to obtain a clean dataset. After training the model, we investigated the properties of its latent space to understand whether it has learnt the relevant physics. Furthermore, we provide a pipeline for generating photometry from the sampled spectra, which we compared with actual quasar photometry, and we show the capabilities of the model in reconstructing and extending quasar spectra.
Results. The trained network faithfully reproduces the input spectrum in terms of sample median and variance. By examining the latent space, we found correlations with continuum and bolometric luminosity, black hole mass, redshift, continuum slope, and emission line properties, among others. When we used the network to generate photometry, the results agreed very well with those from the control sample. The model provides satisfactory results in reconstructing emission lines: estimates of the black hole mass from the reconstructed spectra agree well with those from the original SDSS spectra. Furthermore, when spectra with broad absorption line features were reconstructed, the model successfully interpolated over the absorption systems. Compared with previous work, the spectra sampled from our model and the output of their results agree very well. However, QUEST does not require any ad hoc tuning and is capable of reproducing the full variety of spectra available in the training set.
Key words: methods: data analysis / methods: statistical / techniques: photometric / surveys / quasars: general
© The Authors 2026
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.