| Issue |
A&A
Volume 707, March 2026
|
|
|---|---|---|
| Article Number | A206 | |
| Number of page(s) | 19 | |
| Section | Numerical methods and codes | |
| DOI | https://doi.org/10.1051/0004-6361/202556748 | |
| Published online | 13 March 2026 | |
Automated model selection for the spectral fitting of large samples of active galactic nucleus spectra
1
Instituto de Física y Astronomía, Universidad de Valparaíso,
Gran Bretaña 1111,
Valparaíso,
Chile
2
Millennium Nucleus on Transversal Research and Technology to Explore Supermassive Black Holes (TITANS),
Chile
3
European Southern Observatory,
Karl-Schwarzschild-Str. 2,
85748
Garching,
Germany
4
Instituto de Alta Investigación, Universidad de Tarapacá,
Casilla 7D,
Arica,
Chile
5
Departamento de Astronomía, Universidad de Chile,
Camino el Observatorio 1515,
Santiago,
Chile
6
Astronomy Department, Universidad de Concepción,
Barrio Universitario S/N,
Concepción
4030000,
Chile
7
Millennium Institute of Astrophysics (MAS),
Nuncio Monseñor Sótero Sanz 100,
Providencia,
Santiago,
Chile
8
Kapteyn Astronomical Institute, University of Groningen,
9700 AV
Groningen,
The Netherlands
★ Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
4
August
2025
Accepted:
29
January
2026
Abstract
Aims. We developed an algorithm to automatically recommend and selected the best model to fit active galactic nucleus (AGN) spectra in the ultraviolet/optical wavelength range, enhancing the efficiency of fitting large samples of AGN spectra by replacing the visual inspection and manual selection of the best model.
Methods. We employed the Penalized PiXel-Fitting (pPXF) software to fit AGN spectra using a complete model that includes: narrow and broad emission lines (NELs and BELs), Balmer continuum, Balmer high-order emission lines (H8-H50), FeII pseudo-continuum, AGN continuum, and stellar populations for objects with z < 1; we call this model-1. The fit residuals were analyzed using the discrete wavelet transform (DWT), looking for deviations in the DWT coefficients above some empirically determined threshold value. When deviations were detected in regions of interest of a spectrum (i.e., around Hα, Hβ, MgII, CIV, and [OIII]λ4959, 5007), the significance of the residuals and kinematics were used to recommend (or not) the addition of an extra fitting component for the model. When a new model is recommended, we compared the new fit and the previous one using the root-mean-square (RMS) difference and F-test. The final results of the developed algorithm are the selection of the best-fit model and the corresponding fit results. We validated the results of the algorithm using a sample of 800 SDSS AGN spectra. Each object was fitted using model-1 and the results were visually inspected by three human validators. The validators also recommended (or not) the addition of the same additional components that the algorithm is equipped to recommend.
Results. Comparing the recommendation of each validator and the algorithm, the median coincidence fraction is 0.83-0.88 for different threshold values. These values are comparable to the median coincidence fraction of 0.90 between human validators. The computational time of the model recommendation routine was <1s for 90 percent of the objects, compared to ≈60 s for visual inspection during the validation exercise.
Conclusions. The presented algorithm is an efficient and effective tool for the spectral fitting of large samples of AGN spectra with options for improvement and applications for specific studies.
Key words: methods: data analysis / techniques: spectroscopic / galaxies: active
© The Authors 2026
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.