| Issue |
A&A
Volume 708, April 2026
|
|
|---|---|---|
| Article Number | A64 | |
| Number of page(s) | 14 | |
| Section | Extragalactic astronomy | |
| DOI | https://doi.org/10.1051/0004-6361/202556354 | |
| Published online | 26 March 2026 | |
An automated activity classification tool for optical galaxy spectra
1
Physics Department, and Institute of Theoretical and Computational Physics, University of Crete, 71003 Heraklion, Greece
2
Institute of Astrophysics, Foundation for Research and Technology-Hellas, 71110 Heraklion, Greece
3
Center for Astrophysics | Harvard & Smithsonian, 60 Garden St., Cambridge, MA 02138, USA
4
Astronomical Institute, Academy of Sciences, Boční II 1401, CZ-14131 Prague, Czech Republic
5
ALMA Sistemi Srl, Guidonia, Rome 00012, Italy
6
Quantum Innovation Pc, Chania 73100, Greece
★ Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
10
July
2025
Accepted:
23
February
2026
Abstract
Context. Reliable and versatile galaxy activity diagnostic tools are indispensable for comprehending the physical processes that drive galaxy evolution. Traditional methodologies frequently necessitate extensive preprocessing, such as starlight subtraction and emission-line deblending (e.g., Hα and [N II]), which can introduce substantial biases and uncertainties due to their model-dependent nature. Additionally, numerous diagnostics omit the inclusion of dormant (passive) galaxies.
Aims. This work aims to develop a reliable, automated, and efficient diagnostic tool capable of distinguishing among star-forming galaxies, active galactic nuclei (AGNs), low-ionization nuclear emission-line regions (LINERs), composite and passive galaxies under one unified scheme.
Methods. We developed a diagnostic tool based on a support vector machine trained on ground-truth data originating from optical emission-line ratios and color selection criteria. Building upon previous literature findings and exploring various combinations of discriminatory feature schemes, we identified the equivalent widths (EWs) of Hβ, [O III] λ5007, and Hα+[N II] λλ6548,84 as key discriminatory features. Additionally, galaxies classified as AGNs can be distinguished as broad- or narrow-line AGNs by measuring the full quarter at the half-maximum of the Hα and [N II] complex.
Results. Employing machine-learning algorithms and three EWs directly measured from the galaxy’s optical spectrum, we have developed a diagnostic tool that encompasses all potential activities of galaxies while simultaneously achieving high-performance scores across all of them. Our diagnostic achieves overall accuracy of ∼83% and recall of ∼79% for star-forming galaxies, ∼94% for AGN, ∼85% for LINERs, ∼77% for composite galaxies, and ∼96% for passive galaxies.
Conclusions. Our diagnostic tool offers significant improvements over the existing galaxy activity diagnostics as it can be applied to large numbers of spectra, eliminates the need for preprocessing (i.e., starlight subtraction or flux calibration) and spectral-line deblending, encompasses all activity classes under one unified scheme, and offers the ability to distinguish between the two main types of AGN. In addition, the omission of starlight subtraction was not found to significantly reduce the diagnostic’s performance. Furthermore, the narrow wavelength range required for its application enables its use over a wide range of redshifts, making it highly relevant to activity studies of high-redshift galaxies.
Key words: galaxies: active / galaxies: formation / galaxies: Seyfert / galaxies: statistics
© The Authors 2026
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.