| Issue |
A&A
Volume 708, April 2026
|
|
|---|---|---|
| Article Number | A23 | |
| Number of page(s) | 15 | |
| Section | Galactic structure, stellar clusters and populations | |
| DOI | https://doi.org/10.1051/0004-6361/202558282 | |
| Published online | 27 March 2026 | |
Detection of hot subdwarf binaries and He-poor hot subdwarf stars using machine learning methods and a large sample of Gaia XP spectra
1
Institute of Theoretical Physics and Astronomy, Vilnius University,
Sauletekio av. 3,
10257
Vilnius,
Lithuania
2
Centro de Astrobiología (CSIC-INTA),
Camino Bajo del Castillo s/n,
28692
Villanueva de la Cañada,
Madrid,
Spain
3
Applied Physics Department, Universidade de Vigo,
Campus Lagoas-Marcosende, s/n,
36310
Vigo,
Spain
4
Centro de Investigación Mariña, Universidade de Vigo,
CIM/GEOMA,
36310
Vigo,
Spain
5
Universidade da Coruña (UDC), Department of Computer Science and Information Technologies,
Campus de Elviña s/n,
15071
A Coruña,
Galiza,
Spain
6
IES de Beade, Conseller iá de Educación e Ordenación Universitaria,
Camino do Outeiro 10,
36312
Vigo,
Spain
7
Institute of Applied Mathematics, Vilnius University,
24 Naugarduko st.,
03225
Vilnius,
Lithuania
8
CIGUS CITIC – Department of Computer Science and Information Technologies, University of A Coruña,
s/n,
15071
A Coruña,
Spain
9
CIGUS CITIC – Department of Nautical Sciences and Marine Engineering, University of A Coruña,
Paseo de Ronda 51,
15011
A Coruña,
Spain
10
INAF – Osservatorio Astrofisico di Arcetri,
Largo E. Fermi 5,
50125
Firenze,
Italy
★ Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
27
November
2025
Accepted:
25
January
2026
Abstract
Context. Hot subdwarfs (hot sds) are compact evolved stars near the extreme horizontal branch, and they are key to understanding stellar evolution and the UV excess in galaxies. In this work, we extend our previous analysis of Gaia XP spectra of hot sd stars to a much larger sample, enabling a comprehensive study of their physical and binary properties.
Aims. Our goal is to identify patterns in Gaia XP spectra, investigate binarity, and assess the influence of parameters such as temperature, helium abundance, and variability. We analysed ~20 000 hot sd candidates selected from the literature, combining Gaia XP data with published parameters.
Methods. We applied the uniform manifold approximation and projection technique to the XP coefficients, which represent the Gaia XP spectra in a compact feature-based form, to construct a similarity map. We then used self-organising maps and convolutional neural networks (CNNs) to classify spectra as binaries or singles and as cool/He-poor or hot/He-rich. The spectra were normalised using asymmetric least squares baseline fitting to emphasise individual spectral features.
Results. We found that the BP-RP colour dominates the similarity map, with additional influence from temperature, helium abundance, and variability. Most binaries, which were identified via the Virtual Observatory SED analyser, cluster in two filaments linked to main sequence companions. The CNN classification suggests a strong correlation between variability and binarity, with binary fractions exceeding 60% for active hot sds.
Conclusions. The Gaia XP spectra combined with dimensionality reduction and machine learning effectively revealed patterns in hot sd properties. Our findings indicate that binarity and environmental density strongly shape the evolutionary paths of hot subdwarfs. We identified possible contamination by main sequence and cataclysmic variable stars in our base sample.
Key words: methods: data analysis / techniques: spectroscopic / binaries: general / stars: early-type / subdwarfs / Galaxy: stellar content
© The Authors 2026
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.