Detection of hot subdwarf binaries and He-poor hot subdwarf stars using machine learning methods and a large sample of Gaia XP spectra

M. Ambrosch; C. Viscasillas Vázquez; E. Solano; A. Ulla; X. Pérez-Couto; E. Pérez-Fernández; A. Medžiūnas; M. Manteiga; C. Dafonte; A. Drazdauskas; L. Magrini; Š. Mikolaitis; V. Šatas

doi:10.1051/0004-6361/202558282

Open Access

Issue		A&A Volume 708, April 2026


Article Number		A23
Number of page(s)		15
Section		Galactic structure, stellar clusters and populations
DOI		https://doi.org/10.1051/0004-6361/202558282
Published online		27 March 2026

A&A, 708, A23 (2026)

Detection of hot subdwarf binaries and He-poor hot subdwarf stars using machine learning methods and a large sample of Gaia XP spectra

M. Ambrosch¹^★, C. Viscasillas Vázquez¹, E. Solano², A. Ulla³^,4, X. Pérez-Couto⁵^,8, E. Pérez-Fernández³^,6, A. Medžiūnas⁷, M. Manteiga⁹, C. Dafonte⁸, A. Drazdauskas¹, L. Magrini¹⁰, Š. Mikolaitis¹ and V. Šatas¹

¹ Institute of Theoretical Physics and Astronomy, Vilnius University, Sauletekio av. 3, 10257 Vilnius, Lithuania
² Centro de Astrobiología (CSIC-INTA), Camino Bajo del Castillo s/n, 28692 Villanueva de la Cañada, Madrid, Spain
³ Applied Physics Department, Universidade de Vigo, Campus Lagoas-Marcosende, s/n, 36310 Vigo, Spain
⁴ Centro de Investigación Mariña, Universidade de Vigo, CIM/GEOMA, 36310 Vigo, Spain
⁵ Universidade da Coruña (UDC), Department of Computer Science and Information Technologies, Campus de Elviña s/n, 15071 A Coruña, Galiza, Spain
⁶ IES de Beade, Conseller iá de Educación e Ordenación Universitaria, Camino do Outeiro 10, 36312 Vigo, Spain
⁷ Institute of Applied Mathematics, Vilnius University, 24 Naugarduko st., 03225 Vilnius, Lithuania
⁸ CIGUS CITIC – Department of Computer Science and Information Technologies, University of A Coruña, s/n, 15071 A Coruña, Spain
⁹ CIGUS CITIC – Department of Nautical Sciences and Marine Engineering, University of A Coruña, Paseo de Ronda 51, 15011 A Coruña, Spain
¹⁰ INAF – Osservatorio Astrofisico di Arcetri, Largo E. Fermi 5, 50125 Firenze, Italy

^★ Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.

Received: 27 November 2025
Accepted: 25 January 2026

Abstract

Context. Hot subdwarfs (hot sds) are compact evolved stars near the extreme horizontal branch, and they are key to understanding stellar evolution and the UV excess in galaxies. In this work, we extend our previous analysis of Gaia XP spectra of hot sd stars to a much larger sample, enabling a comprehensive study of their physical and binary properties.

Aims. Our goal is to identify patterns in Gaia XP spectra, investigate binarity, and assess the influence of parameters such as temperature, helium abundance, and variability. We analysed ~20 000 hot sd candidates selected from the literature, combining Gaia XP data with published parameters.

Methods. We applied the uniform manifold approximation and projection technique to the XP coefficients, which represent the Gaia XP spectra in a compact feature-based form, to construct a similarity map. We then used self-organising maps and convolutional neural networks (CNNs) to classify spectra as binaries or singles and as cool/He-poor or hot/He-rich. The spectra were normalised using asymmetric least squares baseline fitting to emphasise individual spectral features.

Results. We found that the BP-RP colour dominates the similarity map, with additional influence from temperature, helium abundance, and variability. Most binaries, which were identified via the Virtual Observatory SED analyser, cluster in two filaments linked to main sequence companions. The CNN classification suggests a strong correlation between variability and binarity, with binary fractions exceeding 60% for active hot sds.

Conclusions. The Gaia XP spectra combined with dimensionality reduction and machine learning effectively revealed patterns in hot sd properties. Our findings indicate that binarity and environmental density strongly shape the evolutionary paths of hot subdwarfs. We identified possible contamination by main sequence and cataclysmic variable stars in our base sample.

Key words: methods: data analysis / techniques: spectroscopic / binaries: general / stars: early-type / subdwarfs / Galaxy: stellar content

© The Authors 2026

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.

1 Introduction

Hot subdwarfs (hot sds) are compact evolved stars located on or near the extreme horizontal branch of the Hertzsprung-Russell diagram. They are characterised by their high temperatures (20 000-50 000 K), low masses (∼0.5 M_⊙), and hydrogendeficient atmospheres. This unusual combination of parameters makes them crucial for understanding stellar evolution. Hot sds play a key role in explaining the ultraviolet excess observed in elliptical galaxies and the bulges of spiral galaxies, and they provide insights into binary interactions such as mass transfer and common-envelope ejection. For a detailed discussion on the properties, formation channels, and observational significance of hot sds, we refer to the introduction in Viscasillas Vázquez et al. (2024, hereafter Paper I) and references therein.

Binary interaction is thought to play an important role in the evolution of hot sds, as they are found in binaries with a variety of companions, most commonly low-mass main sequence (MS) stars and white dwarfs (WDs; e.g. Pelisoli et al. 2021) but also brown dwarfs (BDs; e.g. Schaffenroth et al. 2021) and Be stars (Mourard et al. 2015; Castañón Esteban et al. 2024, e.g.). Composite hot sds+MS systems with F-K type companions in wide orbits constitute a well-defined population (Vos et al. 2018, e.g.) and are the focus of this work. More rarely, hot sd systems have been proposed to host compact companions. While theoretical studies have explored systems with neutron-star companions (Wu et al. 2018, e.g.), observational evidence remains scarce, with only two hot sd systems proposed as neutron-star candidates (Mereghetti et al. 2021; Geier et al. 2023); however, it is unclear whether the compact companions in these systems are neutron stars or massive WDs. Claims of black hole companions have also been reported (Geier et al. 2010, e.g.), although their nature remains debated. In contrast, close hot sds+WD and hot sds+dM/BD binaries typically have orbital periods from tens of minutes to several days and generally do not show composite spectra. These systems reveal their binarity through photometric variability, such as eclipses, reflection effects, or ellipsoidal modulation, but they are too close to exhibit astrometric wobble or increased Gaia renormalised unit weight error (RUWE) values. The composite hot sds+MS binaries considered here are wide systems with orbital periods of a few hundred days. Some of them show astrometric wobble and elevated RUWE values, but they usually do not display binary-induced photometric variability. Instead, their variability is dominated by He-poor hot subdwarf (sdB) pulsations or weak rotational modulation of the cool companion (Pelisoli et al. 2020). Overall, both the relative frequencies of the different binary systems and their impact on hot sd evolution remain uncertain.

Astrophysics is one of the most data-intensive sciences, with missions such as Gaia producing petabytes of information. Adopting advanced techniques for large-scale data analysis is essential for analysing these vast amounts of data. Initiatives such as the Virtual Observatory have been crucial in addressing challenges in massive data analysis. Oreiro et al. (2011) developed a Virtual Observatory-based procedure to identify previously uncatalogued hot sds while minimising contamination from other stellar types. Pérez-Fernández et al. (2016) refined this method, successfully identifying new hot sds using hot sd spectra. Solano et al. (2022) used the Virtual Observatory SED Analyzer (VOSA) tool (Bayo et al. 2008) to detect binary systems involving hot sds. VOSA fits stellar models to observed spectral energy distributions (SEDs) to determine physical parameters of the observed object. Observations that reqire more than one stellar model for a good fit are classified as multiple systems by VOSA. The binary hot sds that we consider in this study show composite SEDs, with a flux excess in the red part of the spectrum. This excess is caused by the cooler companion star. Paper I introduced an advanced classification framework for hot sd binaries using AI techniques and Gaia DR3 data. Their methodology combines supervised and unsupervised machine learning approaches, including support vector machines, self-organising maps (SOMs), convolutional neural networks (CNNs), the uniform manifold approximation and projection (UMAP) technique, and, for the first time for this purpose, the cosine similarity metric. Applying these techniques to 2815 hot subdwarfs with Gaia DR3 BP/RP spectra, they achieved high classification accuracy of their sample.

Other studies have also enhanced the classification of hot sds through the use of advanced techniques. Bu et al. (2019) and Tan et al. (2022) demonstrated the effectiveness of CNN and hybrid models for spectral analysis, achieving high precision in the identification of new hot sd candidates. The same or similar techniques have also been shown to be effective in classifying other classes of dwarf stars. Thus, most recently, Kao et al. (2024) used Gaia DR3 BP/RP spectra and the UMAP technique to analyse ∼96 000 WDs, identifying polluted WDs with multiple atmospheric metals. Similarly, Pérez-Couto et al. (2024) used SOMs to identify new polluted WD candidates among ∼66 000 WDs, showcasing their ability to detect heavy elements such as Ca, Mg, Na, and K in cool WDs. Focusing on hot sds, Zhang et al. (2025) developed the Hot Subdwarf Detector (HsdDet), a multiscale object automated detection algorithm applied to images from the Sloan Digital Sky Survey identifying nearly 30 000 hot sd candidates. In addition, Zou & Lei (2024) developed an Morgan-Keenan-like spectral classification system for ∼1200 hot sds using LAMOST DR9 spectra, categorising stars based on helium content and spectral characteristics. Meanwhile, Tahir et al. (2024) employed machine learning methods, specifically kernel support vector machines, to classify ∼11 000 B-type and ∼2400 hot sds. Their analysis revealed that the linear kernel support vector machine achieved the highest accuracy, demonstrating the potential of spectrum-based classification to improve stellar identification. Similar techniques were also effective in detecting the variability of hot sds. Ranaivomanana et al. (2025) used machine learning algorithms to analyse the variability of ∼1500 candidate hot sds using Gaia DR3 and TESS data. Through dimensionality reduction techniques, the authors identified new hot sd variables as well as potential new cataclysmic variables, demonstrating the effectiveness of their approach for classifying variable stars in large surveys. All of these studies collectively highlight that machine learning techniques have become indispensable in the new era of large-scale data analysis in astronomy. They also emphasise the need to refine the techniques while identifying the most appropriate methods for each particular scientific case.

The most extensive and up to date catalogue of hot sds was compiled by Culpan et al. (2022), who identified ∼62 000 hot subluminous stars from Gaia EDR3, representing a substantial improvement in detection in crowded Galactic regions and near the Magellanic Clouds. Building on this new catalogue, in this paper we expand on and further develop the classification techniques introduced in Paper I. With this approach, we aim to refine our understanding of hot sd populations and enhance the analysis of their binary nature and astrophysical properties.

This paper is structured as follows. In Sect. 2, we describe our dataset and the relation between XP coefficients and stellar parameters. Section 3 presents the detection of binary hot subdwarf systems using XP coefficients. In Sect. 4, we normalise the XP spectra to emphasise individual spectral features and classify objects as cool/sdB or hot/He-rich. Section 5 provides a statistical analysis of the SOM and CNN training results, while Sect. 6 presents the class predictions for all 20 061 objects. In the same section, we investigate correlations between binarity and stellar variability, including contamination by cataclysmic variable stars.

2 Data

The basis of our sample is the list of ∼62000 hot sds from Culpan et al. (2022). There, the targets were selected by combining multi-band photometry and astrometric measurements from Gaia EDR3. For a subset of their total sample, Culpan et al. (2022) provide spectroscopically determined effective temperatures (T_eff) and helium abundances (log(Y)). These parameters were compiled from the literature.

As Gaia’s most recent data release (Gaia Collaboration 2023, DR3) has made abundantly clear, Gaia is not only a photometric and astrometric mission, but also a spectroscopic one. Its Radial Velocity Spectrometer (Recio-Blanco et al. 2023, RVS) provides medium-resolution (R∼11500) spectra in the near-infrared Ca II triplet region (845-872 nm), while its photometric instrument includes two slitless prisms, the blue and bed photometers (BP and RP), which disperse light over the ranges 330-680 nm and 640-1050 nm, respectively (Andrae et al. 2023). These BP and RP data, collectively referred to as low resolution (R∼50) XP spectra, were obtained simultaneously with the astrometric and photometric observations for an unprecedented number of stars, ensuring an exceptional level of internal consistency and calibration. Gaia DR3 released about 220 million mean XP spectra and 1 million mean RVS spectra, together with astrophysical parameters derived for hundreds of millions of sources. The agreement of Gaia’s XP-based parameters with ground-based surveys such as LAMOST (Zhang et al. 2023) and of its RVS results with Gaia-ESO (Van der Swaelmen et al. 2024) demonstrates the reliability of Gaia’s spectroscopic data, extending the mission’s impact well beyond astrometry.

In this work, we combine Gaia photometric, astrometric and low-resolution spectroscopic information. Although Gaia’s RVS spectra offer higher resolution than the XP data, only a small fraction (29 sources) of the ∼62 000 hot sds from Culpan et al. (2022) have RVS spectra available. This very limited overlap precludes the use of RVS data for any statistically meaningful analysis. We therefore rely exclusively on Gaia XP spectra, which provide homogeneous spectroscopic coverage for a substantial part of the sample. Our full dataset consists of XP coefficients of 20 061 hot sds which are listed in Culpan et al. (2022). These XP coefficients are the coefficients of 110 basis functions, which in combination represent the full XP spectra (in standard flux versus wavelength format, Carrasco et al. 2021). The spectra cover the wavelength range from 330 to 1050 nm, in bins of 2 nm. In Sections 2.1 and 3 we use the 110 XP coefficients to construct a similarity map of our objects and to detect binary systems in our sample. Following Pérez-Couto et al. (2024), we normalise the coefficients of every hot sds by dividing them by their L2 norm¹. The L2 norm, also known as Euclidean norm, is a measure for the length of a vector. Normalising the training data ensures that every input feature is treated with equal importance during the training process. Normalisation also increases the numerical stability and improves convergence of the training optimisation process (Kim et al. 2025). In Sect. 4, we use the spectra in their flux-wavelength format.

2.1 Similarity map of Gaia XP coefficients

As a first step, we analysed the similarities and differences between the XP coefficients of our sample hot sds by using the UMAP technique (McInnes et al. 2018). The UMAP method is designed to reduce high-dimensional datasets to lower dimensions while keeping the internal relations between data points intact. In our case, the high-dimensional data are the 20 061 sets of XP coefficients with 110 dimensions each (one dimension for every coefficient). To visualise this data, we reduce this set down to two dimensions with UMAP. The resulting similarity map is shown in Fig. 1. In this map, every set of XP coefficients is represented by one point, and the distance between two points depends on the similarity between the represented coefficient sets. The UMAP algorithm places more similar sets closer together, while different sets are placed further apart in the map. We see that our map is not uniform, but separated into different regions. Apart from the main body, which contains most of the stars in our sample, there are two detached islands (A and B) and two filaments F1 and F2.

To investigate the physical differences between the stars in these regions, we colour-code the similarity map by six different stellar parameters from Culpan et al. (2022). The six panels in Fig. 2 show the same map with added colour dimensions. Colour-coding the points reveals correlations between certain areas of the map and different stellar parameters. This is expected since the spectra, and therefore the XP coefficients of an observed star depend on the star’s physical properties. Colour-coding by Gaia BP-RP magnitudes, for example, shows that the similarity map is dominated by the colour-index: The coefficients of the bluest stars on the left side of the similarity map are most different from those of the red stars at the right edge of the map. A smooth colour gradient connects stars in those two extremes. This property is also apparent when we colour-code the processed spectra themselves by the Gaia BP-RP magnitudes (Fig. A.1). Stars with red colours show a higher flux towards longer wavelengths than blue stars. As a consequence of this behaviour, the four UMAP groups A, B, F1, and F2 occupy different areas of the colour-magnitude diagram (Figure A.2).

The distribution of absolute Gaia G magnitudes is more uniform across the whole similarity map. However, the island B, at the bottom of the map, is brighter than the remaining regions. The average G magnitude of island B is 2.65 mag, approximately 2 mag brighter than the average of the other regions (4.68 mag).

The panels in the middle row of Fig. 2 show that the UMAP method can separate the XP coefficients of hot, He-rich stars from the cooler ones with lower He abundance. This correlation between stellar effective temperature and atmospheric He abundance in hot sd stars has been observed before (for example, Németh et al. 2012). In our dataset, the correlation coefficient between T_eff and log(Y) is 0.51. The bottom row of Fig. 2 shows the colour-coding according to the renormalised unit weight error (RUWE) and excess flux error. The RUWE parameter is a dimensionless measure for how much the centre of an object moves between different observations, (Lindegren et al. 2021) and is therefore a measure of astrometric variability. The excess flux error (Gentile Fusillo et al. 2021) is an indicator for the photometric variability of a source. Large values of the excess error indicate an unusually high scatter of measured brightness between different observations, when compared to the brightness scatter of other objects with similar absolute magnitude, colour, and number of observations. A high value of the excess flux error therefore suggests that the measured brightness variations are intrinsic to the object itself, and not resulting from observational errors. The excess flux errors used in this study have been calculated by Culpan et al. (2022) for about 60% of the hot sds in our sample. In Sect. 6.1, we investigate the connection between the RUWE and excess flux errors, as indicators of stellar variability, and the hot sd binarity in our sample.

Almost all stars that have high values for both RUWE and the excess flux error are found in the island A of our similarity map. The island A itself is split between a half that shows indications of variability (right) and an invariable half (left). The filament F2 contains most of the remaining hot sds with large photometric variability.

Fig. 1

UMAP of the 20 061 sets of XP coefficients. The main body of the map is coloured in grey, separate regions are kept in black and labelled A, B, F1, and F2.

Fig. 2

UMAP of the 20061 sets of XP coefficients. Every panel shows the same UMAP but colour-coded by different parameters. Grey points represent spectra for which the respective parameter is not available.

2.2 Island A: Contamination by objects that are not hot subdwarfs

None of the objects in island A have previously been studied in Solano et al. (2022) or in Paper I. Furthermore, none of the references cited in Culpan et al. (2022) report any T_eff or log(Y) values for these stars. To better understand the nature of the island A objects, we analysed several of them with the VOSA tool. Out of the total 561 objects in island A, 58 satisfy the criteria for a reliable VOSA fit. These criteria demand the presence of photometric data in both GALEX (Galaxy Evolution Explorer; Martin et al. 2005) FUV and NUV filters, as well as coverage in the 3D extinction maps provided by Lallement et al. (2022).

The SEDs of all 58 objects can be fitted with models for cool MS stars, specifically BT-Settl models with T_eff ranging from 4000 to 10 000 K (Allard et al. 2012). Four of these objects can be well fitted by a single BT-Settl model, meaning that they are most likely single stars. The remaining 54 require a combination of two models to achieve a good fit and are therefore classified as binaries by VOSA. All 58 objects are too cool to be hot sds, which typically have effective temperatures exceeding 19 000 K. Examples of VOSA fits of a typical binary hot sd and a cooler sample object from island A are shown in Fig. A.5.

We also conducted a visual inspection of the SEDs of the island A objects. While confirmed hot sds exhibit a characteristic negative slope in the UV region, the island A objects display a positive slope. Figure 3 presents the average XP spectra of the island A objects alongside the average spectra of the remaining sample. The most striking difference is the elevated flux in the island A spectra, which is approximately 0.1 units higher than the mean flux of the other spectra. This flux difference remains nearly constant at wavelengths beyond 500 nm.

In the range of ~400—420 nm, the continuum of the mean island A spectrum remains flat (the absorption feature at 410 nm is a hydrogen Balmer line). In contrast, the mean flux of the nonisland A spectra decreases by ~0.1 over the same 20 nm range. This flux drop is the typical negative slope in the blue part of the spectrum seen in hot objects (for example, hot hds). The shallower slope at short wavelengths, combined with the increased flux at longer wavelengths, suggests that the island A objects are cooler than the remaining sample.

This inspection of the XP spectra supports the VOSA analysis of the 54 binary candidates in island A, confirming that they are too cool to be hot sds. We see in the colour-magnidute diagram in Fig. A.2, that island A contains many of the reddest objects in our sample. They lie close to the cut-off line which was defined by Culpan et al. (2022) to separate the hot sd candidates from the MS. This, together with the low temperatures, makes it very likely that most of the island A objects are simply MS stars. However, a detailed investigation into the nature of island A lies beyond the scope of this work.

This analysis demonstrates that similarity maps are a powerful tool for identifying objects that are physically distinct from the main population of our sample. In our reported results (Sect. 6), we include a flag to identify the island A objects.

Fig. 3

Average of Gaia XP spectra of all objects in island A (black) and average XP spectrum of all other samples that are not in island A (grey). The insert panel highlights the region from 400 to 550 nm.

3 Detection of binary hot sd systems based on XP coefficients

Our training set for the binary detection consists of 2695 hot sds, of which 466 (17%) have been classified as binary by VOSA. The distribution of the training set samples in the similarity map is shown in Fig. 4. When we compare the binary distribution in the map to the top left panel of Fig. 2, we see that the main difference between the filament binaries and the non-filament binaries is the BP-RP colour index. For 29 of the VOSA binaries, properties of their companion star have been measured by Solano et al. (2022). This allows us to investigate how the filament binaries differ from the non-filament ones. Of the 29 binaries with companion information, 22 are in one of the two filaments. The remaining seven objects are scattered across the rest of the map. The companion effective temperatures and radii are not significantly different in the two groups. However, the filament binary companions are more luminous. We performed a Mann-Whitney U test to decide if the difference between the luminosity distributions of the two groups is statistically significant. This test is suitable for small samples, for which it is not clear if they are normally distributed. The Mann-Whitney U p-value for our samples is ∼0.01. This is smaller than the critical p-value of 0.05, which confirms the difference between the two groups. The mean bolometric luminosity (L_bol) of the filament companions is equal to 0.44 L_⊙ and they contribute on average ∼3.3% to the total luminosity of the binary system. The non-filament companions have a mean L_bol of 0.18 L_⊙, which amounts to less than 1% of the total system flux, on average. Figure A.3 shows the contribution of the companion to the total bolometric luminosity of the binary system across our similarity map. For ∼131 of our filament binaries, Culpan et al. (2022) report the spectral class of the companion star. All except one of the filament companions are MS stars, either labelled as hot sds +MS, or more specifically, +F, +G, or +K. The remaining companion is a WD. Main sequence companions are also found outside the filaments. However, most of the non-filament companions (64%) are either WDs or low mass M-dwarfs (hot sds+dM). Three of the non-filament companions are BDs (sds +BD). These faint companions do not significantly contribute to the total luminosity, and therefore to the spectral energy distribution of their binary systems. This suggests that the binaries located on the left-hand side of the UMAP diagram are systems in which the contribution of the secondary component is very small, which explains the overlap and confusion between single and binary hot sds in this region. For a more detailed classification of our sample into single and binary hot sd systems, we used two machine learning methods: SOM and CNN. The class predictions by each method for our full sample of 20 061 objects is presented in Sect. 6.

3.1 Binary detection with SOM

We leveraged the SOM algorithm to further explore the different populations of our dataset. The main advantage of the SOM with respect to other unsupervised machine learning techniques is that the SOM is both a dimensionality reduction algorithm and a clustering technique. Indeed, the SOM projects a highdimensional non-linear dataset (such as the XP coefficients of our sources) in a 2D map, preserving the topology order of the feature space, such as the UMAP. However, in the SOM, the 2D map is a finite grid of neurons so that similar elements fall in the same neuron, and neurons with similar elements are placed adjacent to each other. In this manner, we can perform statistics and study each neuron separately, assigning labels (classes) to each neuron according to the predominant class from the labelled data that fall into it. To implement it, we used the MiniSom² (Vettigli 2018). After some hyperparameter tuning we chose a 8 × 8 map size, σ = 1.4, learning rate 0.5, and the Euclidean distance as metric. The number of iterations was 5000. To classify a neuron z(i,j) (and thus the sources contained inside it) with a label X, we required the neuron to contain at least ten labelled sources to have enough statistical power. Any neuron that did not meet that condition we considered an outlier and was not used for classification.

We applied the SOM to the XP coefficients of our sample of ~20000 hot sds. As a result, we present in Fig. 5 the SOM map with colour-coded binarity. We used the training set of ~2700 hot sds with single/binary flag from VOSA for the colours. Sources with unknown binarity are plotted with transparent colour for better visualisation. As can be seen, there are six neurons in the lower left part of the SOM with an evident representation (P_binary > 0.90) of binary hot sds, encompassing a total of 302 hot sds binary candidates, while the rest of the known hot sd binaries are uniformly mixed with single hot sds. Eleven neurons are classified as outliers, and therefore, 38 neurons are ultimately used for the classification.

Fig. 4

Same UMAP as in Fig. 1 but colour-coded by VOSA binarity. Our new dataset has 2695 samples in common with the set from Paper I.

3.2 Binary detection with CNN

Before training our CNN, we created a test set by randomly selecting 1000 of the total 2695 training samples. This test set is not involved in the network training process. It is instead used to assess how well the trained CNN performs on previously unseen data. For the training, the remaining 1695 samples were split into a validation set of 500 samples and the preliminary training set of 1195 samples. We train our CNN until its performance on the validation set does not increase any more. This prevents our CNN from overfitting to the training data. We improve upon our classification model from Paper I by addressing the singlebinary class imbalance in the training set and by fine-tuning the network training process.

Our preliminary training set of 1195 samples is imbalanced, with only 17% belonging to the binary class. To counter this imbalance during the network training, we applied two different methods. First, we supplemented our training set with synthetic binary samples using the synthetic minority over-sampling technique (SMOTE, Lemaître et al. 2017). Using SMOTE, we generated artificial sets of binary samples by interpolating between existing ones. With this technique, we roughly doubled the number of binaries in our training set to a final binary ratio of 33%. Adding even more synthetic binaries to the training set worsens the performance of the trained CNN on the test set. The final training set thus contains 1477 samples, with 492 belonging to the binary class and 985 to the single class. We did not add any artificial samples to the validation and test sets. To counter the remaining class imbalance in the training set, we applied different loss weight factors to the binary and single samples (0.66 for binary samples, 0.33 for single samples). In this way, the network will be penalized more when it misclassifies binaries. This caused the network to pay more attention to the binary samples.

Due to the small number of samples in our training set, combined with the uneven distribution of binaries in the similarity map, it is possible that random sampling splits our data into non-uniform training and validation sets. For example, if a large majority of the selected training set binaries is located in the filament F1 of the similarity map, then the CNN cannot learn to accurately predict labels for the main body of the map. To minimise the effects of this possible imbalance, we trained an ensemble of ten CNNs. We create new validation and training sets before each of the training runs. After all CNNs have been trained, we inspect their performances on the test set. We keep the five CNNs with the highest binary accuracy scores and discard the remaining five. The binary probabilities that we provide in Sect. 6 are the average predicted values from the best five networks.

Fig. 5

SOM map of the XP coefficients with known hot sds colour-coded according to their binarity.

4 Detection of cool, He-poor hot subdwarfs based on normalised XP spectra

In Sect. 2.1, we observed that the position of a hot sd in the coefficient similarity map primarily depends on its BP-RP colour. The same is true when we generate a similarity map for the XP spectra of our sample. This is because the overall shape of a spectrum is dominated by its flux distribution at longer wavelengths, and therefore by the colour of the star. The more subtle features of our spectra are overwhelmed by this dominance of the overall flux distribution. Therefore, the individual features play a less significant role in the grouping of stars in the similarity map. To remove the influence of the intrinsic spectral flux distribution, we normalise our spectra to the continuum. We determine the continuum baselines using the asymmetric least squares smoothing method Eilers & Boelens (2005). By dividing each spectrum by its continuum baseline, we obtain the normalised spectra. We repeat the same UMAP analysis as in Sect. 2.1, this time with the normalised XP spectra. The resulting similarity map is shown in Fig. 6. The colour-coding confirms that the normalisation has reduced the BP-RP colour’s influence on the overall shape of the spectra. This allows individual absorption features to become more prominent in the total spectral shape. The middle panel of Fig. 6 shows that the He-rich hot sds are grouped closer together in this similarity map than they are in the map of the raw XP coefficients. We also observe that the normalised XP spectra of high-temperature hot sds occupy the same space in the map as the He-rich objects. This indicates that in our dataset both of these parameters have the same effect on the normalised spectra.

To see the range of spectral shapes for samples with different He abundances, we plot the mean normalised spectra of all He-rich and He-poor stars in the top panel of Fig. 7. We chose the threshold between He-rich and He-poor hot sds to be log(Y) = −1. The choice of this threshold value is motivated by the He abundance classification scheme for hot sds by Luo et al. (2021). We see that the two categories differ most strongly at the short wavelength part of the spectral range. The He-poor sample shows stronger absorption features than the He-rich sample. This is counter-intuitive because more helium in a stellar atmosphere is expected to absorb more light and therefore cause stronger spectral absorption features. The wavelengths with the largest differences between mean He-rich and He-poor spectra are 410, 434, and 486 nm. These are not the locations of He absorption lines, but the wavelengths of the hydrogen Balmer lines Hδ, Hγ, and Hß. At the low resolution of the XP spectra, the weaker He-absorption features are fully blended with the much stronger hydrogen absorption lines. In the bottom panel of Fig. 7 we observe that the effective temperature has the same effect on the spectral shape as the He abundance. The hydrogen Balmer lines of the hottest stars, with T_eff > 35 000 K, are weaker than those in the cooler spectra. With rising T_eff, a larger fraction of the hydrogen atoms in the stellar atmospheres becomes ionised and can therefore no longer absorb. This results in weaker H-absorption features in hotter stars. We can see from this comparison of both panels in Fig. 7 that our XP spectra do not allow us to disentangle effects of the He-abundance from those of the effective temperature. Therefore, we separate our sample spectra into a sample that is both hot and He-rich (weak absorption features), and a cooler He-poor class (strong absorption features). More than 90% of the samples in the cool/He-poor group are classified as sdB stars in Culpan et al. (2022). We therefore label the samples in this class as ‘sdB’. We define sdB samples as those objects with T_eff <35 000 K and log(Y) < -1. The hotter and more He-rich group is labelled as ‘hot/He-rich’. The majority of stars in this class belong to one of the hot sds subtypes sdO, He-sdO, or He-sdB. As in Sect. 3, we train our SOM and CNN tools to classify our sample objects into sdB stars and hot/He-rich sds. The classification is based on the normalised XP spectra. The results for the whole sample are presented in Sect. 6.

Fig. 6

UMAP of 20 023 normalised XP spectra. All panels show the same UMAP but colour-coded by different parameters. Grey points represent spectra for which the respective parameter is not available.

Fig. 7

Normalised XP spectra. Top panel: mean normalised spectra of He-poor (red) and He-rich (blue) hot sds. Bottom panel: mean normalised spectra of cool (red) and hot (blue) hot sds. The locations of several relevant absorption lines are indicated in the bottom panel. Their wavelengths are 410, 434, and 486 nm (HI); 420, 454, 469 nm (HeII), and 588 nm (HeI). In both panels, the mean spectra are plotted together with 300 randomly selected normalised spectra from our full sample (black).

Fig. 8

SOM map of the XP spectra, normalised to the continuum, with known hot sds colour-coded according to whether they are He-rich or He-poor hot sds.

4.1 Detection of sdB stars with SOM

To use the SOM to select between hot/He-rich and sdB spectra, we repeat the steps in the previous section but using the spectra normalised to the continuum instead of the XP coefficients. As done with the UMAP, the aim of this is to put the focus in the absorption lines instead of the continuum shape. We also normalise the input vector by the L₂ norm and use the ~2000 sources from Culpan et al. (2022) with available T_eff and log Y to colour-code the resulting SOM with hot/He-rich and sdB labels. The result is presented in Fig. 8 and shows that there are two neurons, z_(0,1) and z_(1,0) (row 0 and column 1, and row 1 and column 0, respectively), with P_hot/He-rich > 0.5, while 15 are sdB neurons, and the remaining 32 are outliers.

Fig. 9

Helium abundances log(Y) relative to the effective temperatures T_eff of the stars in the test set. The colour scale shows the probability of belonging to the sdB class (cool/He-poor), as predicted by our CNN ensemble. The dashed lines show our defined class boundaries of T_eff = 35 000 K and log(Y) = −1.0 dex.

4.2 Detection of sdB stars with CNN

Analogous to the search for binary hot sds in Sect. 3.2, we now use CNNs to classify the normalised XP spectra into sdB and hot/He-rich. Of our total 20 074 sample spectra, 1959 have both T_eff and log(Y) available in the data of Culpan et al. (2022). Of these, 250 spectra were assigned to each the test set and validation set. The training set then contains 1459 spectra. The fraction of sdB spectra in the training set is 54%, the rest is classified as hot/He-rich. As for the binary detection task, we train an ensemble of ten CNNs and keep the five with the best performance on the test set. The reported results are the average predictions from the five best CNNs. Since our sdB and hot/He-rich classes are nearly balanced, we do not apply SMOTE sampling or class loss weights in the training phase.

Network gradients show which parts of the input spectra have the most influence on the CNN output. Figure A.4 shows the network gradients for the class sdB, together with average spectra of the two classes in our training set. The gradients have the deepest negative peaks at the positions of the hydrogen absorption lines. Negative sdB gradients indicate that a decrease of the flux (that is, deeper absorption) around these wavelengths increases the CNN probability for the class sdB. We already observed in Fig. 7 that the difference between our two classes is largest at the hydrogen absorption lines. The network gradients therefore show that our CNN can separate sdB from hot/He-rich samples in a physically meaningful and intuitive way.

As the training result, we show the CNN predictions for the test set in Fig. 9. We see that the CNN ensemble predicts high probabilities for belonging to the class sdB for stars that have temperatures and helium abundances within our set thresholds. Close to the class boundaries, the network classification is less certain for several spectra. The probabilities for these samples is closer to 50%. However, the cooler, He-poor stars are clearly not classified as hot/He-rich by our network ensemble. Given the correlation between T_eff and He-abundance in O and B stars, we have no cool stars with high He-abundances in our sample (neither in the test set nor in the training or validation sets). Therefore, the separation between our two classes appears to be based on the temperature alone. Still, as discussed above, we cannot distinguish between the effects of temperature and He-abundance in our set of XP spectra.

5 Statistical analysis of the SOM and CNN training results

Since both CNN and SOM models are probabilistic binary classifiers, there are three main metrics one has to use when evaluating model performance. These are sensitivity (TPR), specificity (TNR) and precision (PPV), which are defined as follows $\begin{aligned} TPR = \frac{# of true positives}{# of true positives + # of false negatives}, \\ TNR = \frac{# of true negatives}{# of true negatives + # of false positives}, \\ PPV = \frac{# of true positives}{# of true positives + # of false positives} . \end{aligned}$ $Mathematical equation: \begin{aligned} \text{TPR}=\frac{\text{\# of true positives}}{\text{\# of true positives}+\text{\# of false negatives}},\\ \text{TNR}=\frac{\text{\# of true negatives}}{\text{\# of true negatives}+\text{\# of false positives}},\\ \text{PPV}=\frac{\text{\# of true positives}}{\text{\# of true positives}+\text{\# of false positives}}. \end{aligned}$ (1)

5.1 ROC and PR curves

Examining the interaction between TPR, TNR and PPV provides a fundamental means of assessing the model’s performance. We begin our analysis by plotting the receiver operating characteristic (ROC) and precision-recall (PR) curves and calculating the corresponding Area Under the Curve (AUC) scores for both models and datasets (see Figs. A.6, A.7, and Table 1). These visualisations illustrate the trade-offs between TPR and TNR (in the ROC curve) and between TPR and PPV (in the PR curve) across different decision threshold values. A higher AUC value, closer to 1, indicates better overall model performance.

As we observe, the CNN model demonstrates strong performance, with ROC AUC values around 0.95 and PR AUC values around 0.9. In contrast, the SOM model performs considerably worse, particularly on the hot/He-rich stars dataset. For both models, classification performance is lower on the hot/He-rich stars dataset compared to the binary star dataset. It is also worth noting the Log Loss value of 1.051 for the SOM model on the hot/He-rich stars dataset, indicating that the model is overly confident when making incorrect predictions. Furthermore, we observe pronounced diagonal segments in the SOM model’s ROC curve for the hot/He-rich stars dataset (see the bottom right panel in Fig. A.6). This indicates that the SOM model cannot effectively distinguish positives from negatives in those regions (i.e. locally random performance).

5.2 Threshold optimisation

Next, we aim to identify the threshold values that yield the best performance for our models. However, this optimisation depends on how “good” performance is defined. To optimise specificity, we employ Youden’s index, defined as $Youden’s index = TPR + TNR - 1,$ $Mathematical equation: \text{Youden’s index} = \text{TPR} + \text{TNR} - 1,$ (2)

while to optimise precision, we used the F₁ score, defined as $F_{1} = \frac{2 \cdot PPV \cdot TPR}{PPV + TPR} .$ $Mathematical equation: F_1 = \frac{2 \cdot \text{PPV} \cdot \text{TPR}}{\text{PPV} + \text{TPR}}.$ (3)

After threshold optimisation (see Table 2), we note that all previously observed trends are maintained. That is, the CNN model consistently outperforms the SOM model in terms of sensitivity, specificity, and precision. Additionally, both models perform better on the binary hot sds dataset than on the hot/He-rich stars dataset.

Table 1

Performance evaluation scores.

Table 2

Optimal model threshold results for Youden’s index and F₁ score.

6 Predicted labels for 20061 hot subdwarfs

In this section, we present the single versus binary and hot/He-rich versus sdB class predictions for our full sample of 20 061 hot sds. We then use these results to analyse the correlation between stellar variability indicators and hot sd binarity. Table A.1 lists the CNN and SOM label predictions for the first few stars in our sample. A full version of this table is available online.

We see in Sect. 5, that the CNN ensemble is more precise than the SOM in predicting the classes for the training sets. The following analysis is therefore based on the CNN predictions, using the probability threshold values that optimise the prediction precision (F1 score, see Table 2).

For the threshold value of P(binary) = 0.745, our CNN ensemble predicts a binary ratio of ∼17 in our full sample. This corresponds to 3359 detected binaries in our full dataset. The found binary ratio in the full sample is the same as in our binarity training set. The CNN ensemble for the hot/He-rich versus sdB classification predicts a ratio of ∼66% sdB in the full sample (using the classification threshold 0.525). This corresponds do a number of 13 174 sdB stars in our sample.

For our CNN ensemble, the reported class probabilities are the average values from the five best-performing models. Along with these averages, we also include the standard deviations (std) of the class probabilities across the five models. These deviations reflect the internal uncertainty of the CNN ensemble (see, for example, Guiglion et al. 2024). We find that this internal uncertainty is low when the predicted class probability is close to 0 or 1. In these cases, the model is confident, and the samples are clearly representative of their respective classes. However, as the predicted probability gets closer to 50%, the uncertainty increases. Figure A.8 shows the internal uncertainty of the CNN ensemble relative to the predicted class probability for the binary and sdB classes.

Fig. 10

Gaia RUWE versus excess flux error of hot sds in our sample. CNN single stars are coloured in blue, CNN binaries are green. Stars from the island A are not included in this plot. The dashed lines show the variability cut-offs at RUWE =1.4 and excess flux error = 4.0 used in Culpan et al. (2022). The resulting regions of the diagram are numbered from 1 to 4.

6.1 Connection between indicators of stellar variability and binarity

In this section, we investigate the connection between hot sds binarity and the indicators for astrometric and photometric variability. Following Culpan et al. (2022), we consider a hot sds as variable if its RUWE parameter is larger than 1.4 or excess flux error >4.0. These thresholds divide the RUWE versus excess flux error diagram into four regions: region 1 with no indication of variability, region 2 showing photometric variability only, region 3 showing both photometric and astrometric variability, and region 4 with only astrometric variability.

Of our 20 061 sample objects, 11639 have measurements of both RUWE and excess flux errors (excluding the island A objects). Figure 10 shows the distribution of this hot sds sample in the RUWE versus excess flux error diagram. Of the 11 639 hot sds, 90.5% are invariable, 6.2% show indications of photometric variability, only 0.6% display both photometric and astrometric variability, and 2.7% show photometric variability alone. We find a correlation between stellar variability and binarity. Starting from a binary ratio of 17.6% in the invariable region 1, the ratio increases in the other regions. Region 2 has a binary ratio of 60.5%, region 3 shows 77.1%, and region 4 has 80.8% binaries.

6.2 Contamination by cataclysmic variable stars

We remind the reader that the binaries that we detect in this work are those with composite spectra. These are generally wide systems with orbital periods of several hundred days. Wide orbits cause increased astrometric wobble, and therefore higher RUWE values (Culpan et al. 2022). This leads to the observed correlation between RUWE values and binary ratios in regions 3 and 4 in Fig. 10. The elevated binary ratio in region 2, which only shows photometric variability, is harder to explain. The distance between hot sd and wide-orbit companion is too large for the two components to interact directly, for example through the transfer of material from the hot sd to the companion. High photometric variability of our detected binary systems is therefore unexpected. A possible explanation is that the region 2 contains contamination by cataclysmic variables. These objects are binary systems, composed of a white dwarf that accretes material from its companion, typically a late-type star (e.g., Smak 2001). Many cataclysmic variables occupy a region in the Hertzsprung-Russell diagram that overlaps with the reddest objects in our sample (BP-RP ~0.5, GMAG −5.0; Abril et al. 2020). To evaluate a possible contamination of our sample by cataclysmic variables, we crossmatched our dataset with several catalogues of cataclysmic variables. These are the Open Cataclysmic Variable Catalog (Jackim et al. 2020), the Gaia DR3 catalogue of variable sources (Eyer et al. 2023), and the catalogue of cataclysmic variables of Inight et al. (2023). In total, we find 203 unique cataclysmic variable sources in our full sample, and all except five of them are located in the region 2 of our Fig. 10. The 198 cataclysmic variables in the region 2 make up 28% of the 720 objects in this variability region. Many cataclysmic variables are known X-ray sources, contributing a large part of the total galactic X-ray emission. To find additional contamination with cataclysmic variables that were not listed in the above catalogues, we crossmatched our sample with the first data release catalogue of the X-ray survey eROSITA (Merloni et al. 2024). We find a match of 120 X-ray sources in our full sample, 67 of them not in any of the cataclysmic variable catalogues. Of these 67 X-ray sources, 45 show high photometric variability. Interestingly, the majority of the found cataclysmic variables in our sample are concentrated in the filament 2 in our UMAP of XP coefficients (region F2 in Fig. 1). This finding suggests that it is possible to search for cataclysmic variables by using Gaia-XP spectra. While a deeper investigation of the cataclysmic variable contamination in our sample is beyond the scope of this current work, it is clear that a large fraction of our detected binaries with high excess flux errors are not hot sds binaries, but couuld be cataclysmic variables. Our binarity probability predictions for samples with RUWE <1.4 and excess flux error > 4 should therefore be used with caution.

7 Summary and conclusions

The initial sample for this study consisted of ~62000 hot sds identified by Culpan et al. (2022). From this catalogue, we selected a subset of 20 061 objects for which Gaia XP spectra, astrometric measurements, and physical parameter estimates were available. We employed several techniques, but our main focus was analysing their possible binary properties and classification.

To gain an overview of our dataset, we used UMAP to reduce the high-dimensional Gaia XP coefficients of 20 061 stars to two dimensions. The resulting similarity map contained several distinct groups of objects with similar XP coefficients.

Colour-coding the map by various stellar parameters revealed the connections between the physical properties of our sample objects and their XP coefficients. We find that stellar BP-RP colour is the dominant parameter that influences the position of an object in the similarity map. To a lesser extent, stellar effective temperatures, helium abundances, and stellar variability indicators are also important parameters that influence the location of a sample in our similarity map. The majority of VOSA classified binary star systems were grouped together in two filamentary structures in the similarity map. The filament companions are exclusively MS stars, that contribute on average ∼3.3% to the total L_bol of their binary systems. Fainter companions like low mass M-dwarfs, BDs, and WDs were not found in the filaments but are scattered across the rest of the similarity map. The UMAP dimensionality reduction revealed a group of 561 objects whose XP coefficients are different from the coefficients of the rest of our sample. The investigation of 58 members of this separate group with the VOSA tool revealed that these objects are too cool to be single or binary hot sds. A visual inspection of the SEDs and Gaia XP-spectra of these objects further showed the absence of the negative UV slope, which is typical for hot objects. Based on these indications of lower stellar temperatures, we conclude that the 561 objects in island A are not hot sds.

To find binary systems among our full sample, we used the machine SOM and CNN machine learning methods. These tools can predict whether a hot sd belongs to the binary or single class, based on its XP coefficients. The training set for the SOM and CNN consisted of a subset of 2695 hot sds, for which VOSA labels were available. For the supervised CNN method, we had to address the class imbalance in the training set, where only 17% are binary class samples. By using SMOTE, we generated synthetic binary samples to increase the binary ratio to 33%. In total, we trained an ensemble of ten CNN models, and used the best five models to predict the binary probability for our sample of 20061 hot sds.

To remove the influence of the flux excess, we normalised the XP spectra of our sample hot sds to the continuum. We determine the continuum baselines using the asymmetric least squares smoothing method. The similarity analysis of the normalised spectra confirmed that the influence of the BP-RP colour on the spectra shape has been significantly reduced, allowing individual absorption features to become more prominent. The spectra of the hot and He-rich hot sds are grouped much closer together in the similarity map of the normalised spectra compared to the coefficient similarity map.

At the low resolution of the Gaia XP spectra, the He absorption features are blended with the strong Balmer absorption lines of hydrogen. As a consequence, we could not distinguish between the influence of temperature and the influence of He abundance on the shape of our spectra. We therefore could only separate the cool/He-poor spectra (equivalent to the hot sd subtype sdB) from the hot/He-rich spectra (He-sdB, He-sdO, and sdO). Analogous to the detection of binaries from XP coefficients, we trained an SOM model and an ensemble of CNNs to classify our normalised spectra into sdB and hot/He-rich samples.

After training the SOM and CNN ensemble, we used them to predict the binarity and sdB versus hot/He-rich classifications for our full sample of 20 061 objects. Our binarity predictions from the CNN ensemble show that there exists a correlation between astrometric and photometric variability indicators and hot sd binarity. The binary ratio among hot sds without indication of astrometric or photometric variability is ∼18%, and it increases to more than 60% for stars with only photometric variability. We find that at least 28% of the objects with only photometric variability are not hot sds. Instead, they could be cataclysmic variable binary systems. The binarity probability predictions for samples with RUWE less than 1.4 and an excess flux error greater than 4.0 should therefore be used with caution. The binary ratio rises to ∼77% for hot sds with both astrometric and photometric variability. At ∼82%, the binary ratio is the highest for hot sds that show only astrometric variability.

As a follow-up to this work, we have recently initiated high-resolution spectroscopic observations with the Vilnius University Echelle Spectrograph (Jurgenson et al. 2016) for selected targets in our sample. These observations will allow us to validate our results and explore their binary properties in greater depth.

Data availability

The XP coefficients and XP spectra of our sample are available for download at https://github.com/mambr-astro/Hot_sds_GaiaXP. Python codes for the construction of the coefficient similarity map, the normalisation of XP spectra, and the training of CNN ensembles for binary and sdB star detection are also available there.

Acknowledgements

We sincerely thank the anonymous referee for her/his valuable guidelines and insightful comments, which have significantly enhanced the quality of this work. The authors acknowledge support from the project “Unveiling the Nature of Hot Subdwarfs through Spectroscopy and Machine Learning” (MSF-JM-17/2025), funded by the Vilnius University Research Promotion Fund. We also acknowledge support from EU HORIZON-CL4-2023-SPACE-01-71 SPACIOUS project, Ref. 101135205; the Spanish Ministry of Science MCIN / AEI / 10.13039/501100011033 and the EU FEDER collaborate through the coordinated grant PID2024-157964OB-C22. We also acknowledge support from the Xunta de Galicia and the European Union ED431B 2024/21, CITIC ED431G 2023/01. Ana Ulla acknowledges partial funding, under grant IFC3002/25, from the Fundación Ceo, Ciencia e Cultura. This research has made use of the Spanish Virtual Observatory (https://svo.cab.inta-csic.es) project funded by MICIU/AEI/10.13039/501100011033 and by ERDF/EU through Grant PID2023-146210NB-I00.

References

Abril, J., Schmidtobreick, L., Ederoclite, A., & López-Sanjuan, C. 2020, MNRAS, 492, L40 [NASA ADS] [CrossRef] [Google Scholar]
Allard, F., Homeier, D., & Freytag, B. 2012, Philos. Trans. Roy. Soc. Lond. Ser. A, 370, 2765 [NASA ADS] [Google Scholar]
Andrae, R., Fouesneau, M., Sordo, R., et al. 2023, A&A, 674, A27 [CrossRef] [EDP Sciences] [Google Scholar]
Bayo, A., Rodrigo, C., Barrado Y Navascués, D., et al. 2008, A&A, 492, 277 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Bu, Y., Zeng, J., Lei, Z., & Yi, Z. 2019, ApJ, 886, 128 [NASA ADS] [CrossRef] [Google Scholar]
Carrasco, J. M., Weiler, M., Jordi, C., et al. 2021, A&A, 652, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Castañón Esteban, A., Steele, I. A., & Jermak, H. E. 2024, MNRAS, 529, 1555 [Google Scholar]
Culpan, R., Geier, S., Reindl, N., et al. 2022, A&A, 662, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Eilers, P., & Boelens, H. 2005, Unpublished Manuscript [Google Scholar]
Eyer, L., Audard, M., Holl, B., et al. 2023, A&A, 674, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Gaia Collaboration (Vallenari, A., et al.) 2023, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Geier, S., Heber, U., Podsiadlowski, P., et al. 2010, A&A, 519, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Geier, S., Dorsch, M., Dawson, H., et al. 2023, A&A, 677, A11 [Google Scholar]
Gentile Fusillo, N. P., Tremblay, P. E., Cukanovaite, E., et al. 2021, MNRAS, 508, 3877 [NASA ADS] [CrossRef] [Google Scholar]
Guiglion, G., Nepal, S., Chiappini, C., et al. 2024, A&A, 682, A9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Inight, K., Gänsicke, B. T., Breedt, E., et al. 2023, MNRAS, 524, 4867 [NASA ADS] [CrossRef] [Google Scholar]
Jackim, R., Szkody, P., Hazelton, B., & Benson, N. C. 2020, RNAAS, 4, 219 [Google Scholar]
Jurgenson, C., Fischer, D., McCracken, T., et al. 2016, J. Astron. Instrum., 5, 1650003 [Google Scholar]
Kao, M. L., Hawkins, K., Rogers, L. K., et al. 2024, ApJ, 970, 181 [NASA ADS] [CrossRef] [Google Scholar]
Kim, Y.-S., Kim, M. K., Fu, N., et al. 2025, Sustain. Cities Soc., 118, 105570 [Google Scholar]
Lallement, R., Vergely, J. L., Babusiaux, C., & Cox, N. L. J. 2022, A&A, 661, A147 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Lemaître, G., Nogueira, F., & Aridas, C. K. 2017, J. Mach. Learn. Res., 18, 1 [Google Scholar]
Lindegren, L., Klioner, S. A., Hernández, J., et al. 2021, A&A, 649, A2 [EDP Sciences] [Google Scholar]
Luo, Y., Németh, P., Wang, K., Wang, X., & Han, Z. 2021, ApJS, 256, 28 [NASA ADS] [CrossRef] [Google Scholar]
Martin, D. C., Fanson, J., Schiminovich, D., et al. 2005, ApJ, 619, L1 [Google Scholar]
McInnes, L., Healy, J., Saul, N., & Grossberger, L. 2018, J. Open Source Softw., 3, 861 [CrossRef] [Google Scholar]
Mereghetti, S., Pintore, F., Rauch, T., et al. 2021, MNRAS, 504, 920 [NASA ADS] [CrossRef] [Google Scholar]
Merloni, A., Lamer, G., Liu, T., et al. 2024, A&A, 682, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Mourard, D., Monnier, J. D., Meilland, A., et al. 2015, A&A, 577, A51 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Németh, P., Kawka, A., & Vennes, S. 2012, MNRAS, 427, 2180 [Google Scholar]
Oreiro, R., Rodríguez-López, C., Solano, E., et al. 2011, A&A, 530, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Pelisoli, I., Vos, J., Geier, S., Schaffenroth, V., & Baran, A. S. 2020, A&A, 642, A180 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Pelisoli, I., Neunteufel, P., Geier, S., et al. 2021, Nat. Astron., 5, 1052 [NASA ADS] [CrossRef] [Google Scholar]
Pérez-Couto, X., Pallas-Quintela, L., Manteiga, M., Villaver, E., & Dafonte, C. 2024, ApJ, 977, 31 [Google Scholar]
Pérez-Fernández, E., Ulla, A., Solano, E., Oreiro, R., & Rodrigo, C. 2016, MNRAS, 457, 3396 [CrossRef] [Google Scholar]
Ranaivomanana, P., Uzundag, M., Johnston, C., et al. 2025, A&A, 693, A268 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Recio-Blanco, A., de Laverny, P., Palicio, P. A., et al. 2023, A&A, 674, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Schaffenroth, V., Casewell, S. L., Schneider, D., et al. 2021, MNRAS, 501, 3847 [Google Scholar]
Smak, J. 2001, Cataclysmic Variables, eds. F. C. Lázaro, & M. J. Arévalo (Berlin, Heidelberg: Springer Berlin Heidelberg), 110 [Google Scholar]
Solano, E., Ulla, A., Pérez-Fernández, E., et al. 2022, MNRAS, 514, 4239 [NASA ADS] [CrossRef] [Google Scholar]
Tahir, M., Yude, B., Mehmood, T., et al. 2024, Sci. Rep., 14, 16815 [Google Scholar]
Tan, L., Mei, Y., Liu, Z., et al. 2022, ApJS, 259, 5 [NASA ADS] [CrossRef] [Google Scholar]
Van der Swaelmen, M., Viscasillas Vázquez, C., Magrini, L., et al. 2024, A&A, 690, A276 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Vettigli, G. 2018, MiniSom: minimalistic and NumPy-based implementation of the Self Organizing Map [Google Scholar]
Viscasillas Vázquez, C., Solano, E., Ulla, A., et al. 2024, A&A, 691, A223 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Vos, J., Németh, P., Vuckovic, M., Ostensen, R., & Parsons, S. 2018, MNRAS, 473, 693 [NASA ADS] [CrossRef] [Google Scholar]
Wu, Y., Chen, X., Li, Z., & Han, Z. 2018, A&A, 618, A14 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Zhang, X., Green, G. M., & Rix, H.-W. 2023, MNRAS, 524, 1855 [NASA ADS] [CrossRef] [Google Scholar]
Zhang, J., Bu, Y., Wu, H., et al. 2025, AJ, 170, 94 [Google Scholar]
Zou, X., & Lei, Z. 2024, PASJ, 76, 1084 [Google Scholar]

$L 2 (x) = \sqrt{x_{1}^{2} + x_{2}^{2} + . . . + x_{n}^{2}}$ $Mathematical equation: $L2(x) = \sqrt{x_1^2 + x_2^2 + ... + x_n^2}$$ , for a vector x with n components.

https://github.com/JustGlowing/minisom/

Appendix A Complementary figures and tables

Fig. A.1

Gaia XP spectra of 750 randomly selected stars from our sample, colour-coded by Gaia BP-RP magnitudes. The spectra have been scaled to a maximum flux value of 1.0 and truncated to a wavelength range between 400 and 950 nm.

Fig. A.2

Colour-magnitude diagram of our sample objects, based on Gaia DR3 photometry. Top panel: The UMAP filament regions are highlighted. Bottom panel: The UMAP island regions are highlighted. In both panels, the main body of the map is coloured in grey.

Fig. A.3

Similarity map of our 20061 sets of XP coefficients. Where available, colours indicate the ratio of the binary luminosity of the binary companion relative to the hot sds luminosity.

Fig. A.4

Network gradients for the classification of sdB stars. Top panel: Mean spectra of the classes sdB (red) and hot/He-rich (blue) in the CNN training set. Bottom panel: Network gradients for the class sdB. In both panels, dashed vertical lines show the positions of the 410, 434, 486 nm Balmer absorption lines of hydrogen.

Fig. A.5

Examples of VOSA SED fits. Top panel: Fit of the binary hot hds PHL1079 (Gaia DR3 2560685069816099968). For a good fit, the observed SED (red data points) requires a combination of a TMAP model of a hot star (Model 1, purple data points) and a BT-Settl model of a cooler main sequence star (Model 2, cyan data points). Theoretical spectra for both models are shown in lavender colour. The combined model is shown in blue. Bottom panel: Fit of a main sequence BT-Settl model SED (blue) to the observed SED (red) of the island A object with Gaia DR3 4711992372581728128. In both panels, the parameters of the fitted models are given at the top of the plot. The yellow triangles mark upper limits, and are excluded from the model fitting.

Table A.1

Classification probabilities for binary and hot/He-rich subdwarfs.

Fig. A.6

ROC curves for different models and classification tasks.

Fig. A.7

PR curves for different models and classification tasks.

Fig. A.8

Left panel: Internal CNN uncertainties relative to predicted binary probability for our sample of 20061 hot sds. Right panel: Internal CNN uncertainties relative to predicted sdB probability. Darker shade means higher density of data points.

All Tables

Table 1

Performance evaluation scores.

In the text

Table 2

Optimal model threshold results for Youden’s index and F₁ score.

In the text

Table A.1

Classification probabilities for binary and hot/He-rich subdwarfs.

In the text

All Figures

	Fig. 1 UMAP of the 20 061 sets of XP coefficients. The main body of the map is coloured in grey, separate regions are kept in black and labelled A, B, F1, and F2.
In the text

	Fig. 2 UMAP of the 20061 sets of XP coefficients. Every panel shows the same UMAP but colour-coded by different parameters. Grey points represent spectra for which the respective parameter is not available.
In the text

	Fig. 3 Average of Gaia XP spectra of all objects in island A (black) and average XP spectrum of all other samples that are not in island A (grey). The insert panel highlights the region from 400 to 550 nm.
In the text

	Fig. 4 Same UMAP as in Fig. 1 but colour-coded by VOSA binarity. Our new dataset has 2695 samples in common with the set from Paper I.
In the text

	Fig. 5 SOM map of the XP coefficients with known hot sds colour-coded according to their binarity.
In the text

	Fig. 6 UMAP of 20 023 normalised XP spectra. All panels show the same UMAP but colour-coded by different parameters. Grey points represent spectra for which the respective parameter is not available.
In the text

Fig. 7

In the text

	Fig. 8 SOM map of the XP spectra, normalised to the continuum, with known hot sds colour-coded according to whether they are He-rich or He-poor hot sds.
In the text

	Fig. 9 Helium abundances log(Y) relative to the effective temperatures T_eff of the stars in the test set. The colour scale shows the probability of belonging to the sdB class (cool/He-poor), as predicted by our CNN ensemble. The dashed lines show our defined class boundaries of T_eff = 35 000 K and log(Y) = −1.0 dex.
In the text

	Fig. 10 Gaia RUWE versus excess flux error of hot sds in our sample. CNN single stars are coloured in blue, CNN binaries are green. Stars from the island A are not included in this plot. The dashed lines show the variability cut-offs at RUWE =1.4 and excess flux error = 4.0 used in Culpan et al. (2022). The resulting regions of the diagram are numbered from 1 to 4.
In the text

	Fig. A.1 Gaia XP spectra of 750 randomly selected stars from our sample, colour-coded by Gaia BP-RP magnitudes. The spectra have been scaled to a maximum flux value of 1.0 and truncated to a wavelength range between 400 and 950 nm.
In the text

	Fig. A.2 Colour-magnitude diagram of our sample objects, based on Gaia DR3 photometry. Top panel: The UMAP filament regions are highlighted. Bottom panel: The UMAP island regions are highlighted. In both panels, the main body of the map is coloured in grey.
In the text

	Fig. A.3 Similarity map of our 20061 sets of XP coefficients. Where available, colours indicate the ratio of the binary luminosity of the binary companion relative to the hot sds luminosity.
In the text

	Fig. A.4 Network gradients for the classification of sdB stars. Top panel: Mean spectra of the classes sdB (red) and hot/He-rich (blue) in the CNN training set. Bottom panel: Network gradients for the class sdB. In both panels, dashed vertical lines show the positions of the 410, 434, 486 nm Balmer absorption lines of hydrogen.
In the text

Fig. A.5

In the text

	Fig. A.6 ROC curves for different models and classification tasks.
In the text

	Fig. A.7 PR curves for different models and classification tasks.
In the text

	Fig. A.8 Left panel: Internal CNN uncertainties relative to predicted binary probability for our sample of 20061 hot sds. Right panel: Internal CNN uncertainties relative to predicted sdB probability. Darker shade means higher density of data points.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[R1] Abril, J., Schmidtobreick, L., Ederoclite, A., & López-Sanjuan, C. 2020, MNRAS, 492, L40 [NASA ADS] [CrossRef] [Google Scholar]

[R2] Allard, F., Homeier, D., & Freytag, B. 2012, Philos. Trans. Roy. Soc. Lond. Ser. A, 370, 2765 [NASA ADS] [Google Scholar]

[R3] Andrae, R., Fouesneau, M., Sordo, R., et al. 2023, A&A, 674, A27 [CrossRef] [EDP Sciences] [Google Scholar]

[R4] Bayo, A., Rodrigo, C., Barrado Y Navascués, D., et al. 2008, A&A, 492, 277 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R5] Bu, Y., Zeng, J., Lei, Z., & Yi, Z. 2019, ApJ, 886, 128 [NASA ADS] [CrossRef] [Google Scholar]

[R6] Carrasco, J. M., Weiler, M., Jordi, C., et al. 2021, A&A, 652, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R7] Castañón Esteban, A., Steele, I. A., & Jermak, H. E. 2024, MNRAS, 529, 1555 [Google Scholar]

[R8] Culpan, R., Geier, S., Reindl, N., et al. 2022, A&A, 662, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R9] Eilers, P., & Boelens, H. 2005, Unpublished Manuscript [Google Scholar]

[R10] Eyer, L., Audard, M., Holl, B., et al. 2023, A&A, 674, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R11] Gaia Collaboration (Vallenari, A., et al.) 2023, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R12] Geier, S., Heber, U., Podsiadlowski, P., et al. 2010, A&A, 519, A25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R13] Geier, S., Dorsch, M., Dawson, H., et al. 2023, A&A, 677, A11 [Google Scholar]

[R14] Gentile Fusillo, N. P., Tremblay, P. E., Cukanovaite, E., et al. 2021, MNRAS, 508, 3877 [NASA ADS] [CrossRef] [Google Scholar]

[R15] Guiglion, G., Nepal, S., Chiappini, C., et al. 2024, A&A, 682, A9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R16] Inight, K., Gänsicke, B. T., Breedt, E., et al. 2023, MNRAS, 524, 4867 [NASA ADS] [CrossRef] [Google Scholar]

[R17] Jackim, R., Szkody, P., Hazelton, B., & Benson, N. C. 2020, RNAAS, 4, 219 [Google Scholar]

[R18] Jurgenson, C., Fischer, D., McCracken, T., et al. 2016, J. Astron. Instrum., 5, 1650003 [Google Scholar]

[R19] Kao, M. L., Hawkins, K., Rogers, L. K., et al. 2024, ApJ, 970, 181 [NASA ADS] [CrossRef] [Google Scholar]

[R20] Kim, Y.-S., Kim, M. K., Fu, N., et al. 2025, Sustain. Cities Soc., 118, 105570 [Google Scholar]

[R21] Lallement, R., Vergely, J. L., Babusiaux, C., & Cox, N. L. J. 2022, A&A, 661, A147 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R22] Lemaître, G., Nogueira, F., & Aridas, C. K. 2017, J. Mach. Learn. Res., 18, 1 [Google Scholar]

[R23] Lindegren, L., Klioner, S. A., Hernández, J., et al. 2021, A&A, 649, A2 [EDP Sciences] [Google Scholar]

[R24] Luo, Y., Németh, P., Wang, K., Wang, X., & Han, Z. 2021, ApJS, 256, 28 [NASA ADS] [CrossRef] [Google Scholar]

[R25] Martin, D. C., Fanson, J., Schiminovich, D., et al. 2005, ApJ, 619, L1 [Google Scholar]

[R26] McInnes, L., Healy, J., Saul, N., & Grossberger, L. 2018, J. Open Source Softw., 3, 861 [CrossRef] [Google Scholar]

[R27] Mereghetti, S., Pintore, F., Rauch, T., et al. 2021, MNRAS, 504, 920 [NASA ADS] [CrossRef] [Google Scholar]

[R28] Merloni, A., Lamer, G., Liu, T., et al. 2024, A&A, 682, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R29] Mourard, D., Monnier, J. D., Meilland, A., et al. 2015, A&A, 577, A51 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R30] Németh, P., Kawka, A., & Vennes, S. 2012, MNRAS, 427, 2180 [Google Scholar]

[R31] Oreiro, R., Rodríguez-López, C., Solano, E., et al. 2011, A&A, 530, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R32] Pelisoli, I., Vos, J., Geier, S., Schaffenroth, V., & Baran, A. S. 2020, A&A, 642, A180 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R33] Pelisoli, I., Neunteufel, P., Geier, S., et al. 2021, Nat. Astron., 5, 1052 [NASA ADS] [CrossRef] [Google Scholar]

[R34] Pérez-Couto, X., Pallas-Quintela, L., Manteiga, M., Villaver, E., & Dafonte, C. 2024, ApJ, 977, 31 [Google Scholar]

[R35] Pérez-Fernández, E., Ulla, A., Solano, E., Oreiro, R., & Rodrigo, C. 2016, MNRAS, 457, 3396 [CrossRef] [Google Scholar]

[R36] Ranaivomanana, P., Uzundag, M., Johnston, C., et al. 2025, A&A, 693, A268 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R37] Recio-Blanco, A., de Laverny, P., Palicio, P. A., et al. 2023, A&A, 674, A29 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R38] Schaffenroth, V., Casewell, S. L., Schneider, D., et al. 2021, MNRAS, 501, 3847 [Google Scholar]

[R39] Smak, J. 2001, Cataclysmic Variables, eds. F. C. Lázaro, & M. J. Arévalo (Berlin, Heidelberg: Springer Berlin Heidelberg), 110 [Google Scholar]

[R40] Solano, E., Ulla, A., Pérez-Fernández, E., et al. 2022, MNRAS, 514, 4239 [NASA ADS] [CrossRef] [Google Scholar]

[R41] Tahir, M., Yude, B., Mehmood, T., et al. 2024, Sci. Rep., 14, 16815 [Google Scholar]

[R42] Tan, L., Mei, Y., Liu, Z., et al. 2022, ApJS, 259, 5 [NASA ADS] [CrossRef] [Google Scholar]

[R43] Van der Swaelmen, M., Viscasillas Vázquez, C., Magrini, L., et al. 2024, A&A, 690, A276 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R44] Vettigli, G. 2018, MiniSom: minimalistic and NumPy-based implementation of the Self Organizing Map [Google Scholar]

[R45] Viscasillas Vázquez, C., Solano, E., Ulla, A., et al. 2024, A&A, 691, A223 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R46] Vos, J., Németh, P., Vuckovic, M., Ostensen, R., & Parsons, S. 2018, MNRAS, 473, 693 [NASA ADS] [CrossRef] [Google Scholar]

[R47] Wu, Y., Chen, X., Li, Z., & Han, Z. 2018, A&A, 618, A14 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R48] Zhang, X., Green, G. M., & Rix, H.-W. 2023, MNRAS, 524, 1855 [NASA ADS] [CrossRef] [Google Scholar]

[R49] Zhang, J., Bu, Y., Wu, H., et al. 2025, AJ, 170, 94 [Google Scholar]

[R50] Zou, X., & Lei, Z. 2024, PASJ, 76, 1084 [Google Scholar]