| Issue | 
											A&A
									 Volume 696, April 2025				 | |
|---|---|---|
| Article Number | A62 | |
| Number of page(s) | 22 | |
| Section | Stellar atmospheres | |
| DOI | https://doi.org/10.1051/0004-6361/202452048 | |
| Published online | 04 April 2025 | |
Performance of the Stellar Abundances and atmospheric Parameters Pipeline adapted for M dwarfs
I. Atmospheric parameters from the spectroscopic module
1 
Observational Astrophysics, Department of Physics and Astronomy, Uppsala University, 
 Box 516, 
 751 20  
 Uppsala,  Sweden 
2 
 Max Planck Institute for Astronomy, 
 69117  
 Heidelberg, 
 Germany 
3 
Institut de Recherche en Astrophysique et Planétologie, Université de Toulouse, CNRS, IRAP/UMR 5277, 
 14 Avenue Edouard Belin, 
 31400, 
 Toulouse, 
 France 
4 
INAF – Osservatorio Astronomico d’Abruzzo, Via M. Maggini, s/n, 64100 Teramo, Italy; INFN, Sezione di Pisa, 
 Largo Pontecorvo 3, 
 56127  
 Pisa,  Italy 
5 
Yunnan Observatories, China Academy of Sciences, Kunming 650216, China; Key Laboratory for the Structure and Evolution of Celestial Objects, Chinese Academy of Sciences, 
 Kunming  
 650011,  China 
6 
Space sciences, Technologies and Astrophysics Research (STAR) Institute, Université de Liège, Quartier Agora, 
Allée du 6 Août 19c, Bât. B5c, 
 B4000 Liège, 
 Belgium 
7 
Departamento de Física, Universidade Federal de Sergipe, 
Av. Marcelo Deda Chagas, S/N Cep 49.107-230, 
São Cristóvão, SE, 
 Brazil 
8 
Centre for Planetary Habitability, Department of Geosciences, University of Oslo, 
 Sem Sælands vei 2b, 
 0315  
 Oslo, 
 Norway 
9 
Institut für Astrophysik, Georg-August-Universität, 
 Friedrich-Hund-Platz 1, 
 37077  
 Göttingen,  Germany 
10 
Instituto de Alta Investigación, Universidad de Tarapacá, 
 Casilla 7D, 
 Arica, 
 Chile 
11 
Instituto de Astrofísica e Ciências do Espaço (IA), CAUP, Universidade do Porto, Rua das Estrelas, 
 4150-762  
 Porto,  Portugal 
12 
Centro de Astrobiología (CAB), CSIC-INTA, 
 Camino Bajo del Castillo s/n, 
 28692  
 Villanueva de la Cañada (Madrid),  Spain 
13 
Center for Star and Planet Formation, Globe Institute, the University of Copenhagen, 
 Øster Voldgade 5–7, 
 1350  
 København K,  Denmark 
★ Corresponding authors; terese.olander@physics.uu.se; mgent@irap.omp.eu; ulrike.heiter@physics.uu.se
Received: 
29 
August 
2024
Accepted: 
10 
February 
2025
Context. M dwarfs are important targets in the search for Earth-like exoplanets due to their small masses and low luminosities. Several ongoing and upcoming space missions are targeting M dwarfs for this reason, and the ESA PLATO mission is one of these.
Aims. In order to fully characterise a planetary system the properties of the host star must be known. For M dwarfs we can derive effective temperature, surface gravity, metallicity, and abundances of various elements from spectroscopic observations in combination with photometric data.
Methods. The Stellar Abundances and atmospheric Parameters Pipeline (SAPP) has been developed to serve as a prototype for one of the stellar science software within the PLATO consortium. The pipeline combines results from a spectroscopy, a photometry, an interferometry, and an asteroseismology module to derive stellar parameters for FGK-type stars. We have modified the pipeline to be able to analyse the M dwarf part of the PLATO target sample. The current version of the pipeline for M dwarfs mostly relies on spectroscopic observations. The module processing these data is based on the machine learning algorithm The Payne and fits a grid of model spectra to an observed spectrum to derive effective temperature and metallicity. We use spectra in the H-band, as the nearinfrared region is beneficial for M dwarfs because there are fewer molecular lines and they are brighter in this wavelength region than in the optical. A method based on synthetic spectra was developed for the continuum normalisation of the spectra, taking into account the pseudo-continuum formed by numerous lines of the water molecule. Photometry is used to constrain the surface gravity.
Results. We tested the modified SAPP on spectra of M dwarfs from the APOGEE survey. Our validation sample of 26 stars includes stars with interferometric observations and binaries. We found a good agreement between our derived values and reference values from a range of previous studies. We estimate the overall uncertainties in the derived effective temperature, surface gravity, and metallicity to be 100 K, 0.1 dex, and 0.15 dex, respectively.
Conclusions. We find that the modified SAPP performs well on M dwarfs and identify possible areas of future development that should lead to an improved precision of the derived stellar parameters.
Key words: techniques: miscellaneous / stars: fundamental parameters / stars: late-type / stars: low-mass
© The Authors 2025
 Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
1 Introduction
M dwarfs have become prime targets in the hunt for terrestrial exoplanets. This is partly due to their multitude, as over 70% of the stars in the solar neighbourhood are estimated to be M dwarfs (Henry et al. 2006). In addition, their low luminosities and small radii make it easier to find planets around them using both the radial velocity and the transit methods. Observing transits of exoplanets in the habitable zone around M dwarfs and obtaining high resolution transmission spectra of the planetary atmosphere is also made easier by the shorter orbital periods due to the proximity of the exoplanet to the star. Examples for recent exoplanet transmission spectroscopy observations for M dwarfs can be found in Ridden-Harper et al. (2023); Diamond-Lowe et al. (2023); Damiano et al. (2022).
However, a key challenge to understanding M dwarf planets lies in the difficulty of spectroscopically characterising the stars themselves. Being at the faint end of the main sequence, M dwarfs have effective temperatures below 4000 K. These low temperatures make it possible for molecules to form, and the photosphere of M dwarfs therefore has many di- and triatomic molecules. The optical wavelength region is dominated by bands of TiO, which hide the atomic lines, and shows spectral features of CaH, MgH, CaOH, and VO (e.g. Gray & Corbally 2009; Brett 1995; Allard et al. 2000; Shields et al. 2016). At longer wavelengths in the near-infrared (NIR) there are some regions with fewer molecular lines but the atomic lines are still severely blended with lines from molecules such as CO, FeH, OH, and H2O (e.g. Gray & Corbally 2009; Rojas-Ayala et al. 2012; Lindgren et al. 2016; Souto et al. 2022). It is however easier to distinguish the atomic lines at these longer wavelengths, since the absorption features of these molecules are not as dense and deep as those of TiO. The plethora of lines, both atomic and molecular, complicates the analysis of M dwarf spectra. This applies in particular to the continuum normalisation process, as it becomes increasingly difficult to identify the continuum for decreasing effective temperature. For example, in the H-band the presence of water molecules suppresses the continuum to form what we call a pseudo-continuum. This has to be taken into account when normalising an observed spectrum (see Sect. 3.2.2 and Sarmento et al. 2021).
Ideally, spectroscopically derived parameters should be verified by comparing with parameters obtained with independent methods based on measurements of fundamental stellar properties. Examples are effective temperatures derived from interferometric angular diameters and surface gravity derived from asteroseismology. However, asteroseismology cannot be used for M dwarfs because their pulsations cannot be detected with current observing capabilities (Rodríguez-López 2019). Interferometry also has some challenges in regards to M dwarfs. The angular diameter is obtained via observations, and then the Stefan-Boltzmann law is used together with the bolometric flux of the star to derive the temperature. At the moment this method can only be used on few M dwarfs, because of the faintness and small radius of these stars1. Boyajian et al. (2012) and Rabus et al. (2019) give effective temperatures, angular diameters and surface gravities or masses for a range of M dwarfs, see Sect. 4.1.
Despite these challenges, many spectroscopic studies determining stellar parameters of M dwarfs have presented promising results in recent years. Classical tools for spectrum synthesis and fitting have been applied to low-resolution optical SNIFS (Lantz et al. 2004) and NIR SpeX (Rayner et al. 2003) spectra calibrated to an absolute flux scale (Mann et al. 2015), high-resolution optical HARPS and HARPS-N spectra smoothed to low resolution (Maldonado et al. 2015, 2020), high-resolution optical Keck-HIRES spectra (Rosenthal et al. 2021), as well as high-resolution NIR spectra, including spectra in the J band from the CRIRES spectrograph (Lindgren et al. 2016; Lindgren & Heiter 2017), spectra from the CARMENES spectrograph (optical plus J- and H bands, Passegger et al. 2018, 2019; Rajpurohit et al. 2018; Marfil et al. 2021), spectra from the APOGEE survey (H band, Sarmento et al. 2021; Souto et al. 2022; Melo et al. 2024), and spectra from the SPIRou spectrograph (J-, H-, and K bands, Cristofari et al. 2022a,b). At the same time, machine learning approaches have started to be used for the analysis of optical and NIR spectra, including Antoniadis-Karnavas et al. (2020, 2024, optical spectra from five instruments), Rains et al. (2024, optical WiFeS spectra), Passegger et al. (2020, 2022), Bello-García et al. (2023), and Mas-Buitrago et al. (2024) for CARMENES, as well as Ting et al. (2019) and Birky et al. (2020) for APOGEE. For details on some of these studies, which we used for comparison, see Sect. 4.2.
Current and upcoming large surveys that include M dwarfs in their samples put new demands on deriving accurate parameters and abundances. One such project is PLATO (PLAnetary Transits and Oscillations of stars) (Rauer et al. 2024), a space telescope that is planned to be launched in 2026. Its mission is to find habitable terrestrial exoplanets around solar type stars. In addition, at least 5000 (possibly up to about 25 000) M dwarfs with a V magnitude brighter than 16 are planned to be part of the sample. Additional selection criteria are applied for the M dwarf sample, based on Gaia GBP − GRP colours and absolute G magnitudes (Montalto et al. 2021; Nascimbeni et al. 2022). The V magnitude selection criterion leads to the PLATO sample to be dominated by early- to mid-M dwarfs with very few targets with spectral types later than M4 expected to be retained. PLATO will obtain light curves in order to detect exoplanets and to apply asteroseismology to the host stars (those of FGK-type) in order to derive stellar parameters. There will also be a spectroscopic follow-up of the targets. This requires a fast and reliable method to analyse the spectra of thousands of stars.
In preparation of the mission, the Stellar Abundances and atmospheric Parameters Pipeline (SAPP, Gent et al. 2022) has been developed to serve as a prototype for one of the components of the PLATO stellar science software. This pipeline uses Bayesian inference to obtain accurate stellar parameters such as Teff, log ɡ, [Fe/H], and chemical abundances from spectroscopic, photometric, and asteroseismic data. The SAPP has so far only been tested on FGK stars in the optical wavelength region (Gent et al. 2022). Here we present the results from a modified version of the SAPP capable of analysing M dwarfs in the H-band (see Sect. 3), which does not use the asteroseismic and the Bayesian inference part of the code. Our modified SAPP pipeline uses high-resolution spectroscopic and photometric data to derive effective temperature, surface gravity, and metallicity. We leave the analysis of abundances and the full Bayesian analysis to future work.
In Sect. 2, we present the observed sample we used in order to test the pipeline. In Sect. 3, we present the pipeline and its different components, including the sources for the photometric input data needed in the analysis. We give the results and compare with literature values for the sample stars in Sect. 4. We end with an outlook and conclusions in Sects. 5 and 6, respectively.
2 Sample and spectroscopic data
We tested the M dwarf version of the SAPP on observed spectra from the Apache Point Observatory Galactic Evolution Experiment (APOGEE) survey (Majewski et al. 2017; Jönsson et al. 2020), which is part of the Sloan Digital Sky Survey (SDSS-IV, Blanton et al. 2017). The targets of the survey are primarily red giants, but M dwarfs are among the observed stars. We chose APOGEE due to the large number (order of thousand) of reduced M dwarf spectra available with a resolution sufficient for Plato’s precision needs (Sarmento et al. 2021). We note that a considerable number (over 300) of reduced CARMENES near-infrared spectra have become available recently (Ribas et al. 2023). A comparative study will be carried out in future work, following this work and Passegger et al. (2022).
Our test sample was chosen to cover a range from early- to mid-M dwarfs, consistent with the properties of the PLATO target stars (see Sect. 1) and consists of 26 stars. Two K dwarfs with APOGEE spectra were included in this sample as their literature stellar parameters are within our parameter limits (see Sect. 3.2.1) and they are in a binary with an M dwarf that is also included in the sample.
All stars in the sample have determinations of atmospheric parameters in the literature. Stars with interferometric effective temperatures as well as M dwarfs in wide binaries with other M dwarfs and with FGK stars are included in our sample. Interferometric measurements give model independent temperatures, and together with reliable distances and mass-luminosity relations surface gravities can be obtained. Binaries are excellent benchmark systems for verifying metallicities and chemical abundances, assuming that the component stars have formed from the same material within a molecular cloud and thus have the same chemical composition (e.g. Desidera et al. 2004, 2006). The literature data are described in detail in Sect. 4.
The APOGEE survey makes use of two multi-object spectrographs (Wilson et al. 2019); APOGEE-N on the Sloan 2.5 m telescope in New Mexico (Gunn et al. 2006) and APOGEES on the 2.5 m duPont telescope in Chile (Bowen & Vaughan 1973). APOGEE has a resolving power of R = λ/∆λ ~22 500 and covers the H-band (15 000 to 17 000 Å). Three Hawaii-2RG detectors are used, where each detector covers about a third of the wavelength range. We used the combined spectra from multiple observations (apStar/asStar files) available from SDSS-IV DR14 and DR162. The observed spectra were reduced by the APOGEE pipeline apred (Nidever et al. 2015). The spectra are wavelength calibrated and have had telluric lines removed. The spectra are also radial velocity (RV) corrected. However, we found some discrepancy for some of the stars in our sample, so the SAPP was used to recalculate the RV shift for all stars in the sample.We refer the reader to Gent et al. (2022) for more details on the RV correction.
The entire sample can be seen in Table 1 together with the coordinates, 2MASS Ks magnitude, spectral type, projected equatorial rotational velocity, availability of interferometric data, binary system specification, and S/N. The S/N is between 100 and 400 for most stars, with a few exceptions towards lower and higher values.
3 Method
The SAPP serves as a prototype for one of the components of the stellar science software that will derive stellar parameters and abundances for stars in the PLATO sample. This includes both FGK stars and M dwarfs. The original FGK version of the SAPP consists of modules based on spectroscopy, photometry, and interferometry, together with asteroseismology. In its full version the code uses Bayesian inference on results from the different modules to derive parameters such as Teff, log ɡ, [Fe/H], and chemical abundances. For a thorough description of all the functions the reader is directed to Gent et al. (2022).
The M dwarf version of the SAPP presents several key differences. Due to the unobservable nature of M dwarf pulsations the asteroseismic module cannot be applied. Adopting a log ɡ determined from the granulation properties (Bugnet et al. 2018) also appears unlikely. In addition, the original SAPP operates on optical spectra, which for M dwarfs are blanketed by many molecular lines. We therefore modified the spectroscopic module such that NIR spectra in the H band are analysed within the parameter range of M dwarfs. Furthermore, the spectroscopic analysis of M dwarfs is prone to degeneracies, a well-known one being the Teff−[Fe/H] degeneracy (e.g. Passegger et al. 2018). Passegger et al. (2018) showed that the use of an independent method to constrain log ɡ helps to reduce this degeneracy. Therefore, we derive log ɡ from photometry and stellar evolution models, before using it to determine Teff and [Fe/H] from spectroscopy. It is similar to the pipeline for FGK stars where an external constraint on log ɡ is also used (Gent et al. 2022), as routinely done for asteroseismic targets nowadays (e.g. Lund et al. 2024). In this article we focus on obtaining reliable parameters using the spectroscopic module together with constraints from photometry, while the full Bayesian inference analysis including chemical abundances is left to future developments. In this section we describe the modified version of the SAPP capable of analysing M dwarf spectra.
3.1 Model isochrones and photometry
The photometric module is used to estimate fundamental stellar parameters by fitting model isochrones to broadband photometric data. The stellar evolution models for M dwarfs adopted in the present work were specifically calculated for the analysis of M dwarfs among the PLATO targets. They represent an extension of the set of models for very low-mass (VLM) stars presented in Hidalgo et al. (2018) and Pietrinferni et al. (2021) in the framework of the updated BaSTI library3 (Bag of Stellar Tracks and Isochrones). For a detailed discussion of the input physics and numerical assumptions adopted in performing the evolutionary computations, we refer the interested reader to the mentioned references. Here we only briefly summarise the input physics more relevant for the computations of the M dwarf models adopted in present work. The adopted solar metal mixture is that provided by Caffau et al. (2011), supplemented by the abundances given by Lodders (2010), see Table 1 in Hidalgo et al. (2018). By adopting this metal mixture, the calibration of the Solar Standard Model (SSM) provides the initial chemical composition for the Sun as Zini = 0.01721 and Yini = 0.2695, while the actual surface metallicity of the Sun results to be equal to Z⊙ = 0.0153.
Superadiabatic convection in the outer layers is treated according to the Böhm-Vitense (1958) flavour of the mixing length theory (MLT). The value of the free mixing length parameter αml was fixed to 2.006 by the SSM calibration. We note that in any case the calibration of the MLT is not an issue in the VLM stellar regime, as these stars are largely adiabatic along their whole interior structure. On the other hand, a crucial issue concerning M dwarf stellar models is the treatment of the outer boundary conditions, as extensively discussed by Baraffe et al. (1995); Brocato et al. (1998); Chabrier & Baraffe (2000); Cassisi & Salaris (2013, and references therein): for structures with a mass lower than about 0.45 M⊙ it is crucial to determine the outer boundary conditions via accurate non-grey model atmospheres in order to retrieve reliable and precise evolutionary predictions. For the present models we adopted the outer boundary conditions provided by the PHOENIX model atmosphere repository (Allard et al. 2012; Husser et al. 2013). For more details on this topic we refer to the discussion in Hidalgo et al. (2018).
The thermodynamical properties were obtained by using the FreeEOS equation of state by A. Irwin (Cassisi et al. 2003; Hidalgo et al. 2018), in the configuration that provides the most accurate predictions in the thermal regime of high density and low temperature suitable for VLM stars. The sources for the radiative Rosseland opacity are the same as for the more massive stellar structures in the BaSTI library: opacities were taken from the OPAL calculations (Iglesias & Rogers 1996) for temperatures larger than log(T) = 4.0, whereas for lower temperatures the predictions provided by Ferguson et al. (2005) were adopted. The adopted conductive opacities were taken from the tabulations given by Cassisi et al. (2007, 2021).
The photometric module uses the same set of photometric bands as in Gent et al. (2022), that is, Johnson B and V (Koen et al. 2010; Monet et al. 2003; Zacharias et al. 2012), Gaia G, GBP, GRP (Gaia Collaboration 2016, 2023), and 2MASS J, H, Ks magnitudes (Cutri et al. 2003). Future updates of the SAPP for M dwarfs will likely also incorporate further NIR bands, such as W1 and W2 at 3.4 and 4.6 µm from the Wide-field Infrared Survey Explorer (WISE, Marocco et al. 2021), to better capture the peak of the spectral energy distribution (SED) of M dwarfs. The photometry is combined with photogeometric distances derived by Bailer-Jones et al. (2021). If distances are not available in that source, Gaia parallaxes (Gaia Collaboration 2023) are used by the pipeline to calculate the distances. The Stilism tool (Capitanio et al. 2017) was used to obtain line-of-sight reddening corresponding to the given distances, in order to derive and correct for interstellar extinction (see below). We note that the M dwarfs in our sample are all nearby, and their reddening values are consistent with zero.
We computed a grid of model isochrones with synthetic photometry in the bands described above. The grid spans ages between 0.5 Gyr and 15 Gyr in steps of 0.5 Gyr, masses from 0.1 to 0.75 M⊙ with steps of 0.005 M⊙, and [Fe/H] from −2.45 to +0.14 dex (+0.28 dex for the 0.5 Gyr isochrone) with steps of 0.01 dex. We chose a relatively coarse age grid because the stability of M dwarfs throughout their long main-sequence lifetimes means the age sensitivity of the isochrones is small. This range accounts for changes significant in log g, specifically radius inflation for larger masses. The isochrones will be extended to higher [Fe/H] in future updates of the pipeline.
The synthetic photometry from the isochrones is then compared to observed absolute magnitudes using reddening E(B−V) and distance d to derive extinction in all available photometric bands. Except for Gaia bands, the extinction is derived using R values adopted from Casagrande et al. (2011). If the extinction AG from Gaia DR3 is not available, Gaia GBP − GRP-colour-dependent coefficients presented in Casagrande et al. (2021) are used to derive the extinction for G, GBP, and GRP. By comparing model to observed absolute magnitudes, probability distribution functions (PDFs) are derived for the stellar parameters spanned by the BaSTI isochrones, that is, log(Teff), log ɡ, [Fe/H], log(mass/M⊙), log(luminosity/L⊙), log(radius/R⊙) for all available ages. See Sect. 3.3 in Gent et al. (2022) for the exact formulation for deriving the photometric PDF and the error propagation.
In Gent et al. (2022), a set of stellar parameters is passed to the photometric module to define a subdomain within the parameter space, minimising the number of stellar photometric models that are processed in the code. However, here we define the sub-domain of the M dwarf grid as 3500 ± 800 K, 4.9 ± 1.0 dex, and −0.02 ± 1.0 dex in Teff, log ɡ, and [Fe/H], respectively, regardless of the properties of the star being analysed, and the photometric module is only used to determine the log ɡ.
Stars in our sample.
3.2 Spectroscopy
The spectroscopic module in the SAPP is based on The Payne, a method developed by Ting et al. (2019) to infer stellar parameters from observed spectra based on a machine learning algorithm. The Payne uses an artificial neural network (ANN) model trained on a grid of synthetic spectra for FGKM dwarf and giant stars. Ting et al. (2019) generated the spectra with ATLAS12 model atmospheres and the SYNTHE spectral synthesis code. The spectra were defined by 25 parameters (or ‘labels’) corresponding to stellar properties, including Teff, log ɡ, turbulence parameters, as well as elemental abundances. The code identifies the set of parameter values that best reproduce the observed spectrum. Ting et al. (2019) tested The Payne on stars in the APOGEE DR14 sample, which includes M dwarfs. For FGK stars (dwarfs as well as giants) the stellar parameters derived with The Payne were consistent with isochrones and the published APOGEE DR14 values.
However, for the M dwarfs the results were diverging from the isochrones and no comparison was done with APOGEE results, because APOGEE DR14 did not provide parameters for these stars. Possible explanations given by Ting et al. (2019) for their mismatch compared with the isochrones are that the adopted line list was not well calibrated for this temperature range and that the used atmospheric models were not suitable for M dwarfs. When testing how well The Payne recovered labels it was found that the deviation from the input labels was about twice as large for stars between 3000 and 4500 K than for hotter stars (see Fig. 5 in Ting et al. 2019). To improve the performance of The Payne framework for M dwarfs, it is important to use a line list adapted for M dwarfs, and the algorithm needs to be retrained on a set of synthetic spectra that better represent low-mass stars.
3.2.1 Model spectra and neural network
In this study we followed a similar procedure for computing the model spectra and training the neural network as demonstrated in Kovalev et al. (2019). The Payne’s ANN was trained to restore the model spectrum corresponding to input labels. The ANN architecture consists of a fully connected three-layer model with nine input units, two hidden layers with 400 and 300 units, and 11 000 output units. The number of input units corresponds to the dimensionality of the spectral grid used for training, while the number of output units corresponds to the number of wavelength points of the training spectra. A ReLU (rectified linear unit) activation function is used for the hidden units, while a sigmoid activation function is used for the output units.
For training and validation a random uniform grid of synthetic spectra was computed using Turbospectrum as described in Gerber et al. (2023) together with MARCS atmospheric models (Model Atmospheres with a Radiative and Convective Scheme), the APOGEE DR16 line list (Smith et al. 2021), and the water line list by Polyansky et al. (2018). The grid covers the stellar parameter space of Teff from 2500 to 5500 K, log ɡ/cm s−2 from 4 to 5.4 dex4, [Fe/H] from −2.0 to 0.6 dex5, and microturbulence Vturb from 0.01 to 2.0 km s−1. In addition, the elemental abundances of O, Mg, Ca, Si, and Ti vary within −0.2 and 0.8 dex relative to the solar mixture (Grevesse et al. 2007), such that the distribution of each abundance ratio with respect to iron is uniform across the grid. The model grid consists of 11292 spectra in total, of which 70% were used for training and the remainder for validation. For this study, the stellar spectra were modelled assuming local thermodynamic equilibrium (LTE). Apart from Gaussian instrumental broadening corresponding to the spectral resolution of APOGEE no additional broadening of spectral lines (for example, υ sin i) was applied. We note that departures from LTE as well as rapid rotation can occur for M dwarfs, as discussed in Sect. 5.2 and 5.3. However, we leave a possible non-LTE analysis and the fitting of υ sin i to future follow-up studies.
We performed a validation of the neural network by comparing synthetic spectra from the validation sample with models generated with the neural network using the same set of parameters. We found that the median interpolation error for the majority of models is about 0.1%. A few models deviate by more than 1%. The warmer parts of the grid (i.e. Teff > 4000 K) perform slightly better than the cooler parts of the grid (below 0.1% compared with slightly above 0.1%). This can be seen in Fig. A.1 in Appendix A. The performance of the grid is similar to that reported by Ting et al. (2019), who give a median interpolation error of about 0.1% for their grid and a slightly larger interpolation error for the cooler models.
The spectroscopic module uses normalised observed spectra (see Sect. 3.2.2) and a gradient descent method to find the global minimum in the parameter space of the training labels. In the process, the observed spectrum is compared to synthetic spectra reconstructed using the neural network, following the methodology in Gent et al. (2022).
|  | Fig. 1 Example of H-band synthetic spectra generated with different effective temperatures. The surface gravity was set to 4.7 dex and the metallicity was set to solar. The different colours correspond to different Teff values. | 
3.2.2 Normalisation including pseudo-continuum
The synthetic spectra used for training the SAPP’s ANN are normalised to the continuum flux. Thus, the SAPP needs normalised observed spectra in order to analyse the stars. However, M dwarf spectra do not have a clear continuum due to the presence of a multitude of molecular lines, both in the optical and, to a lesser extent, in the NIR range. These molecular features suppress the continuum, forming a pseudo-continuum, where the suppression becomes deeper for cooler stars. In Fig. 1 we show how the pseudo-continuum varies with the effective temperature. The synthetic spectra in the figure were generated in the same way as described in Sect. 3.2.1. In the wavelength region shown the highest flux points of the hottest synthetic spectra are almost 0.3 continuum units larger than the highest flux points of the coolest spectra. We also explored how the pseudo-continuum is affected by metallicity. We found that the flux depression is more severe at higher metallicity, although the effect remains within about 0.03 continuum units. This can be seen in Fig. B.1 in Appendix B. For a discussion of the effect of carbon and oxygen abundances on the pseudo-continuum, see Veyette et al. (2016).
The goal is to match the scaling of the observed and synthetic spectra so that they can be compared. The SAPP has a built-in normalisation routine which is described in the appendix of Gent et al. (2022). The routine can be summarised as a piece-wise linear regression algorithm, whereby the un-normalised spectrum is broken into segments. These are defined considering the location of broad and narrow lines. Each segment is fitted taking into account the S/N, and the observed flux is divided by the fit in each segment. Due to the presence of the pseudo-continuum in M dwarfs this normalisation procedure needed to be modified. In order to apply a pseudo-continuum to the observed M dwarf spectra that matches the pseudo-continuum level of the model grid spectra that were used in training The Payne (see Sect. 3.2.1), we generated a grid of synthetic spectra in the same way as those shown in Fig. 1 in the Teff range of 3000 to 4150 K (it is limited to early- to mid-M dwarfs) with a step size of 50 K, log ɡ of 4.4 to 4.9 dex, and metallicity of −0.8 to 0.5 dex6, where log ɡ and [Fe/H] both had a step size of 0.1 dex. We included some variation in the surface gravity in the grid even though its main effect on the spectra is a broadening of the lines, with a minimal influence on the pseudo-continuum. We fit second-degree polynomials to the highest peaks of the generated synthetic spectra disregarding peaks found outside of three sigma from the mean flux of the highest peaks. This results in sets of polynomial coefficients for the different stellar parameters that are used as input to the SAPP. These polynomials are then used to adjust the flux of the observed spectra to the pseudo-continuum level.
The normalisation in the SAPP for M dwarfs is done in parallel with parameter determination by first running the spec-troscopic module on the observed spectra, where the observed spectra are treated the same way as for FGK stars, namely using the original SAPP normalisation routine. We use the best-fit Teff and [Fe/H] values together with the log ɡ obtained from the photometric module to find the corresponding polynomial derived from the grid of synthetic spectra. This polynomial is then multiplied with the original normalised observed spectrum which is then analysed again. This process is repeated and iterated until convergence. We define convergence in the following way. We take the differences of parameters between each successive iteration. The differences decrease until they reach zero or the derived parameters oscillate between two fixed sets of values. This oscillation occurs for a minority of stars and was found to be stable for at least 100 iterations. By inspecting the stars in our sample, we found that n =10 was sufficient as an iteration limit, that is, the difference between steps reached zero or a constant value before n iterations. Uncertainties are based on the values at the iteration step for which convergence is achieved (see Sect. 3.4), and the size of the oscillation, if any, contributes to the final spectroscopic uncertainty.
|  | Fig. 2 Normalised observed spectrum of the star GJ 880 as black dashed line, and best-fit model (synthetic spectrum predicted by the Payne’s ANN for the parameters given in Table 2) as orange solid line. Grey shaded areas indicate the location of the line mask we used. Derived parameters for this star are Teff: 3649 K, logɡ: 4.8 dex, and [Fe/H]: 0.25 dex. | 
3.2.3 Line mask
First tests using the complete spectral range of the APOGEE data resulted in derived effective temperatures which were higher by more than 200 K compared to interferometric values for some stars. In addition, a degeneracy between the effective temperature and metallicity was apparent. Inspection of the fits indicated that this was due to some spectral regions which cannot be modelled well, which is compensated for by an inadequate change in stellar parameters. To remedy this, we restricted the application of the fitting procedure to selected spectral ranges within a line mask. The line mask was taken from Sarmento et al. (2021), who compared an observed spectrum of the M4V star Ross 128 (GJ 447) with a synthetic spectrum generated for parameters corresponding to this star. The construction of the line mask is described in their Sect. 3.3. For an illustration see Fig. 2.
3.3 SAPP version for M dwarfs
The final parameters for M dwarfs are derived via the spectroscopic module of the SAPP driven by the photometric surface gravity. Specifically, the most probable log ɡ value derived from the photometric PDF (see Sect. 3.1) is passed on to the spectroscopic module and is fixed during the spectrum fitting process. This approach helps to mitigate the strong degeneracies found between log ɡ and other parameters when applying the spectroscopic module alone. Figure 3 shows the covariances for a free fit of the nine parameters in the spectroscopic module for GJ 880, a representative M dwarf in our test sample. The correlations with surface gravity are among the most significant in the figure. Thus, alternative constraints on the parameters are needed. We note that since the ANN model includes a variation in individual element abundances, these are by default given in the output of the SAPP. However, we leave the validation of the derived abundances for M dwarfs to future work.
Figure 4 shows the PDFs for GJ 880 from the two modules of the SAPP. Each PDF shows the likelihood landscape in Teff-log g space at the best-fitting [Fe/H], with the colour scale representing the logarithm of the probability. The correlation between Teff and log g in the spectroscopy module is apparent, while the valid values resulting from the photometric module are restricted to a smaller fraction of the parameter space. The main visual differences in the probability space between the two can be accounted for by how they were calculated. The spectroscopy PDF is built from its best-fit Teff, log g, [Fe/H], and the correlation matrix derived from curvefit (see Fig. 3). These parameters are then compared to the common atmospheric parameters shared by photometry (Teff, log g, and [Fe/H]) to build a PDF space. However, the photometry PDF is the Teff-log g plane of a multidimensional set of isochrones. As in Gent et al. (2022), the photometry PDF is similar in structure to an evolution track. The value of the surface gravity is calculated as the average over all values from the photometric module within the subdomain defined in Sect. 3.1, weighted by the corresponding probabilities.
|  | Fig. 3 Correlation matrix for SAPP’s spectroscopic module without photometric constraint for star GJ 880. The colour scale represents statistical correlation from −1 to 1 for nine ANN parameters. | 
|  | Fig. 4 PDFs calculated for GJ 880 for two different SAPP modules: spectroscopy (left) and photometry (right). The horizontal axis is effective temperature, the vertical axis is surface gravity, and the colour scale is the logarithm of probability. Each PDF is sliced in the [Fe/H] dimension at their maximum probability. White space corresponds to NaN values. | 
3.4 Calculated internal uncertainties
The SAPP includes a calculation of the internal uncertainties arising from the application of the algorithm. These are shown together with our results in Table 2. For an estimation of the overall uncertainties based on comparison with external data from the literature see Sect. 4.5. The internal uncertainty in log g is derived from the photometric PDF which propagates the apparent magnitude, distance and reddening uncertainties. The log g produced from the photometric module directly comes from the weighted average of the PDF. From this posterior distribution we derive a weighted standard deviation which is the reported uncertainty for this parameter.
The internal uncertainties in Teff and [Fe/H] are derived by propagating uncertainties from different parts of the SAPP quadratically as follows:
 (1)
(1)
where σspec is the spectroscopic uncertainty for each parameter derived via the leading diagonal of the co-variance matrix7, corresponding to the square root of the variance. σpseudo is the uncertainty arising from the iterative normalisation procedure allowing for the pseudo-continuum, as described in Sect. 3.2.2. More specifically, it is the standard deviation of the set of parameter values derived in each iteration until convergence. σphot is the uncertainty derived from propagating the uncertainty of the photometric log g through the spectroscopic method. As the log g used in the spectroscopic module is fixed to the maximum likelihood log g from the photometric module, its uncertainty directly contributes to the uncertainties in Teff and [Fe/H]. Furthermore, we do not allow the fitting procedure to go beyond the bounds defined by the photometric log g and its uncertainty.
We note that the procedure for estimating the uncertainties in the SAPP for M dwarfs is different from that used in the SAPP for FGK stars, as the full Bayesian inference scheme is not yet implemented. Furthermore, contrary to what is described in Sect. 3.5.3 in Gent et al. (2022), we do not apply an ‘error model’, owing to the limited number of reference stars.
Stellar parameters and their uncertainties derived in this work.
4 Results and discussion
In this section, we describe the results from applying the SAPP for M dwarfs to our test sample in order to assess the performance of our pipeline. We list our derived stellar parameters in Table 2. The star LSPM J1204+1728S is not included in this table as the derived parameters8 were judged to be unreliable due to its fast rotation (see below). We note that the star GJ 105A is an early K dwarf, and our derived stellar atmospheric parameters might therefore be unreliable, as the pipeline is optimised for M dwarfs. The star GJ 338A is classified as a late K dwarf, which is closer to the parameter range targeted by the pipeline. We note that, as mentioned in Sect. 3.2.1 and shown in Fig. 3 the ANN model includes a variation the microturbulence parameter Vturb , and best-fit values are given in the output of the SAPP. However, in the context of this work we regard the microturbulence as a free nuisance parameter, as its physical meaning is limited and an evaluation with independent reference values is not possible. Summarising the fitting results for the sample as a whole, the Vturb values show a rather flat distribution of values ranging from 0.03 to 1.55 km s−1, with a median of 0.78 km s−1.
In Fig. 2, we show an example of a best fit model for the star GJ 880 (Teff, log g, [Fe/H] = 3650 K, 4.8 dex, 0.25 dex) in comparison with the normalised observed spectrum. Examples of best fit models for two additional stars can be found in Appendix C. The line mask that is used in the SAPP for M dwarfs is indicated in grey. The fit is generally good in the regions covered by the line mask. Lines found at the edges of the detectors have a slightly worse fit (the edges of the detectors are outside of the ranges shown in Fig. 2). This worse fit is most likely caused by the normalisation routine which behaves worse at the edges of the detectors. We note that APOGEE spectra suffer from persistence effects9, in particular at the shortest wavelengths (Holtzman et al. 2018). We can also see that the two potassium lines at 15 163 Å and 15 168 Å show a slightly worse fit. These lines were shown to be affected by non-LTE effects in Olander et al. (2021) which could explain part of the mismatch.
In Fig. 5, we show the Teff values derived from spectroscopy and the log g values derived from photometry together with the parameters covered by a subset of our grid of stellar evolution models (Sect. 3.1). The models shown correspond to an age of 13 Gyr (the maximum age that is physical for a star) and are colour-coded by metallicity. We recall that when constructing the PDF for surface gravity the photometric module uses the whole grid of models for all available ages. We also note that the stars lie in the region of models colour-coded in red, in agreement with our derived metallicities of −0.7 dex and higher. The star lying furthest outside of the parameter space covered by the evolutionary model grid is the fast rotator LSPM J1204+1728S, discussed in Sect. 4.2. The second outlier in the same sense is BD–06 4756B, indicating that the uncertainty of the effective temperature and/or surface gravity might be underestimated for this star.
In the sections below, we compare our results with literature values from some of the works mentioned in Sect. 1 based on several different techniques. The aim is to cross-check our method, and to understand its accuracy, precision, and scope of applicability.
|  | Fig. 5 Surface gravity versus effective temperature derived by the SAPP (black diamonds with error bars). The K dwarf GJ 105A is not visible since its parameters are outside of the axis ranges. Small dots represent a subset of the grid of stellar evolution models used by the photometric module as described in Sect. 3.1, selected for this illustration to have an age of 13 Gyr, colour-coded by metallicity, with masses increasing from 0.1 M⊙ at the lower right towards the upper left with steps of 0.005 M⊙. We note that when constructing the PDF for surface gravity the photometric module uses the whole grid of models for all available ages. | 
4.1 Comparison based on interferometry
We used Boyajian et al. (2012) and Rabus et al. (2019) to obtain reference parameters based on interferometric measurements. Boyajian et al. (2012) used the CHARA array to obtain limb-darkened angular diameters θLD . Coupled with Hipparcos parallaxes (van Leeuwen 2007) and photometry fitted to spectral templates in order to obtain the bolometric fluxes, they calculated stellar radii and effective temperatures. They obtained stellar masses using an absolute K-band mass-luminosity relation from Henry & McCarthy (1993). Rabus et al. (2019) used the VLTI/PIONIER interferometer to obtain θLD. They used Gaia DR2 parallaxes in their analysis, and bolometric fluxes were obtained by integration over stellar model spectra fitted to photometric observations. For the masses they used an empirical mass-luminosity relation from Mann et al. (2019).
In our sample, 12 stars have angular diameter measurements by Boyajian et al. (2012) and one by Rabus et al. (2019, GJ 447). In addition, Rabus et al. (2019) calculated effective temperatures and radii for five stars using uniform-disk angular diameters from Boyajian et al. (2012) and using their own determinations as described above otherwise (indicated in Table 1 as ‘B, R’ in column ‘Int.’)10. We calculated the surface gravities using the masses M and radii R given in Boyajian et al. (2012)11 and Rabus et al. (2019), and the equation g = GM/R2.
In Fig. 6, we compare the Teff and log g values derived with the SAPP with the parameters based on interferometric angular diameters. As mentioned above, Rabus et al. (2019) re-calculated Teff using their method and angular diameters from Boyajian et al. (2012) for five of the stars. Therefore the two sets of results are not completely independent. The effective temperature shows a clear linear trend with both Boyajian et al. (2012) and Rabus et al. (2019). Our derived Teff is generally higher in comparison with Boyajian et al. (2012). When comparing with Boyajian et al. (2012) we find a strong outlier – the star GJ 725B, for which Boyajian et al. (2012) derived 3104 K and we obtained 3556 K. Other studies have also derived a higher Teff for this star than what was obtained by Boyajian et al. (2012). Sarmento et al. (2021) derived a temperature of 3544 K, Maldonado et al. (2020) 3291 K, Mann et al. (2015) 3345 K, and Souto et al. (2022) 3400 K. In addition, in most of the studies mentioned above the derived Teff values for the two binary components GJ 725A and GJ 725B are the same to within 100 K, including our results (difference of 28 K). However, Boyajian et al. (2012) derived a difference of about 300 K in Teff between the two stars. It is clear that GJ 725B needs further investigation which is beyond the scope of this work.
We performed a linear fit to the data from Boyajian et al. (2012), excluding GJ 725B for the reasons mentioned above. The linear fit (with slope 0.963 and intercept 238 K) shows an offset of approximately 100 K above the 1:1 line (see Fig. 6). The mean absolute difference (MAD)12 in effective temperature is 116 K. As can be seen in the figure, Rabus et al. (2019) derived slightly higher Teff values for the five stars for which they re-analysed the data from Boyajian et al. (2012), although they agree within uncertainties. When comparing with Rabus et al. (2019) we find a MAD of 74 K. For the majority of the stars in the sample we derived a higher Teff than both interferometric studies. On the other hand, we do not see a systematic difference with other spectroscopic studies, as shown in Sect. 4.2 and Fig. 7. This implies that the offset seems to be a general trend when comparing spectroscopy and interferometric measurements. Therefore, it could be due to the modelling components used in interferometry, such as accounting for limb-darkening or methods for measuring the bolometric flux. An indication of this can be seen in Fig. 6 as the recalculated effective temperatures by Rabus et al. (2019) are higher than those of Boyajian et al. (2012). In this case, the change in modelling has decreased the offset between spectroscopic and interferometric Teff values. A similar discussion can be found in Souto et al. (2020, their Sect. 4.2 and Fig. 4).
In the bottom panel of Fig. 6, we show the comparison between our derived surface gravity and the one calculated using the mass and radius from Boyajian et al. (2012) and Rabus et al. (2019). We stress that the surface gravity based on the interferometric radius is not as fundamental as the effective temperature because empirical relations are used to calculate the mass of the stars. The stars in our sample generally follow the 1:1 ratio, with a possible small positive offset. The two stars in the binary mentioned above, GJ 725A and B, show the largest deviations of 0.1 (A) to 0.2 (B) dex. The calculated MAD between log g derived with the SAPP and by Boyajian et al. (2012) is 0.032 dex (excluding the outlier GJ 725B). The MAD for the surface gravity when comparing with Rabus et al. (2019) is 0.042 dex. More interferometric measurements and direct mass determinations are needed in order to draw conclusions regarding the accuracy of the surface gravity derived by the SAPP.
We note that in a companion paper to Rabus et al. (2019), Lachaume et al. (2019) used a method to estimate the uncertainties of the measured angular diameters that takes into account correlations between observables and includes systematic errors. This results in larger uncertainties than obtained by the standard method used for instance by Boyajian et al. (2012), in particular for stars with small angular diameters (≲0.6 mas, see Fig. 3 of Lachaume et al. 2019). However, for the stars in the samples of Boyajian et al. (2012) and Lachaume et al. (2019) which show the largest overlap in angular size (∼0.7–0.8 mas) the uncertainties derived in the two works are comparable.
|  | Fig. 6 Comparing Teff (top) and log g (bottom) derived from the SAPP with corresponding parameters based on interferometric angular diameters (Boyajian et al. 2012; Rabus et al. 2019). The black dashed line in both panels corresponds to the 1:1 ratio and the grey dotted line in the top panel corresponds to a linear fit to the values from Boyajian et al. (2012). We excluded the outlier GJ 725B for which the SAPP derived a Teff of roughly 3550 K (leftmost blue square in top panel) from the linear fit, for reasons discussed in the text. We note that the K3 dwarf GJ 105A with Teff ∼4600 K is not shown in the figures. | 
|  | Fig. 7 SAPP results compared with spectroscopic results from Sarmento et al. (2021); Passegger et al. (2019); Mann et al. (2015); Maldonado et al. (2020); Souto et al. (2022); Cristofari et al. (2022a). Values derived using the SAPP are shown on the vertical axis, and the literature values are shown on the horizontal axis. Left: effective temperature. Middle: surface gravity. The uncertainties for Sarmento et al. (2021) and Souto et al. (2022) are represented at the bottom of the figure in grey. One star is located outside of the borders of the figure: LSPM J1204+1728S for which the SAPP value is 4.86 dex and Sarmento et al. (2021) obtained 5.31 dex. Right: metallicity. The black dashed line in all panels corresponds to the 1:1 ratio. | 
4.2 Comparison with classical spectroscopy
4.2.1 Reference values
We compare our results with several spectroscopic studies of M dwarfs, both in the optical and the NIR. Starting with classical spectrum-fitting methods, we include the results of Mann et al. (2015) based on low-resolution spectra calibrated to an absolute flux scale in the optical and in the NIR, complemented by photometry and trigonometric parallaxes. These authors derived bolometric fluxes, Teff, metallicity, stellar radii, and stellar masses (using the empirical mass-luminosity relation from Delfosse et al. 2000) for about 180 nearby K7 to M7 stars, including the majority of the stars in our sample. We calculated the surface gravity and corresponding uncertainty from the mass and radius given by Mann et al. (2015). Another reference study in the optical is Maldonado et al. (2015, 2020), who applied an equivalent width analysis to HARPS and HARPS-N spectra (Mayor et al. 2003; Cosentino et al. 2012) to obtain stellar parameters for about 200 M dwarfs. They used 13 and 47 stars of their sample for Teff and metallicity calibration, respectively.
Turning to high-resolution spectroscopy, Passegger et al. (2018, 2019) used spectra from the CARMENES spectrograph (Quirrenbach et al. 2014) in both the optical and the NIR together with PHOENIX model atmospheres (Husser et al. 2013; Meyer 2017) to determine parameters for about 300 M dwarfs. Passegger et al. (2019) give three sets of parameters based on spectra obtained in the optical, NIR, and the two combined. We compare with the parameters derived using the combined spectra, judged by the authors to give the best results. Cristofari et al. (2022a) analysed spectra of 44 M dwarfs obtained with the SPIRou spectrograph (Donati et al. 2020), using MARCS model atmospheres and Turbospectrum (Alvarez & Plez 1998). They calibrated their method using 12 stars of their sample. Among the remaining stars there are five in common with our work, which we include in the comparison. We also compare with two studies using the same instrument as our work. Sarmento et al. (2021) used APOGEE spectra together with MARCS, Turbospectrum, and a custom line list built partly from the APOGEE line list (Shetrone et al. 2015) to derive parameters for about 300 M dwarfs. Souto et al. (2022, and references therein) also used APOGEE spectra and the behaviour of oxygen abundances as a function of Teff and log g to obtain those parameters for a sample of 21 stars.
A summary of the studies mentioned above can be seen in Table 3. The table lists the references for the published parameters, the instruments used to obtain the spectra, the wavelength ranges, the model atmosphere or technique adopted, and the parameters obtained. For more details regarding the analyses the reader is directed to the individual publications listed in the table.
Spectroscopic reference studies. Top part: classical spectroscopy, bottom part: machine-learning approaches.
4.2.2 Effective temperature
In the left panel of Fig. 7, we show the comparison for the effective temperature. For most stars the results agree within uncertainties. The Teff values derived in the SAPP are slightly lower when comparing with Sarmento et al. (2021), with a MAD of 83 K. On the other hand, the SAPP derived on average higher Teff values compared with Maldonado et al. (2020) and Mann et al. (2015), with corresponding MADs of 120 K and 88 K, respectively. We largely agree with Souto et al. (2022), Passegger et al. (2019), and Cristofari et al. (2022a), for which the corresponding MADs are 68 K, 62 K, and 76 K, respectively. Thus, the MADs for the studies included in Fig. 7 are within 100 K in all cases, except for Maldonado et al. (2020). As can be seen in Fig. 7 the uncertainties associated with the SAPP values are higher at lower effective temperatures. This could be due to the normalisation procedure having difficulties differentiating between noise and molecular lines for cooler stars. It could also just be a result of better Teff diagnostics in the warmer M dwarfs (stronger atomic lines and fewer molecular lines).
Two stars are apparent as outliers in Teff in Fig. 7 (LSPM J1204+1728S and GJ 777B). For LSPM J1204+1728S we derived a Teff of 3185 K, while the values derived by Souto et al. (2022) and Sarmento et al. (2021) are significantly higher (3369 K and 3507 K, respectively). Our uncertainty for this star is high, and when visually inspecting the best-fit model we found that the agreement with the observations is poor. The lines in the observed spectra are significantly broader than in the best fit model which can be seen in Fig. C.3 in Appendix C. According to Gilhool et al. (2018) this star is a fast rotator, with a υ sin i of about 17 kms−1. The current version of the SAPP is not capable of fitting υ sin i, and our result for this fast rotator is therefore not trustworthy. Future versions of the SAPP for M dwarfs should also fit for rotational broadening of the spectral lines. This fast rotator only exists in the sample overlapping with Sarmento et al. (2021) and Souto et al. (2022). Recalculating the MAD without the star LSPM J1204+1728S results in 72 K for Sarmento et al. (2021) and 62 K for Souto et al. (2022).
For GJ 777B the Teff derived by Sarmento et al. (2021, 3027 K) is lower than ours (3251 K). On the other hand, the values derived by Souto et al. (2022, 3295 K) and Mann et al. (2015, 3144 K) are in good agreement with ours. For this star the fit of our best model looks good. The SAPP uncertainty for this star is the highest in our sample (152 K), disregarding the fast rotator LSPM J1204+1728S. In addition, GJ 777B has one of the lowest effective temperatures in our sample and the observed spectrum has a fairly low S/N (see Table 1). In our tests of the spectroscopic module without a line mask (i.e. using the complete spectral range) we found that for stars with low S/N and low Teff we generally derived temperatures much higher than reference values. It is therefore a possibility that the line mask is not fully appropriate for these types of spectra. However, our derived Teff is very similar to that from Souto et al. (2022). This star also requires further study.
4.2.3 Surface gravity
The middle panel of Fig. 7 shows a comparison for the surface gravity from the same studies as for the effective temperature. The uncertainties for Sarmento et al. (2021) and Souto et al. (2022) are represented at the bottom of the figure as grey markers. The largest deviations are found when comparing with Sarmento et al. (2021) who used a method relying largely on spectroscopy and did not constrain log g with photometry, models and parallaxes as is done in the SAPP. The MAD is 0.11 dex. Souto et al. (2022) used the oxygen abundance as a log g indicator and their values also show a larger spread compared with the other studies in the figure. The MAD to our results is 0.07 dex. Sarmento et al. (2021) and Souto et al. (2022) also quote the largest uncertainties, 0.2 dex in both cases. Our derived log g values agree fairly well with those of Mann et al. (2015), Maldonado et al. (2020), Passegger et al. (2019), and Cristofari et al. (2022a), with an apparent small systematic shift towards higher values in this work. The corresponding MADs are 0.04, 0.05, 0.09, and 0.11 dex, respectively. The differences between what was obtained from the SAPP and by Passegger et al. (2019) or Cristofari et al. (2022a) increase at higher surface gravities. The spread when comparing with Maldonado et al. (2020) also increases at higher surface gravities, as do the uncertainties from Maldonado et al. (2020).
An outlier which is outside of the borders of the figure is the fast rotator mentioned above, LSPM J1204+1728S. For this star, Sarmento et al. (2021) derived 5.31 dex using spectroscopy and we obtained 4.86 dex using photometry and evolutionary models. Souto et al. (2022) obtained 4.82 dex for the same star. Excluding the fast rotator from the sample gives a MAD of 0.10 dex in comparison to Sarmento et al. (2021).
4.2.4 Metallicity
The right panel of Fig. 7 shows the metallicity derived using the SAPP compared with literature values. Since M dwarf metallicities are notoriously difficult to measure reliably (e.g. Sarmento et al. 2021, Passegger et al. 2022), it is not surprising that this comparison shows significant deviations between independently derived values. The metallicity from the spectroscopic module of the modified SAPP is more confined to a region around solar metallicities compared with the other literature studies. Most of the metallicities from the SAPP lie between roughly −0.5 dex and +0.25 dex while the literature sample as a whole ranges from −0.75 to +0.4 dex (with a few additional values outside of these limits)13. We note, however, that the values from Sarmento et al. (2021) cluster towards the lower limit and the Passegger et al. (2019) values are found near the upper limit of the literature range. The SAPP-derived metallicities are in general higher compared to Sarmento et al. (2021) and lower compared to Passegger et al. (2019). The corresponding MADs are 0.23 dex and 0.26 dex. The differences between our results and those of Sarmento et al. (2021) increase towards lower metallicites. Our agreement is better with the studies by Mann et al. (2015), Maldonado et al. (2020), and Souto et al. (2022), and Cristofari et al. (2022a), for which the MADs are 0.10, 0.13, 0.13, and 0.09 dex, respectively. The observed spread among the different studies is in line with the discussion in Passegger et al. (2022), who tested various different methods on the same CARMENES spectra. For some stars the results agreed well and for others differences of more than 0.5 dex were found (see their Figs. 2 and 3).
In our case, outliers in metallicity are mainly found in the comparison with Sarmento et al. (2021). For the star BD+00 549B we derived −0.66 dex, whereas Sarmento et al. (2021) derived −1.05 dex and Souto et al. (2022) derived −0.92 dex. Gilhool et al. (2018) obtained a metallicity of −1.0 dex, deriving the rotational velocity together with Teff, log g, and [Fe/H] using a grid of template spectra compared to observed APOGEE spectra. We note that all studies we compare with for this particular star used some form of spectroscopic method to derive log g while we used photometry and evolutionary models. It is a possibility that the difference is caused by difference in method and a degeneracy between log g and [Fe/H], but this should then apply to all stars in our sample. We note that this star is in a binary with an G star, see Sect. 4.4. Another outlier is the previously mentioned fast rotator LSPM J1204+1728S, for which we obtained −0.21 dex with the SAPP, while Sarmento et al. (2021) and Souto et al. (2022) obtained −0.76 dex and −0.45 dex, respectively. Excluding this fast rotator in the calculation of the MAD we obtain slightly lower values for both studies (0.22 dex and 0.123 dex, respectively).
Another outlier in comparison with Sarmento et al. (2021) is the star GJ 777B, for which we derived a metallicity of −0.08 dex with the SAPP and Sarmento et al. (2021) obtained a much higher value of +0.40 dex. Mann et al. (2015) derived +0.06 dex and Souto et al. (2022) obtained +0.21 dex. This star was also mentioned as an outlier in effective temperature. We note that GJ 777B is one of the coolest stars in the sample (spectral type M4.5, similar to GJ 324B and GJ 447), at the limits of validity of the usually employed atmospheric models and relations. This may contribute to the large spread in literature [Fe/H] values. This star is also part of a binary and discussed in Sect. 4.4 We also note that for the stars in our sample overlapping with Maldonado et al. (2020) the authors did not derive a higher metallicity than +0.05 dex. However, they did obtain higher metallicities for other stars in their complete sample.
4.3 Comparison with machine-learning techniques
A growing number of surveys are using machine learning methods to obtain stellar atmospheric parameters for large samples of stars. This includes the SAPP, in which we are using The Payne algorithm with an ANN trained on model spectra. In this section, we compare with the results of Birky et al. (2020), who used another algorithm, The Cannon, trained on observed spectra, and with the results presented in Passegger et al. (2022) based on a deep convolutional neural network trained on synthetic spectra (hereafter referred to as Deep Learning, DL).
The Cannon (Ness et al. 2015; Casey et al. 2016) uses second- degree polynomial generative models trained on observed spectra and is therefore independent of stellar atmospheric models. Instead, it needs well-known benchmark stars. For M dwarfs this can be a problem because of previously mentioned constraints on observing M dwarfs. Birky et al. (2020) used APOGEE spectra of a sample of well known M dwarfs in their training of The Cannon and training labels from West et al. (2011) and Mann et al. (2015). They subsequently applied their algorithm on other M dwarfs from the APOGEE survey. The derived parameters were Teff and [Fe/H] as well as spectral type. We use the ‘Test’ values presented in their Table 2 for the comparison.
For the DL study, Passegger et al. (2020) constructed a deep convolutional neural network architecture to produce neural network models trained on synthetic PHOENIX spectra. These were used to estimate Teff, log g, metallicity, and projected equatorial rotation velocity 3 sin i for a sample of M dwarfs from CARMENES spectra. Passegger et al. (2022) applied the same method to 18 well-studied M dwarfs and compared the results to those obtained with several other methods, such as those mentioned in Sect. 4.2. For the comparison we use the DL results from their Runs ‘A’ and ‘C2’, which differ in the spectral range used (a wavelength interval starting at 8800 Å for Run A, and 35 wavelength windows distributed over the optical and J-band regions for Run C2, see Table 2 in Passegger et al. 2022).
A summary of these studies, including the instrument and the wavelength ranges used, is given in Table 3. Figure 8 shows the literature results in comparison with ours for effective temperature and metallicity. The effective temperature derived using the modified version of the SAPP is on average higher than that obtained by Birky et al. (2020), with a MAD of97 K. There are no clear outliers, but we discuss here the three stars that are at the largest distance from the 1:1 ratio. Two of these are GJ 725A and B, which have been mentioned as outliers above. The Teff values obtained with the SAPP for the (A, B) pair are (3584 K, 3556 K) and those obtained by Birky et al. (2020) are (3384 K, 3387 K). The third star is GJ 105B, for which we obtained 3483 K from the SAPP and Birky et al. (2020) obtained 3241 K. Referring back to the works used for comparison in Sect. 4.2, we find that they also report consistently lower Teff than that from the SAPP for this star, by about 200 K for Passegger et al. (2019), Mann et al. (2015), and Maldonado et al. (2020), and by about 150 K for Sarmento et al. (2021) and Souto et al. (2022). Regarding the Teff values of the Passegger et al. (2022) DL results, they agree well with the SAPP values, with a MAD of 117 K for Run A and 69 K for Run C2. At intermediate temperatures the SAPP values tend to be somewhat lower than the comparison values.
The SAPP derives on average somewhat higher metallicities in the lower metallicity range and somewhat lower metallicities in the higher metallicity range than Birky et al. (2020), with a MAD of 0.09 dex for the whole range. The SAPP metallicities agree well with those from Passegger et al. (2022) DL Run C2 at lower metallicity and with those from DL Run A at higher metallicities, with a MAD of 0.22 dex for Run A and 0.11 dex for Run C2 over the whole range. The outliers at the high-metallicity end are GJ 205 (0.19, 0.50, 0.06 dex), GJ 324B (0.24, 0.46 dex, none), and GJ 880 (0.25, 0.28, −0.07 dex), with values derived by the SAPP, Birky et al. (2020), and Passegger et al. (2022) DL Run C2, respectively, given in parentheses. For GJ 205, the works used for comparison in Fig. 7 report metallicities between 0.00 and 0.57 dex, implicating a need for further investigation of this star. For GJ 324B, Passegger et al. (2019) obtained 0.13 dex and Mann et al. (2015) obtained 0.31 dex. This star is in a binary with an early K star (GJ 324A), for which Montes et al. (2018) derived 0.29 dex. It appears that our derived metallicity is closer to the metallicity of the primary star in the binary than the metallicity of Birky et al. (2020) or Passegger et al. (2022) DL Run C2. We note that Passegger et al. (2022) concluded that the results from their DL Run A were most consistent with those of other methods used in the same work and in the literature. Here, we find a slight preference for DL Run C2 for the limited sample in common, for both Teff and metallicity.
|  | Fig. 8 Comparing Teff (top) and [Fe/H] (bottom) derived with the SAPP with the results based on machine-learning techniques from Birky et al. (2020) and Passegger et al. (2022). The black dashed line corresponds to the 1:1 ratio. | 
4.4 Comparison of binary components
Our sample includes M dwarfs in 13 binary systems with other M dwarfs as well as with FGK-type stars. Analysing both stars in a binary allows one to verify the internal consistency, as any metallicity difference between the two components gives an indication of systematics inherent in the analysis method. For the cases where the two components are very different in terms of spectral type, this holds assuming that differential diffusion effects can be neglected. The stellar parameters of the FGK-type primaries are given in Table 4 and were taken from the literature, except for two late K dwarfs which were included in the analysis in this work. We use metallicities from Montes et al. (2018), who analysed optical spectra, and from Mann et al. (2013) based on moderate resolution visible and infrared spectra. When the primary star was within the parameter range of the modified version of the SAPP and an APOGEE spectrum was available it was analysed with the SAPP. This was the case for GJ 338A and B and GJ 725A and B, in which both stars are either late K dwarfs or early M dwarfs, and for GJ 105A and B, which are an earlier K dwarf and an M dwarf.
The results are compared in Fig. 9 where the metallicity of the primary is shown on the horizontal axis and the metallicity of the secondary on the vertical axis. Different symbols distinguish the binaries for which the metallicity comparison data were taken from the literature from the binaries where both stars were analysed with the SAPP. The stars largely follow the 1:1 ratio with a small spread. The majority of the binaries have a difference in metallicity between the primary and secondary smaller than 0.15 dex.
The two systems which deviate most from the 1:1 line in the comparison with literature values are the (A, B) pairs (BD+00 549, BD+00 549B) where the primary has a metallicity of −0.88 dex and the SAPP derived −0.66 dex for the secondary, and (GJ 777A, GJ 777B) with metallicities of (0.21 dex, −0.08 dex), respectively. Both of the secondaries have previously been mentioned as outliers with regard to our derived metallicity. An additional outlier when comparing with FGK-type primaries is the pair (GJ 3194, GJ 3195), where the literature value for the primary is −0.30 dex, and we derived −0.10 dex for the secondary. For comparison, the values derived by Souto et al. (2022) and Sarmento et al. (2021) for the secondary are −0.33 dex and −0.53 dex. We note that our effective temperature of 3570 K deviates from that by Sarmento et al. (2021) of 3708 K for this star. Mann et al. (2015) derived −0.12 dex for the metallicity, which is much closer to our value. The outlier in the sample where both components were analysed with the SAPP is the binary GJ 105 A/B. For the primary, the SAPP derived 0.02 dex, and for the secondary −0.31 dex was obtained. The pipeline is optimised for analysing M dwarfs and the result for the K-type star GJ 105A can therefore be considered to be unreliable. Montes et al. (2018) quote a metallicity of −0.20 dex for the primary component, which is closer to the value obtained for the secondary with the SAPP. We can conclude that our method is applicable at least up to K7 stars in terms of effective temperature (the case of GJ 338A).
Parameters of FGK-type primary stars with an M dwarf secondary.
|  | Fig. 9 Comparing the derived [Fe/H] from the SAPP for M dwarf secondary components in a binary (y-axis, Table 2) with the metallicity of the primary (x-axis, Table 4). The blue circles show binaries with an FGK-type primary and an M dwarf secondary. The orange squares show binaries with a KM-type primary and an M dwarf secondary. All stars in these three systems were analysed with the SAPP using APOGEE spectra. The black dashed line corresponds to the 1:1 ratio. | 
4.5 Estimated overall uncertainties
Summarising the results presented in the previous sections, we find a systematic offset in Teff compared to interferometric values of about 100 K. Based on this offset and the mean absolute differences calculated with respect to other studies, we estimate our overall uncertainty in Teff to be 100 K. When comparing with literature values of log g the SAPP surface gravities seem to be about 0.1 dex higher. At lower surface gravities this offset is lower. We therefore estimate the general uncertainty in surface gravity derived by the SAPP to be 0.1 dex. Regarding the metallicity, we find differences between binary components of up to 0.2 dex, but for most stars the difference is below 0.15 dex. The MADs calculated with respect to other studies are between 0.1 and 0.26 dex. The higher absolute differences occur when comparing with Sarmento et al. (2021) and Passegger et al. (2019). The median of the MADs is 0.13 dex and the mean is 0.16 dex. We therefore estimate the overall uncertainty of the SAPP-derived metallicity to be 0.15 dex. This estimate can be viewed in the light of the investigation of uncertainties related to abundance determinations in APOGEE M dwarf spectra presented in Melo et al. (2024, Appendix A). Based on simulated spectra, uncertainties were calculated as a function of S/N, Teff , and shifts in the pseudo-continuum level. For most elements14, typical uncertainties were around 0.05 dex for S/N ≳ 100, reaching ∼0.15 dex for lower S/N at the cool end. Typical abundance uncertainties due to continuum shifts of 1% were 0.1 dex. Future development of the SAPP for M dwarfs will aim to improve the precision and accuracy for the derivation of the stellar parameters.
5 Future developments
As shown in the previous sections the results of the modified SAPP for M dwarfs look promising. However, we have identified possible areas of future development that should lead to an improved accuracy and precision of the derived stellar parameters. The capabilities of the pipeline should also be extended to enable the derivation of further parameters, such as rotational velocity and abundances of individual chemical elements.
5.1 Line lists and line mask
It is important to update the line lists used to calculate the grid of synthetic spectra used to train The Payne ANN with the latest atomic and molecular data. In addition, the APOGEE DR16 line list was created for the entire APOGEE survey. It is desirable to design a line list optimised for M dwarfs, which should improve the results of the SAPP. Another factor concerning the synthetic spectra is hyperfine structure splitting, which can severely affect lines in M dwarf spectra, as shown for example by Shan et al. (2021) for vanadium lines. However, hyperfine structure of V and other elements is included in the APOGEE DR16 line list (Smith et al. 2021), and should therefore not be of much concern.
The fitting procedure in the spectroscopic module is applied to selected spectral ranges for which the models are deemed to be most reliable (see Fig. 2). The line mask used in the SAPP version presented here is based on published work done in another context (Sarmento et al. 2021) and needs to be optimised for the PLATO pipeline. Furthermore, in its current preliminary setup the spectroscopic module can only determine Teff and metallicity for M dwarfs. Spectroscopic diagnostics for log g and abundances of individual elements need to be identified. In this regard future work could investigate whether using different line masks when fitting log g, Teff and metallicity could improve the results.
It may also be worthwhile to explore the extension of the analysis to other NIR regions, for example the J-band at 1.1 to 1.4 µm. An example for an abundance analysis of several early- and mid-M dwarfs in the J-band using CARMENES spectra is given by Ishikawa et al. (2020). This region combines the advantages of containing maximum flux and minimal molecular absorption by water for M dwarfs. In practice, this would require an extension of the line list, the synthetic spectra grid, and the training of the ANN, as well as an adaptation of the normalisation procedure.
5.2 Non-LTE
In collision-dominated atmospheres LTE is assumed. This is likely the case for the atmospheres of M dwarfs because of their high density. However, recent studies have shown departures from LTE in M dwarfs for a number of elements. Hauschildt et al. (1997) investigated non-LTE effects on titanium lines for M type stars, both giants and dwarfs. They found that titanium lines in M dwarfs are stronger in non-LTE compared to LTE and that this effect decreases with effective temperature. Abia et al. (2020) studied the elements Rb and Sr. They found an average non-LTE abundance correction of −0.15 dex for Rb lines for a range of stellar parameters covering late K dwarfs and early M dwarfs. They also derived abundance corrections for Sr lines varying between −0.28 and −0.13 dex. The abundance correction decreases with increasing temperature for both elements in this study. Olander et al. (2021) showed that non-LTE effects can cause abundance differences ofup to 0.2 dex for potassium lines.
Therefore, it seems clear that non-LTE effects for atomic lines need to be taken into account when analysing M dwarfs. Future training grids to be used in the SAPP for M dwarfs should be generated using non-LTE departure coefficients, as already done for FGK-type stars in Gent et al. (2022). We note that the diagnostic lines in the spectra used in this work include a number of molecular features (mainly from OH, CN, NO, SiH). However, non-LTE studies of molecular line formation are rare, and they have so far focused on CO, CH, and water in the Sun and cool giants (see for example Sect. 5 in the review by Barklem 2016 or Sect. 2.5.2 in Lind & Amarsi 2024). In summary, there is a need for more research regarding non-LTE in M dwarfs.
5.3 Rotation
One star in our sample for which the SAPP tends to give problematic results is LSPM J1204+1728S, which is a fast rotator. The discrepancy between the parameters derived by the SAPP and given in the literature, and the poor fit between the model and observed spectra for this star show that the current version of the SAPP for M dwarfs is not applicable to stars with high rotation velocities. Future developments of the pipeline in regards to M dwarfs should include the rotational velocity as a fitting parameter. In addition, faster rotating stars also tend to have stronger magnetic fields (e.g. Reiners et al. 2012). If fast rotators are to be analysed care must be taken to avoid magnetically sensitive lines.
5.4 Magnetic fields
The presence of a magnetic field broadens spectral lines through the Zeeman effect. In addition, equivalent widths of strong spectral lines with many Zeeman components are increased due to the effect of magnetic intensification. Both of these effects were discussed in the context of M dwarf studies by Kochukhov (2021) and employed to determine mean magnetic fields of hundreds of M dwarfs in a series of studies based on CARMENES spectra (Shulyak et al. 2019; Reiners et al. 2022). These investigations revealed fields in the range from a few hundred gauss in slowly rotating inactive M dwarfs all the way to 6–8 kG in the most active stars. Strong fields in the kilo gauss-range would certainly have an impact on the fitting procedure in the spectroscopic module. We note that most stars in the sample used in this work are inactive stars with 200–600 G fields (Reiners et al. 2022), which do not produce noticeable line distortions or intensity changes at the resolution of the APOGEE spectra. However, in future analyses with the Mdwarf version of the SAPP, when more stars will be targeted, magnetic fields will need to be taken into consideration, for example by avoiding magnetically sensitive lines in the line mask.
6 Conclusions
In preparation for the launch of the PLATO telescope a prototype pipeline that derives stellar parameters for FGK stars has been developed, the SAPP. It uses Bayesian inference to combine results from spectroscopy, photometry, and asteroseismology in order to obtain reliable stellar parameters such as effective temperature, surface gravity, metallicity, and abundances. In this article, we present a modified version of the pipeline that is capable of analysing M dwarf spectra in the H-band. We focus on the spectroscopic and photometric parts of the code and leave the full Bayesian analysis to future work. We used the pipeline to derive the three main parameters Teff, log g, and [Fe/H] and assessed its performance on a sample of reference stars with APOGEE spectra and independent parameter determinations from the literature. Other parameters are left to future work.
The surface gravity is constrained using photometry and stellar interior models. We implemented a new grid of stellar evolutionary models specifically calculated for the analysis of M dwarfs in the framework of the BaSTI library. Observed magnitudes in different photometric bands are used together with distances to calculate absolute magnitudes. These are compared to synthetic absolute magnitudes calculated from the evolutionary models. Probability distribution functions are computed to estimate the most probable stellar parameters, and the log g value is passed on to the spectroscopic module.
The spectroscopic module of the SAPP fits synthetic spectra based on a model grid to observed spectra, in order to derive atmospheric parameters. The code does not use the synthetic spectra from the grid directly, but uses a fast modelreconstruction technique, based on the machine learning algorithm ‘The Payne’. We used Turbospectrum, MARCS atmospheric models, and the APOGEE DR16 line list together with the water line list by Polyansky et al. (2018) to generate a grid of synthetic spectra in the H-band covering the M dwarf range of atmospheric parameters, which was used for training an ANN used by The Payne. In preparation for the fitting process the observed spectra need to be normalised. The built-in normalisation procedure in the SAPP was adjusted in order to take the pseudo-continuum encountered in M dwarf spectra into account. We generated a grid of synthetic spectra, using the same setup as for generating the training grid. A second degree polynomial was then fitted to the upper envelope of the flux as a function of wavelength for each synthetic spectrum, resulting in a set of polynomials for different effective temperatures. The code starts with a fit to the observed spectrum that has been normalised by the original SAPP normalisation routine. The resulting initial stellar parameters are used to find the corresponding polynomial and the continuum is adjusted accordingly. The adjusted spectrum is then used in a new fit. This procedure is repeated iteratively to convergence. The fitting procedure is applied to selected spectral ranges within a line mask considered to be suitable for M dwarfs.
In summary, the adaptations of the SAPP comprise the following calculations and procedures specific to M dwarfs: the evolutionary models and synthetic photometry, the synthetic spectra grid in the NIR, the trained ANN model, the normalisation of the observed spectra, and the line mask. The results derived with the modified SAPP for our sample of reference M dwarfs agree in general well with the results from a number of literature studies that applied a variety of methods. Our derived effective temperatures seem to be about 100 K higher than those calculated from interferometric angular diameters and bolometric fluxes. For the surface gravity, the SAPP produces values consistent with those derived by other studies based on photometry, for example Mann et al. (2015). To assess the metallicity performance we compared the metallicities of the component stars in binary systems with an FGK type primary and an M dwarf secondary. We find an agreement within about 0.15 dex. In future work, we plan to analyse the primary stars of the binary sample with the FGK-version of the SAPP. This will allow us to compare the performance of the two channels of the SAPP, aiming for a smooth transition in the overlap region of spectral types (K dwarfs).
The performance of the pipeline for M dwarfs is expected to improve following future development. New grids of model spectra will be calculated, updated line lists implemented, and the effects of rotation and non-LTE taken into account. The setup of the spectroscopic module will be refined to enable diagnostics for surface gravity and abundances of individual elements, and to mitigate the effects of magnetic fields. The photometric module will be extended to longer-wavelength bands and to higher metallicities. Finally, the full Bayesian inference scheme will be implemented, combining probability distribution functions from both the spectroscopic and the photometric module to deliver the most reliable properties for the PLATO M dwarfs sample.
Acknowledgements
We thank Andrew Mann, Vera Passegger, and Denis Mourard for useful discussions. T.O., U.H., O.K., and N.J.M. acknowledge support from the Swedish National Space Agency (SNSA/Rymdstyrelsen). O.K. also acknowledges support by the Swedish Research Council (grant agreement no. 2019-03548) and the Royal Swedish Academy of Sciences. M.B. is supported through the Lise Meitner grant from the Max Planck Society. This project has received funding from the European Research Council (ERC) under the European Unions Horizon 2020 research and innovation programme (Grant agreement No. 949173). E.M. acknowledges support by the Collaborative Research centre SFB 881, Heidelberg University, of the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation). S.C. has been funded by the European Union – “NextGenerationEU” RRF M4C2 1.1 n: 2022HY2NSX. “CHRONOS: adjusting the clock(s) to unveil the CHRONO-chemo-dynamical Structure of the Galaxy” (PI: S. Cassisi), and by INAF Theory grant “Lasting” (PI: S. Cassisi). T.M. acknowledges financial support from Belspo for contract PRODEX PLATO mission development. H.S.W. acknowledges support from the Carlsberg Foundation’s Semper Ardens grant (‘FIRSTATMO’; PI: A. Johansen). D.S. thanks the National Council for Scientific and Technological Development – CNPq. B.R.-A. acknowledges funding support from the ANID Basal project FB210003. E.D.M. acknowledges the support by the Ramón y Cajal contract RyC2022-035854-I funded by the Spanish MICIU/AEI/10.13039/501100011033 and by ESF+. This research was partially supported by the project AI4Research at Uppsala University. Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions. SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss4.org. SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, Center for Astrophysics / Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU) / University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional/MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.
Appendix A Validation of the generative ANN model
As mentioned in Sect. 3.2.1 the ANN model used by The Payne was validated by comparing synthetic spectra from the validation sample to spectra predicted by the ANN model using the same set of parameters. Figure A.1 shows the distribution of the median flux error using 3300 of the models from the validation set. It can be seen that the majority of the models have a median interpolation error just above 0.1%.
|  | Fig. A.1 Top: Distribution of median flux error for 3300 models from the validation set of The Payne’s ANN, bottom: cumulative distribution. | 
Appendix B Effect on pseudo-continuum from metallicity
Figure B.1 shows synthetic spectra with varying metallicity, generated with Turbospectrum (Gerber et al. 2023), MARCS atmospheric models, the APOGEE DR16 line list (Smith et al. 2021), and the water line list by Polyansky et al. (2018). The effective temperature was set to 3800 K and the surface gravity to 4.7 dex. The pseudo-continuum varies by about 0.03 continuum units between the lowest and highest metallicity.
|  | Fig. B.1 Example of H-band synthetic spectra generated with different metallicities. The Teff and log g values were set to 3800 K and 4.7 dex, respectively. The different colours correspond to different [Fe/H] values. | 
Appendix C Examples of other fitted spectra
Figures C.1 and C.2 shows observed spectra for the stars GJ 447 and GJ 526 and the corresponding best fit model obtained by the modified SAPP. The first star is a cooler M dwarf where we derived a Teff of 3243 K and the second star is warmer with a Teff of 3729 K. For GJ 447 we obtained a surface gravity with 5.066 dex and for GJ 526 4.792 dex. Both stars have a sub-solar metallicity with −0.13 dex for GJ 447 and −0.32 dex for GJ 526. The regions outside of the line mask are not shown in the figures. For most of the lines the fit is good. It is slightly worse for the cooler star GJ 447 especially at the redder region after 15700Å. This could be due to missing molecular line data. We also can see discrepancies at the edges of the detector as was mentioned in Sect. 4. Figure C.3 shows the spectra of the fast rotator mentioned in Sect. 4 (LSPM J1204+1728S). We can see in the figure that the fit is poor as the observed lines are wider than in the best-fit synthetic spectrum.
|  | Fig. C.1 Normalised observed spectrum of the star GJ 447 in dashed black and best-fit model in solid orange. Grey shaded areas indicate the location of the used line mask. The axes are on the same scale as in Fig. 2. We obtained a Teff of 3243 K, a log g of 5.066 dex, and a metallicity of −0.13 dex. | 
|  | Fig. C.2 Same as Fig. C.1 but for the star GJ 526. We obtained a Teff of 3729 K, a log g of 4.792 dex, and a metallicity of −0.32 dex. | 
|  | Fig. C.3 Same as Fig. C.1 but for the fast rotating star LSPMJ1204+178S. Since the modified SAPP cannot fit for rotation the derived parameters are judged to be unreliable. | 
References
- Abia, C., Tabernero, H. M., Korotin, S. A., et al. 2020, A&A, 642, A227 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Allard, F., Hauschildt, P. H., & Schwenke, D. 2000, ApJ, 540, 1005 [Google Scholar]
- Allard, F., Homeier, D., & Freytag, B. 2012, Philos. Trans. Roy. Soc. Lond. Ser. A, 370, 2765 [NASA ADS] [Google Scholar]
- Alonso-Floriano, F. J., Morales, J. C., Caballero, J. A., et al. 2015, A&A, 577, A128 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Alvarez, R., & Plez, B. 1998, A&A, 330, 1109 [NASA ADS] [Google Scholar]
- Antoniadis-Karnavas, A., Sousa, S. G., Delgado-Mena, E., et al. 2020, A&A, 636, A9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Antoniadis-Karnavas, A., Sousa, S. G., Delgado-Mena, E., Santos, N. C., & Andreasen, D. T. 2024, A&A, 690, A58 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Bailer-Jones, C. A. L., Rybizki, J., Fouesneau, M., Demleitner, M., & Andrae, R. 2021, AJ, 161, 147 [Google Scholar]
- Baraffe, I., Chabrier, G., Allard, F., & Hauschildt, P. H. 1995, ApJ, 446, L35 [NASA ADS] [CrossRef] [Google Scholar]
- Barklem, P. S. 2016, A&A Rev., 24, 9 [NASA ADS] [CrossRef] [Google Scholar]
- Bello-García, A., Passegger, V. M., Ordieres-Meré, J., et al. 2023, A&A, 673, A105 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Bidelman, W. P. 1985, ApJS, 59, 197 [Google Scholar]
- Birky, J., Hogg, D. W., Mann, A. W., & Burgasser, A. 2020, ApJ, 892, 31 [Google Scholar]
- Blanton, M. R., Bershady, M. A., Abolfathi, B., et al. 2017, AJ, 154, 28 [Google Scholar]
- Böhm-Vitense, E. 1958, ZAp, 46, 108 [NASA ADS] [Google Scholar]
- Bowen, I. S., & Vaughan, A. H. J. 1973, Appl. Opt., 12, 1430 [Google Scholar]
- Bowler, B. P., Hinkley, S., Ziegler, C., et al. 2019, ApJ, 877, 60 [NASA ADS] [CrossRef] [Google Scholar]
- Boyajian, T. S., von Braun, K., van Belle, G., et al. 2012, ApJ, 757, 112 [Google Scholar]
- Brett, J. M. 1995, A&A, 295, 736 [NASA ADS] [Google Scholar]
- Brocato, E., Cassisi, S., & Castellani, V. 1998, MNRAS, 295, 711 [NASA ADS] [Google Scholar]
- Bugnet, L., García, R. A., Davies, G. R., et al. 2018, A&A, 620, A38 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Caffau, E., Ludwig, H. G., Steffen, M., Freytag, B., & Bonifacio, P. 2011, Sol. Phys., 268, 255 [Google Scholar]
- Cannon, A. J., & Pickering, E. C. 1993, VizieR Online Data Catalog: III/135A [Google Scholar]
- Capitanio, L., Lallement, R., Vergely, J. L., Elyajouri, M., & Monreal-Ibero, A. 2017, A&A, 606, A65 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Casagrande, L., Schönrich, R., Asplund, M., et al. 2011, A&A, 530, A138 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Casagrande, L., Lin, J., Rains, A. D., et al. 2021, MNRAS, 507, 2684 [NASA ADS] [CrossRef] [Google Scholar]
- Casey, A. R., Hogg, D. W., Ness, M., et al. 2016, arXiv e-prints [arXiv:1603.03040] [Google Scholar]
- Cassisi, S., & Salaris, M. 2013, Old Stellar Populations: How to Study the Fossil Record of Galaxy Formation (Wiley-VCH) [Google Scholar]
- Cassisi, S., Salaris, M., & Irwin, A. W. 2003, ApJ, 588, 862 [Google Scholar]
- Cassisi, S., Potekhin, A. Y., Pietrinferni, A., Catelan, M., & Salaris, M. 2007, ApJ, 661, 1094 [NASA ADS] [CrossRef] [Google Scholar]
- Cassisi, S., Potekhin, A. Y., Salaris, M., & Pietrinferni, A. 2021, A&A, 654, A149 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Chabrier, G., & Baraffe, I. 2000, ARA&A, 38, 337 [Google Scholar]
- Cosentino, R., Lovis, C., Pepe, F., et al. 2012, SPIE Conf. Ser., 8446, 84461V [Google Scholar]
- Cristofari, P. I., Donati, J. F., Masseron, T., et al. 2022a, MNRAS, 516, 3802 [NASA ADS] [CrossRef] [Google Scholar]
- Cristofari, P. I., Donati, J. F., Masseron, T., et al. 2022b, MNRAS, 511, 1893 [NASA ADS] [CrossRef] [Google Scholar]
- Cutri, R. M., Skrutskie, M. F., van Dyk, S., et al. 2003, VizieR Online Data Catalog: II/246 [Google Scholar]
- Damiano, M., Hu, R., Barclay, T., et al. 2022, AJ, 164, 225 [Google Scholar]
- Delfosse, X., Forveille, T., Ségransan, D., et al. 2000, A&A, 364, 217 [NASA ADS] [Google Scholar]
- Desidera, S., Gratton, R. G., Scuderi, S., et al. 2004, A&A, 420, 683 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Desidera, S., Gratton, R. G., Lucatello, S., & Claudi, R. U. 2006, A&A, 454, 581 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Diamond-Lowe, H., Mendonça, J. M., Charbonneau, D., & Buchhave, L. A. 2023, AJ, 165, 169 [NASA ADS] [CrossRef] [Google Scholar]
- Donati, J. F., Kouach, D., Moutou, C., et al. 2020, MNRAS, 498, 5684 [Google Scholar]
- Ferguson, J. W., Alexander, D. R., Allard, F., et al. 2005, ApJ, 623, 585 [Google Scholar]
- Gaia Collaboration (Prusti, T., et al.) 2016, A&A, 595, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaia Collaboration (Vallenari, A., et al.) 2023, A&A, 674, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gent, M. R., Bergemann, M., Serenelli, A., et al. 2022, A&A, 658, A147 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gerber, J. M., Magg, E., Plez, B., et al. 2023, A&A, 669, A43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gilhool, S. H., Blake, C. H., Terrien, R. C., et al. 2018, AJ, 155, 38 [NASA ADS] [CrossRef] [Google Scholar]
- Gray, R. O., & Corbally, C. J. 2009, Chapter 9. M Dwarfs and L Dwarfs—J. Davy Kirkpatrick (Princeton: Princeton University Press), 339 [Google Scholar]
- Gray, R. O., Corbally, C. J., Garrison, R. F., McFadden, M. T., & Robinson, P. E. 2003, AJ, 126, 2048 [Google Scholar]
- Gray, R. O., Corbally, C. J., Garrison, R. F., et al. 2006, AJ, 132, 161 [Google Scholar]
- Grevesse, N., Asplund, M., & Sauval, A. J. 2007, Space Sci. Rev., 130, 105 [Google Scholar]
- Grieves, N., Ge, J., Thomas, N., et al. 2018, MNRAS, 481, 3244 [Google Scholar]
- Gunn, J. E., Siegmund, W. A., Mannery, E. J., et al. 2006, AJ, 131, 2332 [NASA ADS] [CrossRef] [Google Scholar]
- Hauschildt, P. H., Allard, F., Alexander, D. R., & Baron, E. 1997, ApJ, 488, 428 [NASA ADS] [CrossRef] [Google Scholar]
- Henry, T. J., & McCarthy, J., Donald W., 1993, AJ, 106, 773 [NASA ADS] [CrossRef] [Google Scholar]
- Henry, T. J., Jao, W.-C., Subasavage, J. P., et al. 2006, AJ, 132, 2360 [Google Scholar]
- Hidalgo, S. L., Pietrinferni, A., Cassisi, S., et al. 2018, ApJ, 856, 125 [Google Scholar]
- Holtzman, J. A., Hasselquist, S., Shetrone, M., et al. 2018, AJ, 156, 125 [Google Scholar]
- Houk, N., & Swift, C. 1999, Michigan Spectral Survey, 5, 0 [Google Scholar]
- Husser, T. O., Wende-von Berg, S., Dreizler, S., et al. 2013, A&A, 553, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Iglesias, C. A., & Rogers, F. J. 1996, ApJ, 464, 943 [NASA ADS] [CrossRef] [Google Scholar]
- Ishikawa, H. T., Aoki, W., Kotani, T., et al. 2020, PASJ, 72, 102 [CrossRef] [Google Scholar]
- Jönsson, H., Holtzman, J. A., Allende Prieto, C., et al. 2020, AJ, 160, 120 [Google Scholar]
- Keenan, P. C., & McNeil, R. C. 1989, ApJS, 71, 245 [Google Scholar]
- Kesseli, A. Y., Kirkpatrick, J. D., Fajardo-Acosta, S. B., et al. 2019, AJ, 157, 63 [Google Scholar]
- Kirkpatrick, J. D., Henry, T. J., & McCarthy, J., Donald W., 1991, ApJS, 77, 417 [NASA ADS] [CrossRef] [Google Scholar]
- Kochukhov, O. 2021, A&A Rev., 29, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Koen, C., Kilkenny, D., van Wyk, F., & Marang, F. 2010, MNRAS, 403, 1949 [Google Scholar]
- Kovalev, M., Bergemann, M., Ting, Y.-S., & Rix, H.-W. 2019, A&A, 628, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lachaume, R., Rabus, M., Jordán, A., et al. 2019, MNRAS, 484, 2656 [NASA ADS] [CrossRef] [Google Scholar]
- Lantz, B., Aldering, G., Antilogus, P., et al. 2004, SPIE Conf. Ser., 5249, 146 [Google Scholar]
- Lee, S. G. 1984, AJ, 89, 702 [NASA ADS] [CrossRef] [Google Scholar]
- Lépine, S., Hilton, E. J., Mann, A. W., et al. 2013, AJ, 145, 102 [Google Scholar]
- Lind, K., & Amarsi, A. M. 2024, ARA&A, 62, 475 [NASA ADS] [CrossRef] [Google Scholar]
- Lindgren, S., & Heiter, U. 2017, A&A, 604, A97 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lindgren, S., Heiter, U., & Seifahrt, A. 2016, A&A, 586, A100 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lodders, K. 2010, Astrophys. Space Sci. Proc., 16, 379 [NASA ADS] [CrossRef] [Google Scholar]
- Lund, M. N., Basu, S., Bieryla, A., et al. 2024, A&A, 688, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [NASA ADS] [CrossRef] [Google Scholar]
- Maldonado, J., Affer, L., Micela, G., et al. 2015, A&A, 577, A132 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Maldonado, J., Micela, G., Baratella, M., et al. 2020, A&A, 644, A68 [EDP Sciences] [Google Scholar]
- Mann, A. W., Brewer, J. M., Gaidos, E., Lépine, S., & Hilton, E. J. 2013, AJ, 145, 52 [Google Scholar]
- Mann, A. W., Feiden, G. A., Gaidos, E., Boyajian, T., & von Braun, K. 2015, ApJ, 804, 64 [Google Scholar]
- Mann, A. W., Dupuy, T., Kraus, A. L., et al. 2019, ApJ, 871, 63 [Google Scholar]
- Marfil, E., Tabernero, H. M., Montes, D., et al. 2021, A&A, 656, A162 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Marocco, F., Eisenhardt, P. R. M., Fowler, J. W., et al. 2021, ApJS, 253, 8 [Google Scholar]
- Mas-Buitrago, P., González-Marcos, A., Solano, E., et al. 2024, A&A, 687, A205 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mayor, M., Pepe, F., Queloz, D., et al. 2003, The Messenger, 114, 20 [NASA ADS] [Google Scholar]
- Melo, E., Souto, D., Cunha, K., et al. 2024, ApJ, 973, 90 [Google Scholar]
- Meyer, M. 2017, PhD thesis, Universität Hamburg, Hamburg, Germany [Google Scholar]
- Monet, D. G., Levine, S. E., Canzian, B., et al. 2003, AJ, 125, 984 [Google Scholar]
- Montalto, M., Piotto, G., Marrese, P. M., et al. 2021, A&A, 653, A98 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Montes, D., González-Peinado, R., Tabernero, H. M., et al. 2018, MNRAS, 479, 1332 [Google Scholar]
- Mourard, D., Berio, P., Pannetier, C., et al. 2022, SPIE Conf. Ser., 12183, 1218308 [NASA ADS] [Google Scholar]
- Nascimbeni, V., Piotto, G., Börner, A., et al. 2022, A&A, 658, A31 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ness, M., Hogg, D. W., Rix, H. W., Ho, A. Y. Q., & Zasowski, G. 2015, ApJ, 808, 16 [NASA ADS] [CrossRef] [Google Scholar]
- Nidever, D. L., Holtzman, J. A., Allende Prieto, C., et al. 2015, AJ, 150, 173 [NASA ADS] [CrossRef] [Google Scholar]
- Olander, T., Heiter, U., & Kochukhov, O. 2021, A&A, 649, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Passegger, V. M., Reiners, A., Jeffers, S. V., et al. 2018, A&A, 615, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Passegger, V. M., Schweitzer, A., Shulyak, D., et al. 2019, A&A, 627, A161 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Passegger, V. M., Bello-García, A., Ordieres-Meré, J., et al. 2020, A&A, 642, A22 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Passegger, V. M., Bello-García, A., Ordieres-Meré, J., et al. 2022, A&A, 658, A194 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Pietrinferni, A., Hidalgo, S., Cassisi, S., et al. 2021, ApJ, 908, 102 [NASA ADS] [CrossRef] [Google Scholar]
- Polyansky, O. L., Kyuberis, A. A., Zobov, N. F., et al. 2018, MNRAS, 480, 2597 [NASA ADS] [CrossRef] [Google Scholar]
- Quirrenbach, A., Amado, P. J., Caballero, J. A., et al. 2014, SPIE Conf. Ser., 9147, 91471F [Google Scholar]
- Rabus, M., Lachaume, R., Jordán, A., et al. 2019, MNRAS, 484, 2674 [Google Scholar]
- Rains, A. D., Nordlander, T., Monty, S., et al. 2024, MNRAS, 529, 3171 [Google Scholar]
- Rajpurohit, A. S., Allard, F., Rajpurohit, S., et al. 2018, A&A, 620, A180 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Rauer, H., Aerts, C., Cabrera, J., et al. 2024, arXiv e-prints [arXiv:2406.05447] [Google Scholar]
- Rayner, J. T., Toomey, D. W., Onaka, P. M., et al. 2003, PASP, 115, 362 [NASA ADS] [CrossRef] [Google Scholar]
- Reiners, A., Joshi, N., & Goldman, B. 2012, AJ, 143, 93 [Google Scholar]
- Reiners, A., Shulyak, D., Käpylä, P. J., et al. 2022, A&A, 662, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ribas, I., Reiners, A., Zechmeister, M., et al. 2023, A&A, 670, A139 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ridden-Harper, A., Nugroho, S. K., Flagg, L., et al. 2023, AJ, 165, 170 [NASA ADS] [CrossRef] [Google Scholar]
- Rodríguez-López, C. 2019, Front. Astron. Space Sci., 6, 76 [Google Scholar]
- Rojas-Ayala, B., Covey, K. R., Muirhead, P. S., & Lloyd, J. P. 2012, ApJ, 748, 93 [Google Scholar]
- Rosenthal, L. J., Fulton, B. J., Hirsch, L. A., et al. 2021, ApJS, 255, 8 [NASA ADS] [CrossRef] [Google Scholar]
- Sarmento, P., Rojas-Ayala, B., Delgado Mena, E., & Blanco-Cuaresma, S. 2021, A&A, 649, A147 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Shan, Y., Reiners, A., Fabbian, D., et al. 2021, A&A, 654, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Shetrone, M., Bizyaev, D., Lawler, J. E., et al. 2015, ApJS, 221, 24 [Google Scholar]
- Shields, A. L., Ballard, S., & Johnson, J. A. 2016, Phys. Rep., 663, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Shulyak, D., Reiners, A., Nagel, E., et al. 2019, A&A, 626, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Smith, V. V., Bizyaev, D., Cunha, K., et al. 2021, AJ, 161, 254 [NASA ADS] [CrossRef] [Google Scholar]
- Souto, D., Cunha, K., Smith, V. V., et al. 2020, ApJ, 890, 133 [Google Scholar]
- Souto, D., Cunha, K., Smith, V. V., et al. 2022, ApJ, 927, 123 [NASA ADS] [CrossRef] [Google Scholar]
- Ting, Y.-S., Conroy, C., Rix, H.-W., & Cargile, P. 2019, ApJ, 879, 69 [Google Scholar]
- van Leeuwen, F. 2007, A&A, 474, 653 [CrossRef] [EDP Sciences] [Google Scholar]
- Veyette, M. J., Muirhead, P. S., Mann, A. W., & Allard, F. 2016, ApJ, 828, 95 [NASA ADS] [CrossRef] [Google Scholar]
- Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Methods, 17, 261 [Google Scholar]
- West, A. A., Morgan, D. P., Bochanski, J. J., et al. 2011, AJ, 141, 97 [NASA ADS] [CrossRef] [Google Scholar]
- Wilson, J. C., Hearty, F. R., Skrutskie, M. F., et al. 2019, PASP, 131, 055001 [NASA ADS] [CrossRef] [Google Scholar]
- Zacharias, N., Finch, C. T., Girard, T. M., et al. 2012, VizieR Online Data Catalog: I/322A [Google Scholar]
The magnitude limit of current instruments is at about H=7, corresponding to V around 11 for M dwarfs. The limit for angular diameter lies at 0.3 mas in the best conditions, corresponding to stellar radii of ~0.3 R⊙ at a distance of 10 pc, ~0.6 R⊙ at 20 pc, or ~1 R⊙ at 30 pc (e.g. Boyajian et al. 2012; Lachaume et al. 2019; Mourard et al. 2022).
This is derived using the Python module scipy.optimize.curve_fit (Virtanen et al. 2020).
Due to using different limb darkening coefficients the values for θLD by Rabus et al. (2019) are lower than those of Boyajian et al. (2012) by about 1% for these stars (2% for GJ 205). Similarly, the absolute difference in bolometric flux ranges from about 1 to 4%.
Using Gaia parallaxes and Boyajian et al. (2012) angular diameters results in stellar radii in agreement with the published values at the 1% level (except for the binary GJ 338).
All Tables
Spectroscopic reference studies. Top part: classical spectroscopy, bottom part: machine-learning approaches.
All Figures
|  | Fig. 1 Example of H-band synthetic spectra generated with different effective temperatures. The surface gravity was set to 4.7 dex and the metallicity was set to solar. The different colours correspond to different Teff values. | 
| In the text | |
|  | Fig. 2 Normalised observed spectrum of the star GJ 880 as black dashed line, and best-fit model (synthetic spectrum predicted by the Payne’s ANN for the parameters given in Table 2) as orange solid line. Grey shaded areas indicate the location of the line mask we used. Derived parameters for this star are Teff: 3649 K, logɡ: 4.8 dex, and [Fe/H]: 0.25 dex. | 
| In the text | |
|  | Fig. 3 Correlation matrix for SAPP’s spectroscopic module without photometric constraint for star GJ 880. The colour scale represents statistical correlation from −1 to 1 for nine ANN parameters. | 
| In the text | |
|  | Fig. 4 PDFs calculated for GJ 880 for two different SAPP modules: spectroscopy (left) and photometry (right). The horizontal axis is effective temperature, the vertical axis is surface gravity, and the colour scale is the logarithm of probability. Each PDF is sliced in the [Fe/H] dimension at their maximum probability. White space corresponds to NaN values. | 
| In the text | |
|  | Fig. 5 Surface gravity versus effective temperature derived by the SAPP (black diamonds with error bars). The K dwarf GJ 105A is not visible since its parameters are outside of the axis ranges. Small dots represent a subset of the grid of stellar evolution models used by the photometric module as described in Sect. 3.1, selected for this illustration to have an age of 13 Gyr, colour-coded by metallicity, with masses increasing from 0.1 M⊙ at the lower right towards the upper left with steps of 0.005 M⊙. We note that when constructing the PDF for surface gravity the photometric module uses the whole grid of models for all available ages. | 
| In the text | |
|  | Fig. 6 Comparing Teff (top) and log g (bottom) derived from the SAPP with corresponding parameters based on interferometric angular diameters (Boyajian et al. 2012; Rabus et al. 2019). The black dashed line in both panels corresponds to the 1:1 ratio and the grey dotted line in the top panel corresponds to a linear fit to the values from Boyajian et al. (2012). We excluded the outlier GJ 725B for which the SAPP derived a Teff of roughly 3550 K (leftmost blue square in top panel) from the linear fit, for reasons discussed in the text. We note that the K3 dwarf GJ 105A with Teff ∼4600 K is not shown in the figures. | 
| In the text | |
|  | Fig. 7 SAPP results compared with spectroscopic results from Sarmento et al. (2021); Passegger et al. (2019); Mann et al. (2015); Maldonado et al. (2020); Souto et al. (2022); Cristofari et al. (2022a). Values derived using the SAPP are shown on the vertical axis, and the literature values are shown on the horizontal axis. Left: effective temperature. Middle: surface gravity. The uncertainties for Sarmento et al. (2021) and Souto et al. (2022) are represented at the bottom of the figure in grey. One star is located outside of the borders of the figure: LSPM J1204+1728S for which the SAPP value is 4.86 dex and Sarmento et al. (2021) obtained 5.31 dex. Right: metallicity. The black dashed line in all panels corresponds to the 1:1 ratio. | 
| In the text | |
|  | Fig. 8 Comparing Teff (top) and [Fe/H] (bottom) derived with the SAPP with the results based on machine-learning techniques from Birky et al. (2020) and Passegger et al. (2022). The black dashed line corresponds to the 1:1 ratio. | 
| In the text | |
|  | Fig. 9 Comparing the derived [Fe/H] from the SAPP for M dwarf secondary components in a binary (y-axis, Table 2) with the metallicity of the primary (x-axis, Table 4). The blue circles show binaries with an FGK-type primary and an M dwarf secondary. The orange squares show binaries with a KM-type primary and an M dwarf secondary. All stars in these three systems were analysed with the SAPP using APOGEE spectra. The black dashed line corresponds to the 1:1 ratio. | 
| In the text | |
|  | Fig. A.1 Top: Distribution of median flux error for 3300 models from the validation set of The Payne’s ANN, bottom: cumulative distribution. | 
| In the text | |
|  | Fig. B.1 Example of H-band synthetic spectra generated with different metallicities. The Teff and log g values were set to 3800 K and 4.7 dex, respectively. The different colours correspond to different [Fe/H] values. | 
| In the text | |
|  | Fig. C.1 Normalised observed spectrum of the star GJ 447 in dashed black and best-fit model in solid orange. Grey shaded areas indicate the location of the used line mask. The axes are on the same scale as in Fig. 2. We obtained a Teff of 3243 K, a log g of 5.066 dex, and a metallicity of −0.13 dex. | 
| In the text | |
|  | Fig. C.2 Same as Fig. C.1 but for the star GJ 526. We obtained a Teff of 3729 K, a log g of 4.792 dex, and a metallicity of −0.32 dex. | 
| In the text | |
|  | Fig. C.3 Same as Fig. C.1 but for the fast rotating star LSPMJ1204+178S. Since the modified SAPP cannot fit for rotation the derived parameters are judged to be unreliable. | 
| In the text | |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.
 
 