The effect of JWST/NIRSpec data reduction on the retrieval of WASP-39b atmospheric properties

J. Roy-Perez; S. Pérez-Hoyos; N. Barrado-Izagirre; H. Chen-Chen

doi:10.1051/0004-6361/202558193

Home

All issues

Volume 707 (March 2026)

A&A, 707 (2026) A297

Full HTML

Open Access

Issue		A&A Volume 707, March 2026


Article Number		A297
Number of page(s)		18
Section		Planets, planetary systems, and small bodies
DOI		https://doi.org/10.1051/0004-6361/202558193
Published online		23 March 2026

A&A, 707, A297 (2026)

The effect of JWST/NIRSpec data reduction on the retrieval of WASP-39b atmospheric properties

J. Roy-Perez^★, S. Pérez-Hoyos, N. Barrado-Izagirre and H. Chen-Chen

Escuela de Ingeniería de Bilbao, Universidad del País Vasco (UPV/EHU), Bilbao, Spain

^★ Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.

Received: 20 November 2025
Accepted: 3 February 2026

Abstract

Context. The James Webb Space Telescope (JWST) provides exoplanetary transit observations with an unprecedented spectral range coverage, offering exceptional data on exoplanet atmospheres. Nevertheless, the presence of systematics in the data reduction process introduces small, but significant uncertainties that propagate into atmospheric retrievals. Understanding how these reduction choices affect interpreted atmospheric properties is essential.

Aims. We aim to quantify the impact of different JWST/NIRSpec PRISM data-reduction processes, as well as the relevance of saturation on the retrieved atmospheric properties of WASP-39b. We also assess whether or not these biases are comparable to those introduced by assumptions made in atmospheric modelling, particularly in the treatment of aerosol extinction. Compared with previous similar efforts, we use the highest number of alternative data reductions and discuss the role of the saturated spectral region, without adding any additional data from other sources or random noise, in contrast to previous works.

Methods. We performed a series of nested-sampling Bayesian retrievals using MultiNest and forward models generated with the Planetary Spectrum Generator. We analysed six independently reduced NIRSpec/PRISM spectra and compare retrievals using the full wavelength range as well as versions excluding the saturated 0.69-1.91 μm region. We further tested the effect of including three different cloud-opacity parametrisations.

Results. Significant discrepancies arise among retrievals based on different calibrations, affecting key atmospheric parameters such as temperature, molecular abundances, and cloud opacity, often at a level exceeding one order of magnitude. Excluding the saturated region reduces the inter-pipeline dispersion; conversely, however, it also increases degeneracies between parameters. Differences introduced by data reduction are comparable in magnitude to those produced by distinct cloud-opacity models. Bayesian evidence systematically favours non-flat aerosol extinction, although the preferred spectral behaviour depends on the specific calibration used.

Conclusions. Variations among JWST/NIRSpec data-reduction pipelines produce measurable and often substantial differences in retrieved atmospheric properties of WASP-39b. These biases are similar in scale to the uncertainties on modelling assumptions, underscoring the importance of robust and homogeneous calibration procedures. The results also confirm that JWST data possess the sensitivity required to probe aerosol spectral behaviour, although such constraints remain strongly dependent on the adopted data-reduction strategy.

Key words: radiative transfer / methods: data analysis / techniques: spectroscopic / planets and satellites: atmospheres / planets and satellites: individual: WASP-39b

© The Authors 2026

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.

1 Introduction

The characterisation of exoplanets is one of the primary objectives of the James Webb Space Telescope (JWST). Since its launch, JWST has observed more than a hundred exoplanets, with many additional observations in planning. Most of these are transit observations, where the data are analysed using transit spectroscopy to infer atmospheric properties (Barstow et al. 2015).

The observation of WASP-39b during the Early Release Science (ERS) programme demonstrated the potential of the space telescope. The planet, with a mass of 0.28 M_J and a radius of 1.27 R_J, has been extensively studied by the scientific community since its discovery in 2011 by Faedi et al. (2011). Its first transit spectrum was presented by Sing et al. (2016) using the Space Telescope Imaging Spectrograph (STIS) instrument onboard the Hubble Space Telescope (HST). Based on the absorption features of Na and K observed in the data (also seen in ground-based observations from Nikolov et al. 2016), a number of studies (Fischer et al. 2016; Heng 2016; Barstow et al. 2016) investigated whether the planet has a clear atmosphere or it is covered by clouds, but no clear conclusion were obtained. The observations of the planetary transit in the near-infrared (NIR) with the Wide Field Camera 3 (WFC3) allowed Wakeford et al. (2018) to better constrain the abundance of H₂O in the atmosphere. These data undoubtedly showed that the atmosphere was cloudy and later analyses (e.g. Tsiaras et al. 2018; Pinhas et al. 2018a; Fisher & Heng 2018; Pinhas et al. 2018b; Kirk et al. 2019) were instrumental in attempts to determine its characteristics, while also improving existing constraints on the abundance of water.

In July 2022, during the observations of the ERS programme, the scientific team measured the transmission spectra of WASP-39b using the NIRCam (Ahrer et al. 2023b), NIRISS (Feinstein et al. 2023), NIRSpec PRISM (Rustamkulov et al. 2023), and NIRSpec G395H (Alderson et al. 2023) instrumental modes. In addition to the previously detected Na, K, and H₂O abundances, these observations led to the detection of the expected species CO₂ and CO, as well as, unexpectedly, SO₂ (see also Ahrer et al. 2023a; Grant et al. 2023; Tsai et al. 2023a).

The observations also showed an absorption feature that could be produced by H₂S, but this still lacks a robust confirmation.

The data collected during the observations have been extensively used by different teams. Some of them focused their studies on the formation and evolution of the planet (Louca et al. 2023; Khorshid et al. 2024). Differences in the terminators have been clearly detected by Espinoza et al. (2024), which has also motivated the study of the atmospheric transport of chemical species and clouds with global circulation models (Tsai et al. 2023b). The wide spectral range of the data has also motivated the study of aerosol properties (Roy-Perez et al. 2025) that also include microphysical models (Arfaux & Lavvas 2024).

However, it is also important to be aware of the differences between observations of the same object and, particularly, between their data processing. For example, Lueber et al. (2024) made a comparison of the retrieved atmospheric properties of the planet when using the data from different JWST instrument modes. Similarly, Davey et al. (2025) studied the effect of binning the spectroscopic observations on the atmospheric retrievals. The effect of the reduction process was studied for the first time in Constantinou et al. (2023) comparing only two data reductions from NIRSpec instrument, and by Powell et al. (2024) for MIRI observations considering three different data reductions. Recently, Schleich et al. (2025) also studied the effect of random noise in the processed spectra of NIRSpec observations. These works show the strong influence that the observation and reduction process may have on the retrieved characteristics of a given planet. However, the number of alternative calibration procedures has been increasing steadily without a comprehensive recap of their expected impact in the retrievals.

Moreover, during the NIRSpec/PRISM observations, a region of the detector was partially saturated, as shown in Rustamkulov et al. (2023). Each data reduction process published followed specific prescriptions to recover the lost information and safely use the whole spectral range for the atmospheric analysis. However, it is still unclear whether the different efforts to recover that information were fully successful or not and to which extent they affect the conclusions draw by previous works. Some works avoided the problem of the saturated region by using only a subset of the data (e.g. from 2.5 to 5 μm as in Schleich et al. 2025), occasionally adding data from other sources, such as HST to improve their results (see Constantinou et al. 2023). Again, there is no systematic analysis that compares each different approach and their results.

Thus, the purpose of the present work is to determine how different reduction processes applied to the same observations affect the retrieved atmospheric properties, even when they appear to be almost equivalent to each other. Here, we include a higher a number of reduced spectra than previous works, so the variability of atmospheric retrievals can be properly addressed. To complement Constantinou et al. (2023) and Schleich et al. (2025), we chose to include the region below 3 microns as dealt with by each calibration process. This approach avoids introducing uncertainties derived from the calibration of other instruments and tests the self-consistency of the JWST data. We also aim to study the bias that the saturated region of the spectra introduces in the retrieved properties of the planet. To do so, we present a comparison of the results obtained when removing that specific region of the spectral data, without adding any additional sources of information.

Lastly, we try to put these reduction biases into context by comparing them with those introduced at the modelling stage; for instance, when using different cloud extinction models. Different works have claimed that JWST data has enough potential to constrain information about aerosols present on exoplanet atmospheres. Specifically, Constantinou et al. (2023); Lueber et al. (2024); Roy-Perez et al. (2025), amongst others, have used different aerosol opacity definitions to study their role on WASP-39b transmission spectra, arguing that this increased degree of complexity is supported by the data. Thus, our aim is to provide more information on the properties of clouds present on the exoplanet, to complement the study of their effect on the retrieved atmosphere and check that the conclusion that non-flat extinction aerosols are supported by the JWST data still holds.

To do so, we have structured the text as follows. In Sect. 2, we present the set of spectral data used in this work. We briefly explore the different processing steps that can turn into different spectral characteristics for all of them. In Sect. 3, we show how we model the atmosphere of the planet and how we perform our retrievals based on Bayesian inference. In Sect. 4, we compare and discuss the retrieval differences obtained. In Sect. 5, we summarise the main conclusions of this work and how they are related to previous works.

2 Data

The original data for the transit of WASP-39b used in this work correspond to the ERS programme of the JWST presented in Rustamkulov et al. (2023). The 8.23 hour observations were taken on 10 July 2022 using the PRISM mode of the NIRSpec instrument. The ERS team used NIRSpec’s Bright Object Time Series (BOTS) mode with the NRSRAPID readout pattern, the S1600A1 slit (1.6″ × 1.6″) and the SUB512 sub-array. Throughout the exposure, 21 500 integrations were recorded, each with five 0.28 s groups up the ramp. As mentioned in the introduction, a region of the detector saturated during the observations. The position of the saturated pixels in the detector corresponds to a wavelength range between 0.63 and 2.06 μm. The saturated region is represented in Fig. 1 as a gray shadowed area. However, the saturation was not homogeneous at all pixels and this is represented by the different gray shades used in the figure. For the lightest region, only one group per integration was affected by saturation, while for the darkest region, up to four of the five groups were affected. As mentioned in Rustamkulov et al. (2023), the saturation produced a large point-to-point scatter of several thousand ppm in the transmission spectra. Applying various custom steps outside the regular data reduction process helped for potentially recovering the spectral information. Carter et al. (2024) performed a deeper analysis of this saturated region and discussed this correction in more detail.

Rustamkulov et al. (2023) presented four alternative calibrations of the original data, which we selected as a starting point for this work. They were obtained using four main reduction pipelines: FIREFLy (Rustamkulov et al. 2022), Tshirt (Schlawin & Glidic 2025), Tiberius (Kirk et al. 2017, 2021), and Eureka!+ExoTEP (Bell et al. 2022; Benneke et al. 2017). Henceforth, we refer to these transmission spectra by the name of the pipeline used for their extraction.

Later in 2024, Carter et al. (2024) reanalysed the observations of this transit. They did an analysis of the orbital and stellar parameters combining data taken by the four different instrumental modes of the JWST. From their work, we also take the reduced NIRSpec/PRISM spectrum, which we refer to in the following as ‘Carter’. An additional effort was made to recover the saturated region of the observations more effectively and accurately, which resulted in a spectral dependent modification of the affected part of the spectrum. We also include this modified version of the spectrum, hereafter referred to as ‘Carter-modified’.

At this point, it is important to note that the present work is not a comparison of the pipelines themselves. Its purpose is to evaluate how different data reduction procedures influence the atmospheric analysis results. Thus, we use the name of the pipelines just as a simpler way to name the different spectra (and not to evaluate the pipeline itself). It must be noted that beyond the chosen pipeline, the data reduction process consists of a series of steps, each one requiring its own input parameters. An interested reader is referred to original works from the ERS programme and the pipelines documentation. Here, we briefly summarise these steps to highlight possible differences that could lead to disparities in the final spectral data.

The process is divided in different stages. Detector corrections at group level are performed in Stage 1 for each integration. The bias subtraction, removal of bad pixels produced by cosmic rays or saturation, and/or the frequency-dependent noise correction are performed in this first stage. After cleaning the images, the flux for each integration is calculated fitting the ramp produced by its compounding groups. The wavelength calibration required to relate wavelength values to each pixel is executed in Stage 2. A second extraction of the background and outlier detection can be performed in Stage 3, this time at the integration level. At this third stage, the tracing of the source is identified to extract its spectrum from each integration, which are then used to generate the light curves of the transit. From this point on, each pipeline divides the remaining steps in a different number of stages, but they all perform similar processes. First, the spectral grid is defined to set the number of channels and, consequently, the binning for the final spectra. Then, the white light curve is computed. The next step consists of retrieving the orbital and stellar parameters fitting the white light curve with models of transits, stellar limb-darkening, and systematics. Once these parameters are obtained, the light curves of the different spectral channels are fitted separately. Lastly, the final spectrum is obtained from the planet sizes computed at each channel.

All these stages provide numerous opportunities to diverge during the reduction process. Some of them can be spotted even at the paper by Rustamkulov et al. (2023). At the pixel level, there are many choices that can lead to dissimilarities in the extracted stellar spectra; for example, the chosen criteria for identifying a saturated pixel or a cosmic ray event or the pixel group selection for determining both the source trace and the background level. The selection of the channels for the spectral binning is also very relevant (see also Davey et al. 2025). Then, the choices made for fitting the models to the light curves extracted from the images can also affect heavily the retrieved planetary sizes. Different transit models are used for the retrievals and different systematics are considered among the different data reductions. As mentioned in Rustamkulov et al. (2023), the limb darkening can have a huge impact on the extracted spectrum. This is the reason for all the reduction processes to fit it using the same parametrisation, but not retrieving always the same parameter values. In addition, when fitting the light curves of each wavelength channel, some reductions fix the limb darkening parameters to those obtained with the white light curve, while others fit them again.

Figure 1 shows a comparison of the six different spectra used in this work. At first glance, they all look very similar with particular differences mainly below the error bars. All of them capture the same shape of the planetary absorption throughout the spectral range. Not surprisingly, the largest difference can be seen in the saturated region, where Carter and Carter-modified spectra are able to retrieve more atmospheric absorption following their deeper analysis of the detector saturation. It is also relevant to highlight the different spectral grids chosen for each spectrum and this is not always with the same number of spectral points. Furthermore, there seems to be a noticeable dispersion between reductions at longer wavelengths, due to the lower brightness of the star at those wavelengths. It is also possible to appreciate a slight difference in the transit depth base reference value between data reductions. This simple vertical offset could be produced by the difference on the retrieved orbital parameters during the light curve fitting as highlighted in Rustamkulov et al. (2023).

Figure 2 shows a quantitative comparison of the data reductions used. We interpolated the spectra to a common spectral grid with resolving power of 200 and used the FIREFLy spectrum as reference. This choice was made simply for a direct comparison with the analysis from Rustamkulov et al. (2023).

For Tshirt, Tiberius, and Eureka, the residuals indicate that only a few peaks fall outside the error bars. Most of these peaks fall in the saturated regions, indicative of the different treatments of the saturated region of the detector. The spectra from Carter et al. (2024), in addition to those peak differences, also show a clear offset that is not compatible with the error bars along most of the spectrum, possibly due to the additional offset introduced to cross-calibrate these data with other instruments.

Fig. 1

Representation of the different spectral data used for the purpose of this work. The gray shades in the 0.63-2.06 micron range are related to the level of saturation: for the lightest region, only one group per integration was affected by saturation, while for the darkest region, up to four of the five groups were affected.

Fig. 2

Comparison of the spectra with the FIREFLy data reduction as a reference and their respective residuals. Coloured regions at the residuals represent error bar propagation. Grey regions show the saturated spectral range (0.63-2.06 μm) as in Fig. 1.

3 Methodology

The core idea of this work is to perform atmospheric retrievals using each of the calibrated spectra introduced in the preceding section. Comparing the retrieved parameters and their uncertainties provide information on how the data reduction process is affecting those retrievals and, hence, the specific parameters that will give robust results.

The first input we used were the spectra detailed in Sect. 2. However, as the saturated region could compromise the results and was sometimes removed, as a second step, we simply eliminated the saturated region to check how this change would affect the results. We removed the spectral points between 0.69 and 1.91 μm, forming the ‘persistently saturated region’ defined in Carter et al. (2024). It must be noted that, henceforth, there is no difference between the spectra labelled as ‘Carter’ and ‘Carter-modified’ as they are exactly the same outside the saturated region.

So far, only differences in the data have been proposed. We carried out an extra loop of simulations related to changes in the model atmosphere. With this last set of retrievals, we were able to compare the magnitude of the differences introduced in the retrieved atmospheric properties using both the data reduction and model aspects. While we used simple flat cloud absorption models for the first and second iterations, in the third, we used more complex cloud extinction models. This additional loop is intended to offer further insight and validation of the results presented in Roy-Perez et al. (2025), helping to assess how sensitive cloud models are to data calibration.

3.1 Tools

The retrievals in the present work are based on the methodology described in Roy-Perez et al. (2025). The forward simulations are generated with the Planetary Spectrum Generator (PSG; Villanueva et al. 2018). To generate the simulated spectra, we used the Planetary and Universal Model of Atmospheric Scattering (PUMAS; Villanueva et al. 2015; Smith et al. 2013), as the core radiative-transfer model, which uses a one-dimensional and plane-parallel description of the atmosphere. The spectra are computed with a resolving power of 200 using a correlated-k method. Higher resolutions were tested, but they produced negligible differences at the cost of significantly increasing the computation time. We used the line lists for H₂O (Polyansky et al. 2018), CO₂ (Yurchenko et al. 2020), CO (Li et al. 2015), SO₂ (Gordon et al. 2022), Na (Allard et al. 2019), H₂S (Gordon et al. 2022), CH₄ (Yurchenko et al. 2024), and K (Allard et al. 2016). Rayleigh scattering (Sneep & Ubachs 2005), UV broad absorption, and collision-induced absorption by H₂ – H₂ (Abel et al. 2011) and H₂ – He (Abel et al. 2012) are also included in the simulations. Multiple scattering is also taken into account by the radiative-transfer model.

The retrievals were performed with MultiNest (Feroz et al. 2009; Buchner et al. 2014), a Bayesian inference tool based on nested sampling Monte Carlo algorithms (Metropolis et al. 1953; Hastings 1970; Skilling 2006). We used the Bayesian evidence ln Z for model comparison. When comparing two specific models, following Jeffrey’s scale (Jeffreys 1998; Trotta 2008), values of the Bayes factor, ln B_1,2 = ln Z₁ – ln Z₂, that are greater than 5.0 indicate that the first model is decisively better than the second, but differences greater than 1.0 are enough to claim that one model is preferred. Instead, values lower than 1.0 indicate that there is not enough evidence to claim which one is better. In this last case, Occam’s razor reasoning is followed: the model with the lowest number of free parameters is favoured.

3.2 Atmospheric description

Several parameters must be included in the retrievals to simulate the transit spectra. The size of the star, R^*, which determines the total amount of light reaching the planet and the planetary diameter, D_pl, which constrains the planetary surface blocking the star light, are included as free parameters with the tight prior Gaussian distribution constraints calculated in Faedi et al. (2011): 179 000 ± 7000 km (i.e. 1.27 ± 0.04 R_J) for D_pl and 0.895 ± 0.023 R_⊙ for R^*.

To describe the atmosphere, we defined a log-uniform pressure profile with 40 levels between 10⁻⁶ and 10¹ bar. We tested that increasing the grid to 100 vertical levels did not produce any significant differences, while increasing the computation time. Its vertical extension is described by the scale height, which depends on the planetary gravity, the atmospheric mean molecular weight and temperature (Kreidberg 2018). For the planetary gravity, we assumed a Gaussian prior distribution of log g = 0.63 ± 0.05 (with g in m s⁻²) in base to the calculations of Faedi et al. (2011). Then, D_pι and log g are referenced to the pressure of 10 bar, which corresponds to the lowest layer of our model.

Despite the fact that more complex temperature profiles were recently retrieved for WASP-39b (e.g. Ma et al. 2025, combining data from different JWST instruments), here we follow the discussion and calculations of Roy-Perez et al. (2025) and we used an isothermal profile. The rationale for doing so is that a model selection analysis comparing different vertical profiles based on the Kitzmann et al. (2020) parameterisation for NIRSpec/PRISM observations showed that the isothermal profile provided the highest Bayesian evidence; therefore, it was favoured against more sophisticated vertical profiles for this dataset. Similar results were obtained by Lueber et al. (2024), which are also compatible with the results of Constantinou et al. (2023). Therefore, we include the atmospheric temperature, T_iso, with a uniform prior distribution between 500 and 2000 K in our retrievals.

Our model atmosphere is mainly composed of H₂ and He at fixed abundances. Following the discussion from Roy-Perez et al. (2025), we also included H₂O, CO₂ CO, SO₂, Na, and H₂S. The abundances are described as uniform profiles throughout the entire atmosphere and their volume mixing ratio is treated as a free parameter with uniform prior distributions between 10⁻¹² and 10⁻¹. We performed free chemistry retrievals where the chemical composition was allowed to vary without composition constraints. Thus, the value of the mean molecular weight is computed based on the retrieved chemical abundances.

3.3 Cloud parametrisation

Clouds largely determine the height at which the atmosphere becomes opaque in transit geometry. Following the results of the cloud profile model selection of Roy-Perez et al. (2025), we used a uniform vertical profile to describe the abundance X_cl of aerosols. Similarly to what has been discussed on the thermal vertical profile, the work by Roy-Perez et al. (2025) discussed model selection using Bayesian evidence for a number of aerosol vertical profiles, which strongly favour a uniform vertical distribution. While this is most likely an oversimplification of the real situation, it is justified by the fact that the retrieval is only being sensitive to a reduced region of the atmosphere (for this particular dataset) and, hence, a more realistic vertical description is not supported by the information content.

For the spectral behaviour of the aerosols, we used three different parametrisations of the extinction cross-section, Q_ext, which characterises how effectively the aerosol removes radiation from the incoming radiation beam. First, we used a simple flat parametrisation, where Q_ext is constant with wavelength. When performing retrievals with this description, we fixed the value of Q_ext. Thus, the cloud opacity was determined only by its abundance, Xcι . The second model of cloud extinction is an exponential dependence of the optical thickness with wavelength (Ängström 1929) as in Roy-Perez et al. (2025). In this case, the cloud opacity is defined by the free parameters X_cl and the Ängström exponent (α). This model has already been used in exoplanet atmospheric studies (see Lecavelier Des Etangs et al. 2008; MacDonald & Madhusudhan 2017; Pinhas et al. 2018a; or Barstow 2020). It has the advantage of a very simple parameterisation that mimics the behaviour of aerosols across a range of sizes: the smaller the value of α, the bigger the aerosol it resembles. As shown in Roy-Perez et al. (2025), this approach has the potential to be sensitive to the overall dependence of opacity with wavelength.

Lastly, for a more realistic approach to aerosol opacity, we included a Mie parametrisation of the optical properties of the aerosol using the external database Modelled Optical Properties of enSeMbles of Aerosol Particles (MOPSMAP; Gasteiger & Wiegner 2018). We followed the assumptions made in Roy-Perez et al. (2025) and we defined the aerosols in our simulations as spherical particles with a log-normal size distribution. To describe this distribution, we used the effective radius, r_eff, and the effective variance, v_eff, which are the mean and deviation of a size distribution weighted by the cross-section of the particles (Hansen & Travis 1974). For simplicity, we fixed v_eff to 0.1 during the retrieval calculations. The real and the imaginary part of the refractive index were fixed to n_r = 1.4 and n_i = 0.0001. Constantinou et al. (2023) assumed a number of possible compositions for the aerosols during their retrievals, based on thermochemical expectations (Morley et al. 2013) and using the refractive index data listed in Wakeford & Sing (2015). Similarly, assuming the composition of the aerosols as the estimated condensable species MgSiO₃ and SiO₂, Ma et al. (2025) found that using a combination of different homogeneous particle-size distributions is needed for an optimal fit to the combined data of the different JWST instruments. However, this adds an extra layer of complexity that could mask other features in the Q_ext investigation that we propose. Furthermore, there is a long list of potential thermochemical candidates even for the solar system clouds that have not been detected so far, particularly in the giant planets, possibly masked by the complex photochemistry above the cloud decks (Irwin 2009). For this reason, we stuck to a simpler approach disregarding the cloud composition, namely, the dependence of refractive indices with wavelength, but taking into account the role of particle size in the opacity.

Thus, in this last parameterisation, the Q_ext dependency on wavelength is only due to the particle-size distributions and not to the absorption properties of the aerosols. Therefore, when using this parametrisation, the cloud opacity calculated during retrievals is defined by the free parameters X_cl and r_eff. Table 1 summarises the prior probability distributions introduced in MultiNest for every free parameter included in the simulations.

Fig. 3

Comparison of the retrieval results obtained when using the different spectra as input (shown in different colours). See Sect. 2 for a full reference of the spectra. The gray values represent the reference values from Faedi et al. (2011). The planetary radius at 100 mbar, the mean atmospheric molecular weight and the aerosol opacity at 1 μm were computed from actual outputs and not directly retrieved.

Table 1

Prior probability distributions for the free parameters included in the retrievals.

4 Results and discussion

In this section, we present the results obtained for the different sets of retrievals. As already discussed, these include three main different approaches: (1) full spectral range; (2) exclusion of the saturated 0.69-1.91 μm range; (3) same as (2) but using a number of cloud extinction models. To display the results, we assume that, following the law of large numbers, the most likely value for each parameter is the median of the marginalised posterior probability distribution. Similarly, the error bars are computed as 1-σ deviations from the median.

4.1 Impact of data reduction on retrieved parameters

Figure 3 shows the resulting parameters (y-axis) for each of the input spectral data (x-axis) mentioned in Sect. 2. This set of parameters includes those that physically describe the planet (planetary radius referenced to the 100 mbar pressure level, star radius, planetary gravity, and isothermal temperature), its composition (mean atmospheric molecular weight, abundances for H₂O, CO₂ CO, SO₂, Na and H₂S), and the aerosols’ opacity at 1 μm, expressed as the pressure level at which the cloud reaches an optical depth τ = 1 in nadir-viewing geometry. For the full analysis, we refer to the corner plots shown in Figs. A.1–A.6 (black lines).

It must be noted that we are including some parameters that are directly used as inputs for the model and some others that are computed afterwards from the outputs, as they have a physical meaning that can provide valuable insights into the best-fitting results. The planetary radius at 100 mbar is calculated combining the retrieved planetary radius at the reference pressure of 10 bar and the atmospheric scale height of the atmosphere. We show the results at this pressure level for an easier comparison with the bibliography. The mean atmospheric weight was calculated using all the fixed and retrieved gases from the modelled atmosphere. Finally, the pressure level at which the cloud reaches τ = 1 in nadir geometry was determined by computing the optical depth at each atmospheric layer using the retrieved cloud abundance. This parameter indicates the altitude at which the atmosphere becomes optically thick with the sole contribution of the aerosols. As we are using a uniform cloud vertical profile, the bigger the value of the pressure level, the thinner the cloud, as it needs more depth to reach the same opacity.

As a reference, we plotted the values of the planetary diameter, the star radius and the gravity of the planet obtained by Faedi et al. (2011) at the discovery of the planet. We also plotted the stratospheric temperature (Barstow et al. 2016) computed from the equilibrium temperature of Faedi et al. (2011) because we are mostly sensitive to the upper atmospheric levels (Roy-Perez et al. 2025). We note that the stratospheric temperature is always lower than the equilibrium temperature.

As a first conclusion from this inspection, we see that there are clear discrepancies among the retrieved parameters obtained from each data reduction process. Some of them show deviations higher than the retrieved error bars. We go on to analyse the details of some of the most significant differences below.

Both the planetary diameter and the star radius show high compatibility among all the retrieved values and with the reference value, which makes them robust inferences from all modelling efforts. However, the FIREFLy calibration throws a substantially higher value for the gravity, not compatible with any of the other computed values. Regarding the atmospheric temperature, there is also a substantial spread seen across the results, as values differ in some 300 K from the hottest (Tiberius) to the coldest (Carter-modified) model. In this case, both Carter-modified and Tshirt appear to be much cooler than expected for the planet Faedi et al. (2011), while the rest of calibrations provide models that are compatible with previous values, with all uncertainties taken into account.

Regarding the molecular abundances, there are also big differences. In some cases, the deviations are even bigger than two orders of magnitude. In the particular case of FIREFLy, abundances are systematically higher, thus resulting in an anomalously high mean molecular weight, possibly incompatible with what we might expect for this planet. This could be related to the scale height and gravity anomalies for these data, as already discussed.

When inspecting abundances based on single, narrow absorption bands, such as Na, we also found substantial differences. In this case, it is obvious that retrievals are heavily affected by the spectral grids chosen during data reduction and/or for our simulations. While this is common sense, a word of caution should be raised here for these kinds of absorption features, as the spectral resolution could lead to inconsistent values.

There is also a very interesting case seen for the H₂S retrieval. Half of the retrievals (FIREFLy, Tiberius, and Carter) have the capacity to determine its abundance to certain precision, while the other half (Tshirt, Eureka, and Carter-modified) are compatible with no H₂S at all in the atmosphere and do not require its presence. While this kind of retrieval is not the ideal way to detect such narrow-banded species, the inclusion or exclusion of certain species is heavily influenced by the calibration process as well.

Finally, there are also differences in the retrieved opacity of the cloud. While the differences are significant, we have found a physically common ground for most of the cases, as they place the limit of an aerosol, optically thick atmosphere for pressure levels at some 100s mbar.

Thus, in short, we find that there is a clear effect of the data reduction process in the model parameters that we retrieved. That can be traced even to the most basic physical parameters of the planet (e.g. gravity or mean molecular weight) and undoubtedly affect the abundances of many species. Even if we can correct the most simple issues by wisely choosing an adequate spectral grid, there is still an uncertainty of at least one order of magnitude that remains. This encourages to always take the values for a given retrieval within the overall context, including the process followed to extract the spectra from the original data.

Fig. 4

Comparison of the results obtained when using the different spectra in retrievals (shown in different colours). See Sect. 2 for a full reference of the spectra. The filled points correspond to the spectra covering the full spectra range and the unfilled ones to the spectra with the saturated spectral range removed. The gray values represent the reference values from Faedi et al. (2011). The planetary radius at 100 mbar, the mean atmospheric molecular weight and the aerosols opacity at 1 μm were computed from actual outputs and not directly retrieved.

4.2 The effect of the saturated region

We show in Fig. 4 the same information as in Fig. 3, but adding the results obtained from the retrievals excluding the saturated region of the spectra. We note that without the saturated region, Carter and Carter-modified are identical and only Carter results were computed at this set of retrievals. For the full analysis, we refer to the corner plots shown in Figs. A.1–A.6 (coloured lines).

The first thing worth highlighting is the fact that the planetary diameter, star radius, and the gravity of the planet do not suffer relevant variations. There are slight differences in the present analysis, but always smaller than the error bars. A similar behaviour was achieved for the aerosol extinction; even if we found some decay in the cloud opacity, it was always compatible with previous results.

However, there are significant differences for the molecular abundances and temperatures, mainly in line with two opposite trends. On the one hand, for the FIREFLy and Carter spectra, the temperature decreases while the molecular abundances are increased. On the other hand, for the rest of the spectra, the temperature increases significantly while the molecular abundances decrease considerably; in some cases, up to an order of magnitude. This anti-correlation can be clearly identified in the mentioned corner plots (see e.g. Figs. 5 or A.4). While it was also present when using the full spectral range, removing the saturated regions increased it substantially. This anti-correlation may be based in the increase of scale height for a hotter atmosphere, which requires us to decrease gaseous abundances for the same extinction to happen.

There are two main aspects that must be considered regarding the saturated region. First of all, it covers mainly some H₂O absorption features (see Fig. 4 from Rustamkulov et al. 2023). Without the saturated region, the H₂O abundance retrieval relies heavily on the absorption peak at 2.8 μm, which is shared with CO₂ and H₂S absorption, and on the continuum at the red end of the spectra, which is dominated by CO. This fact increases the degeneracy between these parameters, along with temperature (as summarised in Fig. 5), which leads to lower accuracy when constraining their values. On the other hand, SO₂ and Na are mostly unaffected by the removal of the saturated region. This supports the idea of removing the saturated region, as its inclusion seems to be a substantial source of dispersion for the different spectral data.

It is important to highlight the increased homogeneity of the results from the retrievals without the saturated region, relative to the case presented in the preceding section. Results that diverged in terms of more than an order of magnitude, when using the full range of the spectra, differ by less than half of that when the saturated region is removed.

In summary, taking care of saturation provides a better constrain on parameter degeneracies, but this comes at the cost of increasing result dispersion, depending on the exact procedures for the data calibration. While some prescriptions for the recovery of the saturated region are probably better or more sophisticated than others, comparing calibrations that include this particular region of the spectrum increases the dispersion of the retrieval results, particularly regarding the molecular abundances.

Fig. 5

Partial corner plot extracted from Fig. A.4, using Eureka spectrum as the input for the retrievals. Black areas are used for the results including the whole spectral range, while red is used for those excluding the saturated 0.69-1.91 μm range. The temperature shows negative correlations with molecular abundances, while the abundances of H₂O, CO₂ and CO are positively correlated with each other.

Table 2

Bayes factor values obtained for the cloud extinction models when compared to the flat model for each of the input spectral data without the saturated region.

4.3 Cloud extinction model effect

Figure 6 is similar to Fig. 4 but showing different cloud extinction parametrisations. Once we demonstrated that including the saturated region in the retrievals introduces a certain level of noise, we then repeated the retrievals excluding the saturation from the spectral data. The ‘flat’ cloud extinction model is the same as in Fig 4. In this figure, we include the results obtained when using the MOPSMAP database and the Ångström parametrisation. The disparity in the results for a single reference spectrum when using the different cloud extinction parametrisations is (for most of the atmospheric parameters) at the same level as the dispersion obtained when using the flat cloud extinction model for the different data reductions. This highlights that the biases introduced by data reduction processes in the retrieved atmospheric properties are at the same level as those that can be introduced in a modelling stage.

The results shown in Fig. 6 allow us to extend the discussion about the role that different cloud extinction models play on the retrieved atmospheric properties, as done in Roy-Perez et al. (2025). The Bayes factors obtained when including the cloud parametrisations with respect to the flat extinction model are shown in Table 2. Following Jeffrey’s scale criteria (Jeffreys 1998; Trotta 2008), we find that the Bayesian evidence supports a cloud extinction model more complex than the common ‘flat’ assumption. For the FIREFLy, Tiberius, and Eureka spectra, the Mie scattering parametrisation is strongly favoured, while for the T-shirt and Carter spectra, the Ängström parametrisation has higher Bayes factor values; however, the differences are not large enough to claim that one parametrisation is statistically favoured over the other.

The bottom-right panel in Fig. 6 provides a first glimpse over the properties of the retrieved aerosols for each model, showing their opacity at λ = 1.0 μm. Figure 7, instead, shows the retrieved cloud optical depth at the 1 bar layer along the whole spectral range for each of the input spectral data and for both aerosol parametrisations. We can identify three main aerosol spectral trends, which is partially correlated with the aerosol opacity, as seen in Fig. 6.

The first group is formed by the clouds defined using MOPSMAP for the Tiberius, Eureka, and Carter spectra. Their overall opacity value is comparable with that retrieved when using flat parametrisations (see Fig. 6). However, in terms of the dependence with wavelength, they are not producing flat extinctions anymore. With similar retrieved mean particle size close to r_eff ≈ 3.5 μm, they reproduce an opacity slightly growing with wavelength. As mentioned in Roy-Perez et al. (2025), the small bump in the spectra, which is statistically favoured with respect to a flat extinction, can hide the contribution from the H₂S and prevent its detection.

The second group is formed by the clouds retrieved for the FIREFLy, Tiberius, Eureka, and Carter spectra, but this time using the Ängström parametrisation. Figure 7 shows that for these cases the opacity grows with wavelength. However, with this parametrisation the contrast between the opacity at short and long wavelengths is much higher than that achieved for the first group of clouds. While the optical depth at longer wavelengths is similar to the previous group, the extinction at shorter wavelengths is significantly smaller. We can check Fig. 6 to see that these models do indeed compensate for the lack of aerosol opacity by increasing the planetary diameter and gravity, while decreasing in temperature. This fact would flatten the spectra and, thus, H₂O and CO₂ abundances would also increase. Despite the low opacity of these clouds, its increase with wavelength also hides the contribution from the H₂S. In some cases, this effect can even make it difficult to properly constrain the abundances of CO and SO₂, which leave their fingerprints on the longest wavelengths of the spectrum.

Lastly, the third group of cloud models is formed by both clouds (Mie and Ängström) retrieved for the Tshirt spectrum and by the MOPSMAP cloud parametrisation for the FIREFLy spectrum, which are very different from previous groups in terms of their spectral properties. Cloud opacities are heavily decreasing with wavelength, corresponding to small particles of sizes r_eff ≈ 0.03 μm. Inversely to the clouds from the second group, these aerosols produce high optical depths at the shortest wavelengths of the spectrum, while causing almost no extinction at the longest wavelengths. This lower opacity results in the same parameter changes described above: increased diameter, gravity and main molecular abundances, and a decrease in temperature. In this case, as there was no bump towards the longest wavelengths, the retrievals were able to constrain the abundance of H₂S.

These three aerosol behaviours seem to be related with two different types of retrieved planetary atmospheres by the algorithm. On the one hand, there is the case where the retrieved cloud optical depth is relevant at all the wavelengths of the spectrum. The algorithm evolves towards a planet with lower values of the planetary diameter, but with an extended atmosphere produced by the lower values of the gravity of the planet and the high retrieved temperature. On the other hand, we found a planet with optically thin clouds at some regions of the spectrum. Here, the algorithm retrieves a higher planetary diameter at the reference pressure, but with a more compact atmosphere produced by the high values of the gravity of the planet and its lower atmospheric temperatures. The two solution families are shown in Fig. A.7.

With this information, it is not possible to support which of the two possible atmospheres is statistically favoured. FIREFLy and T-shirt spectra favour the case with a compact atmosphere. On the other hand, Tiberius and Eureka! spectra favour the extended atmosphere obtained when using the MOPSMAP cloud parametrisation. The Carter spectrum shows an intermediate behaviour. Despite achieving cloud opacities similar to those with Tiberius and Eureka!, it simultaneously favours a compact atmosphere. It must be noted, however, that the Bayes factor is just slightly higher than 1, so the evidence is not as clear as in the other cases.

As already mentioned, the parameters defining the extension of the upper atmosphere of the planet are correlated with the abundances of the chemical species present in it. Those retrievals that lead to models with compact atmospheres, in general terms, require higher abundances, especially of H₂O and CO₂. For these models, those with a cloud opacity increasing with wavelength partially hide the contribution from CO, SO₂, and H₂S, even completely masking their contribution in some cases. Nevertheless, those with the strongly decreasing cloud optical depth with wavelength require higher opacities also from those three molecules. This effect translates, specifically for the FIREFLy spectrum, into unexpectedly high atmospheric mean molecular weights. In contrast, the models with more extended atmospheres do not require such high values.

Fig. 6

Comparison of the results obtained when using the different spectra excluding the saturated region in retrievals. Each colour corresponds to a different input spectral data (see Sect. 2 for a full reference of the spectra) and each marker shape correspond to a different cloud extinction parametrisation model. The gray values represent the reference values from Faedi et al. (2011). The planetary radius at 100 mbar, the mean atmospheric molecular weight and the aerosols opacity at 1 μm were computed from actual outputs and not directly retrieved.

Fig. 7

Comparison of the retrieved optical depths in nadir geometry along the whole spectral range using cloud extinction parametrisations defined in Sect. 3.3. Each colour corresponds to a different input spectral data. Continuous lines represent optical depth retrieved with the MOPSMAP database and dashed lines represent those computed with the Ångström parametrisation.

4.4 Comparison with previous works

Our findings are in general agreement with the conclusions of Constantinou et al. (2023); Powell et al. (2024) and Schleich et al. (2025), specifically finding that there are relevant differences in the retrieved atmospheric properties when using different spectral data as input. We also agree that the most affected parameters are the molecular mixing ratios. Using the Tiberius and Eureka spectra at the 3-5 μm range, Constantinou et al. (2023) found differences in the retrieved abundances of 1 dex or more. Including other spectra, as FIREFLy or Carter, we find that these differences can be even bigger. Schleich et al. (2025) claimed that no other parameters in their retrievals showed any significant variations under perturbations of the transmission spectrum. Nevertheless, Constantinou et al. (2023) found how PT profiles could differ up to 2 σ from each other, which is similar to the results presented in the present work. In fact, for some cases we could find even bigger differences. We note that we are using the full NIRSpec/PRISM range, compared to the limited range in Schleich et al. (2025) and the use of HST data in Constantinou et al. (2023). Then, for the log g parameter, which is usually fixed for the retrievals in previous works, we also obtained substantial differences in its retrieved values. In some cases, especially when including different cloud extinction parametrisations, we obtained substantially higher values not compatible with the previous measurements from Faedi et al. (2011) or Mancini et al. (2018), as also reported by Lueber et al. (2024) using the NIRSpec/PRISM data.

When removing the saturated region of our input spectra, we get data with reliable spectral points in the NIR and midwavelength IR spectral regions. Using these data allows us to reach a higher homogeneity between our results in the retrievals, at the cost of an increased parameter degeneracy. Constantinou et al. (2023) suggested that to obtain accurate abundance estimates with JWST 3-5 μm data, complementary observations at shorter wavelengths are needed. Combining their Tiberius and Eureka spectra with the 0.8-1.7 μm data from HST, they also managed to obtain a better agreement between the retrieved mixing ratios.

Regarding the role of aerosols in the transmission spectrum, Constantinou et al. (2023) also found evidence suggesting that JWST data have enough potential to constrain the cloud properties. However, their results do not clearly favour any particular condensate or spectral behaviour and they also depend on the data reduction adopted. They obtained aerosols with significant opacity at the wavelengths of HST/WFC3, but with an opacity window at the 3-5 μm range of NIRSpec/PRISM that could be compatible with those from the third group aerosol behaviours (mentioned in Sect. 4.3).

Lastly, there is a huge range of molecular abundances in the retrievals published so far. For example, Fisher et al. (2024) showed that there is a big range of H₂O abundances that are consistent with the data. This is indeed the case for our results: our highest values are compatible with those obtained by Ahrer et al. (2023a); Powell et al. (2024) or Fisher et al. (2024) with the different JWST instruments during the ERS; our lowest values are in agreement with those reported by Constantinou et al. (2023) using NIRSpec or by Pinhas et al. (2018a) using HST and Spitzer. For the other molecular species there are not as many references (see e.g. Ahrer et al. 2023a or Constantinou et al. 2023), but the conclusions seem similar to those found for H₂O. In summary, the dispersion of values agrees with our results when only considering calibration, while not taking into account other modelling or observing factors.

5 Conclusion

The main conclusion of this work is that data reduction process of the JWST observations plays a substantial role for atmospheric retrievals. This is consistent with previous results from other instruments, as Powell et al. (2024) did for MIRI observations. For a set of six different data calibrations of the NIRSpec/PRISM observations of WASP-39b, we find that most of the retrieved atmospheric parameters are heavily affected by the spectrum that has been used as input. Physical parameters such as planetary gravity or temperature show relevant differences among the different retrievals that could lead to different atmospheres. The abundances of the chemical species are heavily influenced by the reference spectrum, showing deviations up to 2 dex in some cases. Thus, it is mandatory to always interpret the values from a given retrieval in the context of the process followed to extract the spectra from the original data.

For the case of WASP-39b, one of the crucial steps during the data reduction process was the recovery of the saturated region of the spectrum. We focus here on the JWST data and the efforts to minimise or remove such saturation. When comparing with retrievals that exclude this spectral region, we find that it introduces more dispersion in the finally retrieved atmospheric parameters, mostly in the abundances of the chemical species of the atmosphere. However, the cost of reducing retrieval dispersion increases the degeneracy between model parameters, as expected from reducing the spectral range. Ideally, a proper correction of the saturated region or, alternatively, a reliable source of information for the same range would be the only solution for removing degeneracies. Meanwhile, combining poor corrections with precise data will just increase the noise for the retrieved range of parameters.

An additional conclusion of this work is that the role of the data processing choices in retrievals is comparable to that of the assumptions and decisions made during the modelling stage.

The differences in the retrieved results introduced by choosing different input data is of the same order, if not bigger, than employing different cloud parametrisations when modelling the atmosphere. In fact, both issues are correlated. Including more complex cloud parametrisation is always statistically supported for all the spectral data analysed here, which is a relevant result by itself. However, we show in this work that the FIREFLy and Tshirt spectra, retrievals favour a reduced cloud opacity at some regions of the spectrum together with a compact atmosphere. On the other hand, retrievals with Eureka or Tiberius spectra favour a relevant cloud optical depth along the whole spectral range, which result in atmospheres with higher vertical extensions.

When analysing the retrieved results from each calibration and modelling effort, we find that there are a few possible atmospheres that reproduce the data similarly, but based on quite different versions of the planet. These families or solutions may point to hotter or colder atmospheres, extended or concentrated, and even clear or cloudy at most wavelengths. Leaving aside that some of these combinations may be discarded taking into account further information, present or future, we are faced again with the uncertainty of having to interpret all the data within the context of the reduction process and not just that of the observations.

All in all, a sustained effort to produce reliable calibrations is required. Such an ideal calibration should be in agreement with those provided by other JWST instruments and configurations, as well as with other sources of data, particularly if they complement the spectral range or resolution. While JWST data clearly have the capability to provide information on aspects of the atmosphere, that have not been studied in much detail so far (such as aerosol properties), this still requires an additional effort to achieve an agreement in terms of a general calibration of the data.

Acknowledgements

This work was supported by Grupos Gobierno Vasco IT1742-22. It has also been supported by grant PID2023-149055NB-C31 funded by MICIU/AEI/10.13039/501100011033 and FEDER, UE. J. Roy-Perez acknowledges a PhD scholarship from UPV/EHU. This work is based on observations made with the NASA/ESA/CSA James Webb Space Telescope. The data were obtained from the Mikulski Archive for Space Telescopes at the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-03127 for JWST. These observations are associated with program JWST-ERS-01366. The data used in this paper are associated with JWST program ERS 1366, available from the Mikulski Archive for Space Telescopes (https://mast.stsci.edu). The authors acknowledge the ERS team for developing their observing program with a zero-exclusive-access period. We thank Dr. G. Villanueva and the PSG team for the development of the tool and the support provided to the users. We would also like to thank E. Ahrer and D. Christie for the discussions during the early stages of this work.

References

Abel, M., Frommhold, L., Li, X., & Hunt, K. L. C. 2011, J. Phys. Chem. A., 115, 6805 [NASA ADS] [CrossRef] [Google Scholar]
Abel, M., Frommhold, L., Li, X., & Hunt, K. L. C. 2012, J. Chem. Phys., 136, 044319 [NASA ADS] [CrossRef] [Google Scholar]
Ahrer, E.-M., Alderson, L., Batalha, N. M., et al. 2023a, Nature, 614, 649 [NASA ADS] [CrossRef] [Google Scholar]
Ahrer, E.-M., Stevenson, K. B., Mansfield, M., et al. 2023b, Nature, 614, 653 [NASA ADS] [CrossRef] [Google Scholar]
Alderson, L., Wakeford, H. R., Alam, M. K., et al. 2023, Nature, 614, 664 [NASA ADS] [CrossRef] [Google Scholar]
Allard, N. F., Spiegelman, F., & Kielkopf, J. F. 2016, A&A, 589, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Allard, N. F., Spiegelman, F., Leininger, T., & Molliere, P. 2019, A&A, 628, A120 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Ängström, A. 1929, Geogr. Ann., 11, 156 [Google Scholar]
Arfaux, A., & Lavvas, P. 2024, MNRAS, 530, 482 [NASA ADS] [CrossRef] [Google Scholar]
Barstow, J. K. 2020, MNRAS, 497, 4183 [Google Scholar]
Barstow, J. K., Aigrain, S., Irwin, P. G. J., Kendrew, S., & Fletcher, L. N. 2015, MNRAS, 448, 2546 [NASA ADS] [CrossRef] [Google Scholar]
Barstow, J. K., Aigrain, S., Irwin, P. G. J., & Sing, D. K. 2016, ApJ, 834, 50 [NASA ADS] [CrossRef] [Google Scholar]
Bell, T., Ahrer, E.-M., Brande, J., et al. 2022, J. Open Source Softw., 7, 4503 [NASA ADS] [CrossRef] [Google Scholar]
Benneke, B., Werner, M., Petigura, E., et al. 2017, ApJ, 834, 187 [Google Scholar]
Buchner, J., Georgakakis, A., Nandra, K., et al. 2014, A&A, 564, A125 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Carter, A. L., May, E. M., Espinoza, N., et al. 2024, Nat. Astron., 8, 1008 [Google Scholar]
Constantinou, S., Madhusudhan, N., & Gandhi, S. 2023, ApJ, 943, L10 [NASA ADS] [CrossRef] [Google Scholar]
Davey, J. J., Yip, K. H., Al-Refaie, A. F., & Waldmann, I. P. 2025, MNRAS, 536, 2618 [Google Scholar]
Espinoza, N., Steinrueck, M. E., Kirk, J., et al. 2024, Nature, 632, 1017 [CrossRef] [Google Scholar]
Faedi, F., Barros, S. C. C., Anderson, D. R., et al. 2011, A&A, 531, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Feinstein, A. D., Radica, M., Welbanks, L., et al. 2023, Nature, 614, 670 [NASA ADS] [CrossRef] [Google Scholar]
Feroz, F., Hobson, M. P., & Bridges, M. 2009, MNRAS, 398, 1601 [NASA ADS] [CrossRef] [Google Scholar]
Fischer, P. D., Knutson, H. A., Sing, D. K., et al. 2016, ApJ, 827, 19 [CrossRef] [Google Scholar]
Fisher, C., & Heng, K. 2018, MNRAS, 481, 4698 [Google Scholar]
Fisher, C., Taylor, J., Parmentier, V., et al. 2024, MNRAS, 535, 27 [NASA ADS] [CrossRef] [Google Scholar]
Gasteiger, J., & Wiegner, M. 2018, GMD, 11, 2739 [NASA ADS] [Google Scholar]
Gordon, I., Rothman, L., Hargreaves, R., et al. 2022, J. Quant. Spec. Radiat. Transf., 277, 107949 [NASA ADS] [CrossRef] [Google Scholar]
Grant, D., Lothringer, J. D., Wakeford, H. R., et al. 2023, ApJ, 949, L15 [NASA ADS] [CrossRef] [Google Scholar]
Hansen, J. E., & Travis, L. D. 1974, Space Sci. Rev., 16, 527 [Google Scholar]
Hastings, W. K. 1970, Biometrika, 57, 97 [Google Scholar]
Heng, K. 2016, ApJ, 826, L16 [NASA ADS] [CrossRef] [Google Scholar]
Irwin, P. G. J. 2009, Giant Planets of Our Solar System (Springer International Publishing) [Google Scholar]
Jeffreys, H. 1998, The Theory of Probability (OuP Oxford) [Google Scholar]
Khorshid, N., Min, M., Polman, J., & Waters, L. B. F. M. 2024, A&A, 685, A64 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Kirk, J., Wheatley, P. J., Louden, T., et al. 2017, MNRAS, 468, 3907 [NASA ADS] [CrossRef] [Google Scholar]
Kirk, J., López-Morales, M., Wheatley, P. J., et al. 2019, ApJ, 158, 144 [Google Scholar]
Kirk, J., Rackham, B. V., MacDonald, R. J., et al. 2021, AJ, 162, 34 [NASA ADS] [CrossRef] [Google Scholar]
Kitzmann, D., Heng, K., Oreshenko, M., et al. 2020, ApJ, 890, 174 [Google Scholar]
Kreidberg, L. 2018, Exoplanet Atmosphere Measurements from Transmission Spectroscopy and Other Planet Star Combined Light Observations, eds. H. J. Deeg, & J. A. Belmonte (Cham: Springer International Publishing), 2083 [Google Scholar]
Lecavelier Des Etangs, A., Pont, F., Vidal-Madjar, A., & Sing, D. 2008, A&A, 481, L83 [NASA ADS] [CrossRef] [Google Scholar]
Li, G., Gordon, I. E., Rothman, L. S., et al. 2015, ApJS, 216, 15 [NASA ADS] [CrossRef] [Google Scholar]
Louca, A. J., Miguel, Y., & Kubyshkina, D. 2023, ApJ, 956, L19 [NASA ADS] [CrossRef] [Google Scholar]
Lueber, A., Novais, A., Fisher, C., & Heng, K. 2024, A&A, 687, A110 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Ma, S., Saba, A., Faris Al-Refaie, A., et al. 2025, arXiv e-prints [arXiv:2504.07823] [Google Scholar]
MacDonald, R. J., & Madhusudhan, N. 2017, MNRAS, 469, 1979 [Google Scholar]
Mancini, L., Esposito, M., Covino, E., et al. 2018, A&A, 613, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., & Teller, E. 1953, J. Chem. Phys., 21, 1087 [Google Scholar]
Morley, C. V., Fortney, J. J., Kempton, E. M.-R., et al. 2013, ApJ, 775, 33 [CrossRef] [Google Scholar]
Nikolov, N., Sing, D. K., Gibson, N. P., et al. 2016, ApJ, 832, 191 [NASA ADS] [CrossRef] [Google Scholar]
Pinhas, A., Madhusudhan, N., Gandhi, S., & MacDonald, R. 2018a, MNRAS, 482, 1485 [Google Scholar]
Pinhas, A., Rackham, B. V., Madhusudhan, N., & Apai, D. 2018b, MNRAS, 480, 5314 [Google Scholar]
Polyansky, O. L., Kyuberis, A. A., Zobov, N. F., et al. 2018, MNRAS, 480, 2597 [NASA ADS] [CrossRef] [Google Scholar]
Powell, D., Feinstein, A. D., Lee, E. K. H., et al. 2024, Nature, 626, 979 [NASA ADS] [CrossRef] [Google Scholar]
Roy-Perez, J., Pérez-Hoyos, S., Barrado-Izagirre, N., & Chen-Chen, H. 2025, A&A, 694, A249 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Rustamkulov, Z., Sing, D. K., Liu, R., & Wang, A. 2022, ApJ, 928, L7 [CrossRef] [Google Scholar]
Rustamkulov, Z., Sing, D. K., Mukherjee, S., et al. 2023, Nature, 614, 659 [NASA ADS] [CrossRef] [Google Scholar]
Schlawin, E., & Glidic, K. 2025, tshirt: Time Series Helper and Integration Reduction Tool, Astrophysics Source Code Library [record ascl:2501.004] [Google Scholar]
Schleich, S., Boro Saikia, S., Changeât, Q., et al. 2025, A&A, 704, A223 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Sing, D. K., Fortney, J. J., Nikolov, N., et al. 2016, Nature, 529, 59 [Google Scholar]
Skilling, J. 2006, Bayesian Anal., 1, 833 [Google Scholar]
Smith, M. D., Wolff, M. J., Clancy, R. T., Kleinböhl, A., & Murchie, S. L. 2013, JGR Planets, 118, 321 [CrossRef] [Google Scholar]
Sneep, M., & Ubachs, W. 2005, J. Quant. Spec. Radiat. Transf., 92, 293 [NASA ADS] [CrossRef] [Google Scholar]
Trotta, R. 2008, Contemp. Phys., 49, 71 [Google Scholar]
Tsai, S.-M., Lee, E. K. H., Powell, D., et al. 2023a, Nature, 617, 483 [CrossRef] [Google Scholar]
Tsai, S.-M., Moses, J. I., Powell, D., & Lee, E. K. H. 2023b, ApJ, 959, L30 [NASA ADS] [CrossRef] [Google Scholar]
Tsiaras, A., Waldmann, I. P., Zingales, T., et al. 2018, AJ, 155, 156 [NASA ADS] [CrossRef] [Google Scholar]
Villanueva, G. L., Mumma, M. J., Novak, R. E., et al. 2015, Science, 348, 218 [Google Scholar]
Villanueva, G. L., Smith, M. D., Protopapa, S., Faggi, S., & Mandell, A. M. 2018, J. Quant. Spec. Radiat. Transf., 217, 86 [NASA ADS] [CrossRef] [Google Scholar]
Wakeford, H. R., & Sing, D. K. 2015, A&A, 573, A122 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Wakeford, H. R., Sing, D. K., Deming, D., et al. 2018, AJ, 155, 29 [Google Scholar]
Yurchenko, S. N., Mellor, T. M., Freedman, R. S., & Tennyson, J. 2020, MNRAS, 496, 5282 [NASA ADS] [CrossRef] [Google Scholar]
Yurchenko, S. N., Owens, A., Kefala, K., & Tennyson, J. 2024, MNRAS, 528, 3719 [CrossRef] [Google Scholar]

Appendix A Additional figures

Fig. A.1

Full set of posterior distributions from our nested sampling retrievals performed on the FIREFLy data reduction. Black posteriors refer to retrievals including the whole spectral range of the data and coloured posteriors refer to retrievals excluding the saturated 0.69-1.91 μm range. A simple flat cloud absorption model is considered.

Fig. A.2

Same as Fig. A.1, but for the Tshirt data reduction.

Fig. A.3

Same as Fig. A.1, but for the Tiberius data reduction.

Fig. A.4

Same as Fig. A.1, but for the Eureka data reduction.

Fig. A.5

Same as Fig. A.1, but for the Carter data reduction.

Fig. A.6

Same as Fig. A.1, but for the Carter-modified data reduction.

Fig. A.7

Full set of posterior distributions for the best cloud extinction model from our nested sampling retrievals performed on the different data reductions considered in the present work. Retrievals have been performed excluding the saturated 0.69-1.91 μm range from the data.

All Tables

Table 1

Prior probability distributions for the free parameters included in the retrievals.

In the text

Table 2

Bayes factor values obtained for the cloud extinction models when compared to the flat model for each of the input spectral data without the saturated region.

In the text

All Figures

	Fig. 1 Representation of the different spectral data used for the purpose of this work. The gray shades in the 0.63-2.06 micron range are related to the level of saturation: for the lightest region, only one group per integration was affected by saturation, while for the darkest region, up to four of the five groups were affected.
In the text

	Fig. 2 Comparison of the spectra with the FIREFLy data reduction as a reference and their respective residuals. Coloured regions at the residuals represent error bar propagation. Grey regions show the saturated spectral range (0.63-2.06 μm) as in Fig. 1.
In the text

	Fig. 7 Comparison of the retrieved optical depths in nadir geometry along the whole spectral range using cloud extinction parametrisations defined in Sect. 3.3. Each colour corresponds to a different input spectral data. Continuous lines represent optical depth retrieved with the MOPSMAP database and dashed lines represent those computed with the Ångström parametrisation.
In the text

	Fig. A.1 Full set of posterior distributions from our nested sampling retrievals performed on the FIREFLy data reduction. Black posteriors refer to retrievals including the whole spectral range of the data and coloured posteriors refer to retrievals excluding the saturated 0.69-1.91 μm range. A simple flat cloud absorption model is considered.
In the text

	Fig. A.2 Same as Fig. A.1, but for the Tshirt data reduction.
In the text

	Fig. A.3 Same as Fig. A.1, but for the Tiberius data reduction.
In the text

	Fig. A.4 Same as Fig. A.1, but for the Eureka data reduction.
In the text

	Fig. A.5 Same as Fig. A.1, but for the Carter data reduction.
In the text

	Fig. A.6 Same as Fig. A.1, but for the Carter-modified data reduction.
In the text

	Fig. A.7 Full set of posterior distributions for the best cloud extinction model from our nested sampling retrievals performed on the different data reductions considered in the present work. Retrievals have been performed excluding the saturated 0.69-1.91 μm range from the data.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[R1] Abel, M., Frommhold, L., Li, X., & Hunt, K. L. C. 2011, J. Phys. Chem. A., 115, 6805 [NASA ADS] [CrossRef] [Google Scholar]

[R2] Abel, M., Frommhold, L., Li, X., & Hunt, K. L. C. 2012, J. Chem. Phys., 136, 044319 [NASA ADS] [CrossRef] [Google Scholar]

[R3] Ahrer, E.-M., Alderson, L., Batalha, N. M., et al. 2023a, Nature, 614, 649 [NASA ADS] [CrossRef] [Google Scholar]

[R4] Ahrer, E.-M., Stevenson, K. B., Mansfield, M., et al. 2023b, Nature, 614, 653 [NASA ADS] [CrossRef] [Google Scholar]

[R5] Alderson, L., Wakeford, H. R., Alam, M. K., et al. 2023, Nature, 614, 664 [NASA ADS] [CrossRef] [Google Scholar]

[R6] Allard, N. F., Spiegelman, F., & Kielkopf, J. F. 2016, A&A, 589, A21 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R7] Allard, N. F., Spiegelman, F., Leininger, T., & Molliere, P. 2019, A&A, 628, A120 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R8] Ängström, A. 1929, Geogr. Ann., 11, 156 [Google Scholar]

[R9] Arfaux, A., & Lavvas, P. 2024, MNRAS, 530, 482 [NASA ADS] [CrossRef] [Google Scholar]

[R10] Barstow, J. K. 2020, MNRAS, 497, 4183 [Google Scholar]

[R11] Barstow, J. K., Aigrain, S., Irwin, P. G. J., Kendrew, S., & Fletcher, L. N. 2015, MNRAS, 448, 2546 [NASA ADS] [CrossRef] [Google Scholar]

[R12] Barstow, J. K., Aigrain, S., Irwin, P. G. J., & Sing, D. K. 2016, ApJ, 834, 50 [NASA ADS] [CrossRef] [Google Scholar]

[R13] Bell, T., Ahrer, E.-M., Brande, J., et al. 2022, J. Open Source Softw., 7, 4503 [NASA ADS] [CrossRef] [Google Scholar]

[R14] Benneke, B., Werner, M., Petigura, E., et al. 2017, ApJ, 834, 187 [Google Scholar]

[R15] Buchner, J., Georgakakis, A., Nandra, K., et al. 2014, A&A, 564, A125 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R16] Carter, A. L., May, E. M., Espinoza, N., et al. 2024, Nat. Astron., 8, 1008 [Google Scholar]

[R17] Constantinou, S., Madhusudhan, N., & Gandhi, S. 2023, ApJ, 943, L10 [NASA ADS] [CrossRef] [Google Scholar]

[R18] Davey, J. J., Yip, K. H., Al-Refaie, A. F., & Waldmann, I. P. 2025, MNRAS, 536, 2618 [Google Scholar]

[R19] Espinoza, N., Steinrueck, M. E., Kirk, J., et al. 2024, Nature, 632, 1017 [CrossRef] [Google Scholar]

[R20] Faedi, F., Barros, S. C. C., Anderson, D. R., et al. 2011, A&A, 531, A40 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R21] Feinstein, A. D., Radica, M., Welbanks, L., et al. 2023, Nature, 614, 670 [NASA ADS] [CrossRef] [Google Scholar]

[R22] Feroz, F., Hobson, M. P., & Bridges, M. 2009, MNRAS, 398, 1601 [NASA ADS] [CrossRef] [Google Scholar]

[R23] Fischer, P. D., Knutson, H. A., Sing, D. K., et al. 2016, ApJ, 827, 19 [CrossRef] [Google Scholar]

[R24] Fisher, C., & Heng, K. 2018, MNRAS, 481, 4698 [Google Scholar]

[R25] Fisher, C., Taylor, J., Parmentier, V., et al. 2024, MNRAS, 535, 27 [NASA ADS] [CrossRef] [Google Scholar]

[R26] Gasteiger, J., & Wiegner, M. 2018, GMD, 11, 2739 [NASA ADS] [Google Scholar]

[R27] Gordon, I., Rothman, L., Hargreaves, R., et al. 2022, J. Quant. Spec. Radiat. Transf., 277, 107949 [NASA ADS] [CrossRef] [Google Scholar]

[R28] Grant, D., Lothringer, J. D., Wakeford, H. R., et al. 2023, ApJ, 949, L15 [NASA ADS] [CrossRef] [Google Scholar]

[R29] Hansen, J. E., & Travis, L. D. 1974, Space Sci. Rev., 16, 527 [Google Scholar]

[R30] Hastings, W. K. 1970, Biometrika, 57, 97 [Google Scholar]

[R31] Heng, K. 2016, ApJ, 826, L16 [NASA ADS] [CrossRef] [Google Scholar]

[R32] Irwin, P. G. J. 2009, Giant Planets of Our Solar System (Springer International Publishing) [Google Scholar]

[R33] Jeffreys, H. 1998, The Theory of Probability (OuP Oxford) [Google Scholar]

[R34] Khorshid, N., Min, M., Polman, J., & Waters, L. B. F. M. 2024, A&A, 685, A64 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R35] Kirk, J., Wheatley, P. J., Louden, T., et al. 2017, MNRAS, 468, 3907 [NASA ADS] [CrossRef] [Google Scholar]

[R36] Kirk, J., López-Morales, M., Wheatley, P. J., et al. 2019, ApJ, 158, 144 [Google Scholar]

[R37] Kirk, J., Rackham, B. V., MacDonald, R. J., et al. 2021, AJ, 162, 34 [NASA ADS] [CrossRef] [Google Scholar]

[R38] Kitzmann, D., Heng, K., Oreshenko, M., et al. 2020, ApJ, 890, 174 [Google Scholar]

[R39] Kreidberg, L. 2018, Exoplanet Atmosphere Measurements from Transmission Spectroscopy and Other Planet Star Combined Light Observations, eds. H. J. Deeg, & J. A. Belmonte (Cham: Springer International Publishing), 2083 [Google Scholar]

[R40] Lecavelier Des Etangs, A., Pont, F., Vidal-Madjar, A., & Sing, D. 2008, A&A, 481, L83 [NASA ADS] [CrossRef] [Google Scholar]

[R41] Li, G., Gordon, I. E., Rothman, L. S., et al. 2015, ApJS, 216, 15 [NASA ADS] [CrossRef] [Google Scholar]

[R42] Louca, A. J., Miguel, Y., & Kubyshkina, D. 2023, ApJ, 956, L19 [NASA ADS] [CrossRef] [Google Scholar]

[R43] Lueber, A., Novais, A., Fisher, C., & Heng, K. 2024, A&A, 687, A110 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R44] Ma, S., Saba, A., Faris Al-Refaie, A., et al. 2025, arXiv e-prints [arXiv:2504.07823] [Google Scholar]

[R45] MacDonald, R. J., & Madhusudhan, N. 2017, MNRAS, 469, 1979 [Google Scholar]

[R46] Mancini, L., Esposito, M., Covino, E., et al. 2018, A&A, 613, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R47] Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., & Teller, E. 1953, J. Chem. Phys., 21, 1087 [Google Scholar]

[R48] Morley, C. V., Fortney, J. J., Kempton, E. M.-R., et al. 2013, ApJ, 775, 33 [CrossRef] [Google Scholar]

[R49] Nikolov, N., Sing, D. K., Gibson, N. P., et al. 2016, ApJ, 832, 191 [NASA ADS] [CrossRef] [Google Scholar]

[R50] Pinhas, A., Madhusudhan, N., Gandhi, S., & MacDonald, R. 2018a, MNRAS, 482, 1485 [Google Scholar]

[R51] Pinhas, A., Rackham, B. V., Madhusudhan, N., & Apai, D. 2018b, MNRAS, 480, 5314 [Google Scholar]

[R52] Polyansky, O. L., Kyuberis, A. A., Zobov, N. F., et al. 2018, MNRAS, 480, 2597 [NASA ADS] [CrossRef] [Google Scholar]

[R53] Powell, D., Feinstein, A. D., Lee, E. K. H., et al. 2024, Nature, 626, 979 [NASA ADS] [CrossRef] [Google Scholar]

[R54] Roy-Perez, J., Pérez-Hoyos, S., Barrado-Izagirre, N., & Chen-Chen, H. 2025, A&A, 694, A249 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R55] Rustamkulov, Z., Sing, D. K., Liu, R., & Wang, A. 2022, ApJ, 928, L7 [CrossRef] [Google Scholar]

[R56] Rustamkulov, Z., Sing, D. K., Mukherjee, S., et al. 2023, Nature, 614, 659 [NASA ADS] [CrossRef] [Google Scholar]

[R57] Schlawin, E., & Glidic, K. 2025, tshirt: Time Series Helper and Integration Reduction Tool, Astrophysics Source Code Library [record ascl:2501.004] [Google Scholar]

[R58] Schleich, S., Boro Saikia, S., Changeât, Q., et al. 2025, A&A, 704, A223 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R59] Sing, D. K., Fortney, J. J., Nikolov, N., et al. 2016, Nature, 529, 59 [Google Scholar]

[R60] Skilling, J. 2006, Bayesian Anal., 1, 833 [Google Scholar]

[R61] Smith, M. D., Wolff, M. J., Clancy, R. T., Kleinböhl, A., & Murchie, S. L. 2013, JGR Planets, 118, 321 [CrossRef] [Google Scholar]

[R62] Sneep, M., & Ubachs, W. 2005, J. Quant. Spec. Radiat. Transf., 92, 293 [NASA ADS] [CrossRef] [Google Scholar]

[R63] Trotta, R. 2008, Contemp. Phys., 49, 71 [Google Scholar]

[R64] Tsai, S.-M., Lee, E. K. H., Powell, D., et al. 2023a, Nature, 617, 483 [CrossRef] [Google Scholar]

[R65] Tsai, S.-M., Moses, J. I., Powell, D., & Lee, E. K. H. 2023b, ApJ, 959, L30 [NASA ADS] [CrossRef] [Google Scholar]

[R66] Tsiaras, A., Waldmann, I. P., Zingales, T., et al. 2018, AJ, 155, 156 [NASA ADS] [CrossRef] [Google Scholar]

[R67] Villanueva, G. L., Mumma, M. J., Novak, R. E., et al. 2015, Science, 348, 218 [Google Scholar]

[R68] Villanueva, G. L., Smith, M. D., Protopapa, S., Faggi, S., & Mandell, A. M. 2018, J. Quant. Spec. Radiat. Transf., 217, 86 [NASA ADS] [CrossRef] [Google Scholar]

[R69] Wakeford, H. R., & Sing, D. K. 2015, A&A, 573, A122 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[R70] Wakeford, H. R., Sing, D. K., Deming, D., et al. 2018, AJ, 155, 29 [Google Scholar]

[R71] Yurchenko, S. N., Mellor, T. M., Freedman, R. S., & Tennyson, J. 2020, MNRAS, 496, 5282 [NASA ADS] [CrossRef] [Google Scholar]

[R72] Yurchenko, S. N., Owens, A., Kefala, K., & Tennyson, J. 2024, MNRAS, 528, 3719 [CrossRef] [Google Scholar]