| Issue |
A&A
Volume 706, February 2026
|
|
|---|---|---|
| Article Number | A137 | |
| Number of page(s) | 8 | |
| Section | Planets, planetary systems, and small bodies | |
| DOI | https://doi.org/10.1051/0004-6361/202556540 | |
| Published online | 09 February 2026 | |
Stellar chemistry and planet size: Insights from GALAH DR4
1
School of Physics and Astronomy, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University,
Tel Aviv
6997801,
Israel
2
Department of Geophysics, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University,
Tel Aviv
6997801,
Israel
3
Institut für Astrophysik, Universität Zürich,
Winterthurerstr. 190,
8057
Zurich,
Switzerland
4
Astrophysics Group, Cavendish Laboratory, University of Cambridge,
JJ Thomson Avenue,
Cambridge
CB3 0HE,
UK
★ Corresponding authors: This email address is being protected from spambots. You need JavaScript enabled to view it.
; This email address is being protected from spambots. You need JavaScript enabled to view it.
; This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
22
July
2025
Accepted:
28
November
2025
The well-known correlation between stellar metallicity and the presence of planets is strongest for giant planets and weaker for smaller planets, suggesting that detailed elemental patterns beyond [Fe/H] may be relevant. Using abundances from the fourth data release of the GALAH spectroscopic survey, we analyzed 104 host stars with 141 confirmed transiting planets. We divided the planets at rp = 2.6 R⊕, the theoretical threshold radius above which planets are unlikely to be pure-water worlds. We find that large-planet hosts are enriched by ~0.2 dex in iron and show a possible excess of highly volatile elements (C, N, and O), though these measurements are affected by observational limitations, whereas small-planet hosts exhibit an enhanced contribution of the classical rock-forming elements (Mg, Si, Ca, and Ti) relative to iron; this corresponds to a modest [Rock/Fe] offset of 0.06 dex, which is statistically significant, with a p value of 10−4. These offsets remain significant for alternative radius cuts. A matched control sample of non-planet-host stars shows only weak and mostly statistically insignificant similar trends, confirming that the stronger chemical signatures are linked to the planetary characteristics. As our study relies on transiting planets, it mainly probes short-period systems (P < 100 days). These results refine the planet–metallicity relation, highlighting the role of the relative balance between iron, volatiles, and rock-forming elements in planet formation.
Key words: methods: statistical / techniques: spectroscopic / planets and satellites: composition / stars: abundances / planets and satellites: formation / planetary systems
© The Authors 2026
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.
1 Introduction
A key empirical result in exoplanetary science is the correlation between stellar metallicity and the presence of planets. Radial-velocity surveys have shown that gas-giant planets are several times more common around stars with [Fe/H]≳ +0.3 dex than around solar-metallicity stars (Santos et al. 2004; Fischer & Valenti 2005; Johnson et al. 2010). This metallicity–planet relation provided crucial support for the core accretion paradigm, in which the enhanced heavy-element content of a protoplanetary disk accelerates the formation of massive planetary cores before disk gas dispersal. In contrast, disk instability scenarios predict only a weak metallicity dependence, since giant planets are assumed to form through the gravitational collapse of gas rather than core buildup (Boss 1997, 2002). Thus, the observed trend is widely regarded as one of the strongest observational pillars of the core accretion framework.
With the advent of Kepler and TESS (Transiting Exoplanet Survey Satellite), it became clear that this trend extends to smaller planets as well, although with a weaker slope. The frequency of 1–5 R⊕ planets still rises with host-star metallicity, albeit more moderately than for gas giants (Wang & Fischer 2015). Large studies have also shown that compact, close-in planets are ubiquitous even around stars of near-solar metallicity, suggesting that the formation of small planets requires less extreme heavy-element enrichment (Mayor et al. 2009; Buchhave et al. 2012). At the same time, population studies have revealed that small close-in planets are more common around α-rich thin-disk stars, emphasizing that detailed stellar chemistry, not only overall metallicity, modulates planet formation efficiency (Bashi & Zucker 2019; Bashi et al. 2020). These findings point toward the importance of considering more complex chemical characteristics rather than treating metallicity as a single number.
This view can be linked to the condensation-temperature framework of protoplanetary disks. Major compounds of different elements condense at characteristic temperatures in the protoplanetary disk (Lodders 2003), and the resulting volatile and refractory inventories provide the raw materials for planetary interiors. Highly volatile species such as carbon, nitrogen, and oxygen dominate the composition of ices and atmospheres, while refractories such as magnesium, silicone, and calcium form the planetary silicate mantle. Among these, iron plays the most distinctive: it is the only major condensate to form in metallic form at nebular temperatures, and it dominates the core mass fraction of rocky planets, strongly influencing their density and ability to retain atmospheres. Iron is also the element for which stellar abundances are measured most precisely, making [Fe/H] both a convenient tracer and a physically meaningful parameter in studies of planet occurrence.
Meanwhile, it was determined that the distribution of planet sizes shows a pronounced “radius valley” near rp ≃ 1.8–2.0 R⊕ (Fulton et al. 2017; Van Eylen et al. 2018; MacDougall et al. 2023). This is usually interpreted as the outcome of atmospheric mass-loss processes, driven either externally by stellar irradiation (photoevaporation; Owen & Wu 2013; Lopez & Fortney 2013) or internally by the cooling luminosity of the planetary core (core-powered mass loss; Ginzburg et al. 2016, 2018). Recent simulations also suggest the involvement of orbital evolution, dynamical instabilities, and late giant impacts in forming and shaping the radius valley (e.g., Venturini et al. 2020; Burn et al. 2024; Shibata & Izidoro 2025). Interior modeling suggests that a related composition-driven transition occurs near ~2.6 R⊕, separating predominantly rocky planets from volatile-rich ones (Lozovsky et al. 2018). Thus, both evolutionary and primordial explanations have been put forward, and the true origin of the valley is likely a combination of these mechanisms. Importantly, the depth and morphology of the valley itself depend on stellar chemistry: for example, it is more pronounced at higher metallicities and greater α-enhancement (Chen et al. 2022), suggesting that chemical composition not only seeds planet formation but also influences long-term planetary evolution.
Large homogeneous spectroscopic surveys now make it possible to test these hypotheses systematically. While large compilations such as the Hypatia Catalog (Hinkel et al. 2014; Hinkel & Unterborn 2018; Hinkel et al. 2019) have provided valuable insights into host-star chemistry, their heterogeneous literature origins limit the precision of comparative studies. The GALAH (GALactic Archaeology with HERMES) survey (Buder et al. 2025) now offers homogeneous abundances for up to 32 elements across nearly a million stars, enabling sensitive differential tests of volatile-to-refractory hypotheses at scale. In this work we cross-matched the catalog of the fourth data release of GALAH (GALAH DR4) with confirmed exoplanet hosts to investigate how stellar abundances correlate with the sizes of the planets they host. We find that large-planet hosts appear enriched in iron and possibly in highly volatile elements, though the quantifying the latter may be affected by observational limitations (Sect. 2.3). This iron-dominated refractory enrichment, together with a potential volatile enhancement, refines the classical metallicity paradigm and highlights the role of stellar chemistry in shaping planetary sizes (and therefore the planetary type).
The paper is organized as follows. Section 2 describes the data and methodology, including the construction of the planet sample, the stellar abundances from GALAH DR4, the composite abundance indices, the weighting scheme, the statistical tests, and the control-sample comparison. Section 3 presents the results of the element-by-element analysis, the composite abundance indices, and the control-sample results. Section 4 discusses the implications and limitations of these findings, and Sect. 5 summarizes our conclusions.
2 Data and methods
2.1 Sample
We cross-matched the stars included in the GALAH DR4 database (Buder et al. 2025) with the NASA Exoplanet Archive table of confirmed planets (Christiansen et al. 2025) on April 10, 2025, yielding Nhost = 104 main-sequence planet-hosting stars and Npl = 141 confirmed planets. Since our analysis requires precise radii, we are restricted to transiting planets. Consequently, our sample is inherently biased toward short-period systems; in practice, nearly all of our planets have orbital periods shorter than 100 days.
Following the compositional transition identified by Lozovsky et al. (2018), we divided the sample at rp = 2.6 R⊕. We considered the 78 planets with rp < 2.6 R⊕ as small and the 63 planets with rp ≥ 2.6 R⊕ as large.
2.2 Abundances
The GALAH survey provides high-resolution (R ≃ 28 000) optical spectroscopy for nearly a million stars, with DR4 delivering abundances for up to 32 elements. Typical uncertainties for well-measured species are ~0.03–0.07 dex.
The catalog reports [Fe/H] directly, while all other abundances were expressed relative to iron, [X/Fe]1. We recover absolute abundances as [X/H] = [X/Fe] + [Fe/H].
The GALAH limited availability of some of the abundance indices constrained the power of the single-element approach. To maximize sample size while preserving physical meaning, after a preliminary element-by-element study (whose results will be presented in Sect. 3.2), we regrouped elements by condensation temperature, adopting the grouping suggested by Lodders (2003), and keeping only elements that would not reduce the sample size too much:
Following Lodders (2003), we collapsed the measured elements into four condensate groups: (i) highly volatile – C, N, and O; (ii) volatile – C, N, O, Na, K, Cu, and Zn; (iii) refractory – Mg, Al, Si, Ca, Sc, Ti, V, Cr, Fe, Co, Ni, and Ba; and (iv) rock-forming – Mg, Al, Si, Ca, and Ti. We note that these groupings are not mutually exclusive. In particular, the highly volatile elements (C, N, and O) also enter the broader “volatile” category, and the classical rock-forming species (Mg, Si, Ca, and Ti) are a subset of the wider refractory pool. This overlap reflects the hierarchical nature of condensation chemistry: broader groups trace the bulk volatile or refractory budget, while the nested subgroups isolate specific components (e.g., the rock-forming silicates). By retaining this overlap, we can test both the overall volatile-refractory balance and the distinct role of individual subgroups within it. To ensure statistical robustness, we restricted the group-composite indices to include only elements measured for a sufficiently large fraction of stars, so that the host-star sample retained more than 80 stars for each composite index.
We excluded iron from our group of rock-forming elements since iron warrants special attention. It is the only major condensate to appear in metallic form at nebular temperatures (~1350 K), it dominates the core mass fraction that controls the mass-radius relation of rocky planets (e.g., Seager et al. 2007; Valencia et al. 2007; Zeng et al. 2016), and owing to its rich line spectrum in cool, near-solar-metallicity stars – the regime mainly probed by GALAH – [Fe/H] remains the most precise and externally calibrated tracer of stellar metallicity and the planet-occurrence trend.
GALAH reports the elemental abundances as [X/Fe] (and [Fe/H]) on the usual logarithmic scale. For our purposes, we combined individual elements into mass-weighted composite indices, following the prescription detailed by Gonzalez (2009):
Convert each element abundance to linear, solar-normalized abundance:
![$\[R_{\mathrm{X}}=10^{[\mathrm{X} / \mathrm{H}]}.\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq1.png)
Apply atomic-mass weighting and restore the solar scale:
where AX is the atomic mass and A(X) is the solar photospheric abundance from the “Reference” column of Table 8 by Buder et al. (2025).Form the group composite (referring to the groups listed above) and return to the standard logarithmic form:
![$\[\operatorname{Group} / \mathrm{H}]=\log _{10}\left(\sum_{\mathrm{X}} M_{\mathrm{X}}\right)-\log _{10}\left(\sum_{\mathrm{X}} M_{\mathrm{X}, \odot}\right).\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq4.png)
Calculate the ratio index between two groups, normalized to the solar reference:
![$\[\text { [Group1/Group2] }=\log _{10}\left(\frac{M_{\text {Group1,star }}}{M_{\text {Group2,star }}}\right)-\log _{10}\left(\frac{M_{\text {Group} 1, \odot}}{M_{\text {Group} 2, \odot}}\right).\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq5.png)
This procedure yields composite indices such as [Vol/H], [Ref/H], [Rock/H], and their ratios [Vol/Ref] and [Rock/Fe]. These indices summarize the relative enrichment of different condensation-temperature groups while minimizing the impact of missing or uncertain individual element measurements.
2.3 Quality flags
To further ensure sample integrity, we inspected the element-specific quality flags provided by GALAH DR4 in the catalog galah_dr4_allstar_240705 (Buder et al. 2025). For each element, a corresponding flag, flag_X_fe, accompanies the reported abundance ratio [X/Fe] and encodes possible issues in its determination. Values of flag_X_fe = 0 denote reliable measurements, while nonzero values identify potential problems such as poor spectral fits, low S/N, or incomplete wavelength coverage. In constructing our main sample, we adopted only abundances with flag_X_fe = 0 for all elements except carbon and nitrogen.
For C and N, particular caution is required. As explained by Buder et al. (2025), the flags flag_X_fe = 32 and 33 indicate abundance determinations affected by incomplete spectral coverage or possible instrumental artifacts in the relevant molecular regions. Since the subset of stars with flag_c_fe = 0 and flag_n_fe = 0 constitutes only about 1% of the entire GALAH DR4 sample, restricting to those values would render the carbon and nitrogen datasets statistically negligible. We therefore retained stars with flag_c_fe = 32 or 33 and flag_n_fe = 32 or 33, while treating the resulting volatile indices with due caution in the interpretation (Sect. 5).
2.4 Statistical treatment of multiplicity
Our cross-match between the NASA Exoplanet Archive and GALAH DR4 yielded a total of 141 planets orbiting 104 host stars. Because planetary systems often contain multiple detected planets, a straightforward analysis at the planet level would over-weight such systems relative to single-planet hosts. To avoid this bias, we adopted a weighting scheme in which each host star contributed a total weight of unity, regardless of the number of planets it hosts:
If host star j has mj planets, then each of its planets k = 1, ..., mj was assigned weight wj,k = 1/mj. This scheme ensured that the weights of all planets belonging to a given host would sum to unity:
![$\[\sum_{k=1}^{m_j} w_{j, k}=1.\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq6.png)
This way, every host star contributes equally to the statistics and we can still make use of all the confirmed planets in the system. We verified that the main chemical trends reported in this study are robust to alternative schemes (such as assigning equal weight to each planet, or restricting the sample to a single planet per host).
2.5 Weighted t test
To quantify the chemical differences between stars hosting small and large planets, we estimated differences in means of elemental abundances and composite indices using weighted two-sample Welch’s t test (e.g., Ruxton 2006), which is robust to unequal sample sizes and variances. For an abundance index X, the weighted test statistic is defined as
(1)
where the weighted mean for a subsample is
![$\[\bar{X}=\frac{\sum w_{j, k} X_j}{\sum w_{j, k}},\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq8.png)
and the effective sample size (considering dependences) is defined as
![$\[N^{\mathrm{eff}}=\frac{\left(\sum w_{\mathrm{j}, \mathrm{k}}\right)^2}{\sum w_{j, k}^2}.\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq10.png)
2.6 p-value significance
To avoid assumptions about the distributions of our samples, we decided to use a permutation test to assess the significance of our statistics (Good 1994). To account for planet multiplicity, we adapted the classic permutation test and used a clustered permutation test:
Group planets by host star: randomly shuffle the list of host stars so that planets orbiting a certain star will all remain affiliated to the same star, albeit probably not the original star.
Recompute t (Eq. (1)) for each of the 106 random permutations.
The two-sided p value is then the fraction of permuted |t| values exceeding the originally observed value |tobs|.
2.7 Control sample
To test whether the chemical differences we identify are indeed related to the presence of planets in different sizes or simply reflect biases caused by broader Galactic abundance trends, we constructed a control sample of stars without any detected planets (either transit or radial-velocity planets). A naive control sample drawn randomly from the field-star population would risk introducing systematic differences in stellar parameters. We therefore matched each host star to a control star from GALAH DR4 catalog selected to be as similar as possible in key stellar properties.
The matching was performed in the parameter space of effective temperature (Teff), surface gravity (log g), the metallicity ([Fe/H]), and apparent magnitude (mG, taken as the Gaia DR3 G-band mean magnitude, provided in GALAH DR4 as phot_g_mean_mag). For each potential pair of host and control stars, we calculated a normalized Euclidean distance:
(2)
(3)
where σTeff, σlog g, and σmG are the standard deviations of these parameters across the entire catalog. This normalization ensures that differences in scale between parameters are accounted for, and that each contributes equally to the distance metric. For each host star, the control star with the smallest normalized distance was selected to the control sample.
Each control star was used only once (matching without replacement), to ensure statistical independence and prevent oversampling of a small subset of field stars. Note again that in systems with multiple planets, the same control star was assigned to all occurrences of the host, preserving the weighting scheme described above.
After matching, we verified that the distributions of Teff, log g and mG in the control sample closely followed those of the host stars (Fig. 1). Weighted two-sample t tests yielded p values of ~0.8–0.9 in all three cases, indicating no statistically significant differences in means between the original planet-host sample and the matching control sample, in terms of these quantities. This agreement confirms that the procedure isolates abundance differences related to planet hosting, minimizing systematic biases caused by unrelated stellar parameter biases.
Furthermore, a potential bias could have arisen if a significant fraction of stars in either the test sample or the control sample were extremely iron-poor ([Fe/H ≲ −0.6). Such stars are typically α-enhanced thick-disk objects (e.g., Edvardsson et al. 1993; Bensby et al. 2003; Adibekyan et al. 2012). As can be seen in Fig. 1, both our test and control samples contain very few such stars, and thus the statistical comparisons should not be significantly affected by this population effect.
![]() |
Fig. 1 Validation of the control-sample matching. We show the distributions of effective temperature (upper left), surface gravity (upper right), Gaia G-band apparent magnitude (lower left), and iron abundance (lower right) for the sample of planet-host stars and the matched control sample. |
3 Results
3.1 Stellar properties of subsamples
Before comparing abundances, we assessed whether stars hosting small and large planets occupy similar regions of stellarparameter space. Figure 2 shows the distributions of Teff, log g, and mG for the two subsamples. Weighted Welch t tests of equality of means yield p values of 0.006 for log g (mean lower by 0.10 for large-planet hosts) and 0.013 for Teff (mean higher by 224 K for large-planet hosts). The Gaia magnitude mG does not exhibit a significant difference – 0.11 mag, with a p value of 0.356. The modest but statistically significant offsets in effective temperature and surface gravity plausibly reflect detection-completeness effects – transits of small planets are naturally shallower around larger, hotter stars.
3.2 Element-by-element offsets
Table 1 lists Welch’s t-test p values and mean differences between the small-planet and large-planet subsamples for 29 individual abundance ratios (which were available for more than three host stars in our sample). The p values were computed using the clustered permutation test on the weighted two-sample t test as described above. The table also shows the differences in mean abundance between large-planet and small-planet hosts (computed as large minus small), and the number of host stars for which GALAH DR4 contained an estimated abundance for the specific element.
Applying the false-discovery rate procedure (Benjamini & Hochberg 1995) for multiple comparisons, at a false-discovery rate of α = 0.05 (for a total of 29 comparisons), leaves nine significant abundances:
[N/Fe] (Δ = +0.26 dex), the strongest volatile tracer;
[Fe/H] (Δ = +0.15 dex), which anchors the classic metallicity trend;
[Na/Fe] (Δ = 0.06 dex) – a moderately volatile species;

Fig. 2 Distributions of effective temperature (left), surface gravity (middle), and Gaia G-band magnitude (right) for stars that host large (orange) and small (blue) planets. Large-planet hosts are slightly hotter and have lower surface gravities on average, while the apparent-magnitude distributions are similar.
Table 1Abundance offsets of small- and large-planet hosts.
[Mg/Fe], [Si/Fe], [Sc/Fe], [Ca/Fe], [Ti/Fe], and [Ni/Fe] (Δ ~ −0.07 − 0.03 dex) – a generally coherent enhancement of the classical rock-forming and iron-peak inventory.
These nine element offsets point to a possible chemical dichotomy: large-planet hosts are enriched in iron and may show elevated volatile abundances, although the latter should be interpreted with caution (Sect. 2.3). Small-planet hosts appear to be less iron-dominated, with higher ratios of rock-forming elements to iron. This points to a natural connection between enhanced rock-forming inventories and the assembly of smaller, predominantly rocky planets.
Composite abundance index offsets between hosts of small and large planets.
3.3 Composite abundance ratios
Table 2 and Fig. 3 compare eight abundance indices – [Fe/H] together with seven composite ratio indices of groups defined in Sect. 2.2 – between the small-planet and large-planet subsamples. The histograms were constructed with the multiplicity-aware weighting described in Sect. 2.4, so that each host star contributes equally.
The offsets and the p values in Table 2 were estimated using the weighted Welch’s t test and the clustered permutation test as described in Sect. 2. The table also shows the sizes of the two subsamples for each element grouping, which depends on the availability of data pertaining to the comprising elements in GALAH DR4. These quantities, NS and NL, are actually the sums of weights in the small-planet and large-planet subsamples, corresponding to the total area under the histograms in Fig. 3. They are reported here as an intuitive measure of the effective sample coverage, and should not be confused with the formal effective sample sizes used in the t-test definition in Sect. 2.5.
Most of the composites show highly statistically significant mean offsets of ~0.20 dex in the sense that large-planet hosts are more enriched. Two indices depart from this pattern. [Rock/H] rises by only 0.12 dex, and thus [Rock/Fe] is in fact lower for large-planet hosts by 0.06 dex, revealing an iron-skew within the refractory pool. Finally, [Vol/Ref] increases by 0.13 dex, and while it is still quite significant, with a p value of ~0.003, it is less significant than the other indices, perhaps due to a somewhat smaller sample of small-planet hosts.
Since oxygen abundances are not affected by the C and N flags, we also examined oxygen-only composites. These show offsets of ~0.20–0.22 dex with very low p values, indicating that the volatile-like trend is not solely driven by the flagged C and N measurements.
![]() |
Fig. 3 Weighted histograms of the eight abundance indices in Table 2 for hosts of small (blue) and large (orange) planets; each star contributes unit total weight. CNO-based indices are affected by abundance quality flags (Sect. 2.3), and the corresponding trends should therefore be interpreted cautiously. |
3.4 Control sample comparison
To assess whether the abundance offsets identified above are intrinsic to planet hosts or somehow related to broader Galactic chemical trends, we repeated the analysis on the matched control sample described in Sect. 2.7. In this case, each planet host was paired with a GALAH field star of similar Teff, log g, and mG, and the resulting subsamples were compared using the same weighted Welch’s t-test procedure.
Table 3 summarizes the results for the composite indices of the control sample. In contrast to the clear and highly significant offsets found for the planet-host sample, the control stars show only small abundance differences, typically in the range of 0.06–0.11 dex. Most of these differences are found to be statistically insignificant. for example, [Fe/H] differs by only −0.01 dex between the “small” and “large” control subsamples, and the offsets in [(CNO + Fe)/H], [CNO/H], and [(Ref + CNO)/H] are all below 0.1 dex, with p ≳ 0.07. The only index with marginal significance is [Vol/Ref], which increases by 0.11 dex (p ≃ 0.018), though even here the effect is weaker than in the planet-host sample.
These results demonstrate that the strong abundance differences reported above are not generic features of the field-star population. While some residual offsets do appear in the control sample, they are smaller in amplitude and generally fail to reach statistical significance. This supports the conclusion that the more pronounced iron enrichments and potentially also volatiles, together with the iron-skew signature among refractories, are genuinely linked to the presence of planets and their sizes.
Composite abundance index offsets for the matched control sample.
![]() |
Fig. 4 Weighted mean abundance differences (large minus small planet hosts) as a function of the chosen radius cutoff. Colored curves trace the various composite indices, with shaded bands showing uncertainties. The apparent volatile enhancement should be regarded as tentative given the limited reliability of C and N measurements. |
3.5 Robustness to radius cutoff
Although we adopted 2.6 R⊕ as a reference value, motivated by interior-structure models (Lozovsky et al. 2018), our findings are not tied to this exact value. Figure 4 shows the mean abundance offsets between large-planet and small-planet hosts as a function of the cutoff radius. The main trends – enrichment in iron and volatiles among large-planet hosts and relatively strong rock-forming inventories among small-planet hosts – persist across cutoff values between ~2.0 and 3.0 R⊕. We note that we plot the abundance offsets rather than the associated p values, since the latter mostly track the changing subsample size, whereas the offsets more directly illustrate the underlying chemical trends. Thus, toward the edges of the 2.0–3.0 R⊕ range the statistical significance weakens, but this is related to the smaller subsample size. Overall the chemical dichotomy remains. We return to the implications of this test in Sect. 4.
4 Discussion
Our results extend the classical planet–metallicity correlation (Santos et al. 2004; Fischer & Valenti 2005) in two important directions. First, they confirm iron enrichment as a hallmark of large-planet hosts, consistent with the idea that a high heavy-element budget facilitates the assembly of massive cores capable of retaining thick atmospheres. Second, we find tentative evidence of enrichment in the highly volatile elements carbon, nitrogen, and oxygen, although this result relies on flagged abundance measurements (Sect. 2.3) and should be interpreted with caution. In contrast, stars hosting small planets display a stronger relative contribution of the classical rock-forming elements (magnesium, silicone, calcium, titanium).
This chemical dichotomy resonates with the morphology of the exoplanet radius distribution. The “radius valley” near 1.8–2.0 R⊕ is usually attributed to atmospheric mass loss (e.g., Owen & Wu 2013; Lopez & Fortney 2013; Ginzburg et al. 2016). However, interior models suggest that a more fundamental composition-driven transition occurs near ~2.6 R⊕ (Lozovsky et al. 2018), separating rocky from volatile-rich planets. Our findings are consistent with this picture: systems enriched in iron and possibly volatiles preferentially host planets above the break, while those with enhanced rock-formers relative to iron are more likely to host planets below it. Stellar chemistry thus appears to shape not only the efficiency of planet formation, but also the eventual distribution of planetary sizes.
We tested the sensitivity of these trends to varying the planet–radius cutoff. While statistical significance naturally decreases as the sample sizes change, the qualitative separation between the two planet-host-stars subsamples persists, especially between 2.0 and 3.0 R⊕. However, because the signals remain similar regardless of the adopted boundary, the present sample is not rich enough to constrain the exact location of the cutoff radius itself. In this sense, our results point to the existence of a chemically defined transition, but do not constrain its exact location.
As a further robustness test, we compared our host sample with a matched control sample of non-host stars. The matched control sample of non-planet-hosting stars shows only weak and mostly insignificant versions of the same patterns, suggesting that the stronger signals in the host sample are genuinely linked to the presence and sizes of planets, rather than to Galactic chemical evolution.
Some limitations must be acknowledged. The host-star sample is modest (104 stars), which restricts the ability to resolve finer subdivisions (e.g., by orbital period or multiplicity). Second, while GALAH DR4 provides abundances for up to 32 elements, coverage is incomplete and uncertainties vary, leading to sample-size differences between indices. In addition, the individual C and N abundances in GALAH DR4 carry specific quality flags (Sect. 2.3), and hence, trends involving CNO should be regarded as tentative. A similar offset is obtained when using oxygen alone, whose abundances are unflagged, suggesting that the volatile-related signal is not exclusively dependent on the C and N determinations.
We also examined each abundance index as a function of Teff and log g. These checks revealed mild parameter-dependent trends that were present in both the host and control samples. Importantly, the chemical offsets between large-planet and small-planet hosts persist even after accounting for the small differences in Teff and log g, indicating that these stellar-parameter variations do not explain the observed abundance differences. In our preliminary explorations of GALAH DR4 we also saw hints of a possible dependence of planet size on stellar age, although the substantial age uncertainties and biases prevented a meaningful analysis at this stage. This remains an interesting direction for future work.
Note that our study is limited to transiting planets, and therefore predominantly probes short-period systems (P < 100 days). Whether the same chemical trends extend to longer-period planets requires additional data and remains to be tested in future research. Furthermore, small-planet transits are shallower around larger, hotter stars, which reduces completeness – the lower log g and higher Teff for large-planet hosts are therefore consistent with selection effects rather than intrinsic evolutionary differences between the host populations.
5 Summary and conclusions
We analyzed chemical abundances from GALAH DR4 for 104 host stars with 141 confirmed transiting planets. We divided the sample into stars that host “small” and “large” planets using a working threshold of rp = 2.6 R⊕. Our main findings are:
Stars that host large planets tend to be enriched in iron and show tentative indications of higher abundances of the highly volatile elements (C, N, and O), while small-planet hosts appear less iron-dominated, with higher ratios of rock-forming refractories (Mg, Si, Ca, and Ti) to iron.
The results remain statistically significant for similar dividing radius choices, but the current sample is too small for us to determine the precise location of the chemical transition.
A matched control sample of field stars shows only weak and mostly insignificant abundance trends, confirming that the stronger offsets in the host sample are not caused by the underlying stellar-population trends.
Our results refine the classical planet–metallicity relation. While iron remains central to the planet–metallicity relation, its influence is intertwined with volatile and rock-forming abundances. This chemistry-driven perspective situates exoplanet demographics within the broader framework of Galactic stellar populations and provides a path toward predicting planetary architectures from stellar abundances. We can therefore conclude that the outcome of planet formation also depends on the balance between iron, volatiles, and rock-forming elements.
Finally, we emphasize an important limitation: although our analysis clearly indicates iron enrichment, and tentatively a volatile enhancement, the CNO abundances in GALAH DR4 carry significant quality flags (Sect. 2.3) and thus remain less reliable. The volatile signal should therefore be regarded as provisional. However, oxygen-only indices, which are unaffected by the C and N observational issues, show comparable ~0.2 dex offsets and thus provide partial support for a volatile-related trend.
In contrast to volatiles, the [Rock/Fe] offset, which quantifies the excess of classical rock-forming elements (Mg, Si, Ca, and Ti) relative to iron by ≲0.1 dex, is statistically robust and unaffected by these caveats. This persistent ratio suggests that the balance between silicate-forming material and metallic iron, rather than the uncertain volatile content, provides the most reliable chemical fingerprint of planet size, and motivates future high-precision abundance analyses of CNO elements to test this emerging picture.
Future surveys conducted by, for example, PLATO, Roman, and ELT (Extremely Large Telescope) can test these trends using larger and more diverse stellar populations, and enhance our understanding of how stellar composition influences planet formation.
Acknowledgements
This research was supported by the ISRAEL SCIENCE FOUNDATION (grant No. 1404/22). This research has made use of the NASA Exoplanet Archive, which is operated by the California Institute of Technology, under contract with the National Aeronautics and Space Administration under the Exoplanet Exploration Program. This work made use of the Fourth Data Release of the GALAH Survey. The GALAH Survey is based on data acquired through the Australian Astronomical Observatory, under programs: A/2013B/13 (The GALAH pilot survey); A/2014A/25, A/2015A/19, A2017A/18 (The GALAH survey phase 1); A2018A/18 (Open clusters with HERMES); A2019A/1 (Hierarchical star formation in Ori OB1); A2019A/15, A/2020B/23, R/2022B/5, R/2023A/4, R2023B/5 (The GALAH survey phase 2); A/2015B/19, A/2016A/22, A/2016B/10, A/2017B/16, A/2018B/15 (The HERMES-TESS program); A/2015A/3, A/2015B/1, A/2015B/19, A/2016A/22, A/2016B/12, A/2017A/14, A/2020B/14 (The HERMES K2-follow-up program); R/2022B/02 and A/2023A/09 (Combining asteroseismology and spectroscopy in K2); A/2023A/8 (Resolving the chemical fingerprints of Milky Way mergers); and A/2023B/4 (s-process variations in southern globular clusters). We acknowledge the traditional owners of the land on which the AAT stands, the Gamilaraay people, and pay our respects to elders past and present. This paper includes data that has been provided by AAO Data Central (datacentral.org.au). We thank the anonymous referee for the wise and constructive comments, which improved this paper substantially.
References
- Adibekyan, V. Z., Sousa, S. G., Santos, N. C., et al. 2012, A&A, 545, A32 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Bashi, D., & Zucker, S. 2019, AJ, 158, 61 [NASA ADS] [CrossRef] [Google Scholar]
- Bashi, D., Zucker, S., Adibekyan, V., et al. 2020, A&A, 643, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Benjamini, Y., & Hochberg, Y. 1995, J. R. Stat. Soc. Ser. B, 57, 289 [Google Scholar]
- Bensby, T., Feltzing, S., & Lundström, I. 2003, A&A, 410, 527 [CrossRef] [EDP Sciences] [Google Scholar]
- Boss, A. P. 1997, Science, 276, 1836 [Google Scholar]
- Boss, A. P. 2002, ApJ, 567, L149 [Google Scholar]
- Buchhave, L. A., Latham, D. W., Johansen, A., et al. 2012, Nature, 486, 375 [NASA ADS] [Google Scholar]
- Buder, S., Kos, J., Wang, X. E., et al. 2025, PASA, 42, e051 [Google Scholar]
- Burn, R., Mordasini, C., Mishra, L., et al. 2024, Nat. Astron., 8, 463 [Google Scholar]
- Chen, D.-C., Xie, J.-W., Zhou, J.-L., et al. 2022, AJ, 163, 249 [NASA ADS] [CrossRef] [Google Scholar]
- Christiansen, J. L., McElroy, D. L., Harbut, M., et al. 2025, Planetary Sci. J., 6, 186 [Google Scholar]
- Edvardsson, B., Andersen, J., Gustafsson, B., et al. 1993, A&A, 275, 101 [NASA ADS] [Google Scholar]
- Fischer, D. A., & Valenti, J. 2005, ApJ, 622, 1102 [NASA ADS] [CrossRef] [Google Scholar]
- Fulton, B. J., Petigura, E. A., Howard, A. W., et al. 2017, AJ, 154, 109 [Google Scholar]
- Ginzburg, S., Schlichting, H. E., & Sari, R. 2016, ApJ, 825, 29 [Google Scholar]
- Ginzburg, S., Schlichting, H. E., & Sari, R. 2018, MNRAS, 476, 759 [Google Scholar]
- Gonzalez, G. 2009, MNRAS, 399, L103 [NASA ADS] [Google Scholar]
- Good, P. I. 1994, Permutation Tests : a Practical Guide to Resampling Methods for Testing Hypotheses, Springer series in statistics (New York: Springer) [Google Scholar]
- Hinkel, N. R., & Unterborn, C. T. 2018, ApJ, 853, 83 [NASA ADS] [CrossRef] [Google Scholar]
- Hinkel, N. R., Timmes, F. X., Young, P. A., Pagano, M. D., & Turnbull, M. C. 2014, AJ, 148, 54 [NASA ADS] [CrossRef] [Google Scholar]
- Hinkel, N. R., Unterborn, C., Kane, S. R., Somers, G., & Galvez, R. 2019, ApJ, 880, 49 [Google Scholar]
- Johnson, J. A., Aller, K. M., Howard, A. W., & Crepp, J. R. 2010, PASP, 122, 905 [Google Scholar]
- Lodders, K. 2003, ApJ, 591, 1220 [Google Scholar]
- Lopez, E. D., & Fortney, J. J. 2013, ApJ, 776, 2 [Google Scholar]
- Lozovsky, M., Helled, R., Dorn, C., & Venturini, J. 2018, ApJ, 866, 49 [Google Scholar]
- MacDougall, M. G., Petigura, E. A., Gilbert, G. J., et al. 2023, AJ, 166, 33 [NASA ADS] [CrossRef] [Google Scholar]
- Mayor, M., Bonfils, X., Forveille, T., et al. 2009, A&A, 507, 487 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Owen, J. E., & Wu, Y. 2013, ApJ, 775, 105 [Google Scholar]
- Ruxton, G. D. 2006, Behav. Ecol., 17, 688 [Google Scholar]
- Santos, N. C., Israelian, G., & Mayor, M. 2004, A&A, 415, 1153 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Seager, S., Kuchner, M., Hier-Majumder, C. A., & Militzer, B. 2007, ApJ, 669, 1279 [NASA ADS] [CrossRef] [Google Scholar]
- Shibata, S., & Izidoro, A. 2025, ApJ, 986, L25 [Google Scholar]
- Valencia, D., Sasselov, D. D., & O’Connell, R. J. 2007, ApJ, 665, 1413 [Google Scholar]
- Van Eylen, V., Agentoft, C., Lundkvist, M. S., et al. 2018, MNRAS, 479, 4786 [Google Scholar]
- Venturini, J., Guilera, O. M., Haldemann, J., Ronco, M. P., & Mordasini, C. 2020, A&A, 643, L1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Wang, J., & Fischer, D. A. 2015, AJ, 149, 14 [NASA ADS] [CrossRef] [Google Scholar]
- Zeng, L., Sasselov, D. D., & Jacobsen, S. B. 2016, ApJ, 819, 127 [Google Scholar]
All Tables
All Figures
![]() |
Fig. 1 Validation of the control-sample matching. We show the distributions of effective temperature (upper left), surface gravity (upper right), Gaia G-band apparent magnitude (lower left), and iron abundance (lower right) for the sample of planet-host stars and the matched control sample. |
| In the text | |
![]() |
Fig. 2 Distributions of effective temperature (left), surface gravity (middle), and Gaia G-band magnitude (right) for stars that host large (orange) and small (blue) planets. Large-planet hosts are slightly hotter and have lower surface gravities on average, while the apparent-magnitude distributions are similar. |
| In the text | |
![]() |
Fig. 3 Weighted histograms of the eight abundance indices in Table 2 for hosts of small (blue) and large (orange) planets; each star contributes unit total weight. CNO-based indices are affected by abundance quality flags (Sect. 2.3), and the corresponding trends should therefore be interpreted cautiously. |
| In the text | |
![]() |
Fig. 4 Weighted mean abundance differences (large minus small planet hosts) as a function of the chosen radius cutoff. Colored curves trace the various composite indices, with shaded bands showing uncertainties. The apparent volatile enhancement should be regarded as tentative given the limited reliability of C and N measurements. |
| In the text | |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.
![$\[M_{\mathrm{X}, \odot}=A_{\mathrm{X}} 10^{A(x)-12}.\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq3.png)
![$\[s^2=\frac{\sum w_{j, k}\left(X_j-\bar{X}\right)^2}{\sum w_{j, k}},\]$](/articles/aa/full_html/2026/02/aa56540-25/aa56540-25-eq9.png)


