| Issue |
A&A
Volume 701, September 2025
|
|
|---|---|---|
| Article Number | A64 | |
| Number of page(s) | 18 | |
| Section | Planets, planetary systems, and small bodies | |
| DOI | https://doi.org/10.1051/0004-6361/202452485 | |
| Published online | 03 September 2025 | |
The New Generation Planetary Population Synthesis (NGPPS)
VII. Statistical comparison with the HARPS/Coralie survey
1
Weltraumforschung und Planetologie, Universität Bern,
Gesellschaftsstrasse 6,
3012
Bern,
Switzerland
2
Universitäts-Sternwarte München, Ludwig-Maximilians-Universität München,
Scheinerstraße 1,
81679
München,
Germany
3
Lunar and Planetary Laboratory, University of Arizona,
1629 E. University Blvd.,
Tucson,
AZ
85721,
USA
4
Observatoire Astronomique de l’Université de Genève,
51 ch. de Pegasi,
1290
Versoix,
Switzerland
5
Max Planck Institute for Astronomy,
Königstuhl 17,
69117
Heidelberg,
Germany
6
Steward Observatory, University of Arizona,
933 N. Cherry Ave.,
Tucson,
AZ
85721,
USA
7
IBM Research,
8803
Rüschlikon,
Switzerland
★ Corresponding author: alexandre.emsenhuber@unibe.ch
Received:
4
October
2024
Accepted:
27
May
2025
Context. Planetary population synthesis is a tool that is used to better understand the key processes of planet formation at the statistical level.
Aims. We seek to quantify the fidelity with which modern population syntheses reproduce observations in view of their use as predictive tools.
Methods. We compared synthetic populations from the Generation 3 Bern Model of Planet Formation and Evolution (core accretion, solar-type host stars) and the HARPS/Coralie radial velocity sample. We biased the synthetic planet population according to the completeness of the observed data. We then performed quantitative statistical comparisons and systematically identified agreements and differences.
Results. Our nominal population reproduces many of the main features of the HARPS planets, such as two main groups of planets in the mass-distance diagram (close-in sub-Neptunes and distant giants), a bimodal mass function with a less populated ‘desert’, an observed mean multiplicity of about 1.6, and several key correlations regarding the stellar metallicity dependency, the period ratio distribution, and the eccentricity distribution. Considering that the model was not optimised beforehand to reproduce any particular survey, this indicates that some of the important physical processes governing planetary formation could be captured. The remaining discrepancies that can be quantified thanks to the population synthesis approach point to areas that are not fully captured in the model. For instance, we find that the synthetic population has (1) in absolute terms too many planets by ≈70%, (2) a ‘desert’ that is too deep by ≈60%, (3) a relative excess of giant planets by ≈40%, (4) planet eccentricities that are on average too low by a factor of about two (median of 0.07 versus 0.15), and (5) a metallicity effect that is too weak. Finally, the synthetic planets are overall too close to the star compared to the HARPS sample. The differences allowed us to find model parameters that better reproduce the observed planet masses, for which we computed additional synthetic populations. We find that decreasing the planet formation efficiency by increasing the planetesimal size re-balances the number of sub-Neptunes versus giant planets. Changing the efficiency of gas-driven migration also affects the sub-Neptune to giant planet ratio, with lower migration rates resulting in more giant planets and fewer sub-Neptunes.
Conclusions. However, only modifying the model parameters seems to be insufficient for the model to fully reproduce both the observed mass and distance distributions at the same time. Instead, physical processes appear to be missing. Planets may originate on wider orbits than our model predicts. Mechanisms leading to higher eccentricities and slower disc-limited gas accretion also seem necessary. We also advocate that theoretical models should make a quantitative, rather than merely a qualitative, comparison between the many current and future large surveys and theoretical results to better understand the origins of planetary systems.
Key words: methods: statistical / techniques: radial velocities / planets and satellites: formation
© The Authors 2025
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.
1 Introduction
The formation and evolution of planetary systems is far from being understood. On the observational side, data exist for early processes such as dust growth and radial drift (e.g. Trapman et al. 2020; Manara et al. 2023, based on ALMA data). However, direct observation of forming planets is difficult. Only a handful of cases at most are known, such as the PDS 70 system, with PDS 70b (Keppler et al. 2018; Müller et al. 2018) and PDS 70c (Haffert et al. 2019), and AB Aurigae b (Currie et al. 2022). Other recent studies have to date been unable to find additional accreting planets (Cugno et al. 2019; Zurlo et al. 2020).
On the theoretical side, many mechanisms that are supposed to take place during the later stages of planetary formation have been proposed. This includes how solids grow from pebbles and planetesimals to planets (Greenzweig & Lissauer 1992; Ida & Makino 1993; Ohtsuki et al. 2002; Thommes et al. 2003; Ormel & Klahr 2010; Alibert et al. 2018); how giant planets accrete their envelope (Pollack et al. 1996; Bodenheimer et al. 2000; Ayliffe & Bate 2012; Mordasini et al. 2014; Valletta & Helled 2020; Schib et al. 2022; Nelson et al. 2023); and how planetary migration affects the architecture of the systems (Baruteau et al. 2016; Chrenko et al. 2022; Paardekooper et al. 2023; Weiss et al. 2023). However, the lack of direct observations of these processes makes it difficult to constrain them.
An alternative approach is to compare the end product from theoretical models (namely the planetary systems) with observations. However, it is difficult to reproduce single exoplanetary systems. While the models can predict many observables, there are usually only a few quantities that have actually been observed (the orbits, masses, or radii of each planet), and even less is known about the initial conditions of a specific system (e.g. the disc metallicity, which is likely related to the stellar value, Gáspár et al. 2016).
Instead, one can leverage the thousands of discovered planets (Zhu & Dong 2021; Weiss et al. 2023) and protoplanetary discs (Tripathi et al. 2017; Andrews et al. 2018; Tobin et al. 2020; Manara et al. 2023) to perform a statistical comparison. For the observational samples, it is best to use systematic surveys with a large sample size, as they offer valuable information about the demographics of exoplanets. This includes:
the fractions of stars with detected planets and the multiplicity of planetary systems;
the distributions of planet properties, such as their masses, periods, and eccentricities;
correlations between these properties; and
correlations between stellar and planet properties, such as the increasing frequency of giant planets with stellar metallicity (Gonzalez 1997; Santos et al. 2004; Fischer & Valenti 2005) or the role of stellar mass and age.
For such an undertaking, one can perform planetary population synthesis (Ida & Lin 2004; Mordasini et al. 2009a; Benz et al. 2014; Mordasini 2018; Emsenhuber et al. 2023) using a global numerical model. This requires several ingredients: (1) initial conditions that are representative of protoplanetary discs, (2) a global formation and evolution model that is able to predict final planetary system properties from the initial conditions, (3) an observational sample, and (4) a synthetic detection bias to mimic the detection capabilities of the survey in order to qualitatively compare the synthetic and observed populations.
In Emsenhuber et al. (2021a, hereafter NGPPS I), we presented the Generation III Bern model of planetary formation and evolution. This model includes disc evolution concurrently with accretion of both solids and gas by the protoplanets, planetary interior structures, gas-driven migration, and dynamical interactions. In addition, the model is able to predict many directly observable planetary properties (such as orbital elements, mass, radius, or magnitudes) from giant planets to compact terrestrial-planet systems, provided the initial number of embryos is sufficient.
Then in Emsenhuber et al. (2021b, hereafter NGPPS II), we discussed the initial conditions for the model so that they are representative of known protoplanetary discs and compared different populations where we varied the initial number of embryos per system, one of the free parameters of the model. Some simple qualitative comparisons with the detected planets were performed, which revealed that migration is likely too efficient in the model. However, testing of these populations in a rigorous quantitative manner against the known exoplanets is yet to be conducted.
The ultimate goal of these synthetic populations is to compare them with systematic surveys to determine if the characteristics of the observed exoplanets population can be reproduced. Mismatches should lead to changes in the model or its parameters to provide better agreements (Mordasini et al. 2015; Mordasini 2018) and ultimately improve our understanding of planet formation. Synthetic populations that agree well with the observed planets can then, at least in some aspects, be used to make predictions about the not-yet-observed parts of the population.
Such a comparison was already performed in Mordasini et al. (2009b) for the first generation of the Bern model, but results were limited by the number of exoplanets known at that time. In Mulders et al. (2019), an earlier version of this model was compared with the Kepler survey results (Thompson et al. 2018). The outcome was that the synthetic and observed populations were similar in multiplicity and inclinations, for instance, although the synthetic population contained too many planets by a factor of about five.
Here, we aim to compare the populations of NGPPS II with the radial velocity (RV) technique of planetary detection. We chose to compare with the combined HARPS/Coralie survey, whose results were presented in Mayor et al. (2011, hereafter M11)1. The survey has both a baseline of more than 10 years, which allows the detection of planets at several astronomical units, and a high sensitivity of about 1 m s−1. It is a volume-limited sample for a representativity of stars (Udry et al. 2000).
However, the synthetic and observed exoplanet populations cannot be directly compared, and observational biases need to be accounted for when making the comparison. This means that either the synthetic systems need to be treated as if they were observed by the same technique as used for the detection of the actual exoplanetary systems, which means that a synthetic detection bias must be applied to the model systems, or the completeness of the survey must be taken into account to infer the underlying distribution of the observed planets. Here, we take the former approach. It is a key feature of our work that we do not try to de-bias the observed population (and therefore extrapolate to non-detectable or poorly detectable regions) but bias the synthetic population.
The organisation of this paper is as follows. We describe the coupled formation and evolution model, the way we set the initial conditions for our population synthesis, and how we applied an observational bias in Sect. 2. The nominal population presented in NGPPS II is compared with the HARPS/Coralie survey in Sect. 3. Then, we investigate in Sect. 4 how to adapt some parameters of the formation model to better reproduce the observed planets. A summary and the conclusions are given in Sect. 5.
2 Methods
2.1 Formation and evolution model
Our combined formation and evolution model, the Generation III Bern model, was extensively described in NGPPS I. We hence provide only a brief overview here. This model derives from the work of Alibert et al. (2004, 2005) for the single-embryo formation (Generation I), Mordasini et al. (2012b) for the long-term evolution part (Generation Ib), and Alibert et al. (2013) for the multi-embryo formation without long-term evolution (Generation II).
Planet formation was modelled for a fixed time interval (either 20 Myr or 100 Myr, see Sect. 2.2), during which gas and planetesimal discs evolve together with the forming planets. The planets accrete from both discs, migrate, and dynamically interact, leading to scatterings, giant impacts, or capture into resonances. After the time elapsed, the evolution stage took place. Here, the thermodynamical evolution (cooling and contraction) of each planet was followed individually to 10 Gyr, along with atmospheric escape and tidal migration.
At its basis, the formation model tracks the evolution of a viscous accretion disc (Lüst 1952; Lynden-Bell & Pringle 1974), where the turbulent viscosity is provided by the standard α parametrisation (Shakura & Sunyaev 1973). The disc midplane temperature was estimated following Nakamoto & Nakagawa (1994), including as sources stellar irradiation and viscous dissipation. Internal (Clarke et al. 2001) and external (Matsuyama et al. 2003) photoevaporation were included as well. The solid component was represented by planetesimals, whose dynamical state is governed by the drag from the gas and the stirring produced by the other planetesimals and the planets (Rafikov 2004; Chambers 2006; Fortier et al. 2013).
The model assumes planets form by core accretion (Perri & Cameron 1974; Mizuno 1980), by accreting planetesimals in the oligarchic regime (Ida & Makino 1993; Ohtsuki et al. 2002; Inaba & Ikoma 2003; Thommes et al. 2003; Fortier et al. 2013), and by giant impacts (planet-planet collisions). The accretion of gas is first constrained by the ability of the forming planets to radiate away the binding energy released during this process. The envelope mass is then obtained by solving the internal structure equations (Bodenheimer & Pollack 1986). Once the planet reaches the critical mass as found by solving the structure equations (usually on the order of 10 M⊕; Stevenson 1982), cooling becomes efficient and runaway gas accretion can occur. When this happens, the envelope can no longer remain in equilibrium with the surrounding gas disc and contracts instead (Bodenheimer et al. 2000; Mordasini et al. 2012b,a). In that phase, gas accretion is limited by the supply of the gas disc, while the internal structure equations are used to determine the radius and luminosity.
The protoplanets undergo gas-driven planetary migration. For Type I migration, we adopted the prescription of Coleman & Nelson (2014), while Type II migration was computed using the fully suppressed prescription of Dittkrist et al. (2014), and the transition between the two regimes was done according to Crida et al. (2006). Multiple embryos could form concurrently in each system, and the gravitational interactions were modelled using the mercury N-body package (Chambers 1999).
After the formation stage is finished, the model switches to the evolution stage, where the planets were evolved individually to 10 Gyr following a procedure similar to Mordasini et al. (2012b). This stage mainly tracks the cooling and contraction. It also includes atmospheric escape (Jin et al. 2014; Jin & Mordasini 2018) and tidal migration (Ferraz-Mello et al. 2008; Jackson et al. 2009; Benítez-Llambay et al. 2011). However, since we are mostly interested in the planet masses in this work, the relevance of the evolution stage is limited compared to planet radii probed in transit surveys, for instance.
2.2 Population synthesis
The procedure to compute a full population was presented in NGPPS II, which represents an update from Mordasini et al. (2009a). The model assumes that the systems form around single stars, and we used observational data to constrain the initial conditions of the formation and evolution model. The initial conditions were the disc masses (Tychoniec et al. 2018), disc sizes (Andrews et al. 2010), inner edge of the disc (Venuti et al. 2017), protoplanetary disc lifetimes, and the dust-to-gas ratio (from the observed stellar [Fe/H] as described in Mordasini et al. 2009a, but with a solar reference value of the dust-to-gas ratio of 0.0149, Lodders 2003, corresponding to [Fe/H]= 0).
Other model parameters were kept constant in one population, such as the parameter of the turbulent viscosity, α = 2 × 10−3 (to reproduce the relationship between disc masses and stellar accretion rates, Mulders et al. 2017; Manara et al. 2019), and the initial slope of the solids’ surface density, which is steeper than the one of the gas disc (Ansdell et al. 2018).
In line with our already-published results, we mainly analyse the nominal synthetic planetary population that was presented in NGPPS II, called NG76. This population starts with 100 planetary embryos per disc and was computed for a duration of the formation stage (which includes the tracking of N-body interactions) of 20 Myr. However, to take into account that dynamical instabilities can take longer to occur (e.g. Izidoro et al. 2021), we also analyse a population where the formation stage was extended to 100 Myr. This population, called NG76longshot, was already discussed in Emsenhuber et al. (2023) and Burn et al. (2024). Finally, we also consider in Sect. 4 a third optimised population, NG192, for which several model parameters were varied to obtain a better agreement with the HARPS/Coralie observations. The results presented in this work are based on the state of the populations at 5 Gyr.
Figure 1 shows the (unbiased) mass-distance diagram of the NG76longshot population. Compared to the situation in NG76 at 20 Myr (see NGPPS II), little change of the a − M diagram occurs for Jovian and (sub)-Neptunian planets. However, there is a clear reduction of the number of low-mass planets (≲4 M⊕) at orbital distances of about less than 3 au. This is due to numerous giant impacts involving (proto)planets with masses typically between Mars and Earth. These impacts mainly occur in systems consisting only of low-mass planets that have not migrated much (Class I systems according to the planetary system architecture classification of Emsenhuber et al. 2023). This evolution is clearly seen in Fig. 2 which shows the unbiased planetary mass function at three moment in time: at τdisc when the gas disc disappears (so 1–10 Myr with an average of about 3 Myr), at 20 Myr, and at 100 Myr. One sees the strong reduction of the number of low-mass (mainly sub-Earth) protoplanets occurring mainly between τdisc and 20 Myr but continuing also in the following 80 Myr. The associated mild increase in the number of more massive super-Earth and (sub-)Neptunian planets is also visible. In the giant planet mass range, only minor evolution occurs between τdisc and 20 Myr, and even less occurs between 20 and 100 Myr. However, as becomes clear below, for the subject of this paper which is the statistical comparison of NGPPS and the HARPS/Coralie survey, the changes between 20 and 100 Myr are of minor importance. The reason is that the survey is hardly sensitive to the very low-mass planets where significant changes occurs.
![]() |
Fig. 1 Mass-distance of the (unbiased) NG76longshot population at an age of 5 Gyr. Each point corresponds to one synthetic planet. The population consists of 1000 synthetic planetary systems. Colours represent the bulk composition: red are planet where the H/He mass is larger than the mass of solids (the core mass). Open circles are planets containing H/He but with a core mass higher than the envelope mass. Filled circles correspond to planets without H/He. Additionally, blue (green) indicates that planets have accreted more (less) than 1% of their core mass as ices. Horizontal bars reach from the peri- to the apoastron to represent eccentricities. Finally, black crosses show the Solar System for comparison. |
![]() |
Fig. 2 Unbiased mass function for planets within a < 3 au at three moments in time. The blue solid line shows the distribution at 100 Myr (NG76longshot) while the black dotted one is at 20 Myr (NG76). The red dashed line additionally shows the situation at the moment when the gas disc disappears in the systems (corresponding to 3 Myr on average). The reduction of the number of low-mass (proto)planets of mainly (sub-)Mars to Earth mass and the associated increase of more massive super-Earth and (sub-)Neptunian planets is visible. It is a consequence of giant impacts during the post-disc phase. |
2.3 Observational bias
To compare our populations, we applied an observational bias using a technique that is updated from Mordasini et al. (2009b). The main changes relative to this work are: (1) accounting for the relative inclinations of the planets with respect to the plane of the disc, (2) an updated map of detection probabilities, and (3) performing multiple observations of the same population to take into account different orientations.
For each observation of the full synthetic population, we randomly selected 822 systems (the same as in the combined HARPS/Coralie sample) without replacement from the population of the 1000 systems and applied the following procedure. For each system, we randomly determined the direction of the observer with two angles, θ and ϕ, that represent the latitude and longitude of the line of sight with respect the systems’ reference frame, which is given by the unit vector
. Then, for each planet in the system, we computed the inclination i of the orbit with respect to that vector. For this, we determined the unit vector parallel to its orbital angular momentum
from its inclination to the reference plane ip and longitude of the ascending node ΩP. The unit vectors are then
(1)
and the inclination of the orbit with respect to the observer is the angle between the vectors
and
. This permitted us to retrieve the sin i value with
(2)
It can be seen that in the case the planet orbits the star in the reference plane (i.e. iP = 0), this method simplifies to the simple case sin i = | sin θ|.
Then, it is possible to compute the effective mass of the planet M sin i (for clarity, we here refer to the planet mass as M rather than Mplanet in NGPPS II). To retrieve its detection probability, we use the probabilities derived by M11 for the full HARPS/Coralie survey that depend on M sin i and the orbital period P. The detection probability is provided in Fig. 3 for reference. This synthetic detection bias was obtained by adding a signal from a planet with the corresponding mass and circular orbit with the prescribed period in the actual observations of the star (with RV signals of detected planet removed) and with different phases, and then checking that the signal is larger than a corresponding 1% false alarm probability. Long-period planets with incomplete coverage are treated in the same way. In this work, we just made one adjustment from the detection probabilities: If it is less than 1% (the white region in Fig. 3), we unconditionally deemed the planet as not detected. This is to avoid obtaining planets far from the usual detection region of HARPS.
There are a couple of items that should be noted. First, the bias used here was derived from the mean across the stars in the HARPS/Coralie survey. It does not account for the variations between the individual stars and the planetary system architecture. It also does not account for the planet eccentricities. Second, it is possible for massive objects to appear with planetary mass when sin i ≈ 0, a situation that the synthetic populations cannot account for. However, such a scenario is statistically unlikely (Jorissen et al. 2001), and we therefore do not account for it here.
![]() |
Fig. 3 Average probability of planet detection from the HARPS/Coralie survey as function of a planet’s period and minimum mass (Mayor et al. 2011). |
2.4 Stars and planets from the HARPS/Coralie survey
The comparison dataset is provided by the High-Accuracy Radial Velocity Planetary Searcher (HARPS, Mayor et al. 2003) GTO (Guaranteed Time of Observation) survey (M11) which was combined for the statistical results with the Coralie survey (Udry et al. 2000), as described in M11. The sample was defined as volume-limited of FGK stars closer than 50 pc in the southern sky, according to the distances known at the time. The sample has not been revised based on more recent distance measurements, such as those of GAIA. Known visual (detected companion with 6 arcseconds) and spectroscopic (RV signal indicating a companion larger than 13 M4) binaries were excluded. This leaves a sample comprising 822 stars. The mean stellar mass across the whole sample is 0.91 M⊙ with a standard deviation of 0.17 M⊙. The full distribution of stellar masses is provided in Appendix C. Concerning the metallicity, the mean value is −0.11 with a standard deviation of 0.28.
The analysis was based on the data from M11, with the inclusion of four more years of observations (2011 to 2015). The other items were left as they were used in M11. This updated analysis retrieved Nobs = 169 planets and candidates around 105 host stars/planetary systems. Detailed parameters about the survey along the list of planets are provided in Appendix C.
Among the detected planets, 24 of them in 20 systems are in known binary systems. It was found that close binaries result in smaller and shorter lived discs (e.g. Harris et al. 2012; Cox et al. 2017). Recent studies find that discs become insensitive to binary separation above a distance of about 140 au (Ricciardi et al. 2025). Of the 20 systems, only three system are known to be in a binary closer this limit: HD142 (Tokovinin & Kiyaeva 2016), HD4113 (Cheetham et al. 2018) and HD7449 (Rodigas et al. 2016). Thus, the initial conditions of the synthetic populations, although nominally obtained for single stars, should also be representative of binary systems except the three mentioned here, out of 105. We believe this should have only a small effect on the statistical comparison.
2.5 Statistical analysis
To assess whether the synthetic and observed populations are different, we performed a series of statistical tests. The null hypothesis of these tests is that the observed sample is one random draw from the synthetic population with observational bias applied. For this, we applied the procedure outlined in Sect. 2.3 to obtain 1000 samples that can be compared to detected sample.
For single-quantity comparisons, we performed a two-sided Kolmogorov-Smirnov (KS) test between each pair of synthetic and observed sample using the SciPy package (Virtanen et al. 2020). To compute the significance of the test, we followed Hope (1968) and compute the KS distance at the significance level S that we chose and compared this value to the expected distance from the KS distribution. If the obtained distance is larger than that of the KS distribution, then the null hypothesis is rejected. This is also an improvement relative to Mordasini et al. (2009b), who only computed one observed sample for the single-variable comparison.
For the multi-variable comparisons (such as the mass-period diagram), we proceed in the same way as Mordasini et al. (2009b): we implement the algorithm of Press et al. (2002). We generate 1000 samples with 169 synthetic planets each, which are put together to form the comparison population. Then the distance between each of the sample as well as the HARPS/Coralie sample against the comparison population is computed. This enabled us to compute the significance of the 2D KS test directly.
3 Comparison of the nominal population
We based our results on two populations from previous works (Sect. 2.2). First, the NG76 population from NGPPS II. This is the one with initially 100 lunar-mass embryos per system; the highest number of embryos per system that was considered. Second, we have the NG76longshot population where the formation stage has been extended from 20 Myr (as in NG76) to 100 Myr to allow for N-body interactions to capture later instabilities.
These synthetic populations were computed for a stellar mass of M⋆ = 1 M⊙, which is the most similar value that we have in the synthetic populations to those in the sample (Sect. 2.4). Populations around lower-mass stars are found in Burn et al. (2021, NGPPS IV). The distribution of disc metallicities (that we take to be the same as stellar metallicity) is slightly higher than that of the sample, as the mean value is −0.02 (Santos et al. 2005). Nevertheless, the values are similar enough to provide for a good comparison.
The analysis was performed on 1000 Monte Carlo samples of the biased synthetic population, each containing 822 systems, the same number of systems as in the HARPS/Coralie sample. In each sample, the systems were selected without replacement from the 1000 systems of the synthetic population. We used the 1000 samples to build a distribution and determine the confidence intervals. Table 1 summarises the results on the base statistics found in the different populations.
3.1 Number of detected planets, multiplicity
To begin with the comparison of absolute numbers, we note that we detect
planets (95% confidence interval) in our synthetic observed population, compared to 169 planets (including candidates) actually detected in the combined HARPS/Coralie sample. Thus, we have close to the double (factor 1.70) of planets in our synthetic population compared to observations. Comparisons with the Kepler data have already found that populations computed with the Bern model generally produce too many planets. Mulders et al. (2019) has shown that populations obtained using an older version of the model had to be scaled down by a factor of about five to match Kepler observations. Analyses performed on NG76 by Matuszewski et al. (2023) found a similar factor, while Mishra et al. (2021, NGPPS VI) found that, on average, 1.7 planets per star could be detected in the synthetic population compared to an average of 1.32 in Kepler DR25 (Thompson et al. 2018). This corresponds to an overabundance by a factor of roughly 1.3. Thus, both the HARPS (RV) and Kepler (transits) comparisons find an overabundance of planets in NG76, while the actual factor depends on the method used.
We stress here that the model parameters or initial conditions were not selected to match either. The main parameter that controls the efficiency of planetary growth in the current model, the radius of the planetesimals, was selected according to Fortier et al. (2013). The initial conditions were selected to best match disc observation, about their mass, extend, and lifetimes (NGPPS II), rather than for the resulting systems. Considering this, we find that the number of planets is in a fairly good agreement with observational data.
In addition, the observed multiplicity is similar in the synthetic and observed populations. This is important, because multiplicity is a proxy for system architecture. The synthetic population has
planets around
stars, giving a mean multiplicity of
, while in the observed population there are 169 planets (including candidates) in 105 systems, for a mean multiplicity of 1.61. This means that the synthetic populations has too many systems harbouring each a number of detectable planets similar to the observations, rather than a larger multiplicity in systems with detectable planets. The latter would correspond to different, incompatible system architectures in the model versus the observations.
To support this, we provide in Fig. 4 histograms of the number of planets per system. The left panel, which provide an absolute comparison, shows a lack of systems without planets and with large multiplicities, while systems with one, two, or three are all a bit over-represented. The right panels provide a relative comparison of the occurrences of multiplicities in systems with planets, and shows that systems with two planets are the only ones that are over-represented, while systems with three planets or five and more are underrepresented.
As Fig. 4 shows, the model is unable to reproduce the systems with five or more close-packed sub-Neptunes (such as HD 10180; Lovis et al. 2011). However, this discrepancy should be taken with caution. For one reason, we use the average observational biases. In RV surveys, once a (low-mass) planet (or multiple planets for that matter) has been detected, the system will likely be observed more intensely, potentially revealing planets that could otherwise have been missed. This introduces a human factor in the selection process of the target stars, which is not accounted for here. To determine whether this difference is significant, a more elaborate scheme to mimic the selection process would be required.
Base characteristics of the populations analysed in this work.
![]() |
Fig. 4 Histograms of the number of detected planets per system in both the HARPS/Coralie surveys (red) and the biased synthetic population (black). The histogram of the synthetic system is based on 1000 samples of the synthetic population, with the solid line denoting the median while grey band shows the 95% confidence interval. The left panel compares the absolute number of systems, while the right panel compares the relative number of systems that have detected planets. |
3.2 The planetary mass function
The planetary mass function is a central quantity, holding many clues for planet formation models (Ida & Lin 2004; Mordasini et al. 2009a; Suzuki et al. 2018; Nayakshin et al. 2019; Emsenhuber et al. 2023). It allows for comparison of the relative number of planets of different masses. For this purpose, we show in Fig. 5 the comparison of the masses of the 1000 biased samples from the synthetic population (in black) and the detected ones (in red). The left panel provides a kernel density estimation (KDE), though it is scaled by the number of planets in each sample for absolute comparison). To compute the bandwidth of the KDE, we perform a search of the best value using scikit-learn (Pedregosa et al. 2011) on the observed sample, which returns a value of roughly 0.19 (in units of decimal exponent). This same bandwidth is then used for all the synthetic observations. As we discussed at the beginning of this section, the number of observable planets in the synthetic population is higher than the number of detected by a factor of roughly two. To provide a relative comparison, the re-scaled synthetic planets to the total number of observed ones in shown in blue. The right panel shows the same data, but in the form of a cumulative distribution. We also perform a two-sample KS test with a rejection criterion for p-value below 5% according to the procedure outlined in Sect. 2.5. We find that the value at the 95% of the distribution of KS distances of 2.34. As this value is larger than the limit of 1.36, we therefore rejected the hypothesis that observed HARPS/Coralie planets are a random draw of the synthetic planets with observational bias applied.
The general form of the planet mass distribution with two peaks is, however, clearly retained: one peak at about 10 M⊕ (Neptunian planets) and one for the giant planets larger than about 300 M⊕. Nevertheless, the following differences remain: (1) an overabundance of giants in relation to sub-Neptunes in synthetic planets compared to the HARPS/Coralie data, (2) a larger dip at intermediate masses in the synthetic planets (as was already pointed out by Nayakshin et al. (2019), Bennett et al. (2021), and also found by Schlecker et al. (2022) in a low-mass star population from our model (Burn et al. 2021, NGPPS IV), and (3) lower masses for the sub-Neptunes in the synthetic planets.
For the first item, the cumulative distribution on the right panel of Fig. 5 reveals the amount of the imbalance between sub-Neptunes and giants. If we take the transition between the two categories at 100 M⊕, we obtain the following values of the cumulative distributions: 0.41 ± 0.05 for the biased synthetic population compared to 0.56 in the HARPS/Coralie sample. This means that if the number of sub-Neptunes remains the same, the number of giants would need to be reduced by about 45% to balance back the two categories. Alternatively, the number of sub-Neptunes would need to be increased by about 83%.
Now considering the depth of the planetary desert, we compute the fraction of planets between 20 and 200 M⊕ (the shallowest part in the cumulative distribution of the synthetic population). We find that in the synthetic population, 14% of the planets are in that range while for the HARPS/Coralie sample, that value is 22%. This means that the number of planets in the desert would have to be increased by 57% (assuming the number of planets outside the desert remains the same) to balance the synthetic population. This is a factor of less than two, which we think is not a strong disagreement with the observations, given the following fact: our model starts with 100 moon-mass embryos per disc and all further evolution is given by the physical processes included in the global model in a low-dimensional description (like an axisymmetric disc or radially symmetric planets) (NGPPS I).
We also emphasise that the presence of a planetary desert in the observed population is subject to debate. For instance, Bennett et al. (2021) suggest that no such desert exists and that the populations presented here have much larger differences on the relative number of planets in the desert than the 57% we obtained. Conversely, Bertaux & Ivanova (2022) find that the planetary desert does exist and their analysis of synthetic populations from an earlier version of the same model (Mordasini 2018) found the desert in the synthetic populations to not be large enough. Our results are consistent with the second work (i.e. Bertaux & Ivanova 2022) in that we do find a planetary desert, even though the depth of the desert is too deep in the synthetic population analysed here. As already discussed in several works (Mordasini et al. 2011; Suzuki et al. 2018; Nayakshin et al. 2019), the too deep desert suggests that our model overestimates gas accretion rates of intermediate mass planets. In this mass regime, planets change from the planet-limited (or attached phase) to disc-limited (or detached phase) gas accretion.
The offset in stellar masses between the synthetic (where it is fixed to M⋆ = 1 M⊙) and the CORALIE/HARPS sample (mean of about 0.91 M⊙) might explain a part of the discrepancy. Burn et al. (2021) analysed the effect of the stellar mass on synthetic populations from the same model. They found that decreasing the stellar mass tend to decrease the number of sub-Neptunes and giants, though the latter are more affected. A synthetic population with a lower stellar mass would reduce the number of planets overall and the imbalance between sub-Neptunes and giants. However, this effect would not be sufficient to resolve the discrepancy entirely.
For the location of the peaks, we note that the synthetic population exhibits a peak centred slightly below 1000 M⊕ while the HARPS/Coralie planets peak is located closer to 500 M⊕. These planets are dominated by their envelopes. Thus, the envelope mass of the planets in the synthetic population is too large. This points to disc-limited gas accretion rates that are overestimated for planets in this mass range. The aforementioned lack of planets in the intermediate regime points towards the same conclusion. There are several possible explanations: differences in the accretion rates obtained in 1D versus 3D simulations (Ayliffe & Bate 2012; Schib et al. 2022), higher dust opacities than assumed here (Mordasini 2014; Schulik et al. 2019), or a quenching of the gas accretion rate because of more efficient gap opening in low-viscosity discs (e.g. Aoyama & Bai 2023). To explore the latter point, we have recently added a MHD-wind driven disc model in the Bern model (Weder et al. 2023). Future work will show whether this will result in a less dry desert that is in better agreement with observations (see also Suzuki et al. 2018).
For sub-Neptunes, however, the mass difference has to be caused by the accretion of solids, as these planets are core dominated. As it can be seen in the right panel of Fig. 7, the synthetic planets are ice-rich for the majority. These planets have formed outside the ice line and subsequently migrated to their final location without accreting much material at the same time. Planets whose masses are about 10 M⊕ cannot grow from a static solid disc in-situ at the location they are observed (inside about 400 d or 1 au) because there is not enough mass available. The masses of the sub-Neptunes is linked to the behaviour of gas-driven migration. In this range, migration becomes more efficient as mass increases. The planet masses thus reflect the equality mass (Emsenhuber et al. 2023) at which the migration timescale becomes smaller than the accretion timescale. The sub-Neptunes end near the inner edge of the disc, where further growth is nearly absent. If the actual sub-Neptunes indeed follow a similar formation track, then the lower masses of the sub-Neptunes in the synthetic planets relative to the HARPS observations indicates that equality mass is underestimated in the model, which in turn observationally constrains the ratio of the migration to the accretion timescale. Here, the difference indicates a shorter growth and/or longer migration timescale than in the model.
![]() |
Fig. 5 Comparison of the planet masses between the synthetic biased population (NG76, black) and the HARPS/Coralie sample (red). The data of the synthetic population is based on 1000 Monte Carlo synthetic observations, with the bold line showing the median of these, the grey region showing the 95% confidence interval, and the thin black lines indicating ten individual realisations. Left: kernel density estimate (scaled to be absolute) with a smoothing length of 0.19 dex, with the addition of the synthetic population scaled down to the same total number of planets as the observed sample (blue). Right: cumulative distributions of the same data. The statistical analysis shows that the samples from the synthetic population do not match the detected ones at the 5% level because the model under-predicts the number of sub-Neptunes and over-predicts the number of giants. |
3.3 Periods
The other main quantity retrieved from RV surveys is the orbital period. To check for how the synthetic population and the Coralie-HARPS sample, we proceeded as for the minimum mass. The comparison of the KDEs is provided in Fig. 6.
The result shows that the two populations again exhibit similar features, with the two peaks for the sub-Neptunes and giant planets. Giant planets are overrepresented in the synthetic populations, as discussed previously.
The match between the periods in the synthetic and observed populations is not as good as for the masses. For instance, the planets in the synthetic population are found at shorter periods than the observed planets. This is true for both the sub-Neptunes (between 1 and 30 d versus 3 and 100 d for the observed planets) and giants (between 100 and 1000 d versus 300 and 3000 d). Such a difference was already seen in the unbiased population, as discussed in NGPPS II. This inward shift of the low-mass planets is consistent with Mulders et al. (2019), who found the same feature in the comparison with the Kepler planets. Further, the peaks in the period distribution are narrower in the synthetic population.
For the sub-Neptunes, their final location is near the inner edge of the gas disc. This means that either the choice of the disc inner edges is incorrect or the sub-Neptunes stop migrating before reaching the inner edge. In the synthetic populations, the disc inner edges were assumed to be at the corotation radius stellar rotation and used a rotation period distribution of young stellar objects Venuti et al. (2017). As discussed in NGPPS II, this provides good agreement with gas tracers; we thus do not think this is the reason for the discrepancy. There are, however, proposed mechanisms to halt the inward migration outside of the inner edge. For instance, there could be viscosity transitions which act as migration traps, such as near the silicate evaporation line, as suggested by Flock et al. (2019). We shall investigate this in future work.
For the giant planets, their locations change only marginally once they undergo runaway gas accretion. Thus, the difference in their final location is most likely due to the migration when planets are at an intermediate mass range (~10 M⊕) or the planets originate further out than what our model predicts. We investigate the effect of migration strength in Sect. 4.
![]() |
Fig. 6 Comparison of the planet periods between the biased synthetic population (NG76, black and blue lines) and the observed HARPS/Coralie sample. The curves have the same meaning as in the left panel of Fig. 5. |
![]() |
Fig. 7 Left: comparison of the minimum mass (M sin i) versus orbital period of the biased synthetic 100-embryo population (NG76, black circles) and the actual HARPS/Coralie survey (red crosses; M11). Both samples have 822 systems. Right: same but only for the synthetic population and with different colours and symbols depending on planet properties. Red circles indicate planets whose H/He envelope content is larger than the core; otherwise, green circles are for planets whose cores do not contain any ices, and blue is for the planets whose cores contain ices. In the latter case, circles are for planets whose envelope mass is at least 10% that of the core, and crosses are for the others. |
3.4 Mass-period diagram
The main quantities retrieved from planets detected with the RV technique are the minimum mass and the orbital period. We thus want to check whether our model can reproduce both these quantities at the same time. A comparison of these two properties between one random observation of the synthetic population and the HARPS planets is shown in Fig. 7. We also perform the 2D KS testing procedure and find that its significance is <0.001, meaning that the KS distance of the observed sample is larger than any of the synthetic samples.
From Fig. 7, we observe that both synthetic and observed populations have the same similar shape with a group of sub-Neptunes at short orbital period (between 2 and 30 M⊕ and inside of 100 d) and a second group of more distant giant planets (above 100 M⊕ and 100 d). However, there are several differences, which explain the difference that is highlighted by the KS test.
First, the positions of the planets in the synthetic population are more grouped than that of the observed planets. The formation of distinct clusters of planets – despite the stochasticity introduced by N-body interactions – is a general feature of our model. These clusters form early and trace different growth and migration tracks during their formation phases (see Schlecker et al. 2021b (NGPPS V) and Emsenhuber et al. 2023).
Second, the synthetic population does not cover the whole parameter space of the observed planets. For instance, the synthetic population does not contain any hot-Jupiter planet (within 10 d and above 100 M⊕). This effect is not due to the sampling of the underlying population, but the population itself does not contain any such planet (NGPPS II). A possible reason for the mismatch is that the model only considers disc migration as formation channel, while other channels, such as eccentric migration (Dawson & Johnson 2018), are not included. In the latter case, it is because the model does not include tidal circularisation. As an estimate of the number of hot-Jupiters that could potentially form by this channel, we can take planets larger than 100 M⊕ that have collided with their star. We find that 20 such planets, of which had an inclination of more than 45° shortly before impact. For comparison, we would expect five to ten hot-Jupiters in our 1000-star population (hot-Jupiters have an occurrence rate of 0.5–1%, M11, Wright et al. 2012). Thus, it is likely the model could form hot-Jupiters by eccentric migration if all the relevant physical processes were included in it.
3.5 Correlation with stellar metallicity
Giant planets are preferentially found around metal-rich stars (e.g. Santos et al. 2001; Fischer & Valenti 2005; Adibekyan 2019), while such a correlation is weaker or non-existent for low-mass planets (e.g. Petigura et al. 2018). To determine if we recover the same correlation and the same strength, we show a planetary mass versus stellar metallicity diagram for both the combined HARPS/Coralie sample and synthetic population in Fig. 8. In addition, to highlight possible trends in the data, we show the medians for various planet mass bins. As before, the grey zone shows the 95% confidence interval for the medians in each bin.
The HARPS/Coralie planets show a positive correlation between mass and stellar metallicity for planets until about 103 M⊕ and slightly decreasing afterwards, indicating that the giant planets have a larger metallicity effect that the sub-Neptunes. The synthetic planets show a similar, albeit shallower, correlation; there are only two out of eight bins that do not fit the 95% confidence internal.
The two bins that do not fit in the confidence intervals are for sub-Neptune and their median metallicities are too large compared to the HARPS planets. Looking at the individual planets reveals that the synthetic population has an overabundance of sub-Neptunes around high-metallicity stars, while it is missing planets at low stellar metallicities. For instance, the synthetic population cannot reproduce the few systems of sub-Neptunes at metallicities between −0.3 and −0.4. The systems (HD 40307 and HD 136352) have a significant mass in planets (about 23 M⊕ in both cases) for such low metallicities, which indicates that either conversion from dust to planets is efficient or the protoplanetary discs were more massive than what our methodology provides for initial conditions.
Concerning the giant planets, previous studies that used older versions of our formation model, such as Mordasini et al. (2009b), already found lower metallicity effects than in observations. Thus, the shallower metallicity effect that we observe here is not a new feature of our model.
There are multiple possible reasons for the differences between the synthetic population and observations. We may cite a non-linear relation between the metallicity and the usable mass for planetary growth arising during the early phases of planetary growth not considered here (Youdin & Goodman 2005; Lenz et al. 2019) or the impact of envelope enrichment (Venturini et al. 2016) which is neither included in the model. Also, the gas disc properties are assumed to be independent of the metallicity. Finally, metal-poor stars tend to exhibit a higher relative abundance of α-elements (Adibekyan et al. 2013; Delgado Mena et al. 2017), meaning that they are not as metal-poor as indicated by their [Fe/H] only. Thus, using [Fe/H] with scaled solar composition – as we do in the model – results in a mismatch in the total amount of heavy elements, mostly underestimating the total amount of solids for low-[Fe/H] stars. This is of importance, as the total mass of solids is a key quantity in determining the resulting planetary systems (Emsenhuber et al. 2023).
![]() |
Fig. 8 Stellar metallicity versus planet mass (M sin i) for the biased 100-embryo population (NG76, in black) and the HARPS/Coralie survey (M11, in red). The scatter plot shows one random observation of the synthetic population against the HARPS/Coralie planets. The lines show the medians in the respective range. For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. |
3.6 Period ratios of adjacent planets
For systems with multiple planets, the spacing between the adjacent planets can be quantified by the ratio of their orbital periods. Figure 9 provides the comparison for this quantity between our synthetic population and the HARPS/Coralie sample. We stress that only systems with multiple detected planets can be used to compute period ratios and that we do not include any bias for the planet orbital periods. As such, the distribution is based on a limited number of planet pairs.
The general outcome of the comparison is that the two curves are in good agreement, with the exception of the pile-up of period ratios past the location of the 2:1. As shown in the histogram (the right panel of Fig. 9), the number of planets in this pile-up is similar in both synthetic and observed populations. However, the period ratios occupy a much narrower in the synthetic population than in the HARPS/Coralie planets (up to about 2.5). Concerning the other resonances, the synthetic population shows other pile-ups of planets near the 3:1 and 4:1 resonances, though in this case, there is no large discrepancy in the fraction of planet pairs located there between the synthetic and observed systems.
Convergent gas-driven migration will lead to pile-ups at the location of mean-motion resonances (MMRs). In previous populations that were generated with the older version of our formation model (e.g. Alibert et al. 2011 and Mulders et al. 2019), the number of pairs close to MMRs was found to be too large compared to the observed exoplanets (e.g. Fabrycky et al. 2014, for results from the Kepler observatory). Further integration past the dispersal of the gas disc reduces the number of planet pairs locked in MMRs (see Fig. 15 of Weiss et al. 2023), which is what we are doing in this work.
The standard model NG76 follows the dynamical interactions up to 20 Myr (NGPPS I). We also performed the same calculations but where the formation stage of the model, during which the N-body interactions are simulated, has been extended to 100 Myr (NG76longshot). This allows us to determine if dynamical interactions at later times would be able to break the resonances. As can be seen in Fig. 9, we find that this is not the case, and the cumulative distribution of the period ratios is very similar. This means that the N-body interactions after 20 Myr are unable to break the resonances in the period regime accessible to the RV technique. This result is the opposite of that of Weiss et al. (2023), who analysed the same population, but from the perspective of transit observations. As RV surveys are only sensitive to more massive planets, they are less affected by long-term dynamical effects (NGPPS II). Thus, the different results between our study and that of Weiss et al. (2023) are not contradictory.
This means that we are left with the possibilities that there are other sources of excitation not taken into account in our model, or that fewer planets start off locked in resonances in the first place. This could be the case if migration is reduced overall, as might be expect for MHD-winds driven discs (Suzuki et al. 2010; Ogihara et al. 2018) compared to classical α-discs as used here. Another possibility is the strength of eccentricity damping that was found to affect the proximity of the planet pairs from MMRs (Charalambous et al. 2022).
3.7 Eccentricity
3.7.1 Overall distribution
Exoplanets have overall higher eccentricities than the planets of the Solar System (e.g. Udry & Santos 2007). Gravitational interactions between the proto-planets stir their eccentricities, and close encounters (or scattering) can lead to very eccentric orbits, up to unity (Carrera et al. 2019). Scatterings are able to reproduce the eccentricity distribution of the observed exoplanets (Chatterjee et al. 2008; Jurić & Tremaine 2008; Ford & Rasio 2008).
To compare the eccentricities between the synthetic population and the observation data, we provide in Fig. 10 the cumulative distributions of the two. For the HARPS/Coralie sample, we draw two curves: one where undetermined eccentricities are neglected and upper limits are taken at those value (which gives an upper boundary) and one where eccentricities in the same situations are taken as zero (which gives a lower limit). The general form of the two curves is reasonably similar. However, we find a difference in the near-zero eccentricities. We observe that many of the synthetic planets have low eccentricities, with a median value of 0.07. While the uncertainties in the HARPS/Coralie surveys can account for a certain percentage of low-eccentricity planets, it cannot account for all the amount there is in the synthetic population. As the histogram shows, the bin that contains the largest number of observed planets is between 0.1 and 0.2, while in the synthetic population, it is for eccentricities below 0.1, which contains nearly 60% of the planets. Conversely, the eccentricities of the observed planets almost reach unity, with the largest value being 0.93 for HD 20782b (Jones et al. 2006). This is not the case for the synthetic population where the largest value is about 0.6. Our populations do have planets with large eccentricities; they are, however, further away than what is detectable by HARPS.
The discrepancy may come from observations, the model, or both. For RV observations, it is well known (Lucy & Sweeney 1971) that when the eccentricities are small and the S/N is not excellent, planet eccentricities are easily overestimated (Hara et al. 2019). For instance, Zakamska et al. (2011) found in their more sophisticated analysis that about (38 ± 9)% of exoplanets detected by RV have e < 0.05, compared to 17% in standard analysis. The former value is closer to our synthetic population. Comparison with Kepler data also hints that eccentricities in RV could be overestimated. For instance, Weiss et al. (2023) find that planets in multiple systems as found by the Kepler survey have eccentricities of 0.03 to 0.05, which is even lower than our synthetic populations. Another item is that in RV surveys, it is impossible to distinguish a pair of circular planets on a 2:1 MMR from a single, eccentric planet. This means that some planet pairs could be misinterpreted as high-eccentric planets. However, we note that the planet pairs close to the 2:1 MMR are similar in the synthetic and observed planets (Fig. 9, right panel), so we do not believe this affects many systems. Concerning the models, the only source of eccentricity in our simulations is gravitational interaction between the protoplanets. Planet-disc interactions are another potential source, such as for high-mass planets (Kley & Dirksen 2006) and turbulent stirring for low-mass planets (Ormel & Kobayashi 2012; Kobayashi et al. 2016), but we do not include these effects in our model. Hence, as for the period ratio, we perform the analysis for the population NG76longshot where the formation stage has been extended to 100 Myr to allow for late dynamical instabilities to occur (e.g. Izidoro et al. 2021). We find that the eccentricity distribution between the two populations is very similar. This result is consistent with our findings from the period ratios in the previous section in that later dynamical interactions have a limited effects on the system’s dynamical state. Again, this indicates that eccentricities should either be stirred at early times, for instance with a less efficient damping by the gas disc, or that another (potentially external) process currently not included in our model is responsible for pumping the eccentricities. This could be stellar encounters (Malmberg et al. 2011; Stock et al. 2020) or the Kozai–Lidov mechanism caused by an external massive companion. Support for the latter hypothesis comes from the fact that many of the large-eccentricity planets in the sample are actually in binary systems, such as HD 20782 (Desidera & Barbieri 2007), HD4113 (Cheetham et al. 2018), HD 156846 (Tamuz et al. 2008) and HD 7449 (Rodigas et al. 2016).
![]() |
Fig. 9 Cumulative distribution (left) and histogram (right) of the period ratios of adjacent planets for the biased 100-embryo population (NG76, in black), the same population but with N-body interactions extended to 100 Myr (NG76longshot, in blue), and the HARPS/Coralie surveys (M11, in red). For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. The number of observable pairs is limited, as only systems with multiple planets are usable. The locations of several MMRs are highlighted. |
![]() |
Fig. 10 Cumulative distribution (left) and histogram (right) of the planets’ eccentricities for the biased 100-embryo population (NG76, in black), the same population but with N-body interactions extended to 100 Myr (NG76longshot, in blue), and the HARPS/Coralie surveys (M11, in red). For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. |
![]() |
Fig. 11 Eccentricity as a function of planet mass (M sin i) for the biased 100-embryo population (NG76, in black) and the HARPS/Coralie surveys (M11, in red). The scatter plot shows one random observation of the synthetic population against the HARPS/Coralie planets. The lines show the medians in the respective range, but one should note the low number of planets in the bins on the right. For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. |
3.7.2 Mass–eccentricity relation
In observational results, higher mass planets tend to have higher eccentricities (Wright et al. 2009; Mayor et al. 2011). To compare our results with these findings, we provide Fig. 11, which is a scatter plot of the eccentricity as a function of mass, along with a running median for each population.
The median of the HARPS/Coralie planets shows two different regions. For intermediate-mass planets (less than about 300 M⊕), the median eccentricity is relatively constant with a value of roughly 0.1. For larger masses, however, the median eccentricity increases with the mass, reaching a value larger than 0.6 at the Deuterium burning limit. The synthetic planets, however, do not exhibit a change of behaviour at 300 M⊕. Rather, the median eccentricity generally increases with the mass. This is particularly striking at masses 100 M⊕ where many of the synthetic planets have eccentricities on the order of 0.1, whereas many of the HARPS/Coralie planets have eccentricities of 0.4 or more.
As we noted above, the synthetic population is lacking planets with eccentricities larger than 0.6. Thus, a median eccentricity comparable to that of the HARPS/Coralie planets for the largest masses cannot be achieved in the first place. Thus, the giant planets in the synthetic population are dynamically cold compared to the HARPS/Coralie planets. There are synthetic planets with very large eccentricities in the underlying synthetic population, but they are starting just further out than the detection limit of HARPS, which is about 15 au (period of about 2 × 104 d). The implication is that the model is unable to produce eccentric giant planets without increasing their semi-major axis as well (as the giant planets mainly form within 10 au). Meanwhile, there are several cases of planets with e > 0.8 and periods less than 103 d in the HARPS sample.
4 Towards a better match of observational data
In the population synthesis scheme (Mordasini et al. 2015; Mordasini 2018; Chambers 2018), one aim is to find model parameters that result in a population that best reproduce the observed exoplanets. In this section, we build on the analysis of the nominal population to perform modifications to the model that result in better agreement with the HARPS/Coralie data.
Although the model has a significant number of parameters, there are none that affects one aspect only of the populations. For instance, one of the main quantities affecting the accretion rate of the planets is the planetesimals’ sizes. Increasing their sizes decreases the solids accretion rate of the embryos, thereby leaving a lower number of cores reaching a large enough mass to undergo gas runaway accretion before the dispersal of the gas disc. However, this is not the only effect of the planetesimals size. When planets take longer to form, their migration pattern will also be different. For example, a lower solid accretion rate results in a lower equality mass (see Emsenhuber et al. (2023) for the definition), which is the mass where the solid accretion and the migration timescales are equal. At this mass, the tracks in the a − M diagram bend inward, and often rapid Type I migration sets in. This might cause the protoplanet end up as a close-in (sub-) Neptunian planet, and not just as a lower-mass, but still distant, planet.
To better reproduce the HARPS/Coralie results, we computed several new populations, using only 20 embryos per system. This is different than the canonical population – which has 100 embryos per system – to limit the computational requirements. We verify in Appendix A that this modification does not affect the planetary mass function. This is because the number of embryos affects mostly the Earth-mass planets, while it only has limited effects on the most massive planets (NGPPS II). We therefore anticipate that the change of the number of embryos will not affect the results presented here.
We highlight the effects of the planetesimals size and the overall efficiency of gas-driven migration in Appendix B. Here, our goal is find a combination of parameters that best reproduces the total number of planets and their mass distribution (that is, not their location). The final optimised population for the HARPS/Coralie planets was obtained assuming the following parameters:
The planetesimals radius Rplan has been set to 2 km instead of 300 m in the nominal population. This is mainly to reduce the efficiency of the model at forming giant planets.
Gas-driven migration has been reduced to 7/8 of its nominal rate, for both type I and type II migration.
The maximum gas accretion rate is computed as the minimum of the Bondi rate and the radial flow of the gas disc, while in the canonical population it was the minimum of the Bondi rate and the local reservoir rate. This is to limit the gas accretion rate in the detached phase and better reproduce the masses of the giant planets.
The resulting population, which we refer to as NG192, contains 1000 systems and has been analysed in the same manner as the original one presented in the previous section. The histogram of the number of observed planets per system comparing this population with the HARPS/Coralie data is provided in Fig. 12, the cumulative distribution of the planet masses in Fig. 13, and the mass-period diagram of one draw in Fig. 14.
The comparison of the number of planets in the new population provides are nearly matching those of the HARPS/Coralie systems, as there are
planets in
systems (as reminder HARPS/Coralie has 169 planets and candidates in 105 systems). Also, Fig. 12 shows that the distribution of system multiplicity is a good match with the combined HARPS/Coralie sample, although the synthetic populations shows too many systems with two planets. Unlike the nominal population, however, this population can produce one system with 6 observed planets (as indicated by the green region on the bottom right of Fig. 12), albeit not in the majority of the samples.
The planetary mass function of the new population better reproduces the observed population overall. The remaining discrepancy lies with the most massive planets (more than 10 Jovian masses), which the synthetic population fails to reproduce. Thus, we note that it is difficult to find a prescription for the maximum gas accretion rate that is able to produce at the same time giants with an occurrence peak at about 500 M⊕ and the presence of very massive giant planets. Nevertheless, the KS distance at the 95% of the random observation is 1.35 compared to a limit value of 1.36, which means that we are just below the rejection threshold for the hypothesis that the planets are drawn from the same sample.
However, the improvements in the reproduction of the planet mass function are not reflected in all the trends we investigated in the previous section. For instance, the planets are still closer-in as in the nominal population (Fig. 14). This is reflected in the 2D (mass-period) KS test, as we find that the distance of the observed sample is larger than any of the synthetic samples, meaning that the p < 0.001 for the hypothesis that the observed sample is drawn for the synthetic population. We thus find that our model is unable to produce a synthetic population that reproduces at the same time the number of planets and their mass and period distributions. The issue here is that the reduction of the model’s efficiency by increasing the size of the planetesimals also lengthen the formation time. The increased formation time make the planets more subject to planetary migration. Thus, the strongly coupled nature of the problem implies that it is generally not possible to “force” one outcome in one direction without also introducing many other consequences.
Also, the eccentricities are even lower than in the nominal population, as there are fewer massive planets (which are the source of excitation in our model). Conversely, the metallicity effect becomes stronger, as the decreased efficiency result in more selective formation around higher-metallicity stars.
Other possible improvements lie in a better understanding of the observed sample. This could be used to better inform the model about its properties and thus having initial conditions for the synthetic systems that better resembles the observations. Avenues in these directions include having a distribution of stellar masses, binary stars, and different elemental composition.
![]() |
Fig. 12 Histogram of the number of planets per systems in both the HARPS/Coralie surveys (red) and the optimised synthetic population NG192 (green). |
![]() |
Fig. 13 Cumulative distribution of the planet masses in the optimised synthetic population NG192 (in green), the canonical 100 embryo population (in black), and the observed population (in red). |
![]() |
Fig. 14 Mass–period diagram comparing one random mock observation of the optimised synthetic population NG193 (in green) and the actual HARPS/Coralie planets (in red). |
5 Summary and conclusion
In this work, we have compared the synthetic planet population obtained in NGPPS II with the large HARPS/Coralie RV survey (M11). We applied an observational bias derived from the survey to perform quantitative comparisons. The purpose of this endeavour was to statistically confront model predictions with observations and determine whether some physical processes that occur during planet formation are not yet well understood (Mordasini et al. 2009b) and which aspects can in contrast be reproduced.
We find that the nominal synthetic population reproduces several of the main characteristics of the HARPS sample, namely, the two clusters of close-in sub-Neptunes and more distant giants in the mass-distance diagram; a bimodal mass function; an observed mean multiplicity of about 1.6; and several important trends in the stellar metallicity dependency, period ratio distribution, and eccentricity distribution. This could indicate that the model contains at least some of the physical mechanisms governing planetary formation. This is a remarkable result considering that the model was not optimised beforehand to reproduce any particular survey. Instead, the model starts from 100 moon-mass embryos per disc, with all following an evolution governed by the physics included in the model. The narrower discrepancies are informative for the calibration of the various processes in the model and the identification of missing physics, and we discuss them in this section. To be able to observe these differences, in our work it is crucial that we can make a quantitative, and not only qualitative, comparison between theory and observation.
First, in terms of the absolute number of detectable planets, the nominal synthetic population has a greater number of planets by a factor of 1.7 compared to the observed sample. Thus, the nominal formation model is too efficient, albeit not by as much as found in Mulders et al. (2019) for the Kepler survey. This could be linked to model parameters controlling the efficiency of solid accretion or the initial conditions. The second discrepancy concerns the mass distribution of planets. We find that 1) the desert of planet masses between that of Neptune and Saturn is too deep by about 60%, 2) the mass of the peaks do not match exactly – synthetic sub-Neptunes have too-low masses, while synthetic giants are too massive by a factor of about two in both cases – and 3) the ratio between giants and sub-Neptunes is too large in the synthetic population (compare Schlecker et al. 2021a, NGPPS III). A third discrepancy relates to the location of the planets: The synthetic planets are overall too close to the star compared to the HARPS sample.
To better reproduce the number and mass distribution of planets in the synthetic population, we study in Appendix B the effects of two parameters: the planetesimal size and the efficiency of gas-driven migration. In addition, we modify the gas accretion prescription to include the radial flow of gas through the disc, but it has little effect on the number or planet masses (Fig. B.1).
Increasing the planetesimal size reduces the overall number of observed planets. In addition, this leads to an increase in the relative amount of sub-Neptunes compared to giants. Reducing the efficiency of gas-driven migration also changes the relative amount of sub-Neptunes and giants, with lower migration rates resulting in more giant planets and fewer sub-Neptunes. In our model, detectable sub-Neptunes and giants originate from the same region (beyond the ice line) and migrate to their final location. In this situation, the question of whether a planetary embryo becomes a sub-Neptune or a giant depends on the relative timescales of accretion versus migration (Emsenhuber et al. 2023). If migration is dominant (for instance with larger planetesimals, since that reduces the solid accretion rate), then sub-Neptunes are favoured, while the opposite leads to more giant planets as the study of the effects of migration efficiency has shown. This illustrates that modifying model parameters will affect not only a single outcome of the model but may also have consequences for other observable quantities in a way that might not be desirable for the goal of concurrently reproducing as many observational constraints as possible.
This outcome is caused by the many links and the feedback between the different physical processes that are included in the model (see the many arrows in Fig. 2 of NGPPS I). It implies that despite the many free parameters in a global model, it is difficult to force the model’s outcome towards a desired direction just by varying parameters. This also shows that the depth of the planetary desert is not determined by runaway gas alone. The more efficient the migration is, the less amount of time planets can remain in this intermediate mass range without being taken to the inner edge of the disc. This finding has implications for how to analyse the HARPS/Coralie planets. The fact that the vast majority of the observable planets in our model come from a similar region also means that the whole population must be analysed together, as opposed to analysing sub-Neptunes and giant planets separately.
We find that larger planetesimals and a slight reduction of gas-driven planetary migration enable better reproduction of the masses of the HARPS/Coralie planets. However, the synthetic planets generally remain too close-in. Yet, migration cannot be reduced further, as was suggested by Ida et al. (2018) because, at least in our model, that would also lead to the formation of too many giant planets. This is a case where just varying the model parameters is insufficient to bring the syntheses into full agreement with the combined observational constraints, and it may suggest that planets originate from further away than they do in our model. However, that is problematic solely with planetesimal accretion, which suffers from long accretion timescales at large separations. Thus, it is an indication of missing physics in the modal and that a process such as pebble accretion (Ormel & Klahr 2010) and/or structured discs (Lau et al. 2022) might be at play for giant planet formation.
Regarding the constraints beyond the masses and orbital distances, we analysed the distribution of the period ratios of adjacent planets (Fig. 9) and the eccentricity distribution (Fig. 10). Regarding the period ratios, the overall synthetic and observed distributions are in good agreement, with the exception of the detailed structure of the pile-up of period ratios past the location of the 2:1 MMR. While the overall number of planets in or near this pile-up is similar, the period ratios occupy a much narrower region (close to exactly 2.0) in the synthetic population than in the HARPS/Coralie planets (up to about 2.5).
Regarding the eccentricity distribution, the median eccentricity of the detectable synthetic planets is 0.07, compared to 0.15 in the HARPS/Coralie sample. In other words, the synthetic population contains a significant number of planets with circular or nearly circular orbits. Apart from this overall shift, the distributions are similar.
Taken together, the period ratios and the eccentricities consistently show that the synthetic systems are too dynamically cold compared to HARPS/Coralie planets. This remains true even when the N-body run-time is extended to 100 Myr in the simulations (compared to NGPPS II, where the interactions were followed up to only 20 Myr). This indicates that we are missing an effect for the excitation of the planet orbits and that this effects occurs either early during the formation stage or has an external origin. The discrepancy could thus be caused either by a too strong eccentricity and inclination damping while the planets are embedded in the gas disc, as was found by Bitsch et al. (2020). External excitation by stellar encounters could also be an explanation (Malmberg et al. 2011). There is an additional alternative explanation on the observational rather than the model side: It could also be that the eccentricities in the HARPS/Coralie sample are overestimated given the well-known bias of the RV method (Lucy & Sweeney 1971).
Finally, the links to the host star properties can also be studied and contrasted. Concerning the metallicity of the systems harbouring observable planets, we find that both nominal and modified populations reproduce the overall metallicity dependence of the planet mass quite well but differ again on a finer scale. Namely, in the model the correlation is weaker, though only two out of eight mass bins do not fit the observational 95% confidence interval. The difference stems from sub-Neptune planets that have too-high median metallicities in the model.
The reduced planet growth efficiency when increasing the planetesimal size results in a higher fraction of systems coming from high-metallicity systems. This affects all masses, however, such that the median metallicity increases from 0.09 to 0.14 without significantly affecting the mass dependence. This means that our model favours overall high-metallicity systems for both sub-Neptunes and giant planets, and we find it difficult to form the close-packed systems with many sub-Neptunes at low metallicity (HD 40307; Tuomi et al. 2013, HD 136352; Udry et al. 2019). Again, this points to physical processes that are currently missing in the model, such as mechanisms affecting the formation of the solid building blocks (Voelkel et al. 2021, 2022) or the effect of envelope enrichment (e.g. Venturini et al. 2016; Mol Lous et al. 2024).
Looking ahead, many large space- and ground-based instruments and surveys (e.g. NIRPS, GAIA, PLATO, ROMAN, ARIEL, etc.) will release observational datasets that are ideal for future statistical comparison with theoretical population-level predictions due to the homogeneous way the data are obtained (ExoPAG Science Interest Group et al. 2023). Theoretical planet formation models should use these surveys as an opportunity to go beyond the qualitative comparison of specific aspects and perform quantitative statistical comparison in a global way, that is, by comparing the same synthetic population in many aspects to all the different observational datasets. This will reveal the success and limitations of our current understanding of planet formation much more clearly and help answer the many fundamental questions still open (Mordasini & Burn 2024).
Data availability
Table C.1 is available at the CDS via https://cdsarc.cds.unistra.fr/viz-bin/cat/J/A+A/701/A64
Acknowledgements
We thank the anonymous reviewer, whose comments helped improve the quality of the manuscript. A.E. and C.M. acknowledge support from the Swiss National Science Foundation under grant 200021_204847 “PlanetsInTime”. A.E. and E.A. acknowledge the support from The University of Arizona. Parts of this work have been carried out within the framework of the NCCR PlanetS supported by the Swiss National Science Foundation under grants 51NF40_182901 and 51NF40_205606. R.B. acknowledges the financial support from DFG under Germany’s Excellence Strategy EXC 2181/1-390900948, Exploratory project EP 8.4 (the Heidelberg STRUCTURES Excellence Cluster). The results reported herein benefited from collaborations and/or information exchange within the program “Alien Earths” (supported by the National Aeronautics and Space Administration under agreement No. 80NSSC21K0593) for NASA’s Nexus for Exoplanet System Science (NExSS) research coordination network sponsored by NASA’s Science Mission Directorate. Calculations were performed on the Horus cluster at the University of Bern. The plots shown in this work were generated using matplotlib (Hunter 2007).
Appendix A Effect of the initial number of embryos
NGPPS II presented multiple populations with different initial number of embryos per system: 1, 10, 20, 50, and 100. It was found that the giant planet population was relatively independent of the initial number of embryos per system as long as there are at least 10, except for a slight shift in the location of the giants. The 100-embryo population was taken as the nominal population, and this is the one that is mainly analysed here. However, obtaining such a population requires large computational resources, such that a parameter study is unfeasible. To perform such a parameter study with the available computational resources, we have to lower the number of embryos per system to 20.
To check the effect of the initial number of embryos for comparison with HARPS, we consider the cumulative distribution of the observed planet masses, which is shown in Fig. A.1. The comparison shows that the observed mass distribution is insensitive to that parameter, except for the masses of the most massive planets, which are lower in the population with fewer embryos initially. Nevertheless, the difference is of the order of the confidence interval and lower than the difference with the observed planets.
Thus, we confirm the discussion in NGPPS II that the initial number of embryos is significant only for the close-in terrestrial planets. As virtually none of these planets are observable by HARPS because of their too low mass, this model parameter has no significant effects on the results presented in this work. This also means that selecting a lower initial number of embryos for the populations that are used to find a better match with the HARPS sample should not cause any issue.
![]() |
Fig. A.1 Cumulative distribution of the planet masses for biased synthetic populations with different initial number of embryos per system. |
Appendix B Parameter study
To find a population that is optimised for the HARPS/Coralie planets in absolute number and mass distribution, we perform a parameter study with several of the model parameters. Here we present the main results of that study and highlight how we selected the population shown in Sect. 4.
B.1 Planetesimal size
The general efficiency of planet formation in the present model is controlled by the size of the planetesimals. For a given surface density, larger planetesimals would lead to a lower growth rate due to, first, the reduced damping of eccentricities and inclinations by the nebular gas and, second, a smaller enhancement of the capture cross-section by the planet envelope (Podolak et al. 1988).
The results of this study are shown in Fig. B.1, which compare the 20-embryos counterpart of the nominal population (called NG74 in NGPPS II; to keep this parameter the same across the comparison) to other populations where the maximum gas accretion rate criterion is given by the minimum of the Bondi rate and the radial flow of the gas and various planetesimals sizes. We see that the number of giant planets is much more affected by the change of planetesimal size than for the sub-Neptunes. For 300 m planetesimals, giants are dominant, while for 2 km and 3 km the sub-Neptunes are. Increasing the planetesimal size thus solves both the issue of the number of planets and the imbalance between giants and sub-Neptunes. However, the best size for both items is not exactly the same: The population with 2 km planetesimals shows an overall good match for the number of planets while having a small skew towards sub-Neptunes. For the latter point, smaller planetesimals would be beneficial.
Changing the planetesimal size has some additional consequences. For instance the location of the desert slightly shifts towards larger masses as the planetesimal size increases, while the location of the peak of the mass function for giant planets shifts towards lower masses. The latter is due to the longer formation time required with larger planetesimals, resulting in giant planets spending less time in the runaway gas accretion regime.
B.2 Efficiency of migration
As discussed in the Appendix of NGPPS II, Schlecker et al. (2021a) (NGPPS III), and in Sect. 3.4, the synthetic planets are generally too close-in. Hence, we also study if a reduction of the gas-driven migration is able to better reproduce the location of to the observed planets. However, migration not only affects the planets’ locations but also all their formation sequence. Therefore, we also study the effect of planetary migration on the mass function of the synthetic planets. The results of this comparison are shown in Fig. B.2, where we show the mass function obtained by multiplying the migration rate by uniform factors.
Suppressing planet migration allows for more efficient giant planet formation, as already found in simulated planetary systems around very low-mass stars (Schlecker et al. 2022). This skews the distribution of planet masses towards giants. In a sense, it is similar to reducing the planetesimal size, though with limited effects on the overall number of planets. This means that to avoid having too many giant planets, migration cannot be strongly reduced. Even a reduction by 25% (the ‘3/4’ population) leads to too many giant planets. A modest reduction of the order of 10%, however, is beneficial to solving the discrepancy about the optimal planetesimals size for the overall number of planets and the ratio between sub-Neptunes and giants.
As for the planetesimal size, changing the efficiency of gas-driven migration affects the location of the planetary desert and the peak of the mass function of giant planets. There is a positive correlation between migration strength and the location of the gap and the peak. The location of the desert is best reproduced for a migration strength that is nominal or only slightly reduced, while that of the peak could be reproduced with weaker migration. Thus, the slight reduction by about 10% is also beneficial for reproducing the location of gaps and peaks in the planetary mass function.
![]() |
Fig. B.1 Comparison of the planet mass distribution between different biased synthetic populations and the observed population. The nominal population with 20 embryos per system (NG75, same as the orange line in Fig. A.1) is shown with the black line; the others use the minimum of the Bondi rate and radial flow of the gas disc for the disc-limited gas accretion rate with different planetesimal size, as given in the legend. For each of the synthetic populations, we performed 1000 Monte Carlo synthetic observations. The bold line showing the median of these, and the lighter region shows the 95% confidence interval. The results are shown as a kernel density estimate, though rescaled to be absolute. |
![]() |
Fig. B.2 Same as Fig. B.1 but now comparing the effects of the overall efficiency of gas driven migration by multiplying the migration rates by uniform factors given in the legend. All the populations have a planetesimal size of 3 km. The population shown in blue (labelled ‘1/1’) is the same as that in cyan in Fig. B.1. |
Appendix C Survey properties and list of planets
The observed planets used for the comparison in this work are based on the results of the CORALIE and HARPS surveys until the end of 2015. Data from other instruments, such as HIRES/Keck, was not used. The original stellar sample was defined as volume-limited within 50 pc as originally defined in Mayor et al. (2003). Newer measurements of stellar distances, for instance with GAIA parallaxes, were not used alter the sample. This is to maintain the consistency in the baseline duration of the observations across the stars.
The distribution of stellar masses in the sample is provided in Fig. C.1. The vast majority stars in the survey have masses between 0.7 and 1.3 M⊙. The distribution peaks around 0.9 M⊙, with a median value of 0.90 M⊙ and a mean value of 0.92 M⊙.
Binaries, as indicated by a detectable companion within 6 arcsecond or a RV signal that indicate a companion of more than 13 M4 were also excluded. While this criterion remove most binaries, it does not exclude wide binary systems. Among the detected planets, 24 (14% of the total) of them in 20 systems (19%) are in known binary systems.
With the exclusion of the additional four years of data, the analysis was performed as in Mayor et al. (2011). The analysis was performed consistently across the entire sample and the results are reported in Table C.1. For long period planets with incomplete coverage, the eccentricity is forced to zero if it cannot be reliably determined, and this is then used to determine the other parameters.
The data used here may differ from that presented in the discovery publications. For comparison, we provide the values from the literature. Values that differ more than 10% are underlined. Usually, differences in the periods are minimal, while differences in masses can be larger. However, as we are interested in the statistical comparison in this work, this should not be a major concern here.
In addition, the tables also provides the metallicity of the host star. Except for certain stars, the values were taken from SWEET-Cat. In case the entry does not provide a reference for the stellar metallicity, the value was taken from the same reference as for the planet.
![]() |
Fig. C.1 Histogram of the stellar masses in the HARPS/Coralie sample. The median value is indicated with the vertical dashed red line, while the value used for the synthetic population is indicated with the vertical dashed red line. |
References
- Adibekyan, V. 2019, Geosciences, 9, 105 [Google Scholar]
- Adibekyan, V. Z., Figueira, P., Santos, N. C., et al. 2013, A&A, 554, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Alibert, Y., Mordasini, C., & Benz, W. 2004, A&A, 417, L25 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Alibert, Y., Mordasini, C., Benz, W., & Winisdoerffer, C. 2005, A&A, 434, 343 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Alibert, Y., Mordasini, C., & Benz, W. 2011, A&A, 526, A63 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Alibert, Y., Carron, F., Fortier, A., et al. 2013, A&A, 558, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Alibert, Y., Venturini, J., Helled, R., et al. 2018, Nat. Astron., 2, 873 [NASA ADS] [CrossRef] [Google Scholar]
- Andrews, S. M., Wilner, D. J., Hughes, A. M., Qi, C., & Dullemond, C. P. 2010, ApJ, 723, 1241 [Google Scholar]
- Andrews, S. M., Terrell, M., Tripathi, A., et al. 2018, ApJ, 865, 157 [Google Scholar]
- Ansdell, M., Williams, J. P., Trapman, L., et al. 2018, ApJ, 859, 21 [NASA ADS] [CrossRef] [Google Scholar]
- Aoyama, Y., & Bai, X.-N. 2023, ApJ, 946, 5 [NASA ADS] [CrossRef] [Google Scholar]
- Ayliffe, B. A., & Bate, M. R. 2012, MNRAS, 427, 2597 [Google Scholar]
- Baruteau, C., Bai, X., Mordasini, C., & Mollière, P. 2016, Space Sci. Rev., 205, 77 [CrossRef] [Google Scholar]
- Benítez-Llambay, P., Masset, F., & Beaugé, C. 2011, A&A, 528, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Bennett, D. P., Ranc, C., & Fernandes, R. B. 2021, AJ, 162, 243 [NASA ADS] [CrossRef] [Google Scholar]
- Benz, W., Ida, S., Alibert, Y., Lin, D., & Mordasini, C. 2014, in Protostars and Planets VI, eds. H. Beuther, R. S. Klessen, C. P. Dullemond, & T. Henning, 691 [Google Scholar]
- Bertaux, J.-L., & Ivanova, A. 2022, MNRAS, 512, 5552 [NASA ADS] [CrossRef] [Google Scholar]
- Bitsch, B., Trifonov, T., & Izidoro, A. 2020, A&A, 643, A66 [EDP Sciences] [Google Scholar]
- Bodenheimer, P., & Pollack, J. B. 1986, Icarus, 67, 391 [Google Scholar]
- Bodenheimer, P., Hubickyj, O., & Lissauer, J. J. 2000, Icarus, 143, 2 [Google Scholar]
- Burn, R., Schlecker, M., Mordasini, C., et al. 2021, A&A, 656, A72 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Burn, R., Mordasini, C., Mishra, L., et al. 2024, Nat. Astron., 8, 463 [Google Scholar]
- Carrera, D., Raymond, S. N., & Davies, M. B. 2019, A&A, 629, L7 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Chambers, J. 2006, Icarus, 180, 496 [NASA ADS] [CrossRef] [Google Scholar]
- Chambers, J. 2018, ApJ, 865, 30 [Google Scholar]
- Chambers, J. E. 1999, MNRAS, 304, 793 [Google Scholar]
- Charalambous, C., Teyssandier, J., & Libert, A.-S. 2022, MNRAS, 514, 3844 [NASA ADS] [CrossRef] [Google Scholar]
- Chatterjee, S., Ford, E. B., Matsumura, S., & Rasio, F. A. 2008, ApJ, 686, 580 [NASA ADS] [CrossRef] [Google Scholar]
- Cheetham, A., Ségransan, D., Peretti, S., et al. 2018, A&A, 614, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Chrenko, O., Chametla, R. O., Nesvorný, D., & Flock, M. 2022, A&A, 666, A63 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Clarke, C. J., Gendrin, A., & Sotomayor, M. 2001, MNRAS, 328, 485 [NASA ADS] [CrossRef] [Google Scholar]
- Coleman, G. A. L., & Nelson, R. P. 2014, MNRAS, 445, 479 [Google Scholar]
- Cox, E. G., Harris, R. J., Looney, L. W., et al. 2017, ApJ, 851, 83 [Google Scholar]
- Crida, A., Morbidelli, A., & Masset, F. 2006, Icarus, 181, 587 [Google Scholar]
- Cugno, G., Quanz, S. P., Hunziker, S., et al. 2019, A&A, 622, A156 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Currie, T., Lawson, K., Schneider, G., et al. 2022, Nat. Astron., 6, 751 [NASA ADS] [CrossRef] [Google Scholar]
- Dawson, R. I., & Johnson, J. A. 2018, ARA&A, 56, 175 [Google Scholar]
- Delgado Mena, E., Tsantaki, M., Adibekyan, V. Z., et al. 2017, A&A, 606, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Desidera, S., & Barbieri, M. 2007, A&A, 462, 345 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Dittkrist, K. M., Mordasini, C., Klahr, H., Alibert, Y., & Henning, T. 2014, A&A, 567, A121 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Emsenhuber, A., Mordasini, C., Burn, R., et al. 2021a, A&A, 656, A69 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Emsenhuber, A., Mordasini, C., Burn, R., et al. 2021b, A&A, 656, A70 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Emsenhuber, A., Mordasini, C., & Burn, R. 2023, Eur. Phys. J. Plus, 138, 181 [NASA ADS] [CrossRef] [Google Scholar]
- ExoPAG Science Interest Group, Christiansen, J. L., Bennett, D. P., et al. 2023, arXiv e-prints [arXiv:2304.12442] [Google Scholar]
- Fabrycky, D. C., Lissauer, J. J., Ragozzine, D., et al. 2014, ApJ, 790, 146 [Google Scholar]
- Ferraz-Mello, S., Rodríguez, A., & Hussmann, H. 2008, Celest. Mech. Dyn. Astron., 101, 171 [Google Scholar]
- Fischer, D. A., & Valenti, J. 2005, ApJ, 622, 1102 [NASA ADS] [CrossRef] [Google Scholar]
- Flock, M., Turner, N. J., Mulders, G. D., et al. 2019, A&A, 630, A147 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ford, E. B., & Rasio, F. A. 2008, ApJ, 686, 621 [Google Scholar]
- Fortier, A., Alibert, Y., Carron, F., Benz, W., & Dittkrist, K. M. 2013, A&A, 549, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gáspár, A., Rieke, G. H., & Ballering, N. 2016, ApJ, 826, 171 [Google Scholar]
- Gonzalez, G. 1997, MNRAS, 285, 403 [Google Scholar]
- Greenzweig, Y., & Lissauer, J. J. 1992, Icarus, 100, 440 [NASA ADS] [CrossRef] [Google Scholar]
- Haffert, S. Y., Bohn, A. J., de Boer, J., et al. 2019, Nat. Astron., 3, 749 [Google Scholar]
- Hara, N. C., Boué, G., Laskar, J., Delisle, J. B., & Unger, N. 2019, MNRAS, 489, 738 [Google Scholar]
- Harris, R. J., Andrews, S. M., Wilner, D. J., & Kraus, A. L. 2012, ApJ, 751, 115 [Google Scholar]
- Hope, A. C. A. 1968, J. Roy. Statist. Soc. Ser. B (Methodological), 968, 582 [Google Scholar]
- Hunter, J. D. 2007, Comput. Sci. Eng., 9, 90 [NASA ADS] [CrossRef] [Google Scholar]
- Ida, S., & Lin, D. N. C. 2004, ApJ, 604, 388 [Google Scholar]
- Ida, S., & Makino, J. 1993, Icarus, 106, 210 [NASA ADS] [CrossRef] [Google Scholar]
- Ida, S., Tanaka, H., Johansen, A., Kanagawa, K. D., & Tanigawa, T. 2018, ApJ, 864, 77 [NASA ADS] [CrossRef] [Google Scholar]
- Inaba, S., & Ikoma, M. 2003, A&A, 410, 711 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Izidoro, A., Bitsch, B., Raymond, S. N., et al. 2021, A&A, 650, A152 [EDP Sciences] [Google Scholar]
- Jackson, B., Barnes, R., & Greenberg, R. 2009, ApJ, 698, 1357 [Google Scholar]
- Jin, S., & Mordasini, C. 2018, ApJ, 853, 163 [Google Scholar]
- Jin, S., Mordasini, C., Parmentier, V., et al. 2014, ApJ, 795, 65 [Google Scholar]
- Jones, H. R. A., Butler, R. P., Tinney, C. G., et al. 2006, MNRAS, 369, 249 [CrossRef] [Google Scholar]
- Jorissen, A., Mayor, M., & Udry, S. 2001, A&A, 379, 992 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Jurić, M., & Tremaine, S. 2008, ApJ, 686, 603 [Google Scholar]
- Keppler, M., Benisty, M., Müller, A., et al. 2018, A&A, 617, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Kley, W., & Dirksen, G. 2006, A&A, 447, 369 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Kobayashi, H., Tanaka, H., & Okuzumi, S. 2016, ApJ, 817, 105 [NASA ADS] [CrossRef] [Google Scholar]
- Lau, T. C. H., Drążkowska, J., Stammler, S. M., Birnstiel, T., & Dullemond, C. P. 2022, A&A, 668, A170 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lenz, C. T., Klahr, H., & Birnstiel, T. 2019, ApJ, 874, 36 [NASA ADS] [CrossRef] [Google Scholar]
- Lodders, K. 2003, ApJ, 591, 1220 [Google Scholar]
- Lovis, C., Ségransan, D., Mayor, M., et al. 2011, A&A, 528, A112 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Lucy, L. B., & Sweeney, M. A. 1971, AJ, 76, 544 [NASA ADS] [CrossRef] [Google Scholar]
- Lüst, R. 1952, Z. Naturfor. A, 7, 87 [Google Scholar]
- Lynden-Bell, D., & Pringle, J. E. 1974, MNRAS, 168, 603 [Google Scholar]
- Malmberg, D., Davies, M. B., & Heggie, D. C. 2011, MNRAS, 411, 859 [Google Scholar]
- Manara, C. F., Mordasini, C., Testi, L., et al. 2019, A&A, 631, L2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Manara, C. F., Ansdell, M., Rosotti, G. P., et al. 2023, in Astronomical Society of the Pacific Conference Series, 534, Protostars and Planets VII, eds. S. Inutsuka, Y. Aikawa, T. Muto, K. Tomida, & M. Tamura, 539 [Google Scholar]
- Matsuyama, I., Johnstone, D., & Hartmann, L. 2003, ApJ, 582, 893 [NASA ADS] [CrossRef] [Google Scholar]
- Matuszewski, F., Nettelmann, N., Cabrera, J., Börner, A., & Rauer, H. 2023, A&A, 677, A133 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mayor, M., Pepe, F., Queloz, D., et al. 2003, The Messenger, 114, 20 [NASA ADS] [Google Scholar]
- Mayor, M., Marmier, M., Lovis, C., et al. 2011, arXiv e-prints [arXiv:1109.2497] [Google Scholar]
- Mishra, L., Alibert, Y., Leleu, A., et al. 2021, A&A, 656, A74 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mizuno, H. 1980, Progr. Theor. Phys., 64, 544 [CrossRef] [Google Scholar]
- Mol Lous, M., Mordasini, C., & Helled, R. 2024, A&A, 685, A22 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C. 2014, A&A, 572, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C. 2018, in Handbook of Exoplanets, eds. H. J. Deeg, & J. A. Belmonte (Springer Living Reference Work), 143 [Google Scholar]
- Mordasini, C., & Burn, R. 2024, Rev. Mineral. Geochem., 90, 55 [Google Scholar]
- Mordasini, C., Alibert, Y., & Benz, W. 2009a, A&A, 501, 1139 [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C., Alibert, Y., Benz, W., & Naef, D. 2009b, A&A, 501, 1161 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C., Mayor, M., Udry, S., et al. 2011, A&A, 526, A111 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C., Alibert, Y., Georgy, C., et al. 2012a, A&A, 547, A112 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C., Alibert, Y., Klahr, H., & Henning, T. 2012b, A&A, 547, A111 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C., Klahr, H., Alibert, Y., Miller, N., & Henning, T. 2014, A&A, 566, A141 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mordasini, C., Mollière, P., Dittkrist, K. M., Jin, S., & Alibert, Y. 2015, Int. J. Astrobiol., 14, 201 [NASA ADS] [CrossRef] [Google Scholar]
- Mulders, G. D., Pascucci, I., Manara, C. F., et al. 2017, ApJ, 847, 31 [NASA ADS] [CrossRef] [Google Scholar]
- Mulders, G. D., Mordasini, C., Pascucci, I., et al. 2019, ApJ, 887, 157 [NASA ADS] [CrossRef] [Google Scholar]
- Müller, A., Keppler, M., Henning, T., et al. 2018, A&A, 617, L2 [Google Scholar]
- Nakamoto, T., & Nakagawa, Y. 1994, ApJ, 421, 640 [Google Scholar]
- Nayakshin, S., Dipierro, G., & Szulágyi, J. 2019, MNRAS, 488, L12 [NASA ADS] [CrossRef] [Google Scholar]
- Nelson, R. P., Lega, E., & Morbidelli, A. 2023, A&A, 670, A113 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ogihara, M., Kokubo, E., Suzuki, T. K., & Morbidelli, A. 2018, A&A, 615, A63 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ohtsuki, K., Stewart, G. R., & Ida, S. 2002, Icarus, 155, 436 [Google Scholar]
- Ormel, C. W., & Klahr, H. H. 2010, A&A, 520, A43 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ormel, C. W., & Kobayashi, H. 2012, ApJ, 747, 115 [NASA ADS] [CrossRef] [Google Scholar]
- Paardekooper, S., Dong, R., Duffell, P., et al. 2023, in Astronomical Society of the Pacific Conference Series, 534, Protostars and Planets VII, eds. S. Inutsuka, Y. Aikawa, T. Muto, K. Tomida, & M. Tamura, 685 [Google Scholar]
- Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, J. Mach. Learn. Res., 12, 2825 [Google Scholar]
- Perri, F., & Cameron, A. G. W. 1974, Icarus, 22, 416 [Google Scholar]
- Petigura, E. A., Marcy, G. W., Winn, J. N., et al. 2018, AJ, 155, 89 [Google Scholar]
- Podolak, M., Pollack, J. B., & Reynolds, R. T. 1988, Icarus, 73, 163 [NASA ADS] [CrossRef] [Google Scholar]
- Pollack, J. B., Hubickyj, O., Bodenheimer, P., et al. 1996, Icarus, 124, 62 [NASA ADS] [CrossRef] [Google Scholar]
- Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 2002, Numerical Recipes in C++: The Art of Scientific Computing (Cambridge University Press) [Google Scholar]
- Rafikov, R. R. 2004, AJ, 128, 1348 [NASA ADS] [CrossRef] [Google Scholar]
- Ricciardi, G., van Terwisga, S. E., Roccatagliata, V., et al. 2025, A&A, 695, A257 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Rodigas, T. J., Arriagada, P., Faherty, J., et al. 2016, ApJ, 818, 106 [NASA ADS] [CrossRef] [Google Scholar]
- Santos, N. C., Israelian, G., & Mayor, M. 2001, A&A, 373, 1019 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Santos, N. C., Israelian, G., & Mayor, M. 2004, A&A, 415, 1153 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Santos, N. C., Israelian, G., Mayor, M., et al. 2005, A&A, 437, 1127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Schib, O., Mordasini, C., & Helled, R. 2022, A&A, 664, A138 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Schlecker, M., Mordasini, C., Emsenhuber, A., et al. 2021a, A&A, 656, A71 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Schlecker, M., Pham, D., Burn, R., et al. 2021b, A&A, 656, A73 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Schlecker, M., Burn, R., Sabotta, S., et al. 2022, A&A, 664, A180 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Schulik, M., Johansen, A., Bitsch, B., & Lega, E. 2019, A&A, 632, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Shakura, N. I., & Sunyaev, R. A. 1973, A&A, 24, 337 [NASA ADS] [Google Scholar]
- Stevenson, D. J. 1982, Planet. Space Sci., 30, 755 [NASA ADS] [CrossRef] [Google Scholar]
- Stock, K., Cai, M. X., Spurzem, R., Kouwenhoven, M. B. N., & Portegies Zwart, S. 2020, MNRAS, 497, 1807 [Google Scholar]
- Suzuki, T. K., Muto, T., & Inutsuka, S.-i. 2010, ApJ, 718, 1289 [NASA ADS] [CrossRef] [Google Scholar]
- Suzuki, D., Bennett, D. P., Ida, S., et al. 2018, ApJ, 869, L34 [NASA ADS] [CrossRef] [Google Scholar]
- Tamuz, O., Ségransan, D., Udry, S., et al. 2008, A&A, 480, L33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Thommes, E. W., Duncan, M. J., & Levison, H. F. 2003, Icarus, 161, 431 [NASA ADS] [CrossRef] [Google Scholar]
- Thompson, S. E., Coughlin, J. L., Hoffman, K., et al. 2018, ApJS, 235, 38 [NASA ADS] [CrossRef] [Google Scholar]
- Tobin, J. J., Sheehan, P. D., Megeath, S. T., et al. 2020, ApJ, 890, 130 [CrossRef] [Google Scholar]
- Tokovinin, A., & Kiyaeva, O. 2016, MNRAS, 456, 2070 [NASA ADS] [CrossRef] [Google Scholar]
- Trapman, L., Ansdell, M., Hogerheijde, M. R., et al. 2020, A&A, 638, A38 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Tripathi, A., Andrews, S. M., Birnstiel, T., & Wilner, D. J. 2017, ApJ, 845, 44 [Google Scholar]
- Tuomi, M., Anglada-Escudé, G., Gerlach, E., et al. 2013, A&A, 549, A48 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Tychoniec, Ł., Tobin, J. J., Karska, A., et al. 2018, ApJS, 238, 19 [Google Scholar]
- Udry, S., Mayor, M., Queloz, D., Naef, D., & Santos, N. 2000, in From Extrasolar Planets to Cosmology: The VLT Opening Symposium, eds. J. Bergeron, & A. Renzini, 571 [Google Scholar]
- Udry, S., Dumusque, X., Lovis, C., et al. 2019, A&A, 622, A37 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Udry, S., & Santos, N. C. 2007, ARA&A, 45, 397 [NASA ADS] [CrossRef] [Google Scholar]
- Valletta, C., & Helled, R. 2020, ApJ, 900, 133 [NASA ADS] [CrossRef] [Google Scholar]
- Venturini, J., Alibert, Y., & Benz, W. 2016, A&A, 596, A90 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Venuti, L., Bouvier, J., Cody, A. M., et al. 2017, A&A, 599, A23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Methods, 17, 261 [Google Scholar]
- Voelkel, O., Deienno, R., Kretke, K., & Klahr, H. 2021, A&A, 645, A132 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Voelkel, O., Klahr, H., Mordasini, C., & Emsenhuber, A. 2022, A&A, 666, A90 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Weder, J., Mordasini, C., & Emsenhuber, A. 2023, A&A, 674, A165 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Weiss, L. M., Millholland, S. C., Petigura, E. A., et al. 2023, in Astronomical Society of the Pacific Conference Series, 534, Protostars and Planets VII, eds. S. Inutsuka, Y. Aikawa, T. Muto, K. Tomida, & M. Tamura, 863 [Google Scholar]
- Wright, J. T., Upadhyay, S., Marcy, G. W., et al. 2009, ApJ, 693, 1084 [NASA ADS] [CrossRef] [Google Scholar]
- Wright, J. T., Marcy, G. W., Howard, A. W., et al. 2012, ApJ, 753, 160 [Google Scholar]
- Youdin, A. N., & Goodman, J. 2005, ApJ, 620, 459 [Google Scholar]
- Zakamska, N. L., Pan, M., & Ford, E. B. 2011, MNRAS, 410, 1895 [NASA ADS] [Google Scholar]
- Zhu, W., & Dong, S. 2021, ARA&A, 59, 291 [NASA ADS] [CrossRef] [Google Scholar]
- Zurlo, A., Cugno, G., Montesinos, M., et al. 2020, A&A, 633, A119 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
This paper has not (yet) been published in a refereed journal. The reason is that at the time, 41 planets used in the statistical analysis had not previously been announced in separate refereed detection papers. In the meantime, almost all planets (except seven out of 169 in total in the updated sample) have gone through this process. As the survey continued between 2011 and 2015, new planets were detected around the stars belonging to our sample. These new planets have been included in the table (Appendix C).
All Tables
All Figures
![]() |
Fig. 1 Mass-distance of the (unbiased) NG76longshot population at an age of 5 Gyr. Each point corresponds to one synthetic planet. The population consists of 1000 synthetic planetary systems. Colours represent the bulk composition: red are planet where the H/He mass is larger than the mass of solids (the core mass). Open circles are planets containing H/He but with a core mass higher than the envelope mass. Filled circles correspond to planets without H/He. Additionally, blue (green) indicates that planets have accreted more (less) than 1% of their core mass as ices. Horizontal bars reach from the peri- to the apoastron to represent eccentricities. Finally, black crosses show the Solar System for comparison. |
| In the text | |
![]() |
Fig. 2 Unbiased mass function for planets within a < 3 au at three moments in time. The blue solid line shows the distribution at 100 Myr (NG76longshot) while the black dotted one is at 20 Myr (NG76). The red dashed line additionally shows the situation at the moment when the gas disc disappears in the systems (corresponding to 3 Myr on average). The reduction of the number of low-mass (proto)planets of mainly (sub-)Mars to Earth mass and the associated increase of more massive super-Earth and (sub-)Neptunian planets is visible. It is a consequence of giant impacts during the post-disc phase. |
| In the text | |
![]() |
Fig. 3 Average probability of planet detection from the HARPS/Coralie survey as function of a planet’s period and minimum mass (Mayor et al. 2011). |
| In the text | |
![]() |
Fig. 4 Histograms of the number of detected planets per system in both the HARPS/Coralie surveys (red) and the biased synthetic population (black). The histogram of the synthetic system is based on 1000 samples of the synthetic population, with the solid line denoting the median while grey band shows the 95% confidence interval. The left panel compares the absolute number of systems, while the right panel compares the relative number of systems that have detected planets. |
| In the text | |
![]() |
Fig. 5 Comparison of the planet masses between the synthetic biased population (NG76, black) and the HARPS/Coralie sample (red). The data of the synthetic population is based on 1000 Monte Carlo synthetic observations, with the bold line showing the median of these, the grey region showing the 95% confidence interval, and the thin black lines indicating ten individual realisations. Left: kernel density estimate (scaled to be absolute) with a smoothing length of 0.19 dex, with the addition of the synthetic population scaled down to the same total number of planets as the observed sample (blue). Right: cumulative distributions of the same data. The statistical analysis shows that the samples from the synthetic population do not match the detected ones at the 5% level because the model under-predicts the number of sub-Neptunes and over-predicts the number of giants. |
| In the text | |
![]() |
Fig. 6 Comparison of the planet periods between the biased synthetic population (NG76, black and blue lines) and the observed HARPS/Coralie sample. The curves have the same meaning as in the left panel of Fig. 5. |
| In the text | |
![]() |
Fig. 7 Left: comparison of the minimum mass (M sin i) versus orbital period of the biased synthetic 100-embryo population (NG76, black circles) and the actual HARPS/Coralie survey (red crosses; M11). Both samples have 822 systems. Right: same but only for the synthetic population and with different colours and symbols depending on planet properties. Red circles indicate planets whose H/He envelope content is larger than the core; otherwise, green circles are for planets whose cores do not contain any ices, and blue is for the planets whose cores contain ices. In the latter case, circles are for planets whose envelope mass is at least 10% that of the core, and crosses are for the others. |
| In the text | |
![]() |
Fig. 8 Stellar metallicity versus planet mass (M sin i) for the biased 100-embryo population (NG76, in black) and the HARPS/Coralie survey (M11, in red). The scatter plot shows one random observation of the synthetic population against the HARPS/Coralie planets. The lines show the medians in the respective range. For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. |
| In the text | |
![]() |
Fig. 9 Cumulative distribution (left) and histogram (right) of the period ratios of adjacent planets for the biased 100-embryo population (NG76, in black), the same population but with N-body interactions extended to 100 Myr (NG76longshot, in blue), and the HARPS/Coralie surveys (M11, in red). For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. The number of observable pairs is limited, as only systems with multiple planets are usable. The locations of several MMRs are highlighted. |
| In the text | |
![]() |
Fig. 10 Cumulative distribution (left) and histogram (right) of the planets’ eccentricities for the biased 100-embryo population (NG76, in black), the same population but with N-body interactions extended to 100 Myr (NG76longshot, in blue), and the HARPS/Coralie surveys (M11, in red). For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. |
| In the text | |
![]() |
Fig. 11 Eccentricity as a function of planet mass (M sin i) for the biased 100-embryo population (NG76, in black) and the HARPS/Coralie surveys (M11, in red). The scatter plot shows one random observation of the synthetic population against the HARPS/Coralie planets. The lines show the medians in the respective range, but one should note the low number of planets in the bins on the right. For the synthetic planets, the values are obtained over 1000 simulated observations of the underlying population and showing the 95% confidence interval. |
| In the text | |
![]() |
Fig. 12 Histogram of the number of planets per systems in both the HARPS/Coralie surveys (red) and the optimised synthetic population NG192 (green). |
| In the text | |
![]() |
Fig. 13 Cumulative distribution of the planet masses in the optimised synthetic population NG192 (in green), the canonical 100 embryo population (in black), and the observed population (in red). |
| In the text | |
![]() |
Fig. 14 Mass–period diagram comparing one random mock observation of the optimised synthetic population NG193 (in green) and the actual HARPS/Coralie planets (in red). |
| In the text | |
![]() |
Fig. A.1 Cumulative distribution of the planet masses for biased synthetic populations with different initial number of embryos per system. |
| In the text | |
![]() |
Fig. B.1 Comparison of the planet mass distribution between different biased synthetic populations and the observed population. The nominal population with 20 embryos per system (NG75, same as the orange line in Fig. A.1) is shown with the black line; the others use the minimum of the Bondi rate and radial flow of the gas disc for the disc-limited gas accretion rate with different planetesimal size, as given in the legend. For each of the synthetic populations, we performed 1000 Monte Carlo synthetic observations. The bold line showing the median of these, and the lighter region shows the 95% confidence interval. The results are shown as a kernel density estimate, though rescaled to be absolute. |
| In the text | |
![]() |
Fig. B.2 Same as Fig. B.1 but now comparing the effects of the overall efficiency of gas driven migration by multiplying the migration rates by uniform factors given in the legend. All the populations have a planetesimal size of 3 km. The population shown in blue (labelled ‘1/1’) is the same as that in cyan in Fig. B.1. |
| In the text | |
![]() |
Fig. C.1 Histogram of the stellar masses in the HARPS/Coralie sample. The median value is indicated with the vertical dashed red line, while the value used for the synthetic population is indicated with the vertical dashed red line. |
| In the text | |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

















