Open Access
Issue
A&A
Volume 704, December 2025
Article Number A81
Number of page(s) 14
Section Galactic structure, stellar clusters and populations
DOI https://doi.org/10.1051/0004-6361/202555964
Published online 08 December 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

The study of the Milky Way (MW) provides a unique perspective on galaxy assembly history, serving as a crucial benchmark for theories of galaxy formation and evolution. Among the various stellar systems within the MW, globular clusters (GCs) play a fundamental role as ancient relics that encode essential information about the galaxy’s assembly history. A key aspect of MW GCs is their diverse origins: while some are thought to have formed in situ within the progenitor of the MW, others were accreted (ex situ) through the mergers of satellite galaxies (Forbes & Bridges 2010; Massari et al. 2019; Kruijssen et al. 2019b; Helmi 2020; van den Bergh 2012; Leaman et al. 2013; Mackey & van den Bergh 2005). Studies of GCs in external galaxies, such as the analysis of accreted GCs in M31 by Andersson & Davies (2019), provide complementary insights into the processes of GC accretion and the signatures of past merger events.

Identifying and characterizing these two populations is essential for reconstructing the accretion history and evolutionary pathway of our Galaxy. This has become possible thanks to the full six-dimensional phase-space information available for nearly all Galactic GCs from the third data release of the Gaia mission (Gaia Collaboration 2021), which has provided unprecedented insights into their dynamics (Vasiliev & Baumgardt 2021). In principle, these data contain fundamental information about the origin of the clusters themselves, that is whether a given GC formed within the MW or originated in a satellite galaxy that was later accreted. However, the question that the community is currently facing is how to analyse and interpret these data in order to discern the origin of Galactic GCs. Various methods (see, for example, Massari et al. 2019; Forbes & Bridges 2010; Kruijssen et al. 2020, 2019a; Malhan et al. 2022; Myeong et al. 2018) have been developed to classify GCs as in–situ or ex–situ, leveraging their kinematic, spatial, age, and chemical abundance properties with the latter being provided in part by spectroscopic surveys (e.g. APOGEE Majewski et al. 2017). In particular, the distribution of GCs in integral-of-motion spaces, such as the energy-angular momentum ELz plane, is in-homogeneous. This in-homogeneity suggests the presence of groups of GCs with distinct spatial and kinematic properties. Clusters located close to each other in this integral-of-motion space are thought to share a common origin, potentially reflecting different formation histories, either in situ or through accretion (Chen & Gnedin 2024; Belokurov & Kravtsov 2023; Massari et al. 2019; Malhan et al. 2022; Pfeffer et al. 2020; Sun et al. 2023). More recently, novel approaches which have a complemented analysis in the ELz plane with mean chemical abundance ratios such as [Al/Fe] have also been introduced to refine the quest of accreted and in situ GCs (Belokurov & Kravtsov 2024).

However, Pagnini et al. (2023) challenge the idea of a dynamical coherence among accreted GCs, emphasizing that even the debris from a single merger event can span a wide range in the ELz space, as well as in all kinematic spaces used so far, including actions space. They argue that the assumption that accreted clusters should exhibit a clear dynamical coherence, that is a tendency to cluster in kinematic spaces, remains unproven, and that while physically motivated by the conservation of energy and angular momentum in static, axisymmetric potentials, this expectation may not hold in more realistic, time-dependent galactic environments. It is important to note that E and Lz are not conserved quantities in a time-evolving non-axisymmetric galaxy (Binney & Tremaine 2008), due to the significant growth of the galaxy’s mass over time and the mergers of massive satellites, which perturb both the accreted and in situ components of the galaxy. As a result, solely relying on the present-day (z = 0) distribution in the ELz space to infer the origin of GCs can be misleading, requiring additional information and potentially the study of their temporal evolution. The results in Pagnini et al. (2023) are consistent with findings for field stars, for which it is also challenging to uncover their nature using kinematic spaces (Jean-Baptiste et al. 2017; Amarante et al. 2022), even when supplemented with abundance information (Khoperskov et al. 2023a,b; Mori et al. 2024). The difficulty encountered in establishing the origin of the stars and GCs in our Galaxy by making use of energy and angular momentum (eventually supplemented by other properties) is illustrated for example by the case of Omega Centauri (NGC 5139), which is likely the former nuclear star cluster of an accreted galaxy, as suggested – among other unusual properties – by its broad metallicity spread (Lee et al. 1999; Majewski et al. 2000; Carraro & Lia 2000; Bekki & Freeman 2003; Tsuchiya et al. 2004, 2003). While some classifications might place it among in situ GCs (Belokurov & Kravtsov 2024; Chen & Gnedin 2024), others support an ex situ origin for this system (Massari et al. 2019; Forbes & Bridges 2010; Pfeffer et al. 2021).

Studying the dynamics of GCs over cosmological timescales and within cosmological environments is crucial for understanding the formation and evolutionary history of galaxies such as the MW. However, due to the vast spatial and temporal scales involved, simulating these objects presents significant challenges, leading to the adoption of various approaches, each with their inherent limitations (for reviews, see Beasley (2020); Renaud (2020)). Three main strategies have emerged for modelling GC dynamics, incorporating increasingly sophisticated models of GC formation. A common approach relies on idealized galaxy simulations with pc resolution, which enables the study of clustered star formation at high redshift. However, most of these simulations do not robustly reproduce GCs in terms of their sizes, masses, or chemical properties (Lahén et al. 2020; Li et al. 2022; Deng et al. 2024; Andersson et al. 2024). Besides, these simulations can only model faint galaxies and in small numbers (typically one or two) and are limited in their evolutionary timescale (typically up to 1 Gyr) due to their high computational cost. Another strategy involves high-resolution cosmological zoom-in simulations, which enhance spatial resolution and allow for the study of GC formation in a more realistic cosmological setting (Kravtsov & Gnedin 2005; Li et al. 2017; Kim et al. 2018; Ma et al. 2020; Meng & Gnedin 2022; Sameie et al. 2023; Dubois et al. 2021). While these simulations achieve extremely high mass resolution (resolving dwarf galaxies) and spatial resolution (down to 10 pc), they are typically restricted to high redshifts (z = 3–5). To overcome these spatial and temporal limitations, one approach involves incorporating semi-analytical models within hydrodynamical simulations to form and track the evolution of GCs (Pfeffer et al. 2018; Reina-Campos et al. 2022; Grudić et al. 2023; Newton et al. 2025; Rodriguez et al. 2023). However, this does not reduce the computational cost of hydrodynamical simulations, which remain limited to a small number of galaxies (typically 1–21 MW-like galaxies). To achieve larger statistical samples, an alternative approach is to apply tagging techniques in post-processing (Bullock & Johnston 2005; Cooper et al. 2010; Laporte et al. 2013; Renaud et al. 2017; Ramos-Almendares et al. 2020; Halbesma et al. 2020; Park et al. 2022; Doppel et al. 2023; Chen & Gnedin 2023; Creasey et al. 2019). In these methods, GCs are “painted” onto simulation particles (either dark matter (DM) or stars) following specific prescriptions. These techniques are computationally inexpensive, as they do not require rerunning simulations to modify the GC formation and evolution model parameters, and they enable the study of a large number of galaxies (up to N = 8000) in diverse environments. Although various tagging techniques exist to study GCs, either by tagging GCs at multiple epochs to analyse their properties within galaxies, or by inferring their dynamics from star or DM particles within simulations, no study has applied these methods on a large statistical sample of MW–like galaxies in a self-consistent manner. Our approach stands out by applying it to nearly 200 MW–like galaxies from TNG50, enabling a statistically robust analysis of GC dynamics from formation to z = 0. Therefore, developing a complete picture of GC dynamics requires combining tagging techniques in cosmological hydrodynamical simulations with orbit integration methods. In principle, coupling these approaches could allow for the self-consistent tracking of GC dynamics within a cosmological environment over a Hubble time.

In this paper, we revisit the challenge of distinguishing between in situ and ex situ GC populations in MW-like galaxies, focusing on their identification within the ELz space. Our analysis is conducted in a full cosmological context, including satellite galaxy accretions and a time-evolving potential for the host galaxy, both of which affect GC dynamics. To achieve this, we follow the spatial evolution of a population of tens of thousands of GCs within a cosmological framework, leveraging orbital integration techniques in combination with the TNG50 simulation. The paper is structured as follows: Section 2 outlines our methodology, which describes our post-processing orbital integration method for GCs in cosmological simulations. In Section 3, we present and discuss our results, comparing them with Gaia data for the MW’s GC system. Finally, Section 5 provides our conclusions.

2 Methods: Combining orbital integration methods in post-processing with the TNG50 simulation

To track the dynamics of GCs over cosmological timescales in a galaxy like the MW, it is essential to consider its merger history, which plays a key role in constructing the observed GC population at z = 0. By providing access to the evolutionary history of 198 MW analogues, the hydrodynamical cosmological simulation TNG50 (Nelson et al. 2019a,b; Pillepich et al. 2019) enables us to reconstruct the time-evolving gravitational potentials that account for both the evolution of the MW and its environment through satellite galaxy accretion. TNG501 is a high-resolution cosmological hydrodynamical simulation with a 51.7 Mpc box and 2 × 21603 particles, reaching a baryonic mass resolution of 8.4 × 104 M and 4.5 × 105 M for DM. With a gravitational softening of 288 pc at z=0, it includes detailed models for star formation, chemical enrichment, black hole growth, AGN feedback, and galactic winds (Weinberger et al. 2017; Pillepich et al. 2018). Besides, Pillepich et al. (2024) provide masses and half-mass radii at different redshifts of these MW-analogues and their associated satellites. By assigning a GC population to each MW progenitor and its satellite population at high-z, we can follow their dynamical evolution through orbital integration using the publicly available code galpy2 (Bovy 2015), taking into account both dynamical friction and mass loss for these time evolving MW-like potentials from z = 3 to the present-day. In the following, we first describe the MW-like potentials, then how we defined initial conditions for our GCs systems at high-z, and finally how we performed the orbit integration.

thumbnail Fig. 1

Orbital radius as a function of time for two merging galaxies (IDs 383325 and 299509) with their associated GCs. The points mark the formation and merger times of the satellites from TNG50. During the initial phase, ex situ GCs orbit within the reference frame of their satellite galaxy until their orbital energy becomes positive (E > 0). Once this condition is met, GCs are transferred to the MW’s reference frame while maintaining the moving potential of the satellite galaxy if it has not yet merged.

2.1 TNG50 Milky Way-like potentials

The 198 MW analogues identified in the TNG50 simulation have global properties, such as a stellar mass between 1010.5 M and 1011.2 M and a large-scale environment, that is similar to those of our Galaxy. Up to 7 Gyr backwards in time, the mean stellar mass of the TNG 50 MW analogues is in good agreement with the mass evolution of our Galaxy derived by Snaith et al. (2015) (see Figure 2). We extract the structural parameters (masses and half-mass radii) of the MWs and their associated merging satellites across 75 snapshots from z = 3 to 0. Actually for TNG50, snapshots at all 100 available redshifts, galaxy catalogues at each snapshot and merger trees are released. We focus on identifying all the satellites that merged with our MW-like galaxies during this period, retaining only those with a stellar mass greater than 107 M, in line with the TNG50 mass resolution (4.5 × 105 M for DM particles and 8.5 × 104 M for the stellar component). We modelled the gravitational potentials of MW–like galaxies using a Hernquist profile (Hernquist 1990) for the stellar component and an NFW profile (Navarro et al. 1997) for the DM halo. The parameters were derived from the properties of galaxies in the TNG50 simulation, which provides both total masses and half-mass radii for the stellar and DM components. We then computed the corresponding scale radii of the Hernquist and NFW profiles from the given half-mass radii. These MW-like potentials evolve over time, and the evolution of the masses and scale radii is incorporated into our model. Specifically, we update the potential 75 times during the orbit integrations. Satellite galaxies are also modelled with the same potential components.

To summarize, from the masses and half-mass radii provided by TNG50, we can derive 198 × 75 MW-analogues composite stellar+DM gravitational potentials. Each of these MW analogues is surrounded by its processions of satellite galaxies (on average 4.94 merging satellites/MW-like progenitor are found at z = 3), which are also modelled by composite stellar+DM potentials.

thumbnail Fig. 2

Mass evolution of the stellar and DM components of the MW for our full sample of 198 MW analogues from TNG50. Up to 7 Gyr backwards in time, the mean stellar mass of the TNG50 MW analogues is in good agreement with the mass evolution of our Galaxy derived by Snaith et al. (2015).

2.2 Initial conditions for globular clusters

Once the gravitational potentials of the MW progenitor galaxies and their satellites are defined at high redshift, we describe how a GC system was assigned to each of these galaxies. The initial number of GCs in galaxies is defined by the relation from Burkert & Forbes (2020), which correlates NGC with the virial mass of the halo: logNGC=9.58±1.58+(0.99±0.13)logMvirM.$\[\left\langle\log N_{\mathrm{GC}}\right\rangle=-9.58 \pm 1.58+(0.99 \pm 0.13) \log \frac{M_{\mathrm{vir}}}{M_{\odot}}.\]$(1)

Although this relation does not explicitly include the star formation rate or GC formation efficiency, it likely reflects their cumulative effects as functions of halo mass. This relation depends on the DM mass of the galaxies at the redshift where the clusters are tagged. We included uncertainties as random noise in our modelling. We note that applying this relation, which is calibrated on nearby galaxies, to galaxies at z=2–3 introduces some uncertainty. Indeed, Choksi & Gnedin (2019) show that this relation can evolve by up to a factor of 10 from z = 3 to z = 0. However, such models depend on assumptions about GC formation physics and merger histories that remain unconstrained by direct observations at high redshift. Given the lack of observational data on GC populations at z > 1, we adopt the Burkert & Forbes (2020) relation as a pragmatic, observation-driven approximation. We generate a GC distribution at z = 2 in the progenitors of the MW for the in situ population, and at z = 3 in the progenitors of merging galaxies for the ex situ population. To estimate the number of in situ GCs in MW–like galaxies, we chose to tag them at z = 2, an epoch when the host galaxy’s mass approaches its present-day value. Indeed, the stellar halo was likely accreted between 9 and 11 Gyr ago (Di Matteo et al. 2019; Mackereth & Bovy 2020), and cosmological simulations predict that the MW’s mass reached its asymptotic value around 9–10 Gyr ago (Diemand et al. 2007; Lux et al. 2010; Diemer et al. 2013). The choice of z = 2 to assign GC systems is observationally motivated to reproduce the observed stellar mass–GC number relation at z = 0, based on two major arguments. First, the GC–halo mass relation becomes valid for a larger number of galaxies at this epoch. This relation is particularly reliable once the galaxy’s halo mass exceeds a certain threshold (MDM > 5 × 109 M). At z = 3, many MW–type galaxies have only just reached this mass threshold. They are still in a rapid growth phase through mergers and accretion. By z = 2, most of these galaxies have already accumulated a mass close to their final value. This means the GC–halo mass relation starts to apply more systematically at z = 2, whereas it is more noisy at z = 3. This leads to an underestimation of the total number of in situ GCs at this higher redshift. Second, in a parallel simulation where clusters are tagged earlier, at z = 3, we observe that not only is the initial number of clusters half as large, but their destruction rate by z = 0 is three times higher. This is explained by the longer evolution time in a growing potential, where tidal effects promote disruption. This biases not only their initial number but also the final observable number at z = 0. As a result, the dominance map in energy-angular momentum space is strongly biased in the z = 3 case, artificially overestimating the fraction of ex situ clusters due to the lack of surviving in situ clusters. Table A.1 shows that this model, which recovers only 23.5% of simulated in situ clusters, represents the worst-case scenario. Finally, tagging in situ GCs at z = 2 therefore ensures a more realistic estimate of the total number of in situ clusters observable at z = 0. In cases where our satellites were not yet formed at z = 3 in the simulation, we tag the GCs at the highest possible redshift.

Next, in each sufficiently massive galaxy (MDM > 5 × 109 M) that at least one GC is present, we derive the initial positions and velocities of in situ GCs from stellar distribution functions based on our sample of progenitor galaxies and ex situ GCs from those of the TNG50 stars. For in situ populations, we use the publicly available code AGAMA3 to construct equilibrium galaxy models composed of a stellar bulge and a DM halo for our z = 2 MW progenitor (Vasiliev 2019). For each system, a self-consistent potential was computed via iterative modelling, from which a phase-space distribution of stellar particles is sampled. We select a subset of star particles located within the stellar half-mass radius, forming an initially spherical distribution, to represent in situ GCs. This choice is motivated by the assumption that in situ GCs form preferentially in the central regions of galaxies, where the gas density and thus the likelihood of massive star cluster formation is highest. Within this spherical region, we then select only those particles with angular momentum Lz > 0.6 Lcirc, where Lcirc is the angular momentum of a circular orbit at the same energy. This kinematic selection impose a disk-like initial configuration for the in situ population. Indeed, the stellar sample at z = 2 in the TNG50 MW analogues does not contain enough star particles with Lz > 0.6 Lcirc within the stellar half-mass radius to match the number of in situ GCs expected from the empirical relation of Burkert & Forbes (2020). For ex situ GCs, we only randomly select NGC stars within the stellar half-mass radius of the galaxies. Therefore, our accreted GCs inherit the positions and velocities of the stars present in TNG50 satellites.

We assign a mass of 106 M and a half-mass radius of 10 pc to all GCs. This arbitrary choice is motivated by the fact that it allows us to recover the typical mass (~105 M) and size values of MW GCs today rather than modelling the full mass spectrum of MW clusters. In fact, a mass of 6 × 106 M corresponds to the upper limit of the stellar mass of Galactic GCs 12 Gyr ago (Baumgardt et al. 2019). In Table A.1, we explored the impact of varying the mass choice on our results, which only affects the calculation of dynamical friction and mass loss. It was found that our results remain unchanged within a reasonable initial mass range for GCs (0.5–5 × 106 M). Increasing or decreasing the mass affects only the in situ population, decreasing or increasing its GC number, respectively (see Table A.1). However, we did not explore even lower masses for the following reasons. Our objective is to tag progenitors of present-day MW GCs. These must be sufficiently massive at formation to survive tidal disruption over a Hubble time. Lowering the initial GC mass below 105 M would lead to enhanced disruption, especially in the central regions of galaxies, and very few GCs would survive until z = 0. In our lowest-mass test (5 × 105 M), the in situ GC population becomes significantly harder to identify using our SVM-based classification. The depletion of the central regions due to increased disruption reduces the number of surviving in situ clusters. Consequently, the classification accuracy drops to 41%, compared to 76.4% in our fiducial model. For these reasons, and because our goal is to model the surviving MW GC population at z = 0, we chose a fiducial mass of 106 M. Moreover, dynamical friction for in situ GCs is inefficient over our timescale (11 Gyr) due to the large mass ratio between the MW’s enclosed mass and the GC masses. While the GC half-mass radius is kept constant during our orbit integrations, we account for mass loss over time Kruijssen et al. (2011). For this mass loss model, we adopted the following parameters: γ = 0.62 and t0,⊙ = 21.3 Myr, which correspond to clusters with masses around ~106 M. Therefore, with these parameters, we aim to describe an average behavior over time. We also performed orbital integrations varying the parameters of this mass loss model. Table A.1 shows that the classification is only weakly affected, with the decision boundaries in our fiducial model identifying a higher percentage of in situ clusters than in the alternative model. We point out the limitations of the adopted mass loss model, particularly its inability to account for GC mass loss driven by rapidly varying tidal fields during disk crossings or interactions with molecular clouds. As highlighted by Renaud (2020), such processes are not captured in current cosmological simulations due to insufficient spatial resolution, and their omission could significantly affect predictions of GC mass evolution.

2.3 Orbital integrations

To integrate the orbits of GCs, we require not only the MW gravitational potentials at different epochs (as described in Section 2.1), but also the orbits of all merging satellites relative to the MW-like galaxy. This is essential because each satellite’s trajectory affects the evolution of the GC system in two key ways: by perturbing the spatial distribution of the in situ clusters, and by guiding the accreted clusters during their infall and subsequent evolution within the host galaxy.

The orbits of the satellite galaxies are also retrieved from TNG50. Hovewer, we re-integrate these merging satellite orbits into our MW potentials by taking into account dynamical friction, both to improve the time resolution of orbits that are sometimes highly jagged in the simulation due to non-linear time evolution, and to incorporate them as “moving potentials” in our full MW potential. Figure B.1 compares the orbits of several satellites in an MW and the orbits we re-integrated for the reasons mentioned above. Between 2 and 5 Gyr, the differences between the orbits from TNG50 and our re-integrated orbits stem primarily from the fact that our approach assumes an idealized spherical distribution for both the host galaxy and its satellites, combined with an analytically modelled dynamical friction (see Figure B.1). In contrast, the density field in TNG50 is non-spherical, exhibiting substructures, asymmetries, and more complex local gravitational interactions. Moreover, dynamical friction in TNG50 arises intrinsically from the gravitational interactions between particles.

TNG50 provides 100 snapshots from z = 20 and 0, distributed non-uniformly in time. In all our realizations, the satellite galaxy potentials evolve in both mass and size over time, as expected from cosmological evolution. As they orbit MW galaxies, satellites are subject to dynamical friction, which gradually alters their trajectories. The time-dependent masses and half-mass radii of the satellites are extracted directly from the TNG50 simulation and are used to compute the corresponding dynamical friction forces exerted by the MW-like host. We adopt the Chandrasekhar dynamical friction formula, as implemented in galpy, which closely follows the prescription of Petts et al. (2016).

For GCs, we include both dynamical friction and mass loss in the orbital integrations. At each update of the MW potential, GC mass loss is computed using the model of Kruijssen et al. (2011), which includes contributions from stellar evolution, two-body relaxation, and tidal shocks. In orbital integrations, mass loss impacts the dynamics of the GCs only in the presence of dynamical friction, which is updated at each change of potential and takes into account the mass loss of GCs. The evolution of mass and structural parameters of the host galaxy accounts for the influence of merging satellites, modelled as moving potentials, and includes dynamical friction exerted by the host galaxy on the GCs. We refer to this description as the “MW full potential + dynamical friction”. For comparison, we also consider two alternative potential models: one where the MW full potential evolves over time but without including dynamical friction, and another where the MW potential is fixed without dynamical friction and corresponds to the TNG50 snapshot at z = 0. We refer to these two configurations as “MW full potential” and “MW static potential”, respectively. In our approach, we distinguish between the orbital integrations of ex situ and in situ GCs. Specifically, ex situ GCs first evolve within the potential of their progenitor galaxy, which is a merging galaxy of the MW, and are later accreted by the MW, thus evolving in its potential (see Figure 1). During the initial phase, ex situ GCs orbit within the reference frame of their satellite galaxy until their orbital energy becomes positive (E > 0). Once this condition is met, GCs are transferred to the MW’s reference frame while maintaining the moving potential of the satellite galaxy if it has not yet merged. This procedure is necessary because the calculation of dynamical friction from a galaxy is only possible within that galaxy’s reference frame in galpy. Once accreted at a given redshift, ex situ GCs evolve like in situ GCs in the full MW potential. We have verified that once tagged, both ex situ and in situ GCs remain bound to their host galaxies (E < 0). We have checked that the distribution of in situ GCs are bound to the various progenitors of the MW by computing the normalized total energy at z = 2.

To compute the orbits, we used a fast C integrator, dop853c, implemented in galpy. This is an explicit Runge–Kutta method of order 8(5,3), which offers high accuracy and efficiency for integrating complex dynamical systems. Our orbital-time resolution is set to 2 Myr (500 time steps per Gyr), which is sufficient to obtain smooth and accurate orbits over these time scales. We apply this procedure to our full sample of 198 MW analogues from TNG50. Regarding the computational performance of our model, orbital integrations between z = 3 and 0 for our entire sample take between 1 and 440 CPU hours. This variation depends on the potential description, as dynamical friction significantly slows down the code.

3 Results

In the following, we only consider ex situ GCs with negative total energy at z = 0, i.e. those that remain gravitationally bound to the MW, since we identified a small fraction of ex situ GCs that are unbound at z = 0, representing 2% and 10% of the ex situ population in the static and full potential + dynamical friction models, respectively. In fact, during their escape from the satellite potential, either due to satellite evolution or merger, these GCs have sufficient energy to not be trapped within the MW potential. A smaller number of wandering GCs in the static potential is found, as expected, since it is described by the most massive MW potential at z = 0. Then, in order to fairly compare the populations of our 198 MW analogues, which have different mass distributions as shown in the Figure 2, we chose to normalize ELz space.

In our analysis, the energy E is normalized by the absolute value of the energy of a circular orbit at the stellar half-mass radius, Ecirc(rhm). rhm provides a physically meaningful scale for two main reasons: (i) it approximately marks the transition between the in situ and ex situ GC populations, as in situ clusters typically form within this radius; and (ii) this radius is directly measurable in observations, enabling robust comparisons with simulations. The angular momentum Lz is normalized by GMtotrhm$\[\sqrt{G M_{\mathrm{tot}} r_{\mathrm{hm}}}\]$, where Mtot is the total mass of the host galaxy, including DM. This normalization captures the influence of the global gravitational potential on the orbital dynamics of GCs, particularly for those on wide orbits. It also allows for meaningful comparisons between systems of different total mass while maintaining a common baryonic scale via rhm. Since the gravitational potentials used by McMillan (2017) and TNG50 also differ, normalizing energies and angular momenta at their stellar half-mass radius allows a fair comparison between them. In addition, varying the choice of potential model only slightly affects the classification results, provided the model remains consistent with the observational constraints, as noted by Chen & Gnedin (2024).

We followed the dynamics of 17 726 GCs, including 13 339 in situ and 4387 ex situ, across three different MW potential descriptions: MW static potential, MW full potential, and MW full potential + dynamical friction (see Section 2.1 for the definitions of potentials). Thanks to our mass-loss implementation from Kruijssen et al. (2011), we can determine the final mass of the GCs, especially those that have survived. The following plots (Figures 3, 4 and 5) show the population of surviving GCs. In these figures, we show the limit defined by Belokurov & Kravtsov (2024). Since we aim to compare our results in the energy-angular momentum space of the MW best-fitting potential from McMillan (2017), we computed the McMillan version of this limit by plotting the GCs in our normalized ELz space and adjusting the limit so that it separates the same clusters identified as in situ or ex situ by Belokurov & Kravtsov (2024). On average, only 0.01% of the ex situ GCs are destroyed, compared to 35% of the in situ ones. This directly echoes a similar result from Renaud et al. (2017), who found that in situ GCs experience significantly stronger tidal forces than their accreted counterparts. Assuming an initial GC mass of 106 M, we observe that, by z = 0, the mass of surviving clusters spans a wide range from 3.2 × 103 M up to 5 × 105 M. Interestingly, the ex situ population typically retains a mass close to its initial value, highlighting their relative resilience to tidal disruption in MW-like environments.

thumbnail Fig. 3

In situ populations at z = 0: normalized total energy as a function of the normalized z-component of the angular momentum at z = 0 for all our in situ GCs for the 198 MWs with three different descriptions for the MW environment (see Section 2.1 for the definitions of potentials). The hexagonal bins represent at least 1 GC. The red dashed line represents the Belokurov & Kravtsov (2024) limit.

thumbnail Fig. 4

Ex situ at z = 0: as in Fig. 3 but for our ex situ GCs for the 198 MWs.

3.1 In situ globular clusters

Figure 3 shows that the evolution of the potential and dynamical friction have a negligible effect on the distribution of in situ GCs in the ELz space. As a matter of fact, the evolution of the potential will stretch some GCs towards higher energies, but most clusters are concentrated at E<1.2|Ecirc(rhm)|$\[E<-1.2|E_{\text {circ}}(r_{\text {hm}}^{*})|\]$. We observe no variation in this dense cluster region across the three descriptions. Since the potential evolves gradually, GCs can simply adjust adiabatically without significant diffusion in the ELz plane. This is expected, as the mass growth of the MW in CDM framework occurs relatively smoothly. Regarding dynamical friction, its negligible impact is also expected, given that it is generally negligible in MW-like galaxies due to the large mass ratio between the enclosed MW mass and GC mass. Moreover, Figure 3 clearly shows that it is highly unlikely to find in situ GCs at E>0.7|Ecirc(rhm)|$\[E>-0.7|E_{\text {circ}}(r_{\mathrm{hm}}^{*})|\]$.

thumbnail Fig. 5

Comparison with Gaia MW GCs: normalized total energy as a function of the normalized z-component of the angular momentum at z = 0 for both in situ and ex situ GCs for the 198 MWs in the full MW potential + dynamical friction. The decision boundaries in dashed lines corresponding to probabilities of 0.05 (magenta), 0.5 (black), and 0.95 (green) are plotted to illustrate the separation between the two populations. Nephele GCs are the six GCs brought by the progenitor of Omega Centauri identified by Pagnini et al. (2025). The 0.5 boundary results in an error of 1.35% for in situ GCs and 11.2% for ex situ GCs. The extreme boundary at 0.95 (0.05) excludes 99.72% (97.3%) of in situ (ex situ) GCs while identifying 81.2% (76.4%) of ex situ (in situ) GCs. We establish that between these two limits, distinguishing the two populations is not feasible with previous methods due to the presence of a mixing zone. The orange dashed line represents the McMillan version of Belokurov & Kravtsov (2024) limit.

3.2 Ex situ globular clusters

Figure 4 depicts that the ex situ GC distribution is more spread out in energy in the full MW potential compared to the static potential, as the evolving potential is less massive and can sustain orbits at higher energies than the MW potential at z = 0. Dynamical friction slightly reduces the GC density distribution in the high-energy range. It also confirms the presence of an overlap between the two populations in MW-like galaxies, occurring below an energy threshold of E<0.7|Ecirc(rhm)|$\[E<-0.7|E_{\text {circ}}(r_{\text {hm}}^{*})|\]$, as ex situ clusters span the entire energy range (see Figure 4). We emphasize that ex situ GCs are present across the full energy range, invalidating the limit proposed by Belokurov & Kravtsov (2024); Massari et al. (2019); Callingham et al. (2022); Chen & Gnedin (2024). In practical terms, this threshold can at best be used to exclude the in situ population. Besides, we note that very few ex situ clusters are found below approximately 1.25|Ecirc(rhm)|$\[-1.25|E_{\text {circ}}(r_{\text {hm}}^{*})|\]$.

3.3 Comparison with Gaia MW GCs

Figure 5 compares our results in the full MW potential, including dynamical friction, with the distribution of GCs in the MW as provided by Gaia (Vasiliev & Baumgardt 2021) in the normalized space ELz. For the observed clusters, we assumed the potential from McMillan (2017), a multi-component analytic model of the MW calibrated to reproduce a wide range of observables. This MW potential has an stellar half-mass radius of 4.6 kpc. For our simulated clusters, we used a hexagonal density plot, where the colour represents the dominant origin of the GCs (ex situ or in situ). Dominance is defined as the ratio (NexsituNinsitu)/(Nexsitu + Ninsitu), where Nexsitu and Ninsitu are the number of ex situ and in situ GCs, respectively. Red indicates a predominance of ex situ clusters (+1), while blue corresponds to in situ clusters (–1). As highlighted in previous Figures 3 and 4, our dominance map confirms that ex situ clusters dominate for E>0.7|Ecirc(rhm)|$\[E>-0.7|E_{\text {circ}}(r_{\text {hm}}^{*})|\]$, while in situ clusters dominate for E<1.25|Ecirc(rhm)|$\[E<-1.25|E_{\mathrm{circ}}(r_{\mathrm{hm}}^{*})|\]$ for positive Lz. More specifically, our density map reveals that the dominance of ex situ clusters extends up to 0.9|Ecirc(rhm)|$\[-0.9|E_{\text {circ}}(r_{\text {hm}}^{*})|\]$, covering a larger fraction of the observed MW GCs distribution. Our results indicate that this boundary underestimates the number of ex situ clusters: in fact, 24% of our ex situ clusters fall below the Belokurov & Kravtsov (2024) threshold. Furthermore, if we were to adopt this boundary, our results suggest a misclassification of 10% of the clusters identified as in situ, which are actually ex situ. Finally, our results challenge simple energy threshold models, which we demonstrate to fail in clearly separating the two populations, and merely identifies a subset of ex situ GCs. According to Figure B.1, our model tends to overestimate dynamical friction during the early orbital evolution of satellites, leading to a faster loss of orbital energy. However, the figure also shows that at the time of satellite disruption, our model overestimates the satellites’ orbital energy and consequently that of the GCs, resulting in an underestimation of the number of low-energy ex situ objects. This implies that our boundaries should be lowered in Figure 5. This further calls into question the energy threshold proposed by previous authors, which appears to exclude too large a fraction of ex situ objects. Moreover, Figure 5 clearly highlights that the real challenge lies in distinguishing the two populations below this energy threshold, where an overlap of both populations persists up to |Ecirc(rhm)|$\[-|E_{\text {circ}}(r_{\text {hm}}^{*})|\]$. The existence of this mixing zone is further supported by Andersson & Davies (2019), who predict that 50% of the GCs located in the central region of M31 were accreted.

3.4 Our new approach

This motivates our proposal for new boundaries that allow for the identification of a larger fraction of the ex situ population while also capturing part of the in situ population. We employed a support vector machine (SVM)4, a supervised learning method, to classify the synthetic in situ and ex situ populations in the ELz space. This algorithm aims to find an optimal hyperplane that separates the data in such a way as to maximize the margin between the two GC populations. We employed a radial basis function (RBF) kernel modelled by a squared exponential function, in order to transform the input space so that the data becomes linearly separable in a higher-dimensional feature space. The SVM was trained using this RBF kernel, with parameters chosen to minimize the classification error rate for both populations. In Figure 5, the decision boundaries corresponding to probabilities of 0.05, 0.5, and 0.95 are plotted to illustrate the separation between the two populations. The 0.5 boundary results in an error of 1.35% for in situ GCs and 11.2% for ex situ GCs. The extreme boundary at 0.95 (0.05) excludes 99.72% (97.3%) of in situ (ex situ) GCs while identifying 81.2% (76.4%) of ex situ (in situ) GCs. We establish that between these two limits, distinguishing the two populations is not feasible with previous methods due to the presence of a mixing zone. The in situ boundary approximately corresponds to an energy thresholds of −1. This is why, in Figure 6, where clusters are sorted from the least to the most bound in energy, Gaia MW GCs within the region of overlap are represented by squares (either in situ (in blue) or ex situ (in red)) without any border. We provide a new classification based on the 0.5 boundary established from the dynamics of our synthetic clusters within the full gravitational potential of 198 MW realizations. We used this threshold to identify ex situ (in situ) GCs, shown in red (blue) (see Figure 6). We also compare our updated classification of the MW GCs with previous studies (Massari et al. 2019; Malhan et al. 2022; Sun et al. 2023; Belokurov & Kravtsov 2024; Chen & Gnedin 2024). We would like to stress that the classifications shown in Figure 6 are taken directly from the original studies and are not recalculated by us. Since our ex situ boundary lies at a lower energy threshold compared to Belokurov & Kravtsov (2024), Sagittarius GCs are classified as ex situ, as expected.

Among the 160 MW GCs, we identify 67 as ex situ, 38 as in situ, and 55 within the mixing zone. This why chemical abundance could provide additional constraints to disentangle these origins, as demonstrated by Pagnini et al. (2025). If we adopt our decision threshold (0.5 boundary), meaning that the probability of belonging to either population is equal, we find an ex situ to in situ number ratio close to 1.02 (79 in situ versus 81 ex situ). This contrasts with Belokurov & Kravtsov (2024) and Chen & Gnedin (2024), who obtain reversed ratios of approximately 0.52 (105 in situ versus 55 ex situ) and 0.6 (91 in situ versus 55 ex situ), respectively, while Sun et al. (2023) reports a more balanced ratio of 1.2 (70 in situ versus 83 ex situ).

Figure 6 clearly shows that the 33 most tightly bound GCs (with lower energy than VVV_CL001) identified as in situ are consistently found across all six studies. We also observe that our results for ex situ GCs are in good agreement with the classifications from Massari et al. (2019); Malhan et al. (2022); Sun et al. (2023), except for a few clusters that were designated as in situ in their classifications. Intriguingly, these discrepant GCs have all been identified as “Disk” GCs, meaning they exhibit properties such as low eccentricity (implying nearly circular orbits) and pericentre and apocentre radii similar to those of stars in the Galactic disk. This suggests that associating certain GCs with the disk based solely on these criteria may not be sufficiently constraining. Finally, all clusters located in the mixing zone (squares without any border in the left panel of Figure 6) have been classified as in situ in previous studies, except for Sun et al. (2023). In their study, they identify a group of 26 GCs (‘Pot’) that are not associated with the bulge, the disk, or known major merger events. According to Sun et al. (2023), these clusters may originate from small accretion events and are therefore classified as ex situ. Once again, the difficulty in distinguishing them from other GCs arises from their location in the region of overlap, where both populations overlap.

Concerning Omega Centauri (NGC 5139 in bold green), which is believed to be the nuclear star cluster of a galaxy accreted by the MW in the past, our analysis classifies it as ex situ in contrast to most results in the literature (see Figure 6). Furthermore, Pagnini et al. (2025) identified six GCs brought by the progenitor of NGC 5139 by combining Gaia EDR3 data with chemical abundances from APOGEE DR17. NGC 6752, NGC 6656, NGC 6809, NGC 6273, NGC 6205, and NGC 6254 have a fraction of stars compatible with Omega Centauri greater than 60%. Our results confirm this prediction by identifying three of these clusters as ex situ (with energies lower than or comparable to that of NGC 5139), while the remaining three lie in the mixing zone, with an uncertain but possibly ex situ origin only for NGC 6273. Finally, we note that two of GCs identified as ex situ (NGC 6752 and NGC 6656) are those with a compatibility greater than 80% with Omega Centauri.

Concerning another specific GC, NGC 6397, which is the second closest GC to the Sun, it is a low-mass and very compact cluster. Due to its proximity to the MW centre, it should have experienced numerous tidal interactions, making it a strong candidate for the development of tidal tails. Moreover, Balbinot & Gieles (2018) estimated that it should have lost 72% of its initial mass. Actually, simulations of NGC 6397 generally predict prominent and extended tidal tails (kpc scale) around this cluster (Boldrini & Vitral 2021; Arnold & Baumgardt 2025). However, the existence of tidal tails around NGC 6397 remains debated. Leon et al. (2000) reported overdensities that were classified as unreliable due to uncertainties in the dust distribution around the cluster. More recently, Ibata et al. (2024) detected a possible tail extending over 18 degrees on the sky using Gaia data, but these findings were challenged by Boldrini & Vitral (2021), who did not detect any tidal tail. Ultimately, identifying potential stream members of NGC 6397 in Gaia data with precision remains a complex challenge. This is crucial to either confirm the absence of a stream, the presence of a sub-kpc stream, or the existence of a prominent stream as predicted by simulations assuming an in situ evolution. Our results classify NGC 6397 (which has an energy level comparable to that of Omega Centauri) as in situ, but this classification remains highly uncertain given its position near the 0.5 boundary (see Figure 5). Therefore, this does not rule out scenarios where either no tidal stream is present or only a sub-kiloparsec stream exists. Indeed, this aligns with the hypothesis proposed by Boldrini & Vitral (2021) that one possible ex situ scenario for this cluster is that it was initially embedded in a DM minihalo. In this scenario, the minihalo could have acted as a protective envelope, preventing the tidal stripping of stars while being gradually disrupted and removed over the course of the cluster’s evolution. The combination of future releases from Gaia and Euclid will provide a better characterization of the tidal features of this cluster and further constrain its origin (Massari et al. 2025).

Finally, there is a group of 23 GCs whose origin remains unidentified but which have been classified as ‘low-energy’ GCs by Massari et al. (2019) due to their low energy and near-zero angular momentum. Our results indicate that this population has energies lying precisely in the mixing zone (see GCs in grey in Figure 6). The fact that these clusters fall within this energy range further complicates the determination of their origin and, more importantly, suggests the absence of a common progenitor. Besides, we find that 10 of them have an ex situ origin.

thumbnail Fig. 6

Origin of MW GCs: each point represents a GC and is colour-coded according to its origin. GCs are sorted from the most to the least bound in energy. in situ clusters (blue) with black edges are those associated with the MW disk by other studies. We also highlight the six GCs still associated with the Sagittarius dwarf galaxy (magenta), NGC 6397 (blue), and Omega Centauri (bold green), along with its six candidate clusters identified by Pagnini et al. (2025) (light green). Massari et al. (2019) GCs whose origin remains unidentified but which have been classified as ‘low-energy’ GCs are in grey.

3.5 E–Lz decoherence of ex situ GC groups

Our final investigation focuses on the distribution of ex situ GC groups originating from the same satellite, still in phase space ELz, as shown in Figure 8, for our three descriptions of the MW potential over time (see Section 2.1 for definitions of potentials). An important result is that GCs from the same satellite can have different energies, as well as both positive and negative z-components of angular momentum. Furthermore, the boundary proposed by Belokurov & Kravtsov (2024) can separate GCs originating from the same satellite. Upon visual inspection of Figure 8, we observe that the evolving potential tends to break the coherence of GC groups from the same satellite across five MW galaxies in our sample, whose orbits are provided in Figure 7. This decoherence manifests as a diffusion of GC groups towards higher energies, due to the changes in the potential experienced by the ex situ GCs. Unlike in situ GCs, ex situ GCs take longer to adapt to variations in the gravitational potential, resulting in an increase in their energy dispersion. Interestingly, the addition of dynamical friction tends to accelerate this adaptation process, as we observe a slight decrease in the energies of the GCs. A notable effect initiated by dynamical friction is its tendency to align the angular momentum of GCs from the same satellite. Indeed, dynamical friction is a process that removes angular momentum and orbital energy, driving the GCs towards more radial orbits.

To confirm these behaviors across our sample of 198 MWs, we calculate the energy and angular momentum dispersions for each ex situ GC group at z = 0, as shown in Figures 9 and 10, respectively. In Figure 9, for the static potential case (blue), we observe a pronounced peak towards small dispersions. When we account for an evolving potential (orange), the distribution shifts slightly towards higher values, with a flattening of the main peak. This suggests that the evolution of the galactic potential induces a greater energy dispersion for GCs originating from the same satellite. This confirms that variations in the potential well alter the orbits of accreted GCs, as visually observed in the example in Figure 7. The addition of dynamical friction (green) slightly reduces the energy dispersion. As seen in Figure 9, the progressive energy loss of the GCs due to dynamical friction slightly narrows the energy gap between them. In Figure 10, the inclusion of an evolving potential does not affect the Lz distribution of ex situ GCs at z = 0, whereas dynamical friction induces a more pronounced peak around a low Lz dispersion. This indicates that a portion of the GCs tends to remain grouped in Lz. Once again, this is expected, as dynamical friction removes angular momentum from the orbiting GCs.

thumbnail Fig. 7

Orbital evolution of merging satellites from five MW galaxies (IDs 342447, 372754, 372755, 388544 and 392277) in our sample, described by a full potential + dynamical friction. The points mark the formation and merger times of the satellites in the TNG50 simulation.

4 Caveats

In this work, as a first approximation, we tagged ex situ GCs at z = 3 in satellite galaxies and in situ GCs at z = 2 in the MW progenitors. To increase realism, future studies will need to tag clusters at all redshifts, based, for example, on a formation efficiency parameter derived from gas and stellar quantities resolved in the simulation (e.g. as in Pfeffer et al. 2018; Reina-Campos et al. 2022; Grudić et al. 2023; Newton et al. 2025; Rodriguez et al. 2023). However, high-resolution or zoom-in simulations, which typically generate a maximum of 20 MW-like galaxies, are required to calculate these parameters because a gas resolution of the order of 1–10 pc is necessary to obtain reliable information on gas pressure and star formation, which enhance the formation of GCs. Although TNG50 is not as precise, with an average cell size of 70–140 pc in the star-forming regions of galaxies, it allows us to access approximately 200 MW-like galaxies (Pillepich et al. 2019). Although our approach is statistically consistent with the estimated ages of all GCs in the MW, it is not consistent with younger clusters with lower masses (104–5 M). Moreover, tagging clusters at lower redshifts will increase the population of clusters that survive in environments like those of MW-type galaxies. In the same spirit, a tagging method based on a formation efficiency parameter using gas and stellar quantities would provide a more realistic mass for the GCs. However, the goal of our initial approach was to model the average behavior of clusters with masses of the order of 106 M. In our analysis, we only consider satellite galaxies with stellar masses above 107 M. However, lower-mass galaxies, which are not resolved in this study, may be highly efficient sites of GC formation, as indicated by recent results from the EDGE simulations (Taylor et al. 2025). This limitation clearly motivates future investigations using zoom-in simulations that can better resolve low-mass DM halos and their contribution to the GC population.

Another point to improve our approach is to include the presence of the MW’s disk, particularly to account for its dynamical impact on the in situ population. In this work, our MWs are only modelled with DM and stellar spherical components. Although these components have been studied in TNG50, they were not catalogued in the same way as the other components for our MW sample to be incorporated into our study (Sotillo-Ramos et al. 2022). To quantify the impact of the disk on our GC classification, we added a fixed Miyamoto–Nagai potential representing the observed disk at z = 0 to all our TNG50 MWs. This test does not aim to provide a realistic modelling of the disk’s evolution, but rather serves as an upper-limit case to assess how much our results could be affected by this underestimation of tidal disruption caused by the absence of disk in our modelling. The table A.1 shows that both the SVM boundaries and the classification remain largely unaffected by this extreme scenario. This can be explained by two main points: (1) the disk primarily affects the in situ population, and (2) our classification relies on GC survivability rather than their final mass. In fact, we found that in situ GCs lose more mass in the presence of the fixed disk. An alternative to avoid imposing fixed structures (Hernquist bulge, exponential disk, NFW DM halo) and breaking this idealized modelling of the MW via a spherical potential is to use methods like AGAMA (Vasiliev 2019). This would allow us to build a self-consistent galactic potential based on the galaxy’s full distribution function, including non-trivial effects such as local density variations.

5 Conclusion

In this work, we presented a new cosmological post-processing framework to study the hierarchical assembly of GCs in MW-like galaxies. By coupling the cosmological hydrodynamical simulation TNG50 with orbital integrations using galpy, and by including realistic treatments of dynamical friction and mass loss, we followed the evolution of 18 000 GCs – both in situ and ex situ – across 198 MW analogues from redshift z = 3 to z = 0. This approach allowed us to explore, for the first time at this statistical scale, how these two GC populations dynamically evolve and mix within time-evolving galactic potentials shaped by mergers and mass accretion.

We find that in situ GCs predominantly occupy a well-defined region of the normalized energy–angular momentum ELz space, remaining tightly bound and dynamically cold even in the presence of an evolving MW potential. In contrast, ex situ GCs span a broader range in energy and angular momentum, reflecting their varied accretion histories. Importantly, we confirm the existence of a significant mixing zone below the energy threshold E<0.7|Ecirc(rhm)|$\[E<-0.7|E_{\text {circ}}(r_{\text {hm}}^{*})|\]$, where both in situ and ex situ GCs overlap. This undermines previous attempts to cleanly separate the two populations using only the integral of motions via a single cut in energy, which based on our models will not be able to reproduce a large fraction of ex situ clusters and will misclassify many in situ ones.

We proposed a new classification scheme, which allows a probabilistic separation of the two populations and highlights the limitations of binary thresholds. Applying this scheme to the observed MW GC system using Gaia kinematics, we identify 81 GCs as ex situ, 79 as in situ (see Figures 5 and 6). This revised classification leads to a more balanced in situ to ex situ ratio. Notably, we confirm the ex situ nature of Omega Centauri and its associated group of GCs, as well as the Sagittarius GCs, and identify potential misclassifications among clusters previously associated with the Galactic disk.

Explicit modelling of mass loss during orbital integration, using the prescription of Kruijssen et al. (2011), allowed us to find that in situ GCs experience significantly stronger tidal disruption, leading to an average survival fraction of only ~65%, while ex situ GCs are more resilient, with a destruction rate below 1%. As a result, surviving ex situ clusters retain masses closer to their initial values, while many in situ clusters are heavily stripped, in agreement with Renaud et al. (2017). This asymmetry in mass loss between in situ and ex situ GCs may provide an additional observational handle to disentangle the two populations. It could also offer a possible explanation for the extremely metal-poor stellar streams recently identified by Ibata et al. (2024), which have not been linked to any known GC progenitor. Our results suggest a plausible scenario in which such streams may represent the remnants of a disrupted, ancient population of in situ GCs, destroyed during the early phases of the Galaxy’s evolution. We emphasize that this is not a firm prediction, but rather a possible interpretation consistent with the asymmetry in GC mass loss observed in our model.

Furthermore, we studied the loss of orbital coherence among GC groups accreted from the same satellite. We find that the evolving MW potential, combined with dynamical friction, progressively erases the initial correlations in ELz space. This challenges the assumption that GCs accreted from the same progenitor should remain kinematically coherent, even in action space, and calls for caution when associating groups of clusters with past accretion events.

Overall, our results show that the dynamical separation of GC populations requires modelling their full cosmological context, including satellite orbits, potential evolution, and external GC evolution. Our method bridges the gap between large-scale statistical studies and the dynamical modelling of individual clusters, offering new insights into the assembly history of the MW. Future improvements will involve tagging GCs based on more physically motivated formation efficiencies, including baryonic structures like the MW disk, and using higher-resolution MW simulations such as VINTERGATAN (Agertz et al. 2021) and HESTIA (Libeskind et al. 2020) to refine GC initial conditions. Such developments will further enhance our ability to disentangle the complex origin of the Galactic GC system in the Gaia era, as well as with Euclid (Voggel et al. 2025; Saifollahi et al. 2025a,b).

thumbnail Fig. 8

ELz decoherence of ex situ GC groups due to evolving MW potentials and dynamical friction: normalized total energy as a function of the normalized z-component of the angular momentum at z = 0, colour-coded according to their associations with satellites from the same five MW galaxies shown in Figure 7. The captions provide information on the mass ratio between the satellite galaxy and the MW at the time of the merger (left panel), the redshift (middle panel), and the merger time (right panel).

thumbnail Fig. 9

Coherence loss in energy: probability density P(σE) of the energy dispersion for our ex situ GCs in the 198 MW analogues at z=0. We calculate the energy dispersions for each ex situ GC groups. The evolution of the galactic potential induces a greater energy dispersion for GCs originating from the same satellite.

thumbnail Fig. 10

Coherence loss in angular momentum: as in Figure 9, but for the dispersion in angular momentum. Dynamical friction induces a more pronounced peak around a low Lz dispersion. This indicates that a portion of the GCs tends to remain grouped in Lz.

Data availability

The data underlying this article is available through reasonable request to the author. The code and GC data will be available at the following URL: https://github.com/Blackholan.

Acknowledgements

PB acknowledges funding from the CNES post-doctoral fellowship program. PB, PDM and GP are grateful to the "Action Thématique de Cosmologie et Galaxies (ATCG), Programme National ASTRO of the INSU (Institut National des Sciences de l’Univers) for supporting this research, in the framework of the project “Coevolution of globular clusters and dwarf galaxies, in the context of hierarchical galaxy formation: from the Milky Way to the nearby Universe” (PI: A. Lançon). PB thanks members of the Euclid working group, “Extragalactic GCs”, for contributing with useful comments in the early stages of this project. PB wants to express his gratitude to Misha Haywood for suggesting a more impactful title. The IllustrisTNG simulations were undertaken with compute time awarded by the Gauss Centre for Supercomputing (GCS) under GCS Large-Scale Projects GCS-ILLU and GCS-DWAR on the GCS share of the supercomputer Hazel Hen at the High Performance Computing Center Stuttgart (HLRS), as well as on the machines of the Max Planck Computing and Data Facility (MPCDF) in Garching, Germany.

References

  1. Agertz, O., Renaud, F., Feltzing, S., et al. 2021, MNRAS, 503, 5826 [NASA ADS] [CrossRef] [Google Scholar]
  2. Amarante, J. A. S., Debattista, V. P., Beraldo e Silva, L., Laporte, C. F. P., & Deg, N. 2022, ApJ, 937, 12 [NASA ADS] [CrossRef] [Google Scholar]
  3. Andersson, E. P., & Davies, M. B. 2019, MNRAS, 485, 4134 [Google Scholar]
  4. Andersson, E. P., Mac Low, M.-M., Agertz, O., Renaud, F., & Li, H. 2024, A&A, 681, A28 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  5. Arnold, A. D., & Baumgardt, H. 2025, MNRAS, 537, 1807 [Google Scholar]
  6. Balbinot, E., & Gieles, M. 2018, MNRAS, 474, 2479 [NASA ADS] [CrossRef] [Google Scholar]
  7. Baumgardt, H., Hilker, M., Sollima, A., & Bellini, A. 2019, MNRAS, 482, 5138 [Google Scholar]
  8. Beasley, M. A. 2020, in Reviews in Frontiers of Modern Astrophysics; From Space Debris to Cosmology, eds. P. Kabáth, D. Jones, & M. Skarka, 245 [Google Scholar]
  9. Bekki, K., & Freeman, K. C. 2003, MNRAS, 346, L11 [Google Scholar]
  10. Belokurov, V., & Kravtsov, A. 2023, MNRAS, 525, 4456 [NASA ADS] [CrossRef] [Google Scholar]
  11. Belokurov, V., & Kravtsov, A. 2024, MNRAS, 528, 3198 [CrossRef] [Google Scholar]
  12. Binney, J., & Tremaine, S. 2008, Galactic Dynamics, 2nd edn. [Google Scholar]
  13. Boldrini, P., & Vitral, E. 2021, MNRAS, 507, 1814 [NASA ADS] [CrossRef] [Google Scholar]
  14. Bovy, J. 2015, ApJS, 216, 29 [NASA ADS] [CrossRef] [Google Scholar]
  15. Bullock, J. S., & Johnston, K. V. 2005, ApJ, 635, 931 [Google Scholar]
  16. Burkert, A., & Forbes, D. A. 2020, AJ, 159, 56 [Google Scholar]
  17. Callingham, T. M., Cautun, M., Deason, A. J., et al. 2022, MNRAS, 513, 4107 [NASA ADS] [CrossRef] [Google Scholar]
  18. Carraro, G., & Lia, C. 2000, A&A, 357, 977 [NASA ADS] [Google Scholar]
  19. Chen, Y., & Gnedin, O. Y. 2023, MNRAS, 522, 5638 [NASA ADS] [CrossRef] [Google Scholar]
  20. Chen, Y., & Gnedin, O. Y. 2024, Open J. Astrophys., 7, 23 [NASA ADS] [Google Scholar]
  21. Choksi, N., & Gnedin, O. Y. 2019, MNRAS, 488, 5409 [Google Scholar]
  22. Cooper, A. P., Cole, S., Frenk, C. S., et al. 2010, MNRAS, 406, 744 [Google Scholar]
  23. Creasey, P., Sales, L. V., Peng, E. W., & Sameie, O. 2019, MNRAS, 482, 219 [Google Scholar]
  24. Deng, Y., Li, H., Liu, B., et al. 2024, A&A, 691, A231 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  25. Di Matteo, P., Haywood, M., Lehnert, M. D., et al. 2019, A&A, 632, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  26. Diemand, J., Kuhlen, M., & Madau, P. 2007, ApJ, 657, 262 [Google Scholar]
  27. Diemer, B., More, S., & Kravtsov, A. V. 2013, ApJ, 766, 25 [NASA ADS] [CrossRef] [Google Scholar]
  28. Doppel, J. E., Sales, L. V., Nelson, D., et al. 2023, MNRAS, 518, 2453 [Google Scholar]
  29. Dubois, Y., Beckmann, R., Bournaud, F., et al. 2021, A&A, 651, A109 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  30. Forbes, D. A., & Bridges, T. 2010, MNRAS, 404, 1203 [NASA ADS] [Google Scholar]
  31. Gaia Collaboration (Brown, A. G. A., et al.) 2021, A&A, 649, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  32. Grudić, M. Y., Hafen, Z., Rodriguez, C. L., et al. 2023, MNRAS, 519, 1366 [Google Scholar]
  33. Halbesma, T. L. R., Grand, R. J. J., Gómez, F. A., et al. 2020, MNRAS, 496, 638 [Google Scholar]
  34. Helmi, A. 2020, ARA&A, 58, 205 [Google Scholar]
  35. Hernquist, L. 1990, ApJ, 356, 359 [Google Scholar]
  36. Ibata, R., Malhan, K., Tenachi, W., et al. 2024, ApJ, 967, 89 [NASA ADS] [CrossRef] [Google Scholar]
  37. Jean-Baptiste, I., Di Matteo, P., Haywood, M., et al. 2017, A&A, 604, A106 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  38. Khoperskov, S., Minchev, I., Libeskind, N., et al. 2023a, A&A, 677, A89 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  39. Khoperskov, S., Minchev, I., Libeskind, N., et al. 2023b, A&A, 677, A90 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  40. Kim, J.-h., Ma, X., Grudić, M. Y., et al. 2018, MNRAS, 474, 4232 [Google Scholar]
  41. Kravtsov, A. V., & Gnedin, O. Y. 2005, ApJ, 623, 650 [Google Scholar]
  42. Kruijssen, J. M. D., Pelupessy, F. I., Lamers, H. J. G. L. M., Portegies Zwart, S. F., & Icke, V. 2011, MNRAS, 414, 1339 [Google Scholar]
  43. Kruijssen, J. M. D., Pfeffer, J. L., Crain, R. A., & Bastian, N. 2019a, MNRAS, 486, 3134 [NASA ADS] [CrossRef] [Google Scholar]
  44. Kruijssen, J. M. D., Pfeffer, J. L., Reina-Campos, M., Crain, R. A., & Bastian, N. 2019b, MNRAS, 486, 3180 [Google Scholar]
  45. Kruijssen, J. M. D., Pfeffer, J. L., Chevance, M., et al. 2020, MNRAS, 498, 2472 [NASA ADS] [CrossRef] [Google Scholar]
  46. Lahén, N., Naab, T., Johansson, P. H., et al. 2020, ApJ, 891, 2 [CrossRef] [Google Scholar]
  47. Laporte, C. F. P., White, S. D. M., Naab, T., & Gao, L. 2013, MNRAS, 435, 901 [Google Scholar]
  48. Leaman, R., VandenBerg, D. A., & Mendel, J. T. 2013, MNRAS, 436, 122 [Google Scholar]
  49. Lee, Y. W., Joo, J. M., Sohn, Y. J., et al. 1999, Nature, 402, 55 [Google Scholar]
  50. Leon, S., Meylan, G., & Combes, F. 2000, A&A, 359, 907 [NASA ADS] [Google Scholar]
  51. Li, H., Gnedin, O. Y., Gnedin, N. Y., et al. 2017, ApJ, 834, 69 [NASA ADS] [CrossRef] [Google Scholar]
  52. Li, H., Vogelsberger, M., Bryan, G. L., et al. 2022, MNRAS, 514, 265 [NASA ADS] [CrossRef] [Google Scholar]
  53. Libeskind, N. I., Carlesi, E., Grand, R. J. J., et al. 2020, MNRAS, 498, 2968 [NASA ADS] [CrossRef] [Google Scholar]
  54. Lux, H., Read, J. I., & Lake, G. 2010, MNRAS, 406, 2312 [Google Scholar]
  55. Ma, X., Grudić, M. Y., Quataert, E., et al. 2020, MNRAS, 493, 4315 [NASA ADS] [CrossRef] [Google Scholar]
  56. Mackereth, J. T., & Bovy, J. 2020, MNRAS, 492, 3631 [NASA ADS] [CrossRef] [Google Scholar]
  57. Mackey, A. D., & van den Bergh, S. 2005, MNRAS, 360, 631 [NASA ADS] [CrossRef] [Google Scholar]
  58. Majewski, S. R., Patterson, R. J., Dinescu, D. I., et al. 2000, in Liege International Astrophysical Colloquia, 35, Liege International Astrophysical Colloquia, eds. A. Noels, P. Magain, D. Caro, E. Jehin, G. Parmentier, & A. A. Thoul, 619 [Google Scholar]
  59. Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [NASA ADS] [CrossRef] [Google Scholar]
  60. Malhan, K., Ibata, R. A., Sharma, S., et al. 2022, ApJ, 926, 107 [NASA ADS] [CrossRef] [Google Scholar]
  61. Massari, D., Koppelman, H. H., & Helmi, A. 2019, A&A, 630, L4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  62. Massari, D., Dalessandro, E., Erkal, D., et al. 2025, A&A, 697, A8 [Google Scholar]
  63. McMillan, P. J. 2017, MNRAS, 465, 76 [NASA ADS] [CrossRef] [Google Scholar]
  64. Meng, X., & Gnedin, O. Y. 2022, MNRAS, 515, 1065 [NASA ADS] [CrossRef] [Google Scholar]
  65. Miyamoto, M., & Nagai, R. 1975, PASJ, 27, 533 [NASA ADS] [Google Scholar]
  66. Mori, A., Di Matteo, P., Salvadori, S., et al. 2024, A&A, 690, A136 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  67. Myeong, G. C., Evans, N. W., Belokurov, V., Sanders, J. L., & Koposov, S. E. 2018, ApJ, 863, L28 [NASA ADS] [CrossRef] [Google Scholar]
  68. Navarro, J. F., Frenk, C. S., & White, S. D. M. 1997, ApJ, 490, 493 [Google Scholar]
  69. Nelson, D., Pillepich, A., Springel, V., et al. 2019a, MNRAS, 490, 3234 [Google Scholar]
  70. Nelson, D., Springel, V., Pillepich, A., et al. 2019b, Computat. Astrophys. Cosmol., 6, 2 [NASA ADS] [CrossRef] [Google Scholar]
  71. Newton, O., Davies, J. J., Pfeffer, J., et al. 2025, MNRAS, 542, 591 [Google Scholar]
  72. Pagnini, G., Di Matteo, P., Khoperskov, S., et al. 2023, A&A, 673, A86 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  73. Pagnini, G., Di Matteo, P., Haywood, M., et al. 2025, A&A, 693, A155 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  74. Park, S.-M., Shin, J., Smith, R., & Chun, K. 2022, ApJ, 941, 91 [Google Scholar]
  75. Petts, J. A., Read, J. I., & Gualandris, A. 2016, MNRAS, 463, 858 [NASA ADS] [CrossRef] [Google Scholar]
  76. Pfeffer, J., Kruijssen, J. M. D., Crain, R. A., & Bastian, N. 2018, MNRAS, 475, 4309 [Google Scholar]
  77. Pfeffer, J. L., Trujillo-Gomez, S., Kruijssen, J. M. D., et al. 2020, MNRAS, 499, 4863 [CrossRef] [Google Scholar]
  78. Pfeffer, J., Lardo, C., Bastian, N., Saracino, S., & Kamann, S. 2021, MNRAS, 500, 2514 [Google Scholar]
  79. Pillepich, A., Springel, V., Nelson, D., et al. 2018, MNRAS, 473, 4077 [Google Scholar]
  80. Pillepich, A., Nelson, D., Springel, V., et al. 2019, MNRAS, 490, 3196 [Google Scholar]
  81. Pillepich, A., Sotillo-Ramos, D., Ramesh, R., et al. 2024, MNRAS, 535, 1721 [NASA ADS] [CrossRef] [Google Scholar]
  82. Ramos-Almendares, F., Sales, L. V., Abadi, M. G., et al. 2020, MNRAS, 493, 5357 [Google Scholar]
  83. Reina-Campos, M., Keller, B. W., Kruijssen, J. M. D., et al. 2022, MNRAS, 517, 3144 [NASA ADS] [CrossRef] [Google Scholar]
  84. Renaud, F. 2020, in IAU Symposium, 351, Star Clusters: From the Milky Way to the Early Universe, eds. A. Bragaglia, M. Davies, A. Sills, & E. Vesperini, 40 [Google Scholar]
  85. Renaud, F., Agertz, O., & Gieles, M. 2017, MNRAS, 465, 3622 [NASA ADS] [CrossRef] [Google Scholar]
  86. Rodriguez, C. L., Hafen, Z., Grudić, M. Y., et al. 2023, MNRAS, 521, 124 [CrossRef] [Google Scholar]
  87. Saifollahi, T., Lançon, A., Cantiello, M., et al. 2025a, A&A, 703, A184 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  88. Saifollahi, T., Voggel, K., Lançon, A., et al. 2025b, A&A, 697, A10 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  89. Sameie, O., Boylan-Kolchin, M., Hopkins, P. F., et al. 2023, MNRAS, 522, 1800 [Google Scholar]
  90. Snaith, O., Haywood, M., Di Matteo, P., et al. 2015, A&A, 578, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  91. Sotillo-Ramos, D., Pillepich, A., Donnari, M., et al. 2022, MNRAS, 516, 5404 [NASA ADS] [CrossRef] [Google Scholar]
  92. Sun, G., Wang, Y., Liu, C., et al. 2023, Res. Astron. Astrophys., 23, 015013 [CrossRef] [Google Scholar]
  93. Taylor, E. D., Read, J. I., Orkney, M. D. A., et al. 2025, Nature, accepted [arXiv:2509.09582] [Google Scholar]
  94. Tsuchiya, T., Dinescu, D. I., & Korchagin, V. I. 2003, ApJ, 589, L29 [NASA ADS] [CrossRef] [Google Scholar]
  95. Tsuchiya, T., Korchagin, V. I., & Dinescu, D. I. 2004, MNRAS, 350, 1141 [NASA ADS] [CrossRef] [Google Scholar]
  96. van den Bergh, S. 2012, ApJ, 746, 189 [Google Scholar]
  97. Vasiliev, E. 2019, MNRAS, 482, 1525 [Google Scholar]
  98. Vasiliev, E., & Baumgardt, H. 2021, MNRAS, 505, 5978 [NASA ADS] [CrossRef] [Google Scholar]
  99. Voggel, K., Lançon, A., Saifollahi, T., et al. 2025, A&A, 693, A251 [Google Scholar]
  100. Weinberger, R., Springel, V., Hernquist, L., et al. 2017, MNRAS, 465, 3291 [Google Scholar]

4

The implementation of the SVM was done using the scikit-learn library in python.

Appendix A Impact of free parameters of our model

Table A.1

Impact of free parameters in our modelling

Appendix B Orbital integrations of satellites

thumbnail Fig. B.1

Orbits of merging satellites from TNG50 and from our orbital integrations in our analytic MW potential. We have re-integrated satellite orbits both to improve the time resolution of orbits that are sometimes highly jagged in TNG50 due to non-linear time evolution, and to incorporate them as "moving potentials" in our full MW potential.

All Tables

Table A.1

Impact of free parameters in our modelling

All Figures

thumbnail Fig. 1

Orbital radius as a function of time for two merging galaxies (IDs 383325 and 299509) with their associated GCs. The points mark the formation and merger times of the satellites from TNG50. During the initial phase, ex situ GCs orbit within the reference frame of their satellite galaxy until their orbital energy becomes positive (E > 0). Once this condition is met, GCs are transferred to the MW’s reference frame while maintaining the moving potential of the satellite galaxy if it has not yet merged.

In the text
thumbnail Fig. 2

Mass evolution of the stellar and DM components of the MW for our full sample of 198 MW analogues from TNG50. Up to 7 Gyr backwards in time, the mean stellar mass of the TNG50 MW analogues is in good agreement with the mass evolution of our Galaxy derived by Snaith et al. (2015).

In the text
thumbnail Fig. 3

In situ populations at z = 0: normalized total energy as a function of the normalized z-component of the angular momentum at z = 0 for all our in situ GCs for the 198 MWs with three different descriptions for the MW environment (see Section 2.1 for the definitions of potentials). The hexagonal bins represent at least 1 GC. The red dashed line represents the Belokurov & Kravtsov (2024) limit.

In the text
thumbnail Fig. 4

Ex situ at z = 0: as in Fig. 3 but for our ex situ GCs for the 198 MWs.

In the text
thumbnail Fig. 5

Comparison with Gaia MW GCs: normalized total energy as a function of the normalized z-component of the angular momentum at z = 0 for both in situ and ex situ GCs for the 198 MWs in the full MW potential + dynamical friction. The decision boundaries in dashed lines corresponding to probabilities of 0.05 (magenta), 0.5 (black), and 0.95 (green) are plotted to illustrate the separation between the two populations. Nephele GCs are the six GCs brought by the progenitor of Omega Centauri identified by Pagnini et al. (2025). The 0.5 boundary results in an error of 1.35% for in situ GCs and 11.2% for ex situ GCs. The extreme boundary at 0.95 (0.05) excludes 99.72% (97.3%) of in situ (ex situ) GCs while identifying 81.2% (76.4%) of ex situ (in situ) GCs. We establish that between these two limits, distinguishing the two populations is not feasible with previous methods due to the presence of a mixing zone. The orange dashed line represents the McMillan version of Belokurov & Kravtsov (2024) limit.

In the text
thumbnail Fig. 6

Origin of MW GCs: each point represents a GC and is colour-coded according to its origin. GCs are sorted from the most to the least bound in energy. in situ clusters (blue) with black edges are those associated with the MW disk by other studies. We also highlight the six GCs still associated with the Sagittarius dwarf galaxy (magenta), NGC 6397 (blue), and Omega Centauri (bold green), along with its six candidate clusters identified by Pagnini et al. (2025) (light green). Massari et al. (2019) GCs whose origin remains unidentified but which have been classified as ‘low-energy’ GCs are in grey.

In the text
thumbnail Fig. 7

Orbital evolution of merging satellites from five MW galaxies (IDs 342447, 372754, 372755, 388544 and 392277) in our sample, described by a full potential + dynamical friction. The points mark the formation and merger times of the satellites in the TNG50 simulation.

In the text
thumbnail Fig. 8

ELz decoherence of ex situ GC groups due to evolving MW potentials and dynamical friction: normalized total energy as a function of the normalized z-component of the angular momentum at z = 0, colour-coded according to their associations with satellites from the same five MW galaxies shown in Figure 7. The captions provide information on the mass ratio between the satellite galaxy and the MW at the time of the merger (left panel), the redshift (middle panel), and the merger time (right panel).

In the text
thumbnail Fig. 9

Coherence loss in energy: probability density P(σE) of the energy dispersion for our ex situ GCs in the 198 MW analogues at z=0. We calculate the energy dispersions for each ex situ GC groups. The evolution of the galactic potential induces a greater energy dispersion for GCs originating from the same satellite.

In the text
thumbnail Fig. 10

Coherence loss in angular momentum: as in Figure 9, but for the dispersion in angular momentum. Dynamical friction induces a more pronounced peak around a low Lz dispersion. This indicates that a portion of the GCs tends to remain grouped in Lz.

In the text
thumbnail Fig. B.1

Orbits of merging satellites from TNG50 and from our orbital integrations in our analytic MW potential. We have re-integrated satellite orbits both to improve the time resolution of orbits that are sometimes highly jagged in TNG50 due to non-linear time evolution, and to incorporate them as "moving potentials" in our full MW potential.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.