Open Access
Issue
A&A
Volume 704, December 2025
Article Number A234
Number of page(s) 17
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202453228
Published online 12 December 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

In our current lambda cold dark matter paradigm, the dark matter halos in our Universe grow hierarchically through merging with one another. As the dark matter halos merge, the baryonic matter hosted within them also merges: the galaxies within the dark matter halos merge (e.g. Conselice 2014; Somerville & Davé 2015). These events, especially ones involving galaxies of a similar stellar mass (M), can be violent and cause large disruption to the merging galaxies. Tidal tails can form between the merging galaxies, the interacting systems can become more irregular and asymmetric, ring structures can be created around the galaxies, and galaxies can be transformed from spirals to ellipticals (e.g. Taranu et al. 2013; Rutherford et al. 2024).

These morphological disturbances can be used to identify galaxy mergers and are exploited for visual identification, by both citizen scientists and professional astronomers (e.g. Lintott et al. 2008; Darg et al. 2010a,b; Holwerda et al. 2019; Pearson et al. 2022). The parametric and non-parametric morphological parameters also change for merging galaxies, allowing parameters such as concentration, asymmetry, and smoothness (CAS; Conselice et al. 2000, 2003) as well as Gini and M20 (Lotz et al. 2004, 2008; Snyder et al. 2015; Rodriguez-Gomez et al. 2019) to be used to identify galaxy mergers. Morphological disturbances are not always needed: the close pairs method identifies galaxies that are close on the sky and moving slow enough, with respect to each other, to become gravitationally bound (e.g. Barton et al. 2000; De Propris et al. 2005; Robotham et al. 2014; Rodrigues et al. 2018; Duncan et al. 2019).

More recently, advances in computational power have allowed galaxy mergers to be identified using machine learning techniques. Convolutional neural networks (CNNs) have been used to replicate visual identification (e.g. Ackermann et al. 2018; Bottrell et al. 2019; Pearson et al. 2019a, 2022; Walmsley et al. 2019; Ćiprijanović et al. 2020b, 2021; Wang et al. 2020; Bickley et al. 2021, 2022). Much like traditional visual classification, this has relied on identifying morphological disturbances and faint features of the merger to produce reliable merger identifications (e.g. Pearson et al. 2019b, 2022). Machine learning methods can also be trained with morphological parameters of galaxies (Snyder et al. 2019; Pearson et al. 2022; Guzmán-Ortega et al. 2023; Margalef-Bentabol et al. 2024) and photometry (Suelves et al. 2023).

The different methods of identifying galaxy mergers identify different stages of mergers. The close pairs method typically identifies pre-mergers while morphological parameters are good at identifying post-mergers; galaxies that are identified with both methods are then typically found to be ongoing mergers (Desmons et al. 2023; Wilkinson et al. 2024). Machine learning methods trained with galaxy mergers identified with these methods are likely to similarly select pre- or post-mergers. Studies have also used machine learning techniques to specifically identify post-merging galaxies (Bickley et al. 2021, 2022; Wilkinson et al. 2024). Identifying non-mergers, pre-mergers, and post-mergers with machine learning has also been studied, although with less effectiveness than non-merger and merger identification (Ferreira et al. 2020; Margalef-Bentabol et al. 2024).

Further refinement to determine the precise time before or after a merger has also been conducted (Koppula et al. 2021; Pearson et al. 2024). Here, merging galaxies from cosmological simulations with known times before or after a merger event are used to train deep neural networks. Images of these galaxies have been combined with their physical properties (Koppula et al. 2021) or used on their own (Pearson et al. 2024, hereafter Paper I). These approaches do not attempt to identify galaxy mergers, only determine the merger time for pre-identified merging galaxies. The merger time prediction only for post-merger galaxies has also been conducted with coarse time resolution, again using simulations to train deep learning networks (Ferreira et al. in prep.).

The interactions of the galaxies, and their morphological changes, drive material around within the galaxies themselves. If this movement of dust and gas is driven into the centre of a galaxy, it can trigger a merger-induced active galactic nucleus (AGN; Keel et al. 1985; Silverman et al. 2011; Hwang et al. 2012; Weston et al. 2017; Ellison et al. 2019; Gao et al. 2020; Silva et al. 2021; Bickley et al. 2023; La Marca et al. 2024). It is also seen to decrease the metallicity of the gas in the centre of galaxies (Ellison et al. 2008; Rupke et al. 2010; Scudder et al. 2012; Bustamante et al. 2018; Cortijo-Ferrero et al. 2017; Sparre et al. 2022).

Tidal forces created by galaxy mergers can also cause the gas in the interacting galaxies to be compressed and shocked. This can result in an increased star formation rate (SFR) in merging galaxies, in some cases to such a degree that they be considered to be starbursts (e.g. Ellison et al. 2008; Hwang et al. 2011; Scudder et al. 2012; Ellison et al. 2013a; Patton et al. 2013; Renaud et al. 2022; Asada et al. 2024). It has been shown that the higher a galaxy’s SFR is, at fixed M, the more likely it is to be undergoing a merger (Pearson et al. 2019a). However, the typical increase in the SFR caused by mergers is low, approximately a factor of two, and some works argue that there is no increase in the SFR during merger events (Knapen et al. 2015; Shah et al. 2022). This low, or lack of, SFR enhancement may be a result of the environment in which the merging galaxies lie, with galaxy mergers in higher-density environments found to have lower SFR enhancement (He et al. 2024).

This difference in SFR enhancement found in different studies may be a result of different research using different selections of galaxy mergers. Studies using pair-selected galaxy mergers, which are pre-mergers, find correlations between the SFR and the separation between the galaxies: smaller separations have a higher SFR (Patton et al. 2013; Shah et al. 2022; Brown et al. 2023). Although, again, there is evidence of a reduced SFR in close pairs where one galaxy is elliptical with this suppression stronger for smaller galaxy separations (Feng et al. 2024). The SFR enhancement in post-mergers is consistent with SFR enhancement of the closest pre-mergers (Bickley et al. 2022). Bickley et al. (2022) also found that if their neural-network-selected post-mergers have a larger SFR enhancement, the galaxy is more likely to be a recent post-merger, possibly indicating a reduction in SFR enhancement with time for post-merger galaxies. Post-merger galaxies are found to have younger stellar populations compared to non-mergers (Reeves & Hudson 2024). There is also evidence that post-mergers undergo a rapid SFR decline after their recent burst of star formation (Ellison et al. 2022).

This apparent link between SFR enhancement and separation distance for pre-mergers as well as SFR enhancement and confidence in identifying post-mergers suggests there is a temporal link between SFR enhancement and the time before or after a merger event. Indeed, this is seen in simulations. Post-merger galaxies in the IllustrisTNG simulations (Weinberger et al. 2017; Marinacci et al. 2018; Naiman et al. 2018; Nelson et al. 2018; Pillepich et al. 2018a,b; Springel et al. 2018) have a decreasing SFR as the time since merger increases (Hani et al. 2020). Merging galaxies in the FIRE-2 zoom-in simulations (Hopkins et al. 2018) have increased SFR at first and second peri-centric passages, increased SFR enhancement as the time to coalescence decreases, followed by a decrease in the SFR as the time after the merger event increases (Moreno et al. 2019). Such a study is not trivial with observations when considering both pre-mergers and post-mergers. Due to the merger timescale of over a billion years, we cannot watch real galaxies merge and trace their SFR enhancement through the pre-merger and post-merger phases. The time before or after a merger event is difficult to determine and has only been done twice (Koppula et al. 2021; Pearson et al. 2024), and primarily for simulated galaxies.

In this work we aim to produce a statistical understanding of how the SFR changes as a merger progresses. We trained a neural network to determine the time before or after a merger event for merging galaxies using images and merger times from IllustrisTNG simulations. This network was trained such that is is applicable to data from the Kilo Degree Survey (KiDS; de Jong et al. 2013a,b). From this, generated a statistical SFR enhancement as a function of time before and after a merger event. This was done by comparing merging galaxies to non-merging galaxies where the primary difference was if a galaxy was merging or not: we assumed that the merging galaxies were randomly sampled from similar non-merging galaxies.

The paper is structured as follows. Section 2 describes the IllustrisTNG and KiDS data and Sect. 3 describes our neural network. Our results are presented and discussed in Sects. 4 and 5, respectively. We conclude in Sect. 6. Where necessary, we follow the Planck2015 cosmology (Planck Collaboration XIII 2016).

2. Data

2.1. IllustrisTNG

The time before or after a merger event is difficult to determine for real galaxies. However, these times are better known for galaxies in simulations, although the precision is subject to the temporal resolution of the simulation. In this work, we use galaxy mergers from IllustrisTNG 100-1, with known times before or after a merger event (merger time), to train our neural network.

2.1.1. Simulation details

IllustrisTNG’s TNG100-1 is the high-resolution run of the TNG100 series of simulations and simulates both dark and baryonic matter. TNG100-1 simulates 18203 dark matter particles within a box of 75 000 ckpc/h per side, and uses Planck2015 cosmology. The dark matter particles in this simulation have a mass of ≈5 × 106 M/h and the gas cells have an average mass of ≈9 × 105 M/h.

The simulation begins at z = 127 and end at z = 0, with the time between snapshots varying between 30 Myr and 234 Myr; larger times between snapshots are typically found at lower redshifts. The galaxies found in IllustrisTNG are found to be morphologically similar to observed galaxies; relations between M and morphologies, shapes, and sizes are found be be within 1σ of the observed trends (Rodriguez-Gomez et al. 2019). On the other hand, at fixed M, spiral galaxies are not found to be larger than elliptical galaxies, unlike observed galaxies, and the morphology-colour relation is not strong (Rodriguez-Gomez et al. 2019).

2.1.2. Mergers

To select the major merging galaxies from this simulation, we followed Paper I. A merging galaxy is initially defined as a galaxy that has merged in the last 500 Myr (post-merger) or will merge in the next 1 Gyr (pre-merger). Throughout this paper, the ‘merger time’ can be considered to be the time after the merger event. Thus, a merger time of 0 Myr is the merger event, positive merger times are post-mergers, and negative merger times are pre-mergers. These times are initially derived from the time between the snapshot that a galaxy is ‘observed’ in the simulation and the ‘snapshot of merger’. The snapshot of merger is the snapshot where multiple galaxies in earlier snapshots are tracked as a single galaxy for the first time using the Sublink_gal merger trees (Springel et al. 2005; Boylan-Kolchin et al. 2009; Rodriguez-Gomez et al. 2015). We define a major merger in the simulation to be a merger between at least two galaxies where the ratio between the stellar masses of the two most massive galaxies is < 1 : 4 (i.e. the larger galaxy has M no more than four times larger than the M of the smaller galaxy) in the snapshot where the second most massive galaxy reached its maximum M (Rodriguez-Gomez et al. 2015). We also required that the baryonic component (stars + star-forming gas) of the merging galaxy’s subhalo be at least 1 × 109 M/h.

2.1.3. Merger sample and times

As with Paper I, we selected merging galaxies from snapshots 87 to 93, inclusive, which correspond to redshifts between 0.07 and 0.15. The upper limit was selected to match the upper redshift limit of our KiDS merging galaxy catalogue (Pearson et al. 2019a). The lower limit allows the full 1 Gyr pre-merger requirement to elapse before z = 0 in the IllustrisTNG simulations, allowing a more complete pre-merger sample. If a galaxy is found to be a merger at any merger stage in more than one snapshot, the same galaxy will be selected in each snapshot that it is found to be a merger. This sample selection resulted in 6139 major mergers from TNG100-1.

To increase the precision of the merger times, simple gravity simulations were applied to each major merger selected, again following Paper I. Here, each galaxy with a mass ratio < 1 : 10 of the most massive galaxy in the merger was treated as a point mass in 3D space and allowed to move only under the gravity of the other galaxies in the merger. Galaxies with a mass ratio < 1 : 10 were included to allow the smaller galaxies to influence the merger time. A softening length of 0.5 ckpc/h was used to match TNG100-1. The simulation was run between the snapshot of merger and the previous snapshot with a time-step of 0.001 Myr. The merger was considered complete when the major galaxies were first at their closest approach. If a merger contains more than two major galaxies, the merger was considered complete the first time the first two major galaxies are at their closest for pre-mergers and the first time the last two major galaxies are at their closest for post-mergers. The resulting merger times were rounded to the nearest million years. We note that as these simple simulations are only considering gravity, they do not accurately reproduce the final positions of the galaxies in the IllustrisTNG simulations. If the simple gravity simulations did not produce a closest approach, the merger was removed from further study. We also removed galaxies that are classified as pre-mergers and as post-mergers to avoid ambiguity. This further refinement of our sample selection left 3321 major mergers from TNG100-1.

After initial testing, we found that using the full 1 Gyr pre-merger time range produces poor results. This is consistent with literature in which mergers more than 500 Myr before a merger event are shown to be difficult to identify and classify (de Graaff et al. 2025). Instead, we reduced the pre-merger time scale to 500 Myr. The left a sample of 2213 major mergers from TNG100-1 that we used for further analysis.

2.1.4. Merger images

Four channel images were created of the 2213 major mergers from the IllustrisTNG simulation. To do this, each stellar particle had a spectral energy distribution derived from the stellar properties of the particle and its corresponding Bruzual & Charlot (2003) stellar population model. The resulting spectra were passed through the KiDS u, g, r, and i filters to create a smoothed 2D image in each band (Rodriguez-Gomez et al. 2019). We matched the pixel resolution of the IllustrisTNG images to that of KiDS, 0.2 arcsec/pixel, and create four channel images of 128 × 128 pixels centred on the merging galaxy. The images of the galaxies were then made such that the galaxies appears to lie at the redshift of the snapshot the galaxy was selected from. This means that the physical to angular size relation and dimming are correct for the redshift of the snapshot of each galaxy. As with Paper I, we do not require the companion galaxy of the pre-mergers to be within the image. The application of radiative transfer to simulated galaxy images has been shown to not cause large changes in galaxy morphology (Rodriguez-Gomez et al. 2019) or notably change the performance of neural networks in classifying galaxy mergers (Bottrell et al. 2019). Therefore, we elect not to calculate radiative transfer on our sample of IllustrisTNG galaxies, saving computational time. Three images of each merging galaxy were created, viewed from along the x, y, and z axes of the IllustrisTNG simulations. This increases the number of merging galaxy images by a factor of three, providing 6639 images for further use.

To train a neural network to be applied to real KiDS images, the IllustrisTNG images of simulated galaxies need to be processed to resemble real KiDS observations as closely as possible (e.g. Bottrell et al. 2019; Pearson et al. 2019b; Ćiprijanović et al. 2020a; Domínguez Sánchez et al. 2023). Within each KiDS survey tile that has a source catalogue, we determined the average separation between the bright KiDS sources (r-mag < 20; Bilicki et al. 2021) at z < 0.3. Random positions that are at least the average separation away from the bright sources at z < 0.3 were found. Using these random positions, 500 cut-outs that were 128 × 128 pixels in size and did not contain masked pixels in any KiDS band were made in each of the 1001 KiDS tiles. This resulted in 500 500, ugri cut-outs being created. This allows background galaxies in the cut-outs but avoids bright sources at similar redshifts to our IllustrisTNG galaxies. For each IllustrisTNG image, we randomly select a cut-out, convolve each band of the IllustrisTNG images with their respective point spread function (PSF) of the tile the cut-out was made in, and place the IllustrisTNG galaxy into the KiDS cut-out to create the mock ugri KiDS images. As we require the PSF, we only use cut-outs that have a well defined PSF in all four u, g, r, and i KiDS bands. We then created segmentation maps for each band of the mock-KiDS images using photutils (Bradley et al. 2024). If sources could not be found in the segmentation map with photutils, the pixel values of the IllustrisTNG images, without KiDS noise or PSF convolution, were increased by a factor of two, and the process to create a KiDS-like image repeated. If this IllustrisTNG image scaling was done four times and creation of the segmentation map still failed, the merger image was rejected. Scaling the images in this way effectively increases the brightness of the galaxies, increasing the signal-to-noise, but does not bias the training. This left all 6639 KiDS-like IllustrisTNG images. An example of a merging IllustrisTNG galaxy, and its segmentation maps, can be found in Fig. 1.

thumbnail Fig. 1.

Randomly selected example IllustrisTNG image after processing to appear like a KiDS image. The target galaxy is in the centre of the image. The left column shows the u, g, r, and i band images, as labelled, with arcsinh scaling applied twice. The right column shows the associated segmentation maps for each band, with each colour representing a different segment. Each panel is 25.6 × 25.6 arcsec (128 × 128 pixels), corresponding to a physical size of 53 × 53 kpc at the redshift of the IllustrisTNG galaxy (z = 0.11).

2.2. KiDS

KiDS is a wide-field, multi-band survey performed with the OmegaCAM of the VLT Survey Telescope (Capaccioli & Schipani 2011; Kuijken 2011; Capaccioli et al. 2012). It covers over 1000 square degrees in the u, g, r, and i bands and covers the same survey area as the VISTA Kilo-degree Infrared Galaxy survey (VIKING; Edge et al. 2013). In this work, we use the latest KiDS data release 4 (DR4; Kuijken et al. 2019). These data have average 5σ limiting magnitudes of 24.13, 25.12, 25.02, and 23.68 Mag and average PSF full-width at half-maximum of 0.58, 0.54, 0.46, and 0.49 arcsec for the u, g, r, and i bands, respectively.

2.2.1. Merger and control selection

We identified galaxies in KiDS by using the star-galaxy flags provided in DR4. We required that galaxies have a CLASS_STAR value less than or equal to 0.05, SG2DPHOT value of 0, and a SG_FLAG of 1. We also required that all galaxies have 5σ detections in the u, g, r, and i bands. We selected galaxy mergers from these KiDS galaxies by selecting all KiDS galaxies where both the peak of the photometric redshift posterior (zB) and the maximum likelihood redshift (zML) are less than 0.15. We also require that a 256 × 256 pixel, r-band cut-out can be made without filling empty space (i.e. the galaxy’s position must be more than 128 pixels from the edge of the r-band image). The resulting 409 934 galaxies were then passed through the CNN of Pearson et al. (2019a). Briefly, this network was trained to identify galaxy mergers in KiDS r-band images using galaxies identified as merging by the GAMA-KiDS Galaxy Zoo project (Holwerda et al. 2024) as well as being loosely identified as merging via a loose r-band asymmetry-smoothness cut where mergers lie above A = 0.35S + 0.02 (Conselice et al. 2003). The Pearson et al. (2019a) network finds 97 891 galaxy merger candidates.

The galaxy merger sample was likely to be highly contaminated (e.g. Bickley et al. 2021; Pearson et al. 2022). As the size of the merger sample is too large to be able to be visually checked, we elected to select merging galaxies where the Pearson et al. (2019a) catalogue’s merger probability is greater than 0.99; that is, the galaxies that the neural network is the most confident are merging. We also limited the sample to redshifts between 0.07 and 0.15 and log(M) > 9, to ensure our sample is above the lower mass limit of the IllustrisTNG training data. This M limit is also approximately 0.8 dex above the expected KiDS-VIKING mass completeness limit of log(M) ≈ 8.2 at z = 0.15 (Wright et al. 2019). This results in a sample of 10 448 merging KiDS galaxies.

To be able to determine the change in the SFR due to galaxy mergers, we required a control sample of non-merging galaxies. We selected non-merging galaxies to be those where the Pearson et al. (2019a) catalogue’s merger probability is less than 0.05, providing 240 129 non-merging KiDS galaxies. Each merging galaxy had ten non-merging control galaxies selected, with replacement, which match closely in redshift, local density, and M (Ellison et al. 2013b). We assume the merging galaxies are randomly sampled from the entire galaxy population where these physical properties are similar. These non-merging galaxies were required to have redshifts within 0.02, environmental density (Sect. 2.2.2) within 0.02, and log(M) within 0.02 of the merging galaxies. Of our merger sample, 10 029 merging galaxies could be matched to 10 unique non-merging galaxies, and were used for further study. If more than 10 non-mergers matched to a merging galaxy, 10 matching non-mergers were randomly selected.

The merger sample was further reduced to a subsample of galaxies for which 128×128 pixel cut-outs can be made in all four KiDS bands without filling empty space. These cut-outs could not contain any masked r-band pixels. We created segmentation maps of these images, again using photutils, and rejected galaxies that could not be detected in the segmentation map or whose segmentation map caused errors in photutils when we attempted to deblend it into its components. This leaves 5897 merging galaxies whose sources can be detected and who have at least 10 non-merging control matches. An example of a merging KiDS galaxy, and its segmentation maps, can be found in Fig. 2.

thumbnail Fig. 2.

Randomly selected example KiDS image of a merging galaxy. The target galaxy is in the centre of the image. The left column shows the u, g, r, and i band images, as labelled, with arcsinh scaling applied twice. The right column shows the associated segmentation maps for each band, with each colour representing a different segment. Each panel is 25.6 × 25.6 arcsec (128 × 128 pixels), corresponding to a physical size of 44 × 44 kpc at the redshift of the galaxy (z = 0.09).

2.2.2. Environmental density

To determine the environmental density, we followed Santos et al. (2021) and used the density relative to the distance of the tenth nearest neighbour:

Σ 10 = 10 θ 10 2 , $$ \begin{aligned} \Sigma _{10} = \frac{10}{\theta _{10}^{2}}, \end{aligned} $$(1)

where θ10 is the angular projected distance to the tenth nearest neighbour, in megaparcecs. The tenth nearest neighbour was determined within a redshift bin centred on the maximum likelihood redshift (zML) of the object being studied and with half width (σbin) determined by the uncertainty in the photo-z estimate. That is:

σ bin = ( z B , max z B , min ) ( 1 + z ML ) 2 , $$ \begin{aligned} \sigma _{bin} = \frac{(z_{B, \mathrm{max}} - z_{B, \mathrm{min}})*(1+z_{ML})}{2}, \end{aligned} $$(2)

where zB, max and zB, min are the upper and lower bounds of the 68% confidence interval of zB.

Galaxies close to the edges of a survey will have an underestimated local density due to the missed galaxies outside of the survey area. To correct for this, we again followed Santos et al. (2021). Edge galaxies were selected where θ10 is larger than the distance to the edge of the survey. A circular area with radius θ10 was calculated (A10) and the fraction of this area that is outside the survey (x) determined. Thus, the area (1-x)A10 contains ten galaxies. If uniform density is then assumed, we can calculate the number of galaxies (n) that should be in the survey coverage of A10 if θ10 was the distance to the true tenth nearest neighbour as:

n = 10 ( 1 x ) . $$ \begin{aligned} n = 10(1-x). \end{aligned} $$(3)

The true θ10 can then be estimated by calculating n, rounding to the nearest integer, and taking the nth nearest neighbour projected distance to be θ10. The robustness of this method is discussed in Santos et al. (2021, Appendix B).

2.2.3. Star formation rates

The SFR (and M) of the KiDS galaxies were derived using Code Investigating GALaxy Emission (CIGALE; Boquien et al. 2019). For this, we used an exponentially declining star formation history with an exponential burst and Bruzual & Charlot (2003) stellar population model with the Chabrier (2003) initial mass function. For the dust, we used a modified Charlot & Fall (2000) attenuation model and the Draine et al. (2014) emission model. We also include the Fritz et al. (2006) AGN model. The full setup can be found in Appendix A.

We used CIGALE to fit these models to the four KiDS optical bands (u, g, r, and i) and the five VISTA Kilo-degree Infrared Galaxy Public Survey (VIKING; Edge et al. 2013) infrared bands (z, y, J, H, and Ks). While far-infrared data are useful to constrain the SFR, these data are only available in parts of the Northern KiDS field from the Herschel atlas (Eales et al. 2010; Shirley et al. 2019). To not limit our sample, we elect to not require far-infrared data with the understanding that the SFRs of dusty galaxies could be particularly unreliable. The mean (median) error on our log(SFR/M yr−1) estimates is 0.61 (0.61). Further discussion on our SFR (and M) values can be found in Appendix B.

3. Neural network

In this work, we trained a neural network with IllustrisTNG data to predict the merger time that will be applied to KiDS images. Generally, this domain switching cannot be done with machine learning unless the tool has been trained to work in the target domain. Hence the work performed to make the IllustrisTNG images look like KiDS images. Our neural network uses the best performing architecture from Paper I, a CNN. In short, the network begins with six convolutional layers, each using a stride of 1 and ‘same’ padding, with each convolutional layer followed by batch normalisation, dropout with a rate of 0.2, and 2 × 2 max pooling. These convolutional layers have 32, 64, 128, 256, and 1024 square filters with 6, 5, 3, 3, and 2 pixels on a side. The convolutional layers are followed by fully connected layers of 2048, 512, and 128 neurons. Each fully connected layer is followed by batch normalisation and dropout with a rate of 0.1. All layers in the feature extractor and classifier use Rectified Linear Units (ReLU; Nair & Hinton 2010) activation.

The input to the network is an eight channel, 128 × 128 pixel image. Four channels are the u, g, r, and i band images while the other four are segmentation maps, one for each of the bands. Tests were performed training the network with all 15 combinations of the four bands (image plus segmentation map), and using u, g, r, and i proved to be the most reliable. Following Paper I, the image channels are arcsinh scaled twice and then normalised between 0 and 1. The segmentation maps are linearly scaled between 0 and 1. The output is a single neuron with sigmoid activation, where the output is a linear scaling of the merger time: 0 is 500 Myr and 1 is −500 Myr.

The IllustrisTNG merger sample was split into three groups: training with 5286 images (1762 galaxies), validation with 672 images (224 galaxies), and test with 681 images (227 galaxies). The training data was used to train the CNN while the validation was used to trace how well the CNN was functioning during training. The test set was used once, and only once, after training was completed to provide information on how well the CNN was performing on unseen data. We enforced that a merger tree cannot be split between the training, validation, and test sets so galaxies from one merger tree are only found in one of the three sets. This prevents the quality of the CNN from being overestimated due to the network learning to interpolate along a merger tree in a way that does not apply to unseen data.

The training was performed using the normalised merger time, calculated as

normalised merger time = 1 × ( merger time / Myr ) + 500 1000 , $$ \begin{aligned} \mathrm{normalised~merger~time} = -1\times \frac{(\mathrm{merger~time / Myr} )+500}{1000}, \end{aligned} $$(4)

meaning normalised merger tim es of 0 and 1 correspond to merger times of 500 Myr and −500 Myr, respectively. The true normalised merger times were compared to the predicted merger times using mean squared error (MSE) loss. We used the Adam optimiser (Kingma & Ba 2015) with an initial learning rate of 5e-5 and ran training for 1000 epochs. The code to train the network and the trained model are available on GitHub1.

4. Results

4.1. Neural network

The final CNN, chosen as the epoch with the lowest validation loss, has a MSE of 0.067. When then applied to the previously unseen test data, the MSE reduced slightly to 0.058. The predicted merger times against the true merger times of the test data set can be found in Fig. 3. The test result merger time had a mean difference between the true and predicted merger times (mean error) of 182 Myr and a median difference between the true and predicted merger times (median error) of 145 Myr.

thumbnail Fig. 3.

Predicted merger times of the test data against the true merger times. The colour corresponds to the number density from low (purple) to high (red). The red line indicates a one-to-one relation and the MSE for the normalised times is shown in the bottom right. Dark brown diamonds show the median true merger times and median predicted merger times in predicted merger time bins with widths of 200 Myr; x errors are the median absolute deviation of the true times in the bin and y errors are the median absolute deviation of the predicted times.

We binned the predicted merger times into bins of width 200 Myr and present the median true merger times and median predicted merger times in Fig. 3 as dark brown diamonds. The true time for the mergers with predicted merger time > 0 Myr tend towards 240 Myr, close to the mid of the post-merger time range (250 Myr). The mergers with a true merger time < 0 Myr appear to be reasonably well reproduced. Therefore, the CNN is able to reasonably assign merger times to the pre-merger galaxies, but struggles with post-merger and assigns them merger times centred close to the median post-merger time. We also calculated the mean and median error of the predicted merger times in each bin. These errors are presented in Table 1.

Table 1.

Mean and median predicted merger time errors within predicted merger time bins.

4.2. KiDS merger time

We applied our neural network to our sample of 5897 merging galaxies described in Sect. 2.2.1. We present these merger times in Table 2 and their distribution in Fig. 4. We note that the merger times can be calculated from the normalised merger times predicted by the neural network by inverting Eq. 4; that is to say,

merger time / Myr = 1 × ( normalised merger time × 1000 ) 500 . $$ \begin{aligned} \mathrm{merger~time} / \mathrm{Myr} = -1\times (\mathrm{normalised~merger~time} \times 1000)-500. \end{aligned} $$(5)

Table 2.

Predicted merger times for KiDS merging galaxies

thumbnail Fig. 4.

Distribution of predicted merger times for KiDS merging galaxies. Most of the KiDS mergers are pre-mergers, with merger times less than 0 Myr.

The majority of our mergers have merger times less than zero, implying our KiDS mergers are primarily pre-mergers. Examples of KiDS mergers with their predicted merger times can be found in Fig. 5.

thumbnail Fig. 5.

Sixteen KiDS merging galaxies with their associated predicted merger times (time after a merger). The shown images are r-band images with arcsinh scaling applied twice. The galaxy in the centre of the image is the galaxy whose merger time is shown. Each panel is 25.6 × 25.6 arcsec (128 × 128 pixels), corresponding to a physical size of 53 × 53 kpc at the median redshift of the KiDS merger sample (z = 0.11).

In Fig. 5, panels a to i show mergers with predicted merger times < 0 Myr. Panel i is the closest to coalescence, only 4 Myr before, has large disturbances but does not show a secondary nuclei. This may suggest that this galaxy is actually recent post merger. All the remaining panels, except panel b, show evidence of a double nuclei or a close companion indicating that these are all pre-mergers. Where a secondary galaxy is present, in panels a, c, e, f, and g, there is no clear trend of decreasing separation with decreasing merger time. This would indicate that we see mergers with different approach speeds or galaxies potentially before and after a close passage. However, the lack of large disturbances for galaxies with smaller merger times but larger separations suggests that we are not primarily seeing galaxies after a close passage in this figure. Panel b has no clear secondary galaxy but show clear distortions in the spiral arms. It is likely that this merger has a predicted merger time that is under-estimated, as it appears similar to what we would expect a merger with positive merger time to look like. It is also possible that the secondary galaxy is outside the cut-out or hidden directly behind the primary galaxy.

The remaining panels in Fig. 5 show mergers with predicted merger times > 0 Myr. These galaxies all show signs of disturbance but lack a companion or obvious, separated double-nuclei. All seven galaxies, however, show signs of disturbance which appears to be reduced as the time since merger increases. The exception is panel j, that while has disturbances does not have strong disturbances as would be expected so close to coalescence.

4.3. SFR as a function of merger time

We determined the change in the SFR of the mergers compared to the non-mergers by subtracting the median log(SFR/M yr−1) of the ten non-merging control galaxies from the log(SFR/M yr−1) of the merging system. That is to say,

Δ SFR = log ( SFR mgr M yr 1 ) median [ log ( SFR nmg M yr 1 ) ] , $$ \begin{aligned} \Delta \mathrm{SFR} = \log \bigg (\frac{\mathrm{SFR}_{mgr}}{M_{\odot }\,\mathrm{yr}^{-1}}\bigg ) - \mathrm{median}\bigg [ \log \bigg (\frac{\mathrm{SFR}_{nmg}}{M_{\odot }\,\mathrm{yr}^{-1}}\bigg ) \bigg ], \end{aligned} $$(6)

where ΔSFR is SFR enhancement, SFRmgr is the SFR of the merging galaxy and SFRnmg is the SFR of a control galaxy. The mean (median) uncertainty on ΔSFR, derived from the uncertainties in the SFR, is 0.60 (0.59). We can then plot the ΔSFR as a function of merger time in Fig. 6. As can be seen, the merging galaxies appear to primarily have an enhanced star formation rate (ΔSFR > 0).

thumbnail Fig. 6.

ΔSFR as a function of merger time. The green circles show the μ from the Gaussian fitting of the mergers. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty (y axis). The colour corresponds to the number density from low (purple) to high (red). The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie. The yellow horizontal line indicates ΔSFR = 0.

To trace the change in ΔSFR with merger time, we binned the galaxies by merger times with bins with a width of 200 Myr. For bins with more than ten galaxies, we attempted to fit a Gaussian distribution to their ΔSFR with Scipy curve_fit (Virtanen et al. 2020). An example of the Gaussian distribution can be found in Fig. 7 (upper panel). We over-plot the mean of these Gaussian distributions on Fig. 6 as green circles.

thumbnail Fig. 7.

Upper panel: Example of Gaussian fitting for the −100 Myr ≤ merger time < 0 Myr bin. Lower panel: Gaussian fitting for all five merger time bins where the fit succeeded. The Gaussian distributions for the bins centred on −400 Myr (solid purple), −200 Myr (dotted dark blue), 0 Myr (solid light blue), 200 Myr (dash-dotted green), and 400 Myr (dashed red) are shown.

The lower panel of Fig. 7 shows the fitted Gaussian distributions for all merger time bins. As can be seen, the μ of the Gaussian distributions increases as the merger time increases from −200 Myr to 200 Myr, before decreasing as it moves to 400 Myr. The μΔSFR take the values of 0.30 ± 0.02, 0.35 ± 0.02, 0.38 ± 0.01, and 0.30 ± 0.01, for the bins centred on −200 Myr, 0 Myr, 200 Myr, and 400 Myr, respectively. The μΔSFR is highest in the bin centred on −400 Myr, at 0.41 ± 0.01. Unsurprisingly, the amplitude of these distributions follows the distribution of merger times in Fig. 4.

In short, from 300 Myr before a merger we see an increase in ΔSFR as the merger approaches coalescence. This increase continues to increase into the post-merger regime before reducing again in late-stage mergers.

4.4. Dependence on galaxy properties

4.4.1. Stellar mass

We binned the merging galaxies by M from log(M/M) = 9.0 to log(M/M) = 10.0 in steps of 0.5 log(M/M) and one bin for log(M/M)≥10.0. For each mass bin, we fitted a Gaussian distribution, as is described in Sect. 4.3, and show these results in Fig. 8.

thumbnail Fig. 8.

ΔSFR as a function of merger time for mergers binned by M. The purple circles, blue triangles, and green squares show the μ from the Gaussian fitting of the binned merger times with masses of 9.0 ≤ log(M/M) < 9.5, 9.5 ≤ log(M/M) < 10.0, log(M/M)≥10.0, respectively. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty (y axis). Empty markers show the results using re-sampled merger times (see text), and the y axis errors are the standard deviation of ΔSFR from these resamplings. The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie.

Due to the large uncertainties of the merger times possibly impacting our results, we re-sampled the merger times of the KiDS galaxies. This was done by randomly selecting merger times from a Gaussian distribution centred on the predicted merger time from the CNN and with a standard deviation of 145 Myr (the median error) or the median merger time error in the merger time bin containing the galaxy (see Sect. 4.1), whichever is larger. Any galaxy whose new merger time is greater than 500 Myr or less than −500 Myr is then removed from further analysis. As with the CNN merger times, we fitted a Gaussian distribution, in the manner described in Sect. 4.3. This resampling is continued until the average value of the means of the Gaussians for each merger time bin in every M bin does not change by more than 0.001. When this criteria is met, the ΔSFR in each bin is assumed to have converged. These averages are presented in Fig. 8 as empty markers.

There is a general trend of increasing ΔSFR with increasing M, with the exception of the high-mass mergers (log(M/M)≥10.0) at merger times of greater than 300 Myr. The ΔSFR of the high-mass mergers decreases as the merger time increases from −500 Myr to 500 Myr. The decrease in ΔSFR is gentler for mergers with merger times of less than 0 Myr, with the changes between time bins within error, while there is a large drop between the bins centred on 200 Myr and 400 Myr. In this last merger time bin, centred on 400 Myr, the ΔSFR of the high-mass mergers is consistent with the ΔSFR of the intermediate-mass mergers (9.5 ≤ log(M/M) < 10.0). The intermediate-mass mergers’ ΔSFR also decreases from −500 Myr to −200 Myr but, unlike the high-mass mergers, the intermediate-mass ones show an increase in ΔSFR as the merger time increases to 100 Myr. Again, this mass range shows a decrease in ΔSFR as the merger time continues to increase. The re-sampled merger times, on the other hand, continue to increase from 200 Myr to the end of the merger time range. The low-mass mergers (9.0 ≤ log(M/M) < 9.5) also see a decrease in ΔSFR between −500 Myr and −200 Myr before remaining approximately constant to 0 Myr. This is followed by an increase to 200 Myr before falling again to the 400 Myr merger time bin. The re-sampled low-mass mergers, however, have ΔSFR that continues to rise from 200 Myr to 500 Myr.

4.4.2. Local density

We binned the merging galaxies by Σ10 into 0.00 ≤ Σ10 < 0.04, 0.04 ≤ Σ10 < 0.08, and Σ10 ≥ 0.08. For each Σ10 sample, we fitted a Gaussian distribution, as is described in Sect. 4.3, and show these results in Fig. 9. As with M above, we also resample the merger times until the average means of the Gaussians converges. These re-sampled averages are presented in Fig. 9 as empty markers.

thumbnail Fig. 9.

ΔSFR as a function of merger time for mergers binned by local density. The purple circles, blue triangles, and green squares show the μ from the Gaussian fitting of the binned merger times with 0.00 ≤ Σ10 < 0.04, 0.04 ≤ Σ10 < 0.08, and Σ10 ≥ 0.08, respectively. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty (y axis). Empty markers show the results using re-sampled merger times (see text), and the y axis errors are the standard deviation of ΔSFR from these resamplings. The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie.

The low-density (0.00 ≤ Σ10 < 0.04) and intermediate-density (0.04 ≤ Σ10 < 0.08) environments all show an increase in ΔSFR as the galaxies move from merger times of −200 Myr to 200 Myr, and then decrease again afterwards. The high-density (Σ10 ≥ 0.08) environment has ΔSFR increase between the −200 Myr and 0 Myr merger time bins followed by a decrease as the merger time becomes more positive, although these changes are within error. For all but the lowest densities, the ΔSFR in the −400 Myr merger time bin is larger than at any other merger time, but again is consistent with later times within error. For the re-sampled merger times, this continues to hold. The intermediate-density environment typically has higher ΔSFR than the other environments, being consistent with the high-density environment at merger times of ≈ − 200 Myr and ≈400 Myr.

5. Discussion

5.1. Neural network

As expected, the CNN performs worse than the CNN in Paper I, as well as all the other architectures used in Paper I (Swin Transformer (Liu et al. 2021), and autoencoder Residual Network 50 (ResNet50; He et al. 2016)). This is likely due to a combination of the noise added to the IllustrisTNG images as well as the convolution with the PSF. The convolution will reduce or erase the smaller scale structures that may be useful for determining the merger time. The inclusion of observational noise will hide fainter features that were found to be useful for the CNN in Paper I. Our results are better than a number of the autoencoder latent space explorations made in Paper I, specifically Isomap (Tenenbaum et al. 2000), neighbourhood components analysis (Goldberger et al. 2004), sparse random projection (Li et al. 2006), and truncated singular value decomposition (Halko et al. 2011).

While our MSE in this work is similar to the ResNet50 used in Paper I, 0.067 with the validation set here compared to 0.065 at validation with ResNet50, these values are misleading. The results with ResNet50 in Paper I tended to a merger time of −250 Myr. As can be seen in Fig. 3, we do not suffer the same issue. We do see that the mergers with true merger times greater than 0 Myr have predicted merger times that tend towards 240 Myr, but mergers with true merger times of less than 0 Myr have predicted merger times that are reasonably well reproduced. We find a Pearson correlation coefficient of 0.57 with a p-value of 1.87 × 10−59. Thus there is correlation between the predicted and true merger times but this correlation is weak.

In the test data, our network find 78 ± 2% of the galaxies to be correctly predicted as having merger times greater than, or of less than 0 Myr, lower than the 81 ± 1% found in Paper I. This is primarily driven by the galaxies with true merger times of less than 0 Myr, with 81 ± 2% of these galaxies predicted to have merger times of less than 0 Myr. The galaxies with true merger times greater than 0 Myr have a slightly worse reproduction with 77 ± 2% having predicted merger times greater than 0 Myr. This is qualitatively in line with Paper I.

5.2. KiDS merger time

The merger times of the KiDS galaxies predominantly have merger times of less than 0 Myr. This is not overly surprising. The training data used to identify the KiDS merging galaxies in this work was from the GAMA-KiDS Galaxy Zoo citizen science project (Lintott et al. 2008; Driver et al. 2009; Holwerda et al. 2019; Pearson et al. 2019a; Holwerda et al. 2024). Galaxies with negative merger times (pre-merger galaxies) can be easier to visually identify than galaxies with positive merger times (post-merger galaxies), as two close galaxies help in identification. Thus, if the training sample in Pearson et al. (2019a) was biased towards pre-mergers, we would expect the merger sample to be biased towards mergers with positive merger times. Indeed, early Galaxy Zoo merger samples were biased towards pre-merger galaxies (Darg et al. 2010b).

In this work, we identify 2373 mergers with positive merger times and 3524 mergers with negative merger times. With our network assigning 23 ± 1% of galaxies with true merger times greater than 0 Myr negative merger times, we would expect approximately 709 of the galaxies with negative merger time to actually have positive merger times, assuming no true positive merger time galaxies were assigned negative merger times. This would agree with the idea that our sample of KiDS merging galaxies is biased towards pre-merger galaxies. On the other hand, if we incorrectly assign 19 ± 2% of galaxies with true negative merger times to have positive merger times, we would expect approximately 827 of the mergers with negative merger times to actually have positive merger times. This again assumes that no galaxies were assigned positive merger times that actually had negative merger times.

For all mergers, the mean and mode merger times of the IllustrisTNG test set are higher than those of the KiDS galaxies. The median of the test set is −14 ± 188 Myr compared to −48 ± 157 Myr for the KiDS sample (where the errors are the median absolute deviation), while the means are 18 ± 9 Myr and −34 ± 2 Myr (where errors are the standard error on the mean) for the test and KiDS galaxies. This would suggest that the merger times for the KiDS galaxies are not just a reproduction of the distribution found for the test data. The Kolmogorov–Smirnov test also rejects the hypothesis that the test and KiDS merger times, for the pre-mergers, are drawn from the same parent sample at level 0.001. As a result, we are confident that the merger times are determined individually for each KiDS galaxy and are not drawn from an underlying, unphysical distribution that may have been learnt by the CNN.

5.3. SFR as a function of merger time

The merger galaxies, traced by green circles in Fig 6, see an increasing enhancement for mergers with negative merger times from −200 Myr before coalescence to the merger event at a merger time of 0 Myr. This period is likely to contain close passages of the merging galaxies. Close passages are known, in simulations, to cause an increase in the SFR in the interacting galaxies (e.g. Moreno et al. 2019; Petersson et al. 2023). As merging galaxies will not have the close passages at exactly the same time, the ΔSFR in individual galaxies due to a close pass will occur at slightly different times. Between passages, we would expect the SFR of the merging galaxies to decrease (e.g. Moreno et al. 2019). We do not see any decrease in ΔSFR, or a constant ΔSFR, as the merger time approached 0 Myr. This is possibly due to some merging systems’ SFR reducing slightly after the passage while other galaxies experience another passage or the beginning of coalescence which stimulated further ΔSFR. As a result, the average SFR increase becomes larger in this period with some galaxies experiencing a slight decline in SFR while others experience another burst. However, our merger time uncertainty is large. Thus, if galaxy mergers all had a close passage at approximately the same time, any decrease in ΔSFR after a close passage can be lost due to the incorrect merger time. Similarly, the closer pre-merging galaxies are together, the higher the SFR of these galaxies (Ellison et al. 2008; Li et al. 2008; Hwang et al. 2011; Patton et al. 2013; Ellison et al. 2024; Feng et al. 2024; Ferreira et al. 2025), which supports our observed increase in ΔSFR as the merger time approached 0 Myr.

In short, the trends that we see in this work see an increase in SFR from 200 Myr before a merger until at least 200 Myr after coalescence. This is followed by a drop in SFR. This trend is broadly consistent with what is expected from simulations (e.g. Hopkins et al. 2013; Moreno et al. 2019; Rodríguez Montero et al. 2019; Li et al. 2025).

The higher ΔSFR for merger times of less than −300 Myr was unexpected: we would expect this merger time bin to have one of the lowest ΔSFR values (e.g. Moreno et al. 2019; Ellison et al. 2024). This may be a period where a majority of mergers are experiencing a close passage, which are known to increase the SFR in merging galaxies. However, looking at the separation between merging galaxies in IllustrisTNG, we see a median or mean separation for mergers between −300 Myr and −500 Myr before the merger that is larger than the median or mean separation for mergers closer to coalescence. We also see very few merging galaxies with ΔSFR of less than 0 with merger times between −500 Myr and −300 Myr compared to between −300 Myr and 0 Myr. As ΔSFR < 0 suggests a reduction in SFR, it may be that mergers closer to coalescence can suppress star formation for a minority of merging galaxies.

The ΔSFR continues to increase moving from 0 Myr to 200 Myr. This may be a lingering effect of a SFR burst triggered by the coalescence or this burst takes up to 300 Myr to manifest. Also likely, the increase in ΔSFR may be caused by high SFR mergers with a true merger time of ≈0 Myr, or with negative merger times close to 0 Myr, being scattered to more positive merger times as the CNN miss-assigns their merger time. We note that post-merger galaxies have previously been found to show evidence of increased SFR comparable to pre-mergers (Bickley et al. 2022; Petersson et al. 2023). It is expected that the SFR decreases with time after the merger event (e.g. Moreno et al. 2019; Bickley et al. 2022; Ellison et al. 2022; Petersson et al. 2023), although the SFR enhancement can remain for up to 500 Myr after coalescence (Ferreira et al. 2025). Our results also show evidence of this, with the merger population showing a lower ΔSFR for galaxies 300 Myr to 500 Myr after the merger event.

If the merging galaxies are plotted in the SFR-M plane, such as in Fig. 10a, we find the main sequence of star-forming galaxies (MS; e.g. Noeske et al. 2007; Speagle et al. 2014; Popesso et al. 2023) and a small quiescent population. The control galaxies also lie across the entire SFR-M plane, as shown in Fig. 10b, as well as our entire KiDS galaxy sample, shown in Fig. 10c. We over-plot the Speagle et al. (2014) compilation MS in Fig. 10 to help identify the location of the MS. We note that the slope of the Speagle et al. (2014) MS appears to be lower than the slope of the galaxies in this work. However, the slope of the MS is known to be sensitive to the star-forming galaxy selection method (Pearson et al. 2023), and we make no selection for star-forming galaxies in Fig. 10. The low redshift (z < 0.2) slopes used in Speagle et al. (2014) to derive the compilation MS are also highly scattered, ranging from 0.35 to 1. Thus a disagreement in the slope of the MS is not unexpected.

thumbnail Fig. 10.

SFR as a function of M for KiDS galaxies. Panel a shows the position of the merger sample, panel b shows the controls, and panel c shows our entire sample of KiDS galaxies. The mergers lie on the MS, with a small number below.

We find the enhancement of the merger population to be a factor of 1.97 ± 0.04 to 2.56 ± 0.04 larger than the control sample, depending on the merger time. This is consistent with or slightly higher than the enhancement of a factor of two found in Patton et al. (2013) and Ferreira et al. (2025). However, Patton et al. (2013) and Ferreira et al. (2025) both select their merging galaxies and their control galaxies to be star-forming. We do not make such a selection and so we compare the SFR of our mergers to control samples that can contain star-forming and quiescent galaxies. As a result, our enhancement may be larger due to the median SFR of our matched control galaxies being lower than the average SFR of the matched control galaxies in Patton et al. (2013) and Ferreira et al. (2025).

The SFRs used in this work estimate the instantaneous SFR, so should be sensitive to star formation over the last 1 Myr (Boquien et al. 2019). However, spectral energy distribution (SED) derived SFRs are typically assumed to be sensitive to stars formed in the last 100 Myr (Kurczynski et al. 2016). Using this more conservative estimate, the SFR merger time sensitivity is smaller than our mean error at all merger times, and is smaller than the median error of true merger times greater than −300 Myr.

5.4. Confirmation that mergers cause increase in ΔSFR

To ensure that the enhancement in the SFR seen is caused by mergers, we repeated the calculation of SFR enhancement using only our non-merging control galaxies. For each sample of ten non-merging control galaxies, we randomly selected one and compared its SFR to the other nine control galaxies. This was done in the same way as we did for the merging galaxies: the log(SFR/Myr−1) of one control galaxy was subtracted from the median log(SFR/Myr−1) of the other nine control galaxies. That is to say,

Δ SFR ctrl = log ( SFR r M yr 1 ) median [ log ( SFR c M yr 1 ) ] , $$ \begin{aligned} \Delta \mathrm{SFR}_{\rm ctrl} = \log \bigg (\frac{\mathrm{SFR}_{r}}{M_{\odot }\,\mathrm{yr}^{-1}}\bigg ) - \mathrm{median} \bigg [ \log \bigg (\frac{\mathrm{SFR}_{c}}{M_{\odot }\,\mathrm{yr}^{-1}}\bigg ) \bigg ], \end{aligned} $$(7)

where ΔSFRctrl is the SFR enhancement of the randomly selected control galaxy, SFRr is the SFR of the randomly selected control galaxy and SFRc is the SFR of a the other nine control galaxies. The mean (median) uncertainty on ΔSFRctrl, derived from the uncertainties in SFR, is 0.65 (0.66) We then use the merger time of the merging galaxy associated with each control sample to create Fig. 11, over-plotting the μ of fitting a Gaussian distribution to the ΔSFR in merger time bins as in Sect. 4.3 and in Fig. 6. As can be seen, there is no indication of ΔSFR being larger than 0. Indeed, we find that the mean ΔSFRctrl in Fig. 11 is −0.02 ± 0.47, the median is 0.00 ± 0.36, and the Gaussian μ (equivalent to the mode) is 0.00 ± 0.02. All these values are consistent with zero and so the enhancement seen in Fig. 6 for the merging galaxies is a result of the mergers causing changes in the SFR.

thumbnail Fig. 11.

SFR enhancement as a function of merger time for the control galaxies. The green circles show the μ from the Gaussian fitting of the mergers. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty. The colour corresponds to the number density from low (purple) to high (red). The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie. The yellow horizontal line indicates ΔSFRctrl = 0

5.5. Merger time uncertainty

The merger times found in this work are highly uncertain. To determine how this uncertainty impacts our results, we resample the merger times of the KiDS galaxies from a Gaussian distribution centred on the merger time from the CNN and with a standard deviation of 145 Myr (the median error) or the median merger time error in the merger time bin containing the galaxy (see Sect. 4.1), whichever is larger. Any galaxy whose new merger time is greater than 500 Myr or less than −500 Myr is then removed from further analysis as our network is not trained to produce such merger times. We also do not recover the merging galaxies who would have merger time estimates of less than −500 Myr or greater than 500 Myr but whose resampling would scatter them back into the −500 Myr to 500 Myr range. We binned the merging galaxies into bins of width 200 Myr and fitted a Gaussian distribution, as is described in Sect. 4.3. This allowed us to trace the merger population with re-sampled merger times.

We continued to resample the merger times until the mean ΔSFR converged. This was done by continuing to resample the merger time for each merger galaxy from a Gaussian distribution centred on the merger time from the CNN and with a standard deviation of 145 Myr (the median error) or the median merger time error in the merger time bin containing the galaxy (see Sect. 4.1 and Table 1), whichever is larger. Again we removed any galaxy whose re-sampled merger time is less than −500 Myr or greater than 500 Myr. After each resampling, we again binned the merging galaxies into bins of width 200 Myr and fitted a Gaussian distribution. The five merger time bins provide five averages. If each of these five average values have not changed by more than 0.001 and the sum of these changes is not more than 0.005, the average is assumed to have converged. This was achieved after 13 re-samplings. These converged average ΔSFR are hereafter referred to as conv and are presented in Fig. 12 as orange pluses. The re-samplings used to derive conv are shown as grey lines in Fig. 12.

thumbnail Fig. 12.

ΔSFR as a function of merger time. The green circles and orange pluses show the μ from the Gaussian fitting of the mergers using the CNN merger times and conv merger times, respectively. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty, for CNN merger time, or the standard deviation of ΔSFR, for conv (y axis). The grey lines show ΔSFR from the merger time re-samplings used to determine conv. Errors for these lines are omitted for clarity. The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie.

For the conv resampling, ΔSFR increases as merger times move from −200 Myr to 200 Myr, meaning that mergers that should have recently completed a merger have higher ΔSFR than mergers that are coalescing or approaching coalescence. Galaxies with merger times greater than 300 Myr appear to have lower ΔSFR than mergers with smaller merger times, but this drop is within uncertainty of the bin centred on 400 Myr so is not robust. At the other end of the time range, we again see the conv resampling has higher ΔSFR for merger times of less than −300 Myr when compared to galaxies with merger times between −300 Myr and 0 Myr.

With the increase in ΔSFR for the conv and CNN merger times, we can be cautiously optimistic that we are seeing in enhancement in the SFR as the mergers approach coalescence. As all the re-samplings and the CNN see a continued ΔSFR enhancement into the early post-merger phase, it is likely that early post-mergers continue to have increasing SFR compared to mergers with merger times of less than 0 Myr.

The large decrease in ΔSFR moving from the 200 Myr to the 400 Myr bins with the CNN merger times is not strongly seen in the conv data and only seen in one of the re-samplings. Therefore, we believe we are only seeing weak evidence of this decrease in ΔSFR. This weak evidence of a decreasing ΔSFR is likely due to the poor merger time accuracy for galaxies with merger times greater than 0 Myr, with galaxies in the 200 Myr and 400 Myr bins being easily confused by the neural network (Fig. 3).

One of the re-samplings in Fig. 12 only shows a very slight drop in ΔSFR between the bins centred on −400 Myr and −200 Myr. This drop is within the errors of these points for this resampling (not shown in Fig. 12). This hints that the unexpectedly large ΔSFR for merger times of less than −300 Myr may also be a result of our merger time uncertainty. However, as this only appears in a single resampling, this is not a very strong argument for why we see this increase with the CNN merger times.

5.6. Dependence on galaxy properties

5.6.1. Stellar mass

High-mass galaxies are more likely to be gas-poor, and thus high mass major mergers are more likely to have less gas available to form new stars (Robotham et al. 2014). This may explain why the ΔSFR of our high-mass galaxies (log(M/M)≥10.0) drops to become consistent with the intermediate-mass mergers (9.5 ≤ log(M/M) < 10.0) for the most positive merger times in Fig. 8. However, this does not explain why the ΔSFR for the high-mass galaxies is otherwise higher than the other mass bins. It is possible, instead, that the gas is depleted faster in high-mass galaxy mergers compared to the other mass bins: higher ΔSFR implies a higher specific SFR and hence faster gas depletion. If the SFR is triggered sooner in a high-mass merger, this would explain why ΔSFR decreases slowly for mergers with merger times of less than 0 Myr. This rapid depletion of gas would then result in a quicker decrease in ΔSFR for high-mass galaxies than intermediate- or low-mass galaxies. If this is what is happening, high-mass mergers are not initially gas-poor (i.e. the high-mass mergers were star-forming before they merged) but the gas is depleted quickly compared to lower mass mergers. ΔSFR for the intermediate-mass galaxies also reduces faster as the merger time increases from 200 Myr to 500 Myr compared to the low-mass galaxies, supporting the idea that the higher the mass of a merger, the faster it depletes the gas available for star formation.

ΔSFR increases with M, as seen in Fig. 8, at least for the population with merger times of less than 300 Myr. We note that the ΔSFR compares galaxies of similar M, so this increase with stellar mass is not caused by the correlation between M and SFR. This would suggest that there is more gas available to form stars as the galaxies undergoing a merger become larger.

However, the control sample for high-mass mergers mainly consists of quiescent galaxies (Fig. 10). Here, we define a star-forming galaxy to have a specific SFR (SFR/M) greater than −10.5 yr−1, determined through visual inspection of Fig 10. With this definition, only 23 ± 1% of the high-mass control galaxies are star-forming while 72 ± 1% of the high-mass mergers are star-forming. This compares to the low-mass sample having 78 ± 1% star-forming controls and 85 ± 1% star-forming mergers while the intermediate-mass sample has 52 ± 1% of the controls being star-forming and 85 ± 1% of the mergers being star-forming. As a result, the ΔSFR increase with M may predominantly be a result of the higher fraction of quiescent control galaxies as the merging galaxies’ M increases. If we compare this to the entire sample of KiDS galaxies, 69 ± 1% of low-mass galaxies, 51 ± 1% of intermediate-mass galaxies, and 24 ± 1% of high-mass galaxies are star-forming. This increasing fraction of quiescent galaxies with M in the control sample is therefore a result of the increasing fraction of quiescent galaxies with M in the entire sample of KiDS galaxies. This increase in the quiescent fraction with M is often seen in the literature (e.g. Peng et al. 2010; Weaver et al. 2023).

5.6.2. Local density

High-density environments contain more gas-poor galaxies than lower-density environments. Thus, we expect more dry mergers, and hence lower ΔSFR mergers (Lin et al. 2010; He et al. 2024). In Fig. 9 we typically see the high-density environments (Σ10 ≥ 0.08) have lower ΔSFR than the intermediate-density environments (0.04 ≤ Σ10 < 0.08), which supports this notion. However, the low-density environments (0.00 ≤ Σ10 < 0.04) are typically found to have lower ΔSFR than intermediate- and high-density environments. This is inconsistent, or at best marginally consistent, with studies finding higher-density environments containing more gas-poor galaxies, or in general contain less gas (Lin et al. 2010). It is not clear why this should be the case as we would expect the lower-density environments to have more gas available to form stars.

The general trend for low- and intermediate-density environments are the same as the entire population: ΔSFR drops from the merger time bin centred on −400 Myr to the −200 Myr before increasing to 200 Myr and then falling into the final merger time bin. The high-density environments differ from this trend after 0 Myr, as this environment only sees a decrease in ΔSFR from this merger time. As is discussed in Sect. 5.5, this difference may be due to merger time uncertainty.

However, the disagreement between environments is different in the converged, re-sampled merger times in Fig. 9 (empty markers). All environments remain approximately constant in ΔSFR between the 200 Myr and 400 Myr merger time bins. These differences suggest that the strength of the trends seen with the CNN merger times are at least partially a result of the merger time uncertainty.

6. Conclusions

In this work, we trained a CNN on images of merging galaxies from IllustrisTNG with known merger times with the intent to apply this CNN to observed merging galaxies from KiDS. The images of the IllustrisTNG galaxies were edited to appear like KiDS observations. The merging galaxies were selected from TNG100-1 to have undergone a merger in the last 500 Myr or will undergo a merger in the next 500 Myr. The trained CNN has a mean error of 182 Myr and a median error of 145 Myr.

The CNN was applied to 5897 merging galaxies from KiDS, providing merger times for a large sample of observed galaxies for the first time. The majority of the KiDS mergers (3524) were found to have merger time consistent with pre-mergers. This was consistent with our expectation that the KiDS sample is biased towards pre-mergers, as the KiDS mergers were selected with a CNN from Pearson et al. (2019a) that was trained on a merger sample that was likely biased towards pre-mergers. The merging galaxies were matched to non-merging counterparts with similar redshift, M, and local density (Σ10). The SFR of the merging systems were compared to the average SFR of their matched counterparts.

Our merging galaxies have increasing SFR activity between 300 Myr before the merger event and coalescence (merger times of 0 Myr). We suggest that this increase is due to closer proximity of the merging galaxies as well as close passages as the merger time approached 0 Myr. This is followed by a further increase in the SFR remaining until at least 200 Myr after the merger event. We also see an indication that this is followed by a decrease in the SFR activity as the time since a merger increases further. This decrease in ΔSFR for late-stage mergers is difficult to be confident in due to the larger merger time uncertainties for late-stage mergers. There is also a decrease in ΔSFR from ≈400 Myr before the merger event and ≈200 Myr before. The reason for this is unclear but there is evidence that it may be a result of the low accuracy in merger time estimation.

When binning by M, we see that the larger the merging galaxy’s M, the greater the SFR enhancement. We generally find that pre-mergers with higher M have higher ΔSFR. However, the most massive enhanced merging galaxies have ΔSFR that decreases faster after the merger is complete compared to the lower mass galaxies. When binned by Σ10, we find that the lowest density environments have lower ΔSFR mergers than higher-density environments and intermediate density environments have higher ΔSFR mergers than other environments. However, this trend is inconsistent, suggesting that density does not play a major roll in the change in SFR in galaxies due to mergers.

The merger times produced in this work are not highly accurate. However, they show that it is possible to determine the time before or after a merger event for real galaxies, something not previously managed on such a large scale. With these times, we can then study how galaxies change during the merger sequence from a statistical standpoint. Future work will use a target observed data set with deeper observations, to better show faint structures, with higher spatial resolution, to better show fine structures. For example, targeting Hyper-Suprime Cam – Subaru Strategic Program (Aihara et al. 2018) data will provide deeper imaging than KiDS at a similar spatial resolution, Vera C. Rubin (Ivezić et al. 2019) data will provide deeper imaging at a similar spatial resolution to KiDS but cover most of the Southern sky, while targeting James Web Space Telescope (Gardner et al. 2006) data will provide both deeper images and greater resolution at the expense of a smaller area of sky coverage.

Data availability

Full Table 2 is available at the CDS via https://cdsarc.cds.unistra.fr/viz-bin/cat/J/A+A/704/A234

Acknowledgments

We would like to thank the referee for their thorough and thoughtful comments that helped improve the quality and clarity of this paper. W.J.P. has been supported by the Polish National Science Center projects UMO-2020/37/B/ST9/00466 and UMO-2023/51/D/ST9/00147. L.W. and B.M-B acknowledge funding from the project ‘Clash of the Titans: de-ciphering the enigmatic role of cosmic collisions’ (with project number VI.Vidi.193.113 of the research programme Vidi which is (partly) financed by the Dutch Research Council (NWO). L.E.S. was supported by the Estonian Ministry of Education and Research (grant TK202), Estonian Research Council grant (PRG1006), and the European Union’s Horizon Europe research and innovation programme (EXCOSM, grant No. 101159513). The IllustrisTNG simulations were undertaken with compute time awarded by the Gauss Centre for Supercomputing (GCS) under GCS Large-Scale Projects GCS-ILLU and GCS-DWAR on the GCS share of the supercomputer Hazel Hen at the High Performance Computing Center Stuttgart (HLRS), as well as on the machines of the Max Planck Computing and Data Facility (MPCDF) in Garching, Germany. Based on observations made with ESO Telescopes at the La Silla Paranal Observatory under programme IDs 177.A-3016, 177.A-3017, 177.A-3018 and 179.A-2004, and on data products produced by the KiDS consortium. The KiDS production team acknowledges support from: Deutsche Forschungsgemeinschaft, ERC, NOVA and NWO-M grants; Target; the University of Padova, and the University Federico II (Naples). This work made use of Astropy: a community-developed core Python package and an ecosystem of tools and resources for astronomy (Astropy Collaboration 2013, 2018, 2022). This research made use of Photutils, an Astropy package for detection and photometry of astronomical sources (Bradley et al. 2024).

References

  1. Ackermann, S., Schawinski, K., Zhang, C., Weigel, A. K., & Turp, M. D. 2018, MNRAS, 479, 415 [NASA ADS] [CrossRef] [Google Scholar]
  2. Aihara, H., Arimoto, N., Armstrong, R., et al. 2018, PASJ, 70, S4 [NASA ADS] [Google Scholar]
  3. Asada, Y., Sawicki, M., Abraham, R., et al. 2024, MNRAS, 527, 11372 [Google Scholar]
  4. Astropy Collaboration (Robitaille, T. P., et al.) 2013, A&A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  5. Astropy Collaboration (Price-Whelan, A. M., et al.) 2018, AJ, 156, 123 [Google Scholar]
  6. Astropy Collaboration (Price-Whelan, A. M., et al.) 2022, ApJ, 935, 167 [NASA ADS] [CrossRef] [Google Scholar]
  7. Barton, E. J., Geller, M. J., & Kenyon, S. J. 2000, ApJ, 530, 660 [NASA ADS] [CrossRef] [Google Scholar]
  8. Bickley, R. W., Bottrell, C., Hani, M. H., et al. 2021, MNRAS, 504, 372 [NASA ADS] [CrossRef] [Google Scholar]
  9. Bickley, R. W., Ellison, S. L., Patton, D. R., et al. 2022, MNRAS, 514, 3294 [NASA ADS] [CrossRef] [Google Scholar]
  10. Bickley, R. W., Ellison, S. L., Patton, D. R., & Wilkinson, S. 2023, MNRAS, 519, 6149 [CrossRef] [Google Scholar]
  11. Bilicki, M., Dvornik, A., Hoekstra, H., et al. 2021, A&A, 653, A82 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  12. Boquien, M., Burgarella, D., Roehlly, Y., et al. 2019, A&A, 622, A103 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Bottrell, C., Hani, M. H., Teimoorinia, H., et al. 2019, MNRAS, 490, 5390 [NASA ADS] [CrossRef] [Google Scholar]
  14. Boylan-Kolchin, M., Springel, V., White, S. D. M., Jenkins, A., & Lemson, G. 2009, MNRAS, 398, 1150 [Google Scholar]
  15. Bradley, L., Sipocz, B., Robitaille, T., et al. 2024, https://doi.org/10.5281/zenodo.10671725 [Google Scholar]
  16. Brown, W., Patton, D. R., Ellison, S. L., & Faria, L. 2023, MNRAS, 522, 5107 [NASA ADS] [CrossRef] [Google Scholar]
  17. Bruzual, G., & Charlot, S. 2003, MNRAS, 344, 1000 [NASA ADS] [CrossRef] [Google Scholar]
  18. Bustamante, S., Sparre, M., Springel, V., & Grand, R. J. J. 2018, MNRAS, 479, 3381 [Google Scholar]
  19. Capaccioli, M., & Schipani, P. 2011, The Messenger, 146, 2 [NASA ADS] [Google Scholar]
  20. Capaccioli, M., Schipani, P., de Paris, G., et al. 2012, in Science from the Next Generation Imaging and Spectroscopic Surveys, 1 [Google Scholar]
  21. Chabrier, G. 2003, PASP, 115, 763 [Google Scholar]
  22. Charlot, S., & Fall, S. M. 2000, ApJ, 539, 718 [Google Scholar]
  23. 6 Ćiprijanović, A., Kafkes, D., Jenkins, S., et al. 2020a, ArXiv e-prints [arXiv:2011.03591] [Google Scholar]
  24. Ćiprijanović, A., Snyder, G. F., Nord, B., & Peek, J. E. G. 2020b, Astron. Comput., 32, 100390 [CrossRef] [Google Scholar]
  25. Ćiprijanović, A., Kafkes, D., Downey, K., et al. 2021, MNRAS, 506, 677 [CrossRef] [Google Scholar]
  26. Conselice, C. J. 2014, ARA&A, 52, 291 [CrossRef] [Google Scholar]
  27. Conselice, C. J., Bershady, M. A., & Jangren, A. 2000, ApJ, 529, 886 [NASA ADS] [CrossRef] [Google Scholar]
  28. Conselice, C. J., Bershady, M. A., Dickinson, M., & Papovich, C. 2003, AJ, 126, 1183 [CrossRef] [Google Scholar]
  29. Cortijo-Ferrero, C., González Delgado, R. M., Pérez, E., et al. 2017, MNRAS, 467, 3898 [NASA ADS] [CrossRef] [Google Scholar]
  30. Darg, D. W., Kaviraj, S., Lintott, C. J., et al. 2010a, MNRAS, 401, 1552 [NASA ADS] [CrossRef] [Google Scholar]
  31. Darg, D. W., Kaviraj, S., Lintott, C. J., et al. 2010b, MNRAS, 401, 1043 [NASA ADS] [CrossRef] [Google Scholar]
  32. de Graaff, R., Margalef-Bentabol, B., Wang, L., et al. 2025, A&A, 697, A207 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  33. de Jong, J. T. A., Kuijken, K., Applegate, D., et al. 2013a, The Messenger, 154, 44 [NASA ADS] [Google Scholar]
  34. de Jong, J. T. A., Verdoes Kleijn, G. A., Kuijken, K. H., & Valentijn, E. A. 2013b, Exp. Astron., 35, 25 [Google Scholar]
  35. De Propris, R., Liske, J., Driver, S. P., Allen, P. D., & Cross, N. J. G. 2005, AJ, 130, 1516 [NASA ADS] [CrossRef] [Google Scholar]
  36. Desmons, A., Brough, S., Martínez-Lombilla, C., et al. 2023, MNRAS, 523, 4381 [CrossRef] [Google Scholar]
  37. Domínguez Sánchez, H., Martin, G., Damjanov, I., et al. 2023, MNRAS, 521, 3861 [CrossRef] [Google Scholar]
  38. Draine, B. T., Aniano, G., Krause, O., et al. 2014, ApJ, 780, 172 [Google Scholar]
  39. Driver, S. P., Norberg, P., Baldry, I. K., et al. 2009, Astron. Geophys., 50, 12 [Google Scholar]
  40. Driver, S. P., Bellstedt, S., Robotham, A. S. G., et al. 2022, MNRAS, 513, 439 [NASA ADS] [CrossRef] [Google Scholar]
  41. Duncan, K., Conselice, C. J., Mundy, C., et al. 2019, ApJ, 876, 110 [NASA ADS] [CrossRef] [Google Scholar]
  42. Eales, S., Dunne, L., Clements, D., et al. 2010, PASP, 122, 499 [NASA ADS] [CrossRef] [Google Scholar]
  43. Edge, A., Sutherland, W., Kuijken, K., et al. 2013, The Messenger, 154, 32 [NASA ADS] [Google Scholar]
  44. Ellison, S. L., Patton, D. R., Simard, L., & McConnachie, A. W. 2008, AJ, 135, 1877 [NASA ADS] [CrossRef] [Google Scholar]
  45. Ellison, S. L., Mendel, J. T., Patton, D. R., & Scudder, J. M. 2013a, MNRAS, 435, 3627 [CrossRef] [Google Scholar]
  46. Ellison, S. L., Mendel, J. T., Scudder, J. M., Patton, D. R., & Palmer, M. J. D. 2013b, MNRAS, 430, 3128 [NASA ADS] [CrossRef] [Google Scholar]
  47. Ellison, S. L., Viswanathan, A., Patton, D. R., et al. 2019, MNRAS, 487, 2491 [NASA ADS] [CrossRef] [Google Scholar]
  48. Ellison, S. L., Wilkinson, S., Woo, J., et al. 2022, MNRAS, 517, L92 [NASA ADS] [CrossRef] [Google Scholar]
  49. Ellison, S., Ferreira, L., Wild, V., et al. 2024, Open J. Astrophys., 7, 121 [Google Scholar]
  50. Feng, S., Shen, S.-Y., Yuan, F.-T., et al. 2024, ApJ, 965, 60 [Google Scholar]
  51. Ferreira, L., Conselice, C. J., Duncan, K., et al. 2020, ApJ, 895, 115 [NASA ADS] [CrossRef] [Google Scholar]
  52. Ferreira, L., Ellison, S. L., Patton, D. R., et al. 2025, MNRAS, 538, L31 [Google Scholar]
  53. Fritz, J., Franceschini, A., & Hatziminaoglou, E. 2006, MNRAS, 366, 767 [Google Scholar]
  54. Gao, F., Wang, L., Pearson, W. J., et al. 2020, A&A, 637, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  55. Gardner, J. P., Mather, J. C., Clampin, M., et al. 2006, Space Sci. Rev., 123, 485 [Google Scholar]
  56. Goldberger, J., Hinton, G. E., Roweis, S., & Salakhutdinov, R. R. 2004, in Advances in Neural Information Processing Systems, eds. L. Saul, Y. Weiss, & L. Bottou, (MIT Press), 17 [Google Scholar]
  57. Guzmán-Ortega, A., Rodriguez-Gomez, V., Snyder, G. F., Chamberlain, K., & Hernquist, L. 2023, MNRAS, 519, 4920 [CrossRef] [Google Scholar]
  58. Halko, N., Martinsson, P. G., & Tropp, J. A. 2011, SIAM Rev., 53, 217 [CrossRef] [Google Scholar]
  59. Hani, M. H., Gosain, H., Ellison, S. L., Patton, D. R., & Torrey, P. 2020, MNRAS, 493, 3716 [Google Scholar]
  60. He, K., Zhang, X., Ren, S., & Sun, J. 2016, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770 [Google Scholar]
  61. He, C., Xu, C., Lisenfeld, U., et al. 2024, Res. Astron. Astrophys., 24, 055005 [Google Scholar]
  62. Holwerda, B. W., Kelvin, L., Baldry, I., et al. 2019, AJ, 158, 103 [NASA ADS] [CrossRef] [Google Scholar]
  63. Holwerda, B. W., Robertson, C., Cook, K., et al. 2024, PASA, 41, e115 [Google Scholar]
  64. Hopkins, P. F., Cox, T. J., Hernquist, L., et al. 2013, MNRAS, 430, 1901 [Google Scholar]
  65. Hopkins, P. F., Wetzel, A., Kereš, D., et al. 2018, MNRAS, 480, 800 [NASA ADS] [CrossRef] [Google Scholar]
  66. Hwang, H. S., Elbaz, D., Dickinson, M., et al. 2011, A&A, 535, A60 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  67. Hwang, H. S., Park, C., Elbaz, D., & Choi, Y. Y. 2012, A&A, 538, A15 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  68. Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111 [Google Scholar]
  69. Keel, W. C., Kennicutt, R. C., Jr, Hummel, E., & van der Hulst, J. M. 1985, AJ, 90, 708 [NASA ADS] [CrossRef] [Google Scholar]
  70. Kennicutt, R. C., Jr 1983, ApJ, 272, 54 [NASA ADS] [CrossRef] [Google Scholar]
  71. Kingma, D. P., & Ba, J. 2015, in 3rd InternationalConference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, Conference Track Proceedings, eds. Y. Bengio, & Y. LeCun [Google Scholar]
  72. Knapen, J. H., Cisternas, M., & Querejeta, M. 2015, MNRAS, 454, 1742 [NASA ADS] [CrossRef] [Google Scholar]
  73. Koppula, S., Bapst, V., Huertas-Company, M., et al. 2021, ArXiv e-prints [arXiv:2102.05182] [Google Scholar]
  74. Kuijken, K. 2011, The Messenger, 146, 8 [NASA ADS] [Google Scholar]
  75. Kuijken, K., Heymans, C., Dvornik, A., et al. 2019, A&A, 625, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  76. Kurczynski, P., Gawiser, E., Acquaviva, V., et al. 2016, ApJ, 820, L1 [NASA ADS] [CrossRef] [Google Scholar]
  77. La Marca, A., Margalef-Bentabol, B., Wang, L., et al. 2024, A&A, 690, A326 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  78. Li, P., Hastie, T. J., & Church, K. W. 2006, in Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’06 (New York, NY, USA: Association for Computing Machinery), 287 [Google Scholar]
  79. Li, C., Kauffmann, G., Heckman, T. M., Jing, Y. P., & White, S. D. M. 2008, MNRAS, 385, 1903 [NASA ADS] [CrossRef] [Google Scholar]
  80. Li, F., Rahman, M., Murray, N., et al. 2025, ApJ, 979, 7 [Google Scholar]
  81. Lin, L., Cooper, M. C., Jian, H.-Y., et al. 2010, ApJ, 718, 1158 [NASA ADS] [CrossRef] [Google Scholar]
  82. Lintott, C. J., Schawinski, K., Slosar, A., et al. 2008, MNRAS, 389, 1179 [NASA ADS] [CrossRef] [Google Scholar]
  83. Liu, Z., Lin, Y., Cao, Y., et al. 2021, in Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) [Google Scholar]
  84. Lotz, J. M., Primack, J., & Madau, P. 2004, AJ, 128, 163 [NASA ADS] [CrossRef] [Google Scholar]
  85. Lotz, J. M., Davis, M., Faber, S. M., et al. 2008, ApJ, 672, 177 [NASA ADS] [CrossRef] [Google Scholar]
  86. Margalef-Bentabol, B., Wang, L., La Marca, A., et al. 2024, A&A, 687, A24 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  87. Marinacci, F., Vogelsberger, M., Pakmor, R., et al. 2018, MNRAS, 480, 5113 [NASA ADS] [Google Scholar]
  88. Moreno, J., Torrey, P., Ellison, S. L., et al. 2019, MNRAS, 485, 1320 [NASA ADS] [CrossRef] [Google Scholar]
  89. Naiman, J. P., Pillepich, A., Springel, V., et al. 2018, MNRAS, 477, 1206 [Google Scholar]
  90. Nair, V., & Hinton, G. E. 2010, Proceedings of the 27th International Conference on International Conference on Machine Learning, ICML’10 (Madison, WI, USA: Omnipress), 807 [Google Scholar]
  91. Nelson, D., Pillepich, A., Springel, V., et al. 2018, MNRAS, 475, 624 [Google Scholar]
  92. Noeske, K. G., Weiner, B. J., Faber, S. M., et al. 2007, ApJ, 660, L43 [CrossRef] [Google Scholar]
  93. Patton, D. R., Torrey, P., Ellison, S. L., Mendel, J. T., & Scudder, J. M. 2013, MNRAS, 433, L59 [CrossRef] [Google Scholar]
  94. Pearson, W. J., Wang, L., Alpaslan, M., et al. 2019a, A&A, 631, A51 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  95. Pearson, W. J., Wang, L., Trayford, J. W., Petrillo, C. E., & van der Tak, F. F. S. 2019b, A&A, 626, A49 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  96. Pearson, W. J., Suelves, L. E., Ho, S. C. C., et al. 2022, A&A, 661, A52 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  97. Pearson, W. J., Pistis, F., Figueira, M., et al. 2023, A&A, 679, A35 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  98. Pearson, W. J., Rodriguez-Gomez, V., Kruk, S., & Margalef-Bentabol, B. 2024, A&A, 687, A45 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  99. Peng, Y.-J., Lilly, S. J., Kovač, K., et al. 2010, ApJ, 721, 193 [Google Scholar]
  100. Petersson, J., Renaud, F., Agertz, O., Dekel, A., & Duc, P.-A. 2023, MNRAS, 518, 3261 [Google Scholar]
  101. Pillepich, A., Nelson, D., Hernquist, L., et al. 2018a, MNRAS, 475, 648 [Google Scholar]
  102. Pillepich, A., Springel, V., Nelson, D., et al. 2018b, MNRAS, 473, 4077 [Google Scholar]
  103. Planck Collaboration XIII. 2016, A&A, 594, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  104. Popesso, P., Concas, A., Cresci, G., et al. 2023, MNRAS, 519, 1526 [Google Scholar]
  105. Reeves, A. M. M., & Hudson, M. J. 2024, MNRAS, 527, 2037 [Google Scholar]
  106. Renaud, F., Segovia Otero, Á., & Agertz, O. 2022, MNRAS, 516, 4922 [NASA ADS] [CrossRef] [Google Scholar]
  107. Robotham, A. S. G., Driver, S. P., Davies, L. J. M., et al. 2014, MNRAS, 444, 3986 [NASA ADS] [CrossRef] [Google Scholar]
  108. Rodrigues, M., Puech, M., Flores, H., Hammer, F., & Pirzkal, N. 2018, MNRAS, 475, 5133 [NASA ADS] [CrossRef] [Google Scholar]
  109. Rodríguez Montero, F., Davé, R., Wild, V., Anglés-Alcázar, D., & Narayanan, D. 2019, MNRAS, 490, 2139 [CrossRef] [Google Scholar]
  110. Rodriguez-Gomez, V., Genel, S., Vogelsberger, M., et al. 2015, MNRAS, 449, 49 [Google Scholar]
  111. Rodriguez-Gomez, V., Snyder, G. F., Lotz, J. M., et al. 2019, MNRAS, 483, 4140 [NASA ADS] [CrossRef] [Google Scholar]
  112. Rupke, D. S. N., Kewley, L. J., & Barnes, J. E. 2010, ApJ, 710, L156 [NASA ADS] [CrossRef] [Google Scholar]
  113. Rutherford, T. H., van de Sande, J., Croom, S. M., et al. 2024, MNRAS, 529, 810 [NASA ADS] [CrossRef] [Google Scholar]
  114. Santos, D. J. D., Goto, T., Kim, S. J., et al. 2021, MNRAS, 507, 3070 [NASA ADS] [CrossRef] [Google Scholar]
  115. Schartmann, M., Meisenheimer, K., Camenzind, M., Wolf, S., & Henning, T. 2005, A&A, 437, 861 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  116. Scudder, J. M., Ellison, S. L., Torrey, P., Patton, D. R., & Mendel, J. T. 2012, MNRAS, 426, 549 [NASA ADS] [CrossRef] [Google Scholar]
  117. Shah, E. A., Kartaltepe, J. S., Magagnoli, C. T., et al. 2022, ApJ, 940, 4 [NASA ADS] [CrossRef] [Google Scholar]
  118. Shirley, R., Roehlly, Y., Hurley, P. D., et al. 2019, MNRAS, 490, 634 [Google Scholar]
  119. Silva, A., Marchesini, D., Silverman, J. D., et al. 2021, ApJ, 909, 124 [Google Scholar]
  120. Silverman, J. D., Kampczyk, P., Jahnke, K., et al. 2011, ApJ, 743, 2 [NASA ADS] [CrossRef] [Google Scholar]
  121. Snyder, G. F., Lotz, J., Moody, C., et al. 2015, MNRAS, 451, 4290 [NASA ADS] [CrossRef] [Google Scholar]
  122. Snyder, G. F., Rodriguez-Gomez, V., Lotz, J. M., et al. 2019, MNRAS, 486, 3702 [NASA ADS] [CrossRef] [Google Scholar]
  123. Somerville, R. S., & Davé, R. 2015, ARA&A, 53, 51 [Google Scholar]
  124. Sparre, M., Whittingham, J., Damle, M., et al. 2022, MNRAS, 509, 2720 [Google Scholar]
  125. Speagle, J. S., Steinhardt, C. L., Capak, P. L., & Silverman, J. D. 2014, ApJS, 214, 15 [Google Scholar]
  126. Springel, V., White, S. D. M., Jenkins, A., et al. 2005, Nature, 435, 629 [Google Scholar]
  127. Springel, V., Pakmor, R., Pillepich, A., et al. 2018, MNRAS, 475, 676 [Google Scholar]
  128. Suelves, L. E., Pearson, W. J., & Pollo, A. 2023, A&A, 669, A141 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  129. Taranu, D. S., Dubinski, J., & Yee, H. K. C. 2013, ApJ, 778, 61 [NASA ADS] [CrossRef] [Google Scholar]
  130. Taylor, E. N., Hopkins, A. M., Baldry, I. K., et al. 2011, MNRAS, 418, 1587 [Google Scholar]
  131. Tenenbaum, J. B., de Silva, V., & Langford, J. C. 2000, Science, 290, 2319 [CrossRef] [PubMed] [Google Scholar]
  132. Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Methods, 17, 261 [Google Scholar]
  133. Walmsley, M., Ferguson, A. M. N., Mann, R. G., & Lintott, C. J. 2019, MNRAS, 483, 2968 [NASA ADS] [CrossRef] [Google Scholar]
  134. Wang, L., Pearson, W. J., & Rodriguez-Gomez, V. 2020, A&A, 644, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  135. Weaver, J. R., Davidzon, I., Toft, S., et al. 2023, A&A, 677, A184 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  136. Weinberger, R., Springel, V., Hernquist, L., et al. 2017, MNRAS, 465, 3291 [Google Scholar]
  137. Weston, M. E., McIntosh, D. H., Brodwin, M., et al. 2017, MNRAS, 464, 3882 [NASA ADS] [CrossRef] [Google Scholar]
  138. Wilkinson, S., Ellison, S. L., Bottrell, C., et al. 2024, MNRAS, 528, 5558 [Google Scholar]
  139. Wright, A. H., Robotham, A. S. G., Bourne, N., et al. 2016, MNRAS, 460, 765 [Google Scholar]
  140. Wright, A. H., Hildebrandt, H., Kuijken, K., et al. 2019, A&A, 632, A34 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

Appendix A: CIGALE setup

Here we present our CIGALE setup used in this work in Table A.1.

Table A.1.

Parameters used for the CIGALE model SEDs used to estimate M and SFR. All ages and times are in Myr.

Appendix B: Comparison of SFR and M to GAMA

Here we compare our SFR and M values derived through SED fitting described in Sect. 2.2.3. We match KiDS galaxies to galaxies found in the fourth data release of the Galaxy and Mass Assembly survey (GAMA; Driver et al. 2009, 2022). We match all KiDS galaxies with z < 0.15 to GAMA galaxies that are within 2 arcsec and have redshifts within 0.01. In total, we match to 5212 GAMA galaxies.

For M, we compared our values to the values derived by GAMA using SED fitting to the Lambdar photometry (Taylor et al. 2011; Wright et al. 2016), presenting the distributions in Fig. B.1. We fitted a straight line to the relation between the GAMA log(M/M) and our KiDS log(M/M). We found a slope of 1.05 ± 0.01 and an offset of −0.20 ± 0.01. This suggests that our M are systematically lower than the GAMA M by 0.2 dex. Using our CIGALE setup with the Lambdar data from GAMA, we find a very small offset. Again fitting a straight line to the relation between the GAMA log(M/M) and the log(M/M) found using our CIGALE setup and the GAMA Lambdar photometry, we find a slope of 0.99 ± 0.01 and an offset of −0.03 ± 0.01. This suggests that the offset we observe is a result of the limited data used for fitting: here the 9 KiDS+VIKINGS optical and NIR bands compared to the 21 ultraviolet (UV) to far-infrared (FIR) bands used in GAMA.

thumbnail Fig. B.1.

Our CIGALE derived M for KiDS galaxies as a function of M for matched GAMA galaxies. The colour corresponds to the number density from low (purple) to high (red). The solid red line indicates the one-to-one relation, while the dashed blue line indicates our fitted relation.

For SFR, we compared our values to values derived from GAMA Hα observations using the Kennicutt relation (Kennicutt 1983). We present this comparison in Fig. B.2. Here we fit a straight line with a slope of unity to the galaxies where both log(SFR/Myr−1) are between −3 and 1. We find a systematic offset with our CIGALE derived SFRs being 0.45 ± 0.01 dex below the Hα SFRs. A disagreement was not unexpected as we do not have any UV or FIR data in our SED fitting, both of which improve SFR estimation. Again using our CIGALE setup to estimate the SFR with the GAMA Lambdar photometry, we find a smaller underestimation of −0.13 ± 0.01 dex. This smaller offset again suggests that our underestimation is primarily due to the limited photometry available across the entire KiDS fields.

thumbnail Fig. B.2.

Our CIGALE derived SFR for KiDS galaxies as a function of SFR for matched GAMA galaxies derived from Hα observations. The colour corresponds to the number density from low (purple) to high (red). The solid red line indicates the one-to-one relation, while the dashed blue line indicates our fitted relation with unity slope.

We also compared our KiDS results and the GAMA results for only the merging galaxies and only the non-merging galaxies. Fitting the log(SFR) of the 308 matched merging galaxies with a straight line with unity slope, we find an offset of −0.64±0.04. The 2787 matched non-mergers have an offset of −0.35±0.01. Thus, the mergers have a larger underestimation compared to the non-mergers, assuming the GAMA SFRs are correct. We note that the missing 2117 galaxies are not classified as mergers or non-mergers using our criteria in Sect. 2.2.1.

Both M and SFR used in this work have a systematic underestimation compared to the results from GAMA. However, M is only used to select control galaxies while the SFRs are used in comparison to other SFRs which should alleviate the impact of constant systematic offsets seen here. We see larger underestimation for the merging galaxies compared to the non-merger galaxies, meaning ΔSFR may be underestimated. As a result, we elect to not correct for these offsets in this work.

All Tables

Table 1.

Mean and median predicted merger time errors within predicted merger time bins.

Table 2.

Predicted merger times for KiDS merging galaxies

Table A.1.

Parameters used for the CIGALE model SEDs used to estimate M and SFR. All ages and times are in Myr.

All Figures

thumbnail Fig. 1.

Randomly selected example IllustrisTNG image after processing to appear like a KiDS image. The target galaxy is in the centre of the image. The left column shows the u, g, r, and i band images, as labelled, with arcsinh scaling applied twice. The right column shows the associated segmentation maps for each band, with each colour representing a different segment. Each panel is 25.6 × 25.6 arcsec (128 × 128 pixels), corresponding to a physical size of 53 × 53 kpc at the redshift of the IllustrisTNG galaxy (z = 0.11).

In the text
thumbnail Fig. 2.

Randomly selected example KiDS image of a merging galaxy. The target galaxy is in the centre of the image. The left column shows the u, g, r, and i band images, as labelled, with arcsinh scaling applied twice. The right column shows the associated segmentation maps for each band, with each colour representing a different segment. Each panel is 25.6 × 25.6 arcsec (128 × 128 pixels), corresponding to a physical size of 44 × 44 kpc at the redshift of the galaxy (z = 0.09).

In the text
thumbnail Fig. 3.

Predicted merger times of the test data against the true merger times. The colour corresponds to the number density from low (purple) to high (red). The red line indicates a one-to-one relation and the MSE for the normalised times is shown in the bottom right. Dark brown diamonds show the median true merger times and median predicted merger times in predicted merger time bins with widths of 200 Myr; x errors are the median absolute deviation of the true times in the bin and y errors are the median absolute deviation of the predicted times.

In the text
thumbnail Fig. 4.

Distribution of predicted merger times for KiDS merging galaxies. Most of the KiDS mergers are pre-mergers, with merger times less than 0 Myr.

In the text
thumbnail Fig. 5.

Sixteen KiDS merging galaxies with their associated predicted merger times (time after a merger). The shown images are r-band images with arcsinh scaling applied twice. The galaxy in the centre of the image is the galaxy whose merger time is shown. Each panel is 25.6 × 25.6 arcsec (128 × 128 pixels), corresponding to a physical size of 53 × 53 kpc at the median redshift of the KiDS merger sample (z = 0.11).

In the text
thumbnail Fig. 6.

ΔSFR as a function of merger time. The green circles show the μ from the Gaussian fitting of the mergers. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty (y axis). The colour corresponds to the number density from low (purple) to high (red). The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie. The yellow horizontal line indicates ΔSFR = 0.

In the text
thumbnail Fig. 7.

Upper panel: Example of Gaussian fitting for the −100 Myr ≤ merger time < 0 Myr bin. Lower panel: Gaussian fitting for all five merger time bins where the fit succeeded. The Gaussian distributions for the bins centred on −400 Myr (solid purple), −200 Myr (dotted dark blue), 0 Myr (solid light blue), 200 Myr (dash-dotted green), and 400 Myr (dashed red) are shown.

In the text
thumbnail Fig. 8.

ΔSFR as a function of merger time for mergers binned by M. The purple circles, blue triangles, and green squares show the μ from the Gaussian fitting of the binned merger times with masses of 9.0 ≤ log(M/M) < 9.5, 9.5 ≤ log(M/M) < 10.0, log(M/M)≥10.0, respectively. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty (y axis). Empty markers show the results using re-sampled merger times (see text), and the y axis errors are the standard deviation of ΔSFR from these resamplings. The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie.

In the text
thumbnail Fig. 9.

ΔSFR as a function of merger time for mergers binned by local density. The purple circles, blue triangles, and green squares show the μ from the Gaussian fitting of the binned merger times with 0.00 ≤ Σ10 < 0.04, 0.04 ≤ Σ10 < 0.08, and Σ10 ≥ 0.08, respectively. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty (y axis). Empty markers show the results using re-sampled merger times (see text), and the y axis errors are the standard deviation of ΔSFR from these resamplings. The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie.

In the text
thumbnail Fig. 10.

SFR as a function of M for KiDS galaxies. Panel a shows the position of the merger sample, panel b shows the controls, and panel c shows our entire sample of KiDS galaxies. The mergers lie on the MS, with a small number below.

In the text
thumbnail Fig. 11.

SFR enhancement as a function of merger time for the control galaxies. The green circles show the μ from the Gaussian fitting of the mergers. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty. The colour corresponds to the number density from low (purple) to high (red). The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie. The yellow horizontal line indicates ΔSFRctrl = 0

In the text
thumbnail Fig. 12.

ΔSFR as a function of merger time. The green circles and orange pluses show the μ from the Gaussian fitting of the mergers using the CNN merger times and conv merger times, respectively. The errors on these points are the standard deviation of the merger times in the merger time bin (x axis) and the error in μ from the Gaussian fitting combined with bootstrapped ΔSFR uncertainty, for CNN merger time, or the standard deviation of ΔSFR, for conv (y axis). The grey lines show ΔSFR from the merger time re-samplings used to determine conv. Errors for these lines are omitted for clarity. The light shaded region indicates where pre-mergers should lie and the dark shaded region indicates where post-mergers should lie.

In the text
thumbnail Fig. B.1.

Our CIGALE derived M for KiDS galaxies as a function of M for matched GAMA galaxies. The colour corresponds to the number density from low (purple) to high (red). The solid red line indicates the one-to-one relation, while the dashed blue line indicates our fitted relation.

In the text
thumbnail Fig. B.2.

Our CIGALE derived SFR for KiDS galaxies as a function of SFR for matched GAMA galaxies derived from Hα observations. The colour corresponds to the number density from low (purple) to high (red). The solid red line indicates the one-to-one relation, while the dashed blue line indicates our fitted relation with unity slope.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.