Open Access
Issue
A&A
Volume 702, October 2025
Article Number A270
Number of page(s) 19
Section Extragalactic astronomy
DOI https://doi.org/10.1051/0004-6361/202555829
Published online 29 October 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

Massive quiescent galaxies represent crucial tests of our understanding of early galaxy formation and evolutionary mechanisms. Simply ‘counting’ their number within a given volume (i.e. computing their cosmic number densities) provides strong constraints on the strength and effect of feedback processes on galaxy evolution (e.g. Somerville & Davé 2015; Beckmann et al. 2017; Merlin et al. 2019; Donnari et al. 2021a,b; Baker et al. 2025a; De Lucia et al. 2024; Lagos et al. 2025). These massive quiescent galaxies are the descendents of earlier star-forming galaxies that have undergone the process of quenching, after which galaxies cease forming new stars (at least in any significant number; for a review see Man & Belli 2018; Curtis-Lake et al. 2023; De Lucia et al. 2025).

Many possible quenching mechanisms have been proposed, including stellar feedback (Larson 1974; Dekel & Silk 1986; Veilleux et al. 2005), AGN feedback (Silk & Rees 1998; Croton et al. 2006; Fabian 2012), starvation (Peng et al. 2015; Trussler et al. 2020; Baker et al. 2024), morphological quenching (Martig et al. 2009), cosmic rays (Ruszkowski & Pfrommer 2023), environmental quenching (Alberts et al. 2024), and many more (Man & Belli 2018). It can be helpful to split quenching mechanisms based on the stellar masses of the galaxies in question as it is believed that different mechanisms are responsible for various galaxy subpopulations. This becomes even more important when investigating high-redshift galaxy quenching as it necessarily places more constraints on both the masses of galaxies and the timescales for quenching because of the younger age of the Universe at these redshifts.

At least more locally, galaxy quenching has been shown to be roughly split into two categories based on stellar mass and environment (Peng et al. 2010), mostly corresponding to internal versus external quenching mechanisms (which is also seen in simulations, e.g. Lim et al. 2025). The external mechanisms (under the broad name of environmental quenching) correspond to the region in which the galaxy is present, with more overdense regions containing a greater proportion of passive galaxies (Dressler 1980). This is traditionally thought to affect lower-mass galaxies ( M < 10 10 M $ M_\star < 10^{10}\,\rm{M_\odot} $) more than high-mass galaxies ( M > 10 10 M $ M_\star > 10^{10}\,\rm{M_\odot} $) due to their shallower gravitational potentials, making them less able to hold on to their gas reservoirs, whilst dealing with processes such as ram-pressure stripping or harassment (Gunn & Gott 1972; Peng et al. 2010). For more massive galaxies, generally internal quenching mechanisms are thought to be the most important, especially in cosmological simulations (see for a review see Somerville & Davé 2015; Man & Belli 2018; Vogelsberger et al. 2020, but mergers and environment can still play a role in certain cases).

Prior to JWST, spectroscopically confirmed massive quiescent galaxies were limited to a handful of studies, with the redshift frontiers being continually pushed higher (e.g. Dunlop et al. 1996; Cimatti et al. 2004; Daddi et al. 2005; Glazebrook et al. 2017; Belli et al. 2019; Valentino et al. 2020). However, since the advent of the JWST we have been finding many more high-z massive quiescent galaxies than were previously found, and at even higher redshifts (e.g. Carnall et al. 2024; Nanayakkara et al. 2025, 2024; Baker et al. 2025a; de Graaff et al. 2025a; Weibel et al. 2025; Wu 2025). This has resulted in high number densities that disagree with many aspects of theory (Schreiber et al. 2018; Merlin et al. 2019; Carnall et al. 2023a; Valentino et al. 2023; Baker et al. 2025a). This was shown in some of the first data taken with JWST (e.g. Carnall et al. 2023a; Long et al. 2024; Russell et al. 2024; Alberts et al. 2024), but, due to the small regions probed, cosmic variance was shown to be an increasingly important factor (Steinhardt et al. 2021; Valentino et al. 2023). Compared with theory, it is also important to take into account the effects of cosmic variance within cosmological simulations, as in Baker et al. (2025a), in which they found that although this affects number densities, there is still a 3σ disagreement at z>3. The observational overabundance remains even when using spectroscopy to better ensure the fidelity of the quiescent galaxy sample (Baker et al. 2025a). This has confirmed that we are observing more of these high-z quiescent galaxies than we find in our best cosmological simulations (Valentino et al. 2023; Lagos et al. 2025; Baker et al. 2025a), i.e. more massive quiescent galaxies exist at high z than are predicted by our current models and theory.

If we classify high-z massive quiescent galaxies as those with stellar masses above M > 10 9.5 M $ \rm M_\star > 10^{9.5}\,M_\odot $ 1, then when comparing to cosmological hydrodynamical simulations and semi-analytic models (SAMs), another key aspect to probe is not only the number densities, but also the number of quiescent galaxies of a given stellar mass within a given area. This gives the well-studied stellar mass function (SMF, e.g. Grazian et al. 2015; Davidzon et al. 2017; Leja et al. 2020; Weaver et al. 2023; Weibel et al. 2024), only for high-z quiescent galaxies. Stellar mass functions are crucial for understanding many aspects of galaxy evolution, particularly the separation into separate star-forming and quiescent galaxy subpopulations. However, high-z exploration of the SMFs has previously been conducted based solely on colour selection (e.g. Weaver et al. 2023), rather than full Bayesian spectral energy distribution (SED) modelling. As has previously been shown in multiple works from both observations and simulations (Antwi-Danso et al. 2023; Lovell et al. 2023; Baker et al. 2025a), this is likely to miss a significant number of quiescent galaxies at high-z.

Generally cosmological hydrodynamical simulations and SAMs are designed to reproduce the SMF at low redshifts, but at high-z predictions deviate substantially for massive quiescent galaxies (Lagos et al. 2025) compared to previous observations (Weaver et al. 2023). The different predictions for high-z number densities and SMFs from various cosmological simulations and SAMs were explored in depth in Lagos et al. (2025). They showed that there were significant differences between the observations and simulations, as well as significant differences between the simulations and the SAMs themselves. This opens up an exciting new avenue for testing our understanding of galaxy evolutionary physics using the abundances and masses of high-z quiescent galaxies.

However, on the observational side, we have yet to fully exploit the wide area of publicly available JWST imaging, the largest area explored so far being 145 arcmin2 in Valentino et al. (2023). Currently we have one spectroscopically confirmed quiescent galaxy above z = 5 (Weibel et al. 2025; Valentino et al. 2025) and a handful above z = 4 (Tanaka et al. 2019; Carnall et al. 2023b, 2024; Kakimoto et al. 2024; Urbano Stawinski et al. 2024; de Graaff et al. 2025a; Baker et al. 2025a; Barrufet et al. 2025; Wu 2025; Nanayakkara et al. 2025; Antwi-Danso et al. 2025). To properly compute number densities, we require a comprehensive understanding of the area probed, which is complex in the case of spectroscopy due to target selection. For this we require a photometric approach that also brings the benefit of greatly enhanced sample sizes, albeit without the exquisite detail of a spectrum.

In this paper, we explore a comprehensive photometric sample of z >  2 quiescent galaxies. We compute their cosmological number densities, SMFs, and report high-z candidates. Our combined area consists of 816 arcmin2 of publicly available JWST imaging, which is more than 5× larger than Valentino et al. (2023).

2. Data

As we want to compute number densities and SMFs, it is crucial that we probe a large area of JWST imaging. To do this, we use a combination of publicly available JWST fields. These correspond to the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS, Grogin et al. 2011; Koekemoer et al. 2011) fields. We use the Cosmic Epochs Early Release Survey (CEERS, DD-ERS 1345 Finkelstein et al. 2025), Public Release IMaging for Extragalactic Research (PRIMER, GO 1837 Dunlop et al. 2021; Donnan et al. 2024), and the JWST Advanced Deep Extragalactic Survey (JADES, GTO 1180, 1181, 1210, 1287, 1215 Eisenstein et al. 2023a; Rieke et al. 2023). This consists of multiband imaging in EGS, UDS, COSMOS, GOODS-S and GOODS-N (Giavalisco et al. 2004; Grogin et al. 2011; Koekemoer et al. 2011). The idea behind this is that we are more robust to cosmic variance using a combination of different fields from different regions of the sky. This culminates in a combined total area of 816.9 arcmin 2 $ \rm 816.9\;arcmin^2 $.

We use the photometric data from the DAWN JWST Archive (DJA Valentino et al. 2023; Heintz et al. 2025)2 a publicly available repository of reduced JWST imaging and spectroscopy. The photometric data were processed with the GRIZLI (Brammer 2023) code. See Valentino et al. (2023) for more details about the data reduction and photometric extraction process.

For our colour selection with EAZY (Brammer et al. 2008) we use all available photometric data including HST, NIRCam, and MIRI data. For SED modelling with BAGPIPES (Carnall et al. 2018) we use all available JWST data (both NIRCam and, where available, MIRI). We use the corrected second aperture flux values, corresponding to an aperture of 0.7 $ {\prime \prime } $ in diameter. These follow the standard DJA format (see Valentino et al. 2023, for details) hence are not PSF-matched, but have been shown to provide accurate stellar population properties for high-z quiescent galaxies in Ito et al. (2025a). In this work, we use a Kroupa (2001) IMF3 and Planck Collaboration VI (2020) cosmology throughout.

3. Methods and sample selection

Our selection criteria consists of three steps. Step 1 is an expanded UVJ colour selection via the use of the template fitting code EAZY (Brammer et al. 2008). Step 2 is a specific star-formation rate (sSFR) cut after more detailed SED modelling with BAGPIPES (Carnall et al. 2018, 2019a). Finally, step 3 is a visual inspection of the SED fits in order to remove poorly fitted galaxies or clear contaminants. In addition, for z >  5 targets we also implement a brown dwarf (BD) removal cut based upon colour-colour selection (Langeroodi & Hjorth 2023; Kokorev et al. 2024).

3.1. Template fitting with EAZY

We started by using the template fitting code EAZY (Brammer et al. 2008) to compute redshifts alongside rest-frame UVJ colours for all the sources within our fields. We used the standard templates included in the DJA analysis pipeline (Valentino et al. 2023), alongside the standard eazy parameters. EAZY does template fitting across a redshift grid by finding the best-fit linear combination of templates at each redshift, then from the χ2s of the best-fit SEDs, it finds the best-fit redshift. It then extracts rest-frame colours and masses from the best-fit SEDs. We selected galaxies with a photometric redshift of z >  2 and with a stellar mass of M >  109.3M. We adopt this lower mass cut to account for EAZY potentially underestimating stellar masses.

For our initial colour selection, we used the criteria of Baker et al. (2025b). This selection recovers many high-redshift quiescent galaxies that fall outside a standard colour selection box (such as Schreiber et al. 2018). Our selection corresponds to the UVJ colours

U V > 1.30 ( V J ) 0.20 . $$ \begin{aligned} U-V\ > \ 1.30 \ (V-J) - 0.20. \end{aligned} $$(1)

This has been shown to fully recover 18 massive high-z spectroscopically confirmed quiescent galaxies (Baker et al. 2025a). We reiterate that our key cut will be based on sSFR which will remove any extra contaminants introduced by this more relaxed approach. This cut reduces our sample to 15 560 galaxies.

3.2. SED modelling with BAGPIPES

To model our candidate quiescent galaxies we employed the Bayesian SED modelling code BAGPIPES (Carnall et al. 2018, 2019a). See Appendix A for the exact details of our set-up.

To select quiescent galaxies from those fit we utilise a traditional cut based on the sSFR (e.g. Franx et al. 2008; Gallazzi et al. 2014; Schreiber et al. 2018; Carnall et al. 2023a) that corresponds to

sSFR 0.2 / t age , $$ \begin{aligned} \mathrm {sSFR}\ \le 0.2/t_{age}, \end{aligned} $$(2)

where t age $ \rm t_{age} $ is the age of the Universe at the observed redshift. In addition, as we are after massive quiescent galaxies, we select those with M >  109.5M (except for our analysis of the SMFs in Sect. 5). This cut reduces our sample from 15 560 galaxies to 882 galaxies.

3.3. Visual inspection and brown dwarf cut

For candidate z >  5 quiescent galaxies with JWST, BDs become interlopers (e.g. Kauffmann et al. 2020). To deal with this problem, we used the selection criteria of Langeroodi & Hjorth (2023) based on BD templates. This corresponds to a cut of F200W−F150W > 0.20. We combined this with a cut from Kokorev et al. (2024) that was based on selecting for the so called ‘little red dots’ (LRDs, e.g. Matthee et al. 2024, and many more works). There is still much debate on the nature of LRDs as to whether they are AGN (e.g. Inayoshi & Maiolino 2025) or quiescent galaxies (e.g. Baggen et al. 2024).

In light of this, we settled for removing galaxies with clear V-shaped SEDs via visual inspection as possible LRD or BD contaminants. Most high-z quiescent galaxies (e.g. those seen in Weibel et al. 2024; de Graaff et al. 2025a; Baker et al. 2025a; Nanayakkara et al. 2025) show blue slopes redwards of the Balmer break so the removal of V-shaped SEDs should not affect our selection of these types of massive quiescent galaxies. We also remove candidate quiescent galaxies with insufficient filter coverage to properly probe the Balmer break, as we cannot be certain of those galaxies quiescent nature (this is, however, a negligible number of galaxies)4. These cuts reduce our sample from 882 galaxies to 743 galaxies.

Our final full sample consists of 743 massive quiescent galaxies with M >  109.5M and with EAZY photometric redshifts of z >  2. Fig. 1 shows the distribution of our final sample in UVJ colour space, colour-coded by redshift. We can see that our quiescent galaxy sample primarily occupies the traditional criteria, but with significant numbers within the new Baker et al. (2025a) criteria. These would have been missed by strict UVJ cuts, but have been selected by our new colour selection with confirmation from the sSFR cut. We explore F356W magnitude versus stellar mass binned in redshift for the full sample in Appendix B.

thumbnail Fig. 1.

Upper panel: U–V and V–J colour selection colour-coded by redshift for the massive quiescent galaxies in our full sample. The black line corresponds to the Schreiber et al. (2015) quenching criterion with the grey line addition of the Belli et al. (2019) fast quenching criterion. The dotted red line corresponds to the selection criteria from Baker et al. (2025a) which we use as our initial selection criteria in this work.

4. Number densities

The first aspect to explore for the full sample of massive quiescent galaxies is their number densities. In Fig. 2 we show the massive (M >  109.5 M) quiescent galaxy number density versus redshift for our study (red markers) and a compilation of other equivalent studies from the literature. The number density was computed as the number of massive quiescent galaxies within a given redshift range divided by the total co-moving volume within that redshift range. The area of each field was computed using the total area of the F444W mosaic for each field, as this is used in computing the detection image in the DJA pipeline (Valentino et al. 2023). The volume was then computed assuming

thumbnail Fig. 2.

Quiescent galaxy number densities of those with M >  109.5 M from our sample (red points). We compare to Carnall et al. (2023a) green points, Valentino et al. (2023) blue points, and the single galaxy from Weibel et al. (2025) as the grey point. We offset the points ever so slightly in redshift for the Valentino et al. (2023) and Carnall et al. (2023a) number densities so as to make their errors visible within the figure.

V = Ω 3 × ( d 1 3 d 2 3 ) . $$ \begin{aligned} \mathrm V=\frac{\Omega }{3}\times (d_1^3-d_2^3). \end{aligned} $$(3)

For more details on this method see Baker et al. (2025a) where it is described in detail. As can be seen in Fig. B.1, even our faintest massive quiescent galaxy is significantly brighter than our shallowest survey limiting magnitude5 so we do not have to worry about Malmquist bias due to our mass cut of M > 10 9.5 M $ \mathrm{M}_\star > \rm 10^{9.5}\,M_\odot $.

Our errors for the number densities are a combination of three different causes. First we bootstrapped the number density calculation for variations in the EAZY redshifts. This enables galaxies to shift redshift bin. This has an almost negligible effect on our number densities. Secondly, we incorporated the effects of cosmic variance. We followed the prescription of Jespersen et al. (2025a), calibrating the cosmic variance based on a combination of the cosmic variance calculator by Moster et al. (2011) and added constraints from the UniverseMachine simulations (Behroozi et al. 2019) at the highest redshifts. This directly incorporates realistic effects of scatter due to assembly bias (Jespersen et al. 2022; Chuang et al. 2024). This has strong effects on individual fields, but it is less of an issue here due to the combination of separate sightlines and our (comparatively) large area. Thirdly, we included a Poisson counting error, which became the main source of error in the high-z bins. All of these errors are combined in quadrature to produce our overall errors. We follow the same approach when computing errors throughout this work.

The results of this are shown in Fig. 2. We see that we obtain a rapid decline with redshift as would initially be expected. We limit the amount of comparisons here as we wish to (at least) match the exact mass cut used in the literature (see Fig. 3 for many more comparisons to observations and simulations). The most comparable observational study is that of Valentino et al. (2023) as they also select massive quiescent galaxies with M >  109.5 M enabling a more like-for-like comparison. Valentino et al. (2023) had a field size of 145 arcmin2, whereas in this work we have an area of 816 arcmin2 which is approximately 5× the size. This gives us significantly greater number statistics. They used UVJ selection, but included everything within 1σ, in effect expanding the criteria.

thumbnail Fig. 3.

Quiescent galaxy number densities for M >  1010 M from our sample (red points). Upper panel: Comparison to other observational studies, Carnall et al. (2023a), green points, a spectroscopically corrected sample, Baker et al. (2025a), orange points, a spectroscopic sample, Nanayakkara et al. (2025), light green point, and the Weibel et al. (2025) single galaxy point. Middle panel: Comparison to five cosmological hydrodynamical simulations, EAGLE, ILLUSTRIS-TNG100, SIMBA, FLAMINGO, and MAGNETICUM (Lagos et al. 2025; Baker et al. 2025a; Remus & Kimmig 2025) and two SAMs, SHARK and GALFORM. Bottom panel: Four cosmological hydrodynamical simulations and two SAMs, but with Gaussian scatter included in the stellar masses and star-formation rates. We see that simulations struggle to reproduce the observed number densities at most redshifts.

Our number densities remain mostly consistent with this work despite our much larger field size and redshift range containing ∼9× as many quiescent galaxies as in that work. This shows the benefit of selecting multiple different fields in trying to deal with the effects of cosmic variance. We also show the results of Carnall et al. (2023a), which shows an excess in the higher redshift bin due to an overdensity of massive quiescent galaxies in CEERS (Jin et al. 2024; Valentino et al. 2023).

We also compare the number density obtained in Weibel et al. (2025). This is based on a single z = 7.3 massive quiescent galaxy obtained as part of the RUBIES program (de Graaff et al. 2025b)6. A key difficulty for that work is that they found a single galaxy and it is not entirely apparent what volume should be considered for it. We find a number density slightly lower than was found in Weibel et al. (2025) but it remains consistent with our M >  109.5M result.

4.1. Number densities above M >  1010 M

Due to the number of simulations with limited resolution, and because at lower redshift it is often used as a benchmark, it is also useful to explore the number density of massive quiescent galaxies with M >  1010 M. This naturally reduces our sample size to 614 massive quiescent galaxies. We explore their number densities in Fig. 3. We compare them to a selection of other comparable observations and simulations.

Fig. 3, upper panel, shows our observed number density versus several comparable studies. Observationally, due to the large volume area probed by our study, we are able to use more finely grained binning enabling a greater understanding of the evolution of the number density. Comparing with the literature, there are few JWST-era studies on number densities in fields with large volumes, and most are based on single fields that are likely to be significantly affected by cosmic variance (e.g. Steinhardt et al. 2021; Carnall et al. 2023a; Long et al. 2024; Valentino et al. 2023; Jespersen et al. 2025a). In spite of this, we find good agreement with the results of Baker et al. (2025a), which was based on a small survey area of GOODS-S (also included as a field within this work) but with a correction to the photometric number densities based on their spectroscopic sample. They used a traditional UVJ selected photometric sample (selected via Ji et al. 2024) as the baseline, but rescaled the quiescent galaxy number densities based on the confirmation/accuracy rate when compared to a spectroscopic sample, which itself was selected primarily by an evolving sSFR cut. We find significantly more massive quiescent galaxies than those found by Nanayakkara et al. (2025), but their result was based on spectroscopy in a smaller area. Again we see the effect of the overdensity in CEERS on the Carnall et al. (2023a) number densities which are significantly higher than ours at z = 4 − 5 (but based on only a handful of galaxies in a small area). As shown in the previous section, we again find a number density lower than was found in Weibel et al. (2025) for our M >  1010M number densities. Our observed number densities and errors are included in Table 1.

Table 1.

Number densities.

One of the key benefits of the M >  1010M selection is we can more cleanly compare with simulations and SAMs, as most predictions presented in the literature adopt that mass threshold. We first compare to the FLAMINGO simulations (Schaye et al. 2023; Kugel et al. 2023) using the number densities computed in Baker et al. (2025a). These FLAMINGO number densities from Baker et al. (2025a) were computed from the higher resolution 1 Gpc3 FLAMINGO simulation box based on the same evolving sSFR criteria adopted in this work (e.g. Eq. (2)). This enabled Baker et al. (2025a) to calculate the effect of cosmic variance on the simulations to up to 3σ. We do not include the errors from that work here, as they were computed to match the observed area within that work. However, we do include the number density from the simulation as the blue line in the middle panel of Fig. 3. We see that we obtain significantly higher number densities than are found in FLAMINGO at all redshifts probed, which is consistent with the findings in Baker et al. (2025a).

The next stage is to compare with other cosmological simulations with different set-ups, feedback prescriptions, box-sizes, and sub-grid physics. We use the results of Lagos et al. (2025), which consists of three cosmological hydrodynamical simulations and two SAMs7. The cosmological simulations are EAGLE (Schaye et al. 2015; Crain et al. 2015), ILLUSTRIS-TNG100 (Pillepich et al. 2018; Springel et al. 2018), and SIMBA (Davé et al. 2019). The SAMs consist of SHARK (Lagos et al. 2018, 2024) and GALFORM (Lacey et al. 2016). For full details on the simulations, see Lagos et al. (2025). Here, we modified the default sSFR selection in Lagos et al. (2025) – based on fixed sSFR thresholds – to match our evolving time-dependent selection of Eq. (2).

The comparison is shown in the middle panel of Fig. 3, where the simulations are denoted by dashed lines. Upper limits from the simulations are not plotted, and hence some tracks end abruptly (these can be envisaged as horizontal lines from the end of each track).

As was shown in Lagos et al. (2025), ILLUSTRIS-TNG100 (Pillepich et al. 2018; Springel et al. 2018) shows a significant lack of massive quiescent galaxies above z = 4 (in fact there are none in the ( 100 Mpc ) 3 $ (100\,\rm Mpc)^3 $ volume, hence it becomes an upper limit of around ∼10−6), which totally disagrees with the observations. However, note that ILLUSTRIS-TNG100 does manage to reproduce the number density of massive quiescent galaxies at z = 2 − 3 with M >  1010M. EAGLE fails to match any of our observed number densities at every redshift whilst SIMBA also fails at almost all redshifts. The same is found for SAM GALFORM, although it does reproduce the number densities at z = 2 − 3 (like ILLUSTRIS-TNG100).

On the other hand, the SAM SHARK (Lagos et al. 2018, 2024) appears to agree better with our observed number densities. It remains mostly consistent with our observations up to z ∼ 6 and only substantially starts to deviate at z ≥ 6 which is the most unconstrained observationally. This is likely a result of the AGN feedback prescriptions within SHARK, consisting of a radiative and jet mode feedback (which can be triggered by mergers or disc instabilities, enabling more possible triggers, Lagos et al. 2025).

In addition to the aforementioned simulations, recent studies (e.g. Remus & Kimmig 2025; Kimmig et al. 2025; Chittenden et al. 2025) have shown that the cosmological hydrodynamical simulation MAGNETICUM appears to be able to produce many more high-z quenched galaxies than other simulations. Due to this, we also include a matched comparison in this work, whereby the MAGNETICUM number densities are given by the light blue/turquoise lines in Fig. 3. We see that whilst it is able to produce enough quiescent galaxies at z = 5 − 7, it significantly overproduces them at lower redshifts (e.g. z = 2 − 5), where our constraints are most stringent. This is likely due to the different black hole feeding growth caused by the simulation treating hot and code accretion modes separately, with a boost to the cold mode by around a factor of 100 (Gaspari et al. 2013). This ensures high accretion onto the BH at early times, helping boost quenching above z = 5 − 7, but likely results in there being so much cold gas at z = 2 − 5 that the BH is still fed very efficiently, driving overly strong quenching at these lower redshifts which is in significant disagreement with our observations.

The next stage is to try and introduce realistic observational errors in the simulations to attempt to perform a fair comparison. We follow the approach of Lagos et al. (2025), explained below. Observationally, we have errors on our measured SFRs and stellar masses. These can scatter to both higher or lower values. However, at high-z these will not scatter equally as we have significantly fewer quiescent galaxies than SF galaxies, and similarly fewer massive galaxies than lower-mass ones. This means that by including an extra observational error on SFRs and stellar masses from the simulations, we are likely to increase the observed number densities. In practice, we add Gaussian-distributed errors centred on 0 (meaning we assume no bias in our measurements) with a σ of 0.3 dex to all stellar masses and SFRs. The sSFR selection criteria is then applied to the resulting quantities. This yields more galaxies making the quiescent cut. The number densities computed from this approach are shown in the lower panel of Fig. 3.

We can see that adding random errors to stellar masses and SFRs changes the number densities from simulations significantly, overall leading to an increase. This increase for the most part is still insufficient for most simulations, which remain below our measurements. In the case of SHARK, adding errors leads to better agreement with the observations.

5. Stellar mass functions

In addition to simply counting the number density of massive quiescent galaxies, we can also explore their mass distribution. To do this, we compute the stellar mass function (SMF) in various bins of redshift. A limitation of previous studies using JWST at these redshifts is their respective survey volume; they have not been large enough to compute a SMF for just the quiescent galaxies. Even with our volume size, we cannot go above z = 5, but we can explore the SMF from z = 2 − 5. This enables a direct comparison to cosmological simulations and SAMs, albeit with the usual observational caveats (see Appendix C).

In our case a key advantage over similar SMF works (e.g. McLeod et al. 2021; Weaver et al. 2023), is that our entire sample is based upon a more stringent selection criteria for obtaining massive quiescent galaxies. We also benefit from JWST-based data that enables us to fully probe the rest optical to high sensitivity. This means that we have more flexibility in our selection and, due to fewer targets, we can use Bayesian-based SED modelling and visual inspection to remove contaminants.

Fig. 4 shows the observed SMF for our sample of high-z massive quiescent galaxies. We computed the SMF for ranges of z = 2.0 − 2.5, z = 2.5 − 3.0, z = 3.0 − 3.5, z = 3.5 − 4.0 and finally z = 4.0 − 5.0. We bin the masses in bins from 109.3M <  M <  1011.5M. We reduce the number of mass bins for the highest-z ranges due to a lack of very massive galaxies at those redshifts.

thumbnail Fig. 4.

Upper: observed stellar mass function (SMF, Φ) versus stellar mass for massive quiescent galaxies in bins of redshift. Lower: Fitted SMF (Φ) versus stellar mass for massive quiescent galaxies in bins of redshift. The solid line is our best-fit SMFs including the correction for Eddington bias, the dotted lines exclude the Eddington bias correction. The dotted lines show that we would significantly overestimate the number of most massive quiescent galaxies at z = 2 − 2.5 without the correction.

We see from Fig. 4 that as z increases we obtain fewer massive quiescent galaxies of all masses (as expected from our number densities results), but also that this is more pronounced for masses closest to the turn over in the SMF (i.e. in the range 1010.3M <  M* <  1011M). The least amount of evolution is seen for the highest and lowest mass bins. We find that, as has been seen before (e.g. McLeod et al. 2021; Weaver et al. 2023), there are fewer low-mass and most-massive quiescent galaxies.

In addition, we find a sharp jump in the growth at all masses from z = 4.0 − 5.0 to z = 3.5 − 4.0, mirroring what has been found previously for the overall galaxy population (Weibel et al. 2024) and for the dusty star-forming galaxy sub-population (Gottumukkala et al. 2024). This suggests that during this epoch galaxies could be growing particularly efficiently. We see that the growth in number densities with redshift does appear to have a mass-dependence, with the lower-mass quiescent galaxies experiencing less of an increase with redshift compared to other masses. The next stage is to fit a functional form to the SMF. This will enable a better understanding of its evolution and physics.

5.1. Fitting the SMF

The functional form we used to fit the SMF is the single Schechter function (Schechter 1976). This function has been shown to effectively reproduce SMFs of single galaxy populations (e.g. Davidzon et al. 2017; Adams et al. 2021). The Schechter function is defined as follows:

Φ ( M ) d M = Φ ( M M ) α exp ( M M ) d ( M M ) , $$ \begin{aligned} \Phi (M)\ \mathrm{d}M = \Phi ^*\left(\frac{M}{M^*}\right)^\alpha \exp \left(-\frac{M}{M^*}\right)\mathrm{d}\left(\frac{M}{M^*}\right), \end{aligned} $$(4)

where M $ \rm M^* $ is the characteristic stellar mass that corresponds to the ‘knee’ of the SMF, the point at which the exponential cutoff and powerlaw slope start. Φ* is the normalisation and corresponds to the number density at M $ \rm M^* $, and α is the strength of the powerlaw slope. Fitting the SMF with a common functional form enables easy comparison between different works and their parameters. It also provides a simple model that can be linked to theory.

However, before we fit the functional form, we should explore the relative uncertainties involved in fitting a SMF. This is explored in depth in Appendix C. For now we note that as we are probing massive quiescent galaxies we can detect down to our mass cut. We take into account cosmic variance following the methodology of Jespersen et al. (2025a). Finally, we correct for Eddington (1913) bias via convolving the SMF with a Voigt profile.

In order to fit the SMF we used the code DYNESTY (Speagle 2020), which uses dynamic nested sampling (Skilling 2004; Higson et al. 2019) in order to explore the posterior space for the SMF. This approach better enables degeneracies between various quantities to be explored, whilst also being able to deal with a large number of dimensions within the parameter space. We fit for ϕ*, M $ \rm M^* $ and α with the log likelihood calculated for a simple Gaussian error model (e.g. χ2, following Hogg et al. 2010). We use uniform priors on all parameters.

The resulting best-fit SMFs are shown in Fig. 4. The errors are obtained from the 16th and 84th percentiles of the resulting distribution of the SMFs computed using the posteriors of the three parameters. This has the effect of propagating through the uncertainties on ϕ*, M $ \rm M^* $ and α.

The exact values for the best-fit SMFs are reported in Table 2. These show that the ‘knee’ of the SMF (M*) remains around 1010.3 − 10.5 regardless of redshift, hence it does not show signs of evolution from z = 2 − 5. The ‘knee’ of the SMF is classically used to explain the switch between high and low-mass quenching mechanisms (longside the switch from a single Schechter to double Schechter function required by environmental quenching Peng et al. 2010; Shuntov et al. 2025). The idea is that the powerlaw slope set by α describes primarily the lower mass quenching mechanism, whilst the exponential cutoff describes the higher-mass quenching mechanism (typically thought to be AGN feedback). Therefore, the knee describes the mass range at which this switchover occurs. This suggests that the switch between high and low-mass quenching mechanisms in the quiescent galaxy SMF does not appear to vary significantly with redshift.

Table 2.

Best-fit stellar mass function parameters.

We see an expected decrease in the normalisation of the Schechter function which reiterates our findings with the number densities. We see fewer massive quiescent galaxies of every mass at higher redshift rather than simply seeing a different distribution. However, we do see some tentative evidence for a flattening of the power-law slope α (although this is particularly difficult to constrain). This could suggest that the lower mass quenching mechanism is evolving less with redshift compared to the higher mass mechanism. Alternatively, the greater high-mass build-up with redshift may correspond to growth via major or minor mergers (Puskás et al. 2025); however, recent works have suggested a lack of major mergers among high-z massive quiescent galaxies (Baker et al. 2025a; D’Eugenio et al. 2024; Pascalau et al. 2025; Chittenden et al. 2025).

5.2. Comparison to simulations

The real power of exploring SMFs for quiescent galaxies comes from comparing them to cosmological simulations and SAMs. This is more helpful in our case as we have a larger more robust observational sample that has been selected primarily based on the same sSFR criteria as from the simulations enabling a much more like-for-like comparison.

We compare with the SMFs from EAGLE, ILLUSTRIS-TNG100, SIMBA, SHARK, GALFORM, and MAGNETICUM in various bins from z = 2 − 5. In order to match the observed bins of our observation we average over the available simulation snapshots’ SMF within that bin. The results are shown in the left panel of Fig. 5. Our observed SMF is given by the solid lines and shaded region, whilst the results from the simulations is given by the dashed line. We can see immediately that none of the cosmological simulations or SAMs fully reproduce the observed SMF for the massive quiescent galaxies at any redshift probed in this work. As was found by Lagos et al. (2025) for just z = 3, some simulations and SAMs over produce lower mass quiescent galaxies and under produce higher mass, whilst others fail at a variety of different mass scales.

thumbnail Fig. 5.

Comparison between our observed stellar mass function (solid lines) at z = 2 − 2.5 (upper panel) z = 2.5 − 3 (upper middle panel), z = 3 − 3.5 (lower middle panel) and z = 4 − 5 (lower panel), alongside six cosmological simulations denoted by dashed lines (left hand panels, Lagos et al. 2025). The right hand panel shows the same, but with Gaussian scatter added to the stellar masses and SFRs for the six simulations. We see that the simulations cannot accurately reproduce the observed SMFs.

There is a large disagreement between the simulations themselves on the shapes of the quiescent galaxy SMFs. These shapes are a convolution of the total SMFs in the simulations (which tend to be more power-law-like) with the passive fractions as functions of stellar mass (which rarely look to be smooth functions, e.g. Weaver et al. 2023). This makes them complicated to understand physically. However, a significant cause of the disagreement is likely due to the various feedback implementations within the simulations (Lagos et al. 2025). This shows that even SAMs such as SHARK, which accurately reproduce the observed number densities for M >  1010M, cannot reproduce the observed mass distribution.

In the right-hand panel of Fig. 5 we compare to the simulations with the addition of the Gaussian errors on the observed quantities (stellar mass and SFR). These are denoted with the dot-dash lines. Adding errors has the effect of broadening the SMF, particularly helping to reproduce the higher-mass end, in the cases where the models struggle with producing enough high-mass galaxies, but there are still significant differences with the observations. This is again most apparent at the lower mass end, with EAGLE being the closest to reproducing the SMF there, but largely failing at the high mass end. It is also worth recalling that all these simulations failed to reproduce the observed number densities in Sect. 4.1.

5.3. Field-to-field and colour selection variations

Due to our use of several different fields in different regions of the sky, we can investigate possible field-to-field variations in the SMF. We only compute this from redshift z = 2.0 − 2.5, z = 2.5 − 3.0, and z = 3 − 3.5 in order to ensure we have enough galaxies for our bins. The results of this are shown in Fig. 6. The upper panel shows the SMF computed on individual fields, in this case PRIMER-UDS, PRIMER-COSMOS, CEERS-EGS, and GOODS. Also overplotted is the equivalent SMF from Weaver et al. (2023) 8.

thumbnail Fig. 6.

Upper panels: Field-to-field variations in the SMF with the same selection criteria as previously. We also plot the fitted SMF from Weaver et al. (2023) in blue for their quiescent galaxy sample. Left for z = 2 − 2.5, middle for z = 2.5 − 3.0 and right for z = 3.0 − 3.5. We see significant field-to-field variations underlining the effects of cosmic variance in small field observations. This increases at higher redshifts. Lower panels: Same, but using a stricter UVJ cut to better replicate the selection criteria of previous works McLeod et al. (2021), Weaver et al. (2023). The observed quiescent galaxy SMFs from this strict UVJ selection are denoted by the green line. We find greater consistency with the Weaver et al. (2023) and McLeod et al. (2021) results, highlighting the importance of selection criteria in obtaining as many quiescent galaxies as possible. We plot the SMF for the sample with our main selection criteria as the black line, showing how the largest difference between the selection criteria is for low-mass quiescent galaxies and at higher redshifts.

In Fig. 6, we can immediately see that there is a natural variation in the SMFs for each field, particularly at the low-mass end. CEERS clearly has the greatest number of quiescent galaxies for its area at the low-mass end, which mirrors previous results finding an overabundance of massive quiescent galaxies within this field (Jin et al. 2024; Ito et al. 2025b; Valentino et al. 2023). Meanwhile, GOODS-S varies significantly depending on masses. This is showing that cosmic variance does have an effect, particularly on small survey areas, hence one should take into account multiple different fields with different positions on the sky when computing number densities and SMFs for massive quiescent galaxies.

For the z = 2.5 − 3.0 bin we see much more variation, showing the increasing impact of cosmic variance at higher-z when sources become rarer. This becomes even more pronounced at z = 3.0 − 3.5, where we can see clear differences between the shapes and values of the SMFs between the various fields. It should also be noted that we are probing the redshift ranges that should be least affected by cosmic variance or Poisson counting statistics (i.e. z <  4) and that the effect is only likely to be more significant at higher redshifts (z >  4). Therefore, this provides strong evidence that cosmic variance will play a role for individual fields, as we clearly see this effect at z = 2.5 − 3.5. We also see that for the z = 2 − 2.5 bin in particular, we see many more lower-mass massive quiescent galaxies than were found in Weaver et al. (2023). We now explore what is driving this offset.

In the lower panel of Fig. 6 we show SMFs computed with a much stricter UVJ selection. We used the Schreiber et al. (2015) criteria. Whilst not the same selection criteria as Weaver et al. (2023), it is more strict than our sSFR criteria and should enable any variation to be seen. It is also very close to the UVJ criteria used in McLeod et al. (2021). There is a clear drop in the number of low-mass massive quiescent galaxies seen at z = 2 − 2.5, z = 2.5 − 3 and a clear drop in almost all masses at z = 3 − 3.5. This can be seen by the offset between the SMFs for our full sample (black line) and the purely UVJ selected sample (green line). At z = 2 − 2.5 we become more consistent with the results of McLeod et al. (2021), whilst at higher redshifts we become more consistent with the results of Weaver et al. (2023). This once again highlights the importance of a broad selection criteria (i.e. Bayesian based SED modelling, in addition to colours) in order to accurate uncover quiescent galaxies, particularly at lower masses and/or higher redshifts. This suggests that previous studies (e.g. McLeod et al. 2021; Weaver et al. 2023), are missing significant numbers of lower-mass or higher-z quiescent galaxies through their selections.

5.4. Stellar mass density of massive quiescent galaxies

Another area of interest is exploring the stellar mass density (SMD) of a particular population of galaxies in the Universe (e.g. Muzzin et al. 2013; Madau & Dickinson 2014). This can be computed a number of ways (e.g. from the cosmic star-formation rate density), but in our case this is a straightforward matter of integrating the obtained SMFs at each redshift. We use our fitted form of the SMF with a commonly chosen lower mass limit of M* >  108M 9. We integrate using our best-fit Schechter function for each redshift, extrapolating to lower masses than those explored in our SMFs.

Fig. 7 shows our stellar mass density versus redshift compared to other studies. It shows that we are seeing a steeper evolution in our SMD compared to other studies such as Weaver et al. (2023). We find less mass per unit volume at higher redshifts and more mass at lower z. We note that we fully propagate through the errors from our fitted SMF. The increase in ρ* at z = 2.0 − 2.5 is in keeping with our findings for the number densities and SMFs where we see an overabundance compared to other works. Interestingly, this is not the case at z = 2.5 − 3.0 or z = 3.0 − 3.5, which match the results of Weaver et al. (2023) (as can be seen in Fig. 5). We quantify the strength of the increase we see with a simple parametric model of the form ρ* = ρ0(1 + z)α, where ρ0 is a normalisation constant and α is the power-law index. We find that our best-fit model has ρ 0 = 4.1 × 10 10 M Mpc 3 $ \rm \rho_0=4.1\times 10^{10}\ M_\odot\ Mpc^{-3} $ and α = −7.2 ± 0.3. This means that we are seeing an increase of the form ρ* ∝ (1 + z)−7.2 ± 0.3 from z = 5 to z = 2. This is much higher than has been found in previous works, such as Muzzin et al. (2013), who found ρ* ∝ (1 + z)−4.7 ± 0.4.

thumbnail Fig. 7.

Stellar mass density (SMD) of quiescent galaxies versus redshift based upon our best-fit SMFs (red points) with our best-fit overplotted (red line and shaded region). Also overplotted is a comparison for quiescent galaxies from Weaver et al. (2023), blue points, Santini et al. (2021), grey points, McLeod et al. (2021), light blue points, and theoretical predictions from UNIVERSEMACHINE (Behroozi et al. 2019), dashed black line. We slightly offset the z = 3.25 points for the McLeod et al. (2021) and Weaver et al. (2023) stellar mass densities in order to aid visibility within this figure. In addition, we plot SMDs from SIMBA, SHARK, ILLUSTRIS-TNG100, GALFORM, EAGLE and MAGNETICUM obtained by integrating their SMFs. We see a steeper increase in the observed SMD from z = 5 to z = 2 corresponding to ρ* ∝ (1 + z)−7.2 ± 0.3, highlighting the increasing importance of galaxy quenching over this epoch.

We also compare our results to two other pre-JWST works, Santini et al. (2021) and McLeod et al. (2021). We find reasonable agreement with McLeod et al. (2021), particularly at z = 3.0 − 3.5, but find lower mass per unit volume than Santini et al. (2021) (after correcting for IMF assumptions). We again also compare with cosmological simulations and SAMs. We find that both EAGLE and MAGNETICUM dramatically fail to reproduce the observed SMD, with one severely underestimating and the other severely overestimating. ILLUSTRIS-TNG100 matches it well up to z = 3.5 where it dramatically falls (due to having no high-z massive quiescent galaxies). SHARK overpredicts and GALFORM slightly underpredicts the SMD at z >  3.0.

Interestingly, SIMBA well reproduces the SMD. This is surprising as it failed to reproduce the observed number densities or SMF, but goes to show how difficult it is to accurately reproduce all of these within the same simulation. Clearly the shape of the SMFs in SIMBA is dramatically different to observations, but the area underneath remains the same. We also overplot predictions from UNIVERSEMACHINE (Behroozi et al. 2019; Weaver et al. 2023) which slightly over predicts the SMD at each redshift, i.e. the normalisation is too high, but it matches the shape well.

As is shown by the fit, the increase in the stellar mass density with redshift appears much larger in our work with ρ* ∝ (1 + z)−7.2 ± 0.3. This suggests that galaxy quenching activity was significantly higher in those epochs than previously thought.

6. High-z candidates

Finally, we explore the high-redshift end of our massive quiescent galaxy sample. Fig. 8 shows SEDs and RGB images of our two z >  6 quiescent galaxies as identified by our selection criteria. For more details on our selection criteria for the highest redshift candidates, see Appendix D.

thumbnail Fig. 8.

BAGPIPES SED fits for our two highest redshift candidates (the highest-z of which is confirmed by spectroscopy Weibel et al. 2025). The yellow points are the observed photometry and the grey points correspond to non-detections with S/N < 3. The red-squares correspond to the best-fit model photometry and the red line corresponds to the best-fit model spectrum. Uncertainties in the best-fit model spectrum are denoted by the shaded regions which correspond to the 16th and 84th percentiles of the resulting distribution. An RGB image of the galaxy corresponding to F115W-F200W-F444W is included in the upper right of the figure. Both galaxies have MIRI coverage constraining the NIR part of the SED.

The first key aspect is the recovery of 66283 which has previously been identified and published in Weibel et al. (2025) and is a spectroscopically confirmed quiescent galaxy that lies at z = 7.3. We note that our best fit EAZY photometric redshift is off by around 0.3, but that this is due to the difficulty of constraining a precise redshift from the Balmer break alone. Due to our wide redshift bins for the highest-z massive quiescent galaxies, this does not significantly affect our results. Our other z >  6 target is 30744 at z = 6.1 which shows clear Balmer break in the photometry and has the benefit of a MIRI detection at 770 μm. Our-best fit BAGPIPES model gives it a stellar mass of M* = 109.7M. This makes it a robust candidate that would be worth following up in the future with spectroscopy.

7. Discussion

7.1. Observations

In this work we have explored a large sample of massive quiescent galaxies from z = 2 − 7, investigating their number densities, SMFs and cosmic stellar mass density. Previous JWST studies have reported significantly larger number densities compared to previous works and simulations (e.g. Carnall et al. 2023a; Valentino et al. 2023; Alberts et al. 2024; Long et al. 2024; Baker et al. 2025a). It is complicated to compare like with like due to the number of differences between selection and SED modelling, and how these affect the number density values.

As has been shown previously (e.g. Antwi-Danso et al. 2023; Baker et al. 2025a), and comprehensively in this work, selections based on colours and sSFR give different results, especially at high-z. By extending our colour selection and then using an sSFR selection, we showed that we obtain a larger population of lower-mass and higher-redshift quiescent galaxies than were found in previous works (e.g. McLeod et al. 2021; Weaver et al. 2023). We also showed how galaxies that fall in an expanded region of the UVJ diagram occur at all redshifts (see Fig. 1), so this is not simply a problem at the highest redshifts where we lose J-band coverage. This indicates that previous works based on UVJ selection were missing this population of galaxies, explaining the discrepancy between colour selected and sSFR works (see Fig. 6). In light of this, we highlight the importance of switching from simple colour selection to other selection criteria, such as sSFR cuts, or spectral information (where available) such as the strength of the Dn4000 break.

In addition to this, the actual mass cut for selecting massive quiescent galaxies varies between works with some studies using M > 10 10.6 M $ \mathrm{M}_\star > 10^{10.6}\,\rm M_\odot $ and others M > 10 10.0 M $ \mathrm{M}_\star > 10^{10.0}\,\rm M_\odot $. The specific star-formation rate selection (if used) also varies with some works using a flat cut (e.g. Lagos et al. 2025; Chittenden et al. 2025) or others using a threshold that evolves with redshift (e.g. Schreiber et al. 2018; Carnall et al. 2023a; Baker et al. 2025a; Lim et al. 2025). In this work, we have tried to homogenise as much as possible by providing two different mass cuts, a M >  109.5M cut and an M >  1010M cut (the latter providing more literature comparisons) alongside the evolving sSFR criteria (Franx et al. 2008; Carnall et al. 2018).

Regarding the number densities measured, we find strong agreement with previous works such as Valentino et al. (2023) and Baker et al. (2025a). This is likely due to the expanded quiescent galaxy selection criteria within these works. We find a greater number of massive quiescent galaxies than Nanayakkara et al. (2025), but fewer than Carnall et al. (2023a) found in the CEERS field. At the high-z end, we find roughly the same number density as that reported in Weibel et al. (2025), but again caution that their reported value is based on a single source.

What we are learning from this is the importance in both probing large volumes to combat cosmic variance, but also that we need a combination of different fields which are better able to sample the underlying large-scale structure, making the number densities more robust to the effects of over- or under-densities (Valentino et al. 2023; Jespersen et al. 2025a,b).

This is seen again in the SMFs. We test the effects of cosmic variance and colour selection on our results, finding field-to-field variations (again reiterating the result of Valentino et al. 2023, but this time for the high-z quiescent galaxy SMFs) that become increasingly pronounced at higher z. We also find significant differences based on our selection criteria, showing how just using colour selection can lead to significantly reduced SMFs on the low-mass or highest-z end (compared to an expanded colour selection followed by detailed SED modelling).

The ‘knee’ of the fitted SMFs remains within 1010.3 − 10.5 regardless of redshift, suggesting that the quenching processes are operating consistently at the same mass ranges, i.e. the turnover between quenching regimes is redshift independent. We see tentative signs of upturns at the low-mass end for our observed SMFs, particularly in the individual fields (see Fig. 6, particularly COSMOS at z = 2.5 − 3.0 and the combined SMF in the lower redshift bins). Whilst we caution that we may be affected by bin edge effects and low-number statistics, this is possibly a sign of environmental effects playing a role in quenching up to z ∼ 3. Indeed, more local galaxies with stellar masses 10 9 10 10 M $ 10^9{-}10^{10}\,\rm M_\odot $ would be prime targets for environmental quenching (Peng et al. 2010; Hamadouche et al. 2025). This would also explain why we see more field-to-field variation in the lower-mass end of the SMFs – this is simply due to the varying sizes and environments of the individual fields.

This further motivates combining multiple wide-area JWST surveys with many different filters that probe various fields to better sample the underlying large-scale structure. In addition, shallower, wide-field surveys such as COSMOS-Web (Casey et al. 2023) will be important in understanding variations within particular fields (albeit with a reduction in available bands and depth). In addition to JWST, instruments such as NISP and VIS on EUCLID (Euclid Collaboration: Mellier et al. 2025; Weaver et al. 2025), while significantly more shallow and with fewer bands, will also be vital in studying the effects of cosmic variance on the SMF.

We integrated our SMFs in order to compute the cosmic SMD of massive quiescent galaxies and compare to other observational results (Santini et al. 2021; McLeod et al. 2021; Weaver et al. 2023). We find a sharper rise in the mass density from z = 4 to 2 than has been seen previously (Weaver et al. 2023), highlighting that we see a steep increase in the amount of mass locked up in these quiescent systems in the period up towards cosmic noon. We find that it increases by a factor of around ×60 from z = 4.5 to z = 2, such that ρ* ∝ (1 + z)−7.2 ± 0.3. This increase could be due to a dramatic rise in the amount of galaxies being quenched, perhaps via stronger quenching mechanisms. Another alternative is that it may be a product of there being more massive galaxies available to quench due to the steep rise in the cosmic SFRD within this epoch (Madau & Dickinson 2014).

In accurately determining quiescent galaxies from photometry, increased band coverage is crucial for helping remove interlopers such as BDs and LRDs at the highest redshifts (e.g. above z = 5). Dedicated medium bands surveys among pre-existing fields are incredibly useful for this (e.g. Williams et al. 2023; Eisenstein et al. 2023b; Suess et al. 2024). Greater understanding of the interloper populations will also be vital to better understand and remove them.

7.2. Simulations

Predictions differ wildly between simulations. With a mass cut of M >  1010M we find poor agreement with most cosmological hydrodynamical simulations and SAMs. FLAMINGO (Schaye et al. 2023) and ILLUSTRIS-TNG100 (Pillepich et al. 2018; Springel et al. 2018) perform poorly, with neither of these works having close to enough massive quiescent galaxies, failing to reproduce the observed number densities above z = 3. ILLUSTRIS-TNG100 does, however manage to reproduce the observed number densities at z = 2 − 3. Interestingly, FLAMINGO, SIMBA (Davé et al. 2019), and EAGLE (Schaye et al. 2015; Crain et al. 2015) all fail to produce the number densities at z = 2 − 3, despite this range being the one best-constrained by observations. GALFORM (Lacey et al. 2016) and ILLUSTRIS-TNG100 struggle at higher-z, i.e. above z = 3.5. SHARK (Lagos et al. 2018, 2024) performs better, generally matching the number densities up to z = 4, but deviates from our observations at higher redshifts. MAGNETICUM shows that it is possible to produce many quenched galaxies; however, it massively overproduces them, severely overpredicting the number densities within the redshift range z = 2 − 5 which are the most tightly constrained by observations.

We also explored including errors in the simulations to better mimic observations, finding that for most simulations, while this changes the number densities, it cannot account for the discrepancies seen (although this pushes SHARK to accurately predict the number densities up to z = 6). This highlights the difficulty in obtaining accurate number densities at all redshifts with simulations.

A more in-depth question is whether the simulations can reproduce the observed mass distribution of these massive quiescent galaxies and this is tested by the SMF. We find that none of the simulations appears to reproduce the quiescent galaxy SMF as observed in this work (or in Weaver et al. 2023) at any redshift (although ILLUSTRIS-TNG100 is close at z = 2 − 3). Predictions are again inconsistent with some simulations obtaining enough high-mass quiescent galaxies, but failing at the low-mass end, whilst others can reproduce the low-mass, but fail at the high-mass end. These discrepancies still hold after addingGaussian-distributed errors on simulated quantities (this just has the effect of broadening the simulated SMFs).

We also see that several simulations fail to fully reproduce the SMD of quiescent galaxies across cosmic time. Interestingly, most remain within 2σ of the observations (with the exception of EAGLE and MAGNETICUM), and the dramatic differences seen in the SMF are mostly washed out, with most simulations producing SMDs within 0.3 dex of each other. EAGLE dramatically under-predicts the SMD at all redshifts, whilst MAGNETICUM significantly overpredicts it. ILLUSTRIS-TNG100 fails at z >  3.5 due to a lack of quiescent galaxies. Despite obtaining the closest number densities, SHARK overpredicts the SMD at z >  3 due to its significantly larger population of lower mass quiescent galaxies. SIMBA and GALFORM appear mostly able to reproduce the observed SMD (despite struggling with the number densities and SMFs).

As has previously been reported in Lagos et al. (2025) for z = 3, these findings for the simulations are likely due to the varying feedback prescriptions with them, particularly the onset and strength of AGN feedback. Simulations where it only kicks in above a certain mass range (e.g. ILLUSTRIS-TNG100) struggle to reproduce any of the less massive quiescent galaxies. Meanwhile, other simulations such as EAGLE reproduce the low-mass end of the SMFs well but cannot produce high-mass quiescent galaxies, showing that their high-mass quenching mechanisms need stronger or more efficient feedback. We see from the SMFs that the lower mass quenching mechanisms appear to be much too strong within the SAMs SHARK and GALFORM. SIMBA fails to reproduce the shape of the SMF at either the low or high-mass ends, suggesting both require adjustments.

Resolution effects and box sizes are also important. As has been shown previously in Baker et al. (2025a), simulation based number densities are also subject to cosmic variance because of their inherent small box sizes. Whilst large volume simulations exist they often struggle to match the required small-scale physics to produce enough massive quiescent galaxies (as seen for FLAMINGO in Baker et al. 2025a). This suggests that large-volume simulations are crucial to minimise the effects of cosmic variance even within the simulations.

On the other hand, quenching both massive and low-mass galaxies is a multi-scale problem (Man & Belli 2018; De Lucia et al. 2025) with many of the crucial feedback processes taking place at much smaller scales. This motivates further research into the quenching mechanisms within high resolution simulations. However, this proves to be a big challenge for simulations as they can either aim for a large box (sacrificing resolution and resorting to sub-grid physics) or they can resolve the small-scale physics but only for a handful of galaxies which is too small for number densities or SMFs (for a review of the problem see Vogelsberger et al. 2020). Small-scale simulations also struggle to find massive quiescent galaxies at high z due to their relative rarity within the overall galaxy population. Hence, cosmological simulations also need more complicated feedback prescriptions in order to reproduce observations. Overall, future wide and deep JWST surveys (combined with legacy data) and the next generation of cosmological hydrodynamical simulations and SAMs will start to address these issues.

7.3. Caveats and limitations

Within any work of this kind there are necessarily caveats and limitations. These correspond to both the observational part of the analysis and the simulations. We will focus on the observational part here and refer readers to Lagos et al. (2025) for the simulations. One of our (and many other works) primary caveats is the SED modelling. This takes two forms within our work – the initial EAZY template fitting run and then the more detailed Bayesian BAGPIPES fitting. The usual caveats corresponding to SED modelling apply (e.g. Conroy 2013). We note, however, that as we are exploring massive quiescent galaxies aspects such as outshining (e.g. Sawicki & Yee 1998) should be insignificant in comparison to other works involving star-forming galaxies. The usual caveats about systematic effects apply, however, such as star-formation histories (Leja et al. 2019a; Wang et al. 2023), metallicities, IMFs (e.g. van Dokkum & Conroy 2024) and abundance ratios (Beverage et al. 2025).

Uncertainties on the photo-zs for these type of massive quiescent galaxies are another source of uncertainty but have been shown to be sub-dominant in Ito et al. (2025a), where they cover the lower redshift range of our study, although photo-z uncertainty is more significant for our highest-z candidates. We also note that we fix the BAGPIPES redshift to that of the EAZY photo-z as opposed to allowing it to vary (e.g. Weibel et al. 2024), as it has been shown to be more accurate at recovering spectroscopic redshifts (Weibel et al. 2024; Ito et al. 2025a). This may have the effect of ruling out potential areas of the Bayesian posterior space in BAGPIPES, but, due to the improved accuracy of the EAZY photo-zs, it should primarily rule out less feasible solutions.

Another key aspect is the role of AGN within these galaxies. Recent spectroscopic JWST works have found a number of high-z massive quiescent galaxies show signs of current AGN activity (Bugiani et al. 2025; D’Eugenio et al. 2024; Carnall et al. 2023b; Baker et al. 2025a) which alter their SEDs compared to non-AGN host galaxies. We do not include AGN models within our BAGPIPES SED modelling. This could increase or reduce the number of quiescent galaxies within our sample. Firstly, there is the possibility for interlopers due to extreme emission lines at certain redshifts which could mimic a Balmer break. Specifically in our case this is an issue at the highest redshifts due to the prevalence of LRDs (Matthee et al. 2024) which we do our best to remove via visual inspection. Secondly, solutions involving emissions lines within BAGPIPES will attribute those emission lines to star-formation rather than AGN activity, potentially misclassifying AGN activity in quiescent galaxies as SF, leading to an underestimate. The first of these should be minor for a population study of this size, at least up to z∼5, but is likely to be a bigger problem at high-z. This will require larger spectroscopic surveys to investigate in depth. The second should be lessened by our extended UVJ criteria in the initial EAZY selection.

We also note that we are exploring the integrated properties of these galaxies and on spatially resolved scales individual properties can vary due to differing morphological components, stellar populations, dust properties and other evolutionary processes (e.g. Tacchella et al. 2015; D’Eugenio et al. 2024; Mosleh et al. 2025; Baker et al. 2025c).

To summarise, we have the same uncertainties and caveats as expected for studies involving integrated SED modelling of quiescent galaxies. We do not expect these to be significant within the bulk of the analysis, which is restricted primarily to z = 2 − 5 where we have excellent number statistics. However, above redshift 5 where we are restricted to a handful of candidates, these are likely to be a cause of more uncertainty.

8. Conclusions

We have assembled a robust sample of over 700 massive quiescent galaxies from z = 2 − 7 from a combination of some of the deepest JWST multi-band surveys with an overall area 5× larger than existing JWST studies. We computed quiescent galaxy number densities at redshifts z = 2 − 7 and explored SMFs and the cosmic stellar mass density from z = 2 − 5. We compare with a range of simulations and SAMs with matching selection criteria to provide an accurate comparison to the state-of-the-art galaxy evolutionary modelling. Our key findings are as follows.

  1. We confirm an overabundance of massive quiescent galaxies relative to pre-JWST works and models at high-z.

  2. We find significant numbers of observed massive quiescent galaxies that would be missed by the traditional UVJ colour selection criteria, further motivating cuts based on sSFR.

  3. We find that the cosmological hydrodynamical simulations we explore cannot accurately predict the observed number density of massive quiescent galaxies in the range z = 2 − 7.

  4. Simulations cannot reproduce the observed quiescent galaxy SMF at any redshift from z = 2 − 5. All fail to match the shape and normalisation of the observed SMF. Some models overpredict lower-mass quiescent galaxies (i.e. SHARK, GALFORM) and fail at the high-mass end, whilst others such as ILLUSTRIS-TNG100 match the high-mass end but fail to quench any lower-mass galaxies. This is connected to the various feedback prescriptions in the simulations.

  5. Quiescent galaxy stellar mass functions can vary significantly with differing selection criteria, and we uncover more quiescent galaxies at the lower-mass end than previous studies. This is due to our extended colour selection and our sSFR cut being better able to uncover the lower-mass quiescent galaxy population.

  6. We show the impact of field-to-field variation in SMFs due to cosmic variance and demonstrate how it significantly increases at higher redshift. We also show significant variations when different colour selection criteria are used compared to a sSFR cut.

  7. We find that the stellar mass density of quiescent galaxies evolves more steeply between z = 5 and z = 2 than has been found in previous work, with ρ* ∝ (1 + z)−7.2 ± 0.3, indicating a significant increase in the importance of galaxy quenching within these epochs.

  8. As part of the sample, we report another robust high-z (z >  6) quiescent galaxy candidate.

Our best models for galaxy evolution, as encapsulated by current cutting-edge cosmological simulations, are unable to accurately reproduce the high-z quiescent galaxy population as observed. This presents a wonderful opportunity to improve our knowledge of high-z galaxy evolutionary theory with the next generation of cosmological simulations.

Data Availability

The quiescent galaxy catalogue and observed SMFs are available via the following DOI:https://zenodo.org/records/16942039. The raw JWST data is available via the DAWN JWST Archive (DJA).


1

We note that the mass ranges probed are also important; the JWST has uncovered a population of low-mass (M <  109.0M) “mini” or “rapidly” quenched galaxies at z > 5 (Strait et al. 2023; Looser et al. 2024; Baker et al. 2025b). These galaxies are not standard quiescent galaxies because while they have visible Balmer breaks they also have low masses and blue colours, an indication of very recent quenching t quench 30 Myr $ \rm t_{quench}\leq30\,Myr $.

3

We note that the simulations use a Chabrier (2003) IMF, but that the difference between the two is minor and less than the errors in obtaining stellar masses and SFRs from photometric SED modelling.

4

As an example, we remove galaxies with only F200W, F277W, F356W and F444W, if the break is bluewards of F200W. We also remove any galaxy with fewer than four filters (there are a select number of well-constrained z = 2 galaxies with four filters that properly probe the full NIRCam range). We note that this only applies to the BAGPIPES fitting – we use the full JWST + Hubble filters in the initial EAZY based selection.

5

Which following Donnan et al. (2024) we take the 5σ global depth in F444W to be 27.9 mag in the shallowest region of the Primer-UDS field.

6

We note that this galaxy has been recovered as part of our photometric sample in this work.

7

We restrict the analysis here to the simulations used in Lagos et al. (2025) which make their data publicly available.

8

In this work they used ground-based photometry based on the full COSMOS field, which corresponds to an area of 1.27 square degrees. The advantage of that work is the significantly larger volume than our ∼800 arcmin2. However, the ground-based imaging naturally has reduced depth (approaching only 26 mag in H band) and it lacks the longer wavelength filters we have access to (which enable a better probe of the rest-frame optical). In addition, by using a combination of different fields in different regions on the sky we benefit from multiple different sightlines, helping reduce the effect of cosmic variance compared to a single large field.

9

We test varying this by integrating from 109M, as the simulations only go down to 109M, and find it makes no appreciable difference, as the very low-mass end is sub-dominant.

Acknowledgments

WMB thanks Lucas Kimmig and Rhea-Silvia Remus for providing data from Magneticum Pathfinder for the comparisons, and also for useful discussions. In addition, WMB would like to thank Francesco D’Eugenio and Marko Shuntov for helpful and enlightening discussions. WMB would like to acknowledge support from DARK via the DARK Fellowship. This work was supported by a research grant (VIL54489) from VILLUM FONDEN. Some of the data products presented herein were retrieved from the Dawn JWST Archive (DJA). DJA is an initiative of the Cosmic Dawn Center, which is funded by the Danish National Research Foundation under grant DNRF140. FV and KI acknowledge support from the Independent Research Fund Denmark (DFF) under grant 3120-00043B. This study was supported by JSPS KAKENHI Grant Number JP23K13141. AS is supported by a Villum Experiment grant (VIL69896) and research grant (VIL54489) from VILLUM FONDEN. We acknowledge use of ASTROPY (Astropy Collabration 2013), FSPS (Conroy et al. 2009; Conroy & Gunn 2010), EAZY (Brammer et al. 2008), GRIZLI (Brammer 2023), BAGPIPES (Carnall et al. 2018, 2019, NUMPY (Harris et al. 2020), DYNESTY (Speagle 2020), MATPLOTLIB (Hunter 2007), and TOPCAT (Taylor 2005). This work is based [in part] on observations made with the NASA/ESA/CSA James Webb Space Telescope. The data were obtained from the Mikulski Archive for Space Telescopes at the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-03127 for JWST. These observations are associated with programs #1345, #1837, #1180, #1181, #1210, #1287, and #1215.

References

  1. Adams, N. J., Bowler, R. A. A., Jarvis, M. J., Häußler, B., & Lagos, C. D. P. 2021, MNRAS, 506, 4933 [CrossRef] [Google Scholar]
  2. Alberts, S., Williams, C. C., Helton, J. M., et al. 2024, ApJ, 975, 85 [Google Scholar]
  3. Antwi-Danso, J., Papovich, C., Leja, J., et al. 2023, ApJ, 943, 166 [NASA ADS] [CrossRef] [Google Scholar]
  4. Antwi-Danso, J., Papovich, C., Esdaile, J., et al. 2025, ApJ, 978, 90 [Google Scholar]
  5. Astropy Collabration (Robitaille, T. P., et al.) 2013, A&A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  6. Baggen, J. F. W., van Dokkum, P., Brammer, G., et al. 2024, ApJ, 977, L13 [NASA ADS] [CrossRef] [Google Scholar]
  7. Baker, W. M., Maiolino, R., Bluck, A. F. L., et al. 2024, MNRAS, 534, 30 [Google Scholar]
  8. Baker, W. M., Lim, S., D’Eugenio, F., et al. 2025a, MNRAS, 539, 557 [Google Scholar]
  9. Baker, W. M., D’Eugenio, F., Maiolino, R., et al. 2025b, A&A, 697, A90 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  10. Baker, W. M., Tacchella, S., Johnson, B. D., et al. 2025c, Nat. Astron., 9, 141 [Google Scholar]
  11. Barrufet, L., Oesch, P. A., Marques-Chaves, R., et al. 2025, MNRAS, 537, 3453 [Google Scholar]
  12. Beckmann, R. S., Devriendt, J., Slyz, A., et al. 2017, MNRAS, 472, 949 [NASA ADS] [CrossRef] [Google Scholar]
  13. Behroozi, P., Wechsler, R. H., Hearin, A. P., & Conroy, C. 2019, MNRAS, 488, 3143 [NASA ADS] [CrossRef] [Google Scholar]
  14. Belli, S., Newman, A. B., & Ellis, R. S. 2019, ApJ, 874, 17 [Google Scholar]
  15. Beverage, A. G., Slob, M., Kriek, M., et al. 2025, ApJ, 979, 249 [Google Scholar]
  16. Brammer, G. 2023, https://doi.org/10.5281/zenodo.8370018 [Google Scholar]
  17. Brammer, G. B., van Dokkum, P. G., & Coppi, P. 2008, ApJ, 686, 1503 [Google Scholar]
  18. Bressan, A., Marigo, P., Girardi, L., et al. 2012, MNRAS, 427, 127 [NASA ADS] [CrossRef] [Google Scholar]
  19. Bruzual, G., & Charlot, S. 2003, MNRAS, 344, 1000 [NASA ADS] [CrossRef] [Google Scholar]
  20. Bugiani, L., Belli, S., Park, M., et al. 2025, ApJ, 981, 25 [NASA ADS] [CrossRef] [Google Scholar]
  21. Byler, N., Dalcanton, J. J., Conroy, C., & Johnson, B. D. 2017, ApJ, 840, 44 [Google Scholar]
  22. Calzetti, D., Armus, L., Bohlin, R. C., et al. 2000, ApJ, 533, 682 [NASA ADS] [CrossRef] [Google Scholar]
  23. Carnall, A. C., McLure, R. J., Dunlop, J. S., & Davé, R. 2018, MNRAS, 480, 4379 [Google Scholar]
  24. Carnall, A. C., McLure, R. J., Dunlop, J. S., et al. 2019a, MNRAS, 490, 417 [Google Scholar]
  25. Carnall, A. C., Leja, J., Johnson, B. D., et al. 2019b, ApJ, 873, 44 [Google Scholar]
  26. Carnall, A. C., McLeod, D. J., McLure, R. J., et al. 2023a, MNRAS, 520, 3974 [NASA ADS] [CrossRef] [Google Scholar]
  27. Carnall, A. C., McLure, R. J., Dunlop, J. S., et al. 2023b, Nature, 619, 716 [NASA ADS] [CrossRef] [Google Scholar]
  28. Carnall, A. C., Cullen, F., McLure, R. J., et al. 2024, MNRAS, 534, 325 [NASA ADS] [CrossRef] [Google Scholar]
  29. Casey, C. M., Kartaltepe, J. S., Drakos, N. E., et al. 2023, ApJ, 954, 31 [NASA ADS] [CrossRef] [Google Scholar]
  30. Chabrier, G. 2003, PASP, 115, 763 [Google Scholar]
  31. Chevallard, J., & Charlot, S. 2016, MNRAS, 462, 1415 [NASA ADS] [CrossRef] [Google Scholar]
  32. Chittenden, H. G., Glazebrook, K., Nanayakkara, T., et al. 2025, ArXiv e-prints [arXiv:2504.19696] [Google Scholar]
  33. Chuang, C.-Y., Jespersen, C. K., Lin, Y.-T., Ho, S., & Genel, S. 2024, ApJ, 965, 101 [Google Scholar]
  34. Cimatti, A., Daddi, E., Renzini, A., et al. 2004, Nature, 430, 184 [NASA ADS] [CrossRef] [Google Scholar]
  35. Conroy, C. 2013, ARA&A, 51, 393 [NASA ADS] [CrossRef] [Google Scholar]
  36. Conroy, C., & Gunn, J. E. 2010, ApJ, 712, 833 [Google Scholar]
  37. Conroy, C., Gunn, J. E., & White, M. 2009, ApJ, 699, 486 [Google Scholar]
  38. Crain, R. A., Schaye, J., Bower, R. G., et al. 2015, MNRAS, 450, 1937 [NASA ADS] [CrossRef] [Google Scholar]
  39. Croton, D. J., Springel, V., White, S. D. M., et al. 2006, MNRAS, 365, 11 [Google Scholar]
  40. Curtis-Lake, E., Bluck, A., d’Eugenio, F., Maiolino, R., & Sijacki, D. 2023, Nat. Astron., 7, 247 [Google Scholar]
  41. Daddi, E., Renzini, A., Pirzkal, N., et al. 2005, ApJ, 626, 680 [NASA ADS] [CrossRef] [Google Scholar]
  42. Davé, R., Anglés-Alcázar, D., Narayanan, D., et al. 2019, MNRAS, 486, 2827 [Google Scholar]
  43. Davidzon, I., Ilbert, O., Laigle, C., et al. 2017, A&A, 605, A70 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  44. de Graaff, A., Setton, D. J., Brammer, G., et al. 2025a, Nat. Astron., 9, 280 [Google Scholar]
  45. de Graaff, A., Brammer, G., Weibel, A., et al. 2025b, A&A, 697, A189 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  46. De Lucia, G., Fontanot, F., Xie, L., & Hirschmann, M. 2024, A&A, 687, A68 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. De Lucia, G., Fontanot, F., Hirschmann, M., & Xie, L. 2025, ArXiv e-prints [arXiv:2502.01724] [Google Scholar]
  48. Dekel, A., & Silk, J. 1986, ApJ, 303, 39 [Google Scholar]
  49. D’Eugenio, F., Pérez-González, P. G., Maiolino, R., et al. 2024, Nat. Astron., 8, 1443 [CrossRef] [Google Scholar]
  50. Donnan, C. T., McLure, R. J., Dunlop, J. S., et al. 2024, MNRAS, 533, 3222 [NASA ADS] [CrossRef] [Google Scholar]
  51. Donnari, M., Pillepich, A., Joshi, G. D., et al. 2021a, MNRAS, 500, 4004 [Google Scholar]
  52. Donnari, M., Pillepich, A., Nelson, D., et al. 2021b, MNRAS, 506, 4760 [NASA ADS] [CrossRef] [Google Scholar]
  53. Dressler, A. 1980, ApJ, 236, 351 [Google Scholar]
  54. Dunlop, J., Peacock, J., Spinrad, H., et al. 1996, Nature, 381, 581 [Google Scholar]
  55. Dunlop, J. S., Abraham, R. G., Ashby, M. L. N., et al. 2021, PRIMER: Public Release IMaging for Extragalactic Research, JWST Proposal. Cycle 1, ID. 1837 [Google Scholar]
  56. Eddington, A. S. 1913, MNRAS, 73, 359 [Google Scholar]
  57. Eisenstein, D. J., Willott, C., Alberts, S., et al. 2023a, ArXiv e-prints [arXiv:2306.02465] [Google Scholar]
  58. Eisenstein, D. J., Johnson, B. D., Robertson, B., et al. 2023b, ArXiv e-prints [arXiv:2310.12340] [Google Scholar]
  59. Euclid Collaboration (Mellier, Y., et al.) 2025, A&A, 697, A1 [Google Scholar]
  60. Fabian, A. C. 2012, ARA&A, 50, 455 [Google Scholar]
  61. Falcón-Barroso, J., Sánchez-Blázquez, P., Vazdekis, A., et al. 2011, A&A, 532, A95 [Google Scholar]
  62. Finkelstein, S. L., Bagley, M. B., Arrabal Haro, P., et al. 2025, ApJ, 983, L4 [Google Scholar]
  63. Franx, M., van Dokkum, P. G., Förster Schreiber, N. M., et al. 2008, ApJ, 688, 770 [NASA ADS] [CrossRef] [Google Scholar]
  64. Gallazzi, A., Bell, E. F., Zibetti, S., Brinchmann, J., & Kelson, D. D. 2014, ApJ, 788, 72 [Google Scholar]
  65. Gaspari, M., Ruszkowski, M., & Oh, S. P. 2013, MNRAS, 432, 3401 [NASA ADS] [CrossRef] [Google Scholar]
  66. Giavalisco, M., Ferguson, H. C., Koekemoer, A. M., et al. 2004, ApJ, 600, L93 [NASA ADS] [CrossRef] [Google Scholar]
  67. Glazebrook, K., Schreiber, C., Labbé, I., et al. 2017, Nature, 544, 71 [Google Scholar]
  68. Gottumukkala, R., Barrufet, L., Oesch, P. A., et al. 2024, MNRAS, 530, 966 [NASA ADS] [CrossRef] [Google Scholar]
  69. Grazian, A., Fontana, A., Santini, P., et al. 2015, A&A, 575, A96 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  70. Grogin, N. A., Kocevski, D. D., Faber, S. M., et al. 2011, ApJS, 197, 35 [NASA ADS] [CrossRef] [Google Scholar]
  71. Gunn, J. E., & Gott, J. R., III 1972, ApJ, 176, 1 [Google Scholar]
  72. Hamadouche, M. L., McLure, R. J., Carnall, A. C., et al. 2025, MNRAS, 541, 463 [Google Scholar]
  73. Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357 [NASA ADS] [CrossRef] [Google Scholar]
  74. Heintz, K. E., Brammer, G. B., Watson, D., et al. 2025, A&A, 693, A60 [Google Scholar]
  75. Higson, E., Handley, W., Hobson, M., & Lasenby, A. 2019, Stat. Comput., 29, 891 [Google Scholar]
  76. Hogg, D. W., Bovy, J., & Lang, D. 2010, ArXiv e-prints [arXiv:1008.4686] [Google Scholar]
  77. Hunter, J. D. 2007, Comput. Sci. Eng., 9, 90 [NASA ADS] [CrossRef] [Google Scholar]
  78. Inayoshi, K., & Maiolino, R. 2025, ApJ, 980, L27 [Google Scholar]
  79. Ito, K., Valentino, F., Brammer, G., et al. 2025a, A&A, submitted, ArXiv e-prints [arXiv:2506.22642] [Google Scholar]
  80. Ito, K., Valentino, F., Farcy, M., et al. 2025b, A&A, 697, A111 [Google Scholar]
  81. Jespersen, C. K., Cranmer, M., Melchior, P., et al. 2022, ApJ, 941, 7 [NASA ADS] [CrossRef] [Google Scholar]
  82. Jespersen, C. K., Steinhardt, C. L., Somerville, R. S., & Lovell, C. C. 2025a, ApJ, 982, 23 [Google Scholar]
  83. Jespersen, C. K., Carnall, A. C., & Lovell, C. C. 2025b, ApJ, 988, L19 [Google Scholar]
  84. Jespersen, C.K., Melchior, P., Spergel, D.N., et al. 2025c, ArXiv e-prints [arXiv:2503.03816] [Google Scholar]
  85. Ji, Z., Williams, C.C., Suess, K.A., et al. 2024, ArXiv e-prints [arXiv:2401.00934] [Google Scholar]
  86. Jin, S., Sillassen, N. B., Magdis, G. E., et al. 2024, A&A, 683, L4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  87. Kakimoto, T., Tanaka, M., Onodera, M., et al. 2024, ApJ, 963, 49 [NASA ADS] [CrossRef] [Google Scholar]
  88. Kauffmann, O. B., Le Fèvre, O., Ilbert, O., et al. 2020, A&A, 640, A67 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  89. Kimmig, L. C., Remus, R.-S., Seidel, B., et al. 2025, ApJ, 979, 15 [Google Scholar]
  90. Koekemoer, A. M., Faber, S. M., Ferguson, H. C., et al. 2011, ApJS, 197, 36 [NASA ADS] [CrossRef] [Google Scholar]
  91. Kokorev, V., Caputi, K. I., Greene, J. E., et al. 2024, ApJ, 968, 38 [NASA ADS] [CrossRef] [Google Scholar]
  92. Kroupa, P. 2001, MNRAS, 322, 231 [NASA ADS] [CrossRef] [Google Scholar]
  93. Kugel, R., Schaye, J., Schaller, M., et al. 2023, MNRAS, 526, 6103 [NASA ADS] [CrossRef] [Google Scholar]
  94. Lacey, C. G., Baugh, C. M., Frenk, C. S., et al. 2016, MNRAS, 462, 3854 [Google Scholar]
  95. Lagos, C. d. P., Tobar, R. J., Robotham, A. S. G., et al. 2018, MNRAS, 481, 3573 [CrossRef] [Google Scholar]
  96. Lagos, C. d. P., Bravo, M., Tobar, R., et al. 2024, MNRAS, 531, 3551 [CrossRef] [Google Scholar]
  97. Lagos, C.d.P., Valentino, F., Wright, R.J., et al. 2025, MNRAS, 536, 2324 [Google Scholar]
  98. Langeroodi, D., & Hjorth, J. 2023, ApJ, 957, L27 [NASA ADS] [CrossRef] [Google Scholar]
  99. Larson, R. B. 1974, MNRAS, 169, 229 [NASA ADS] [CrossRef] [Google Scholar]
  100. Leja, J., Carnall, A. C., Johnson, B. D., Conroy, C., & Speagle, J. S. 2019a, ApJ, 876, 3 [Google Scholar]
  101. Leja, J., Johnson, B. D., Conroy, C., et al. 2019b, ApJ, 877, 140 [NASA ADS] [CrossRef] [Google Scholar]
  102. Leja, J., Speagle, J. S., Johnson, B. D., et al. 2020, ApJ, 893, 111 [Google Scholar]
  103. Lim, S., Tacchella, S., Maiolino, R., Schaye, J., & Schaller, M. 2025, ArXiv e-prints [arXiv:2504.02027] [Google Scholar]
  104. Long, A. S., Antwi-Danso, J., Lambrides, E. L., et al. 2024, ApJ, 970, 68 [Google Scholar]
  105. Looser, T. J., D’Eugenio, F., Maiolino, R., et al. 2024, Nature, 629, 53 [Google Scholar]
  106. Lovell, C. C., Roper, W., Vijayan, A. P., et al. 2023, MNRAS, 525, 5520 [NASA ADS] [Google Scholar]
  107. Madau, P., & Dickinson, M. 2014, ARA&A, 52, 415 [Google Scholar]
  108. Man, A., & Belli, S. 2018, Nat. Astron., 2, 695 [NASA ADS] [CrossRef] [Google Scholar]
  109. Marigo, P., Bressan, A., Nanni, A., Girardi, L., & Pumo, M. L. 2013, MNRAS, 434, 488 [Google Scholar]
  110. Martig, M., Bournaud, F., Teyssier, R., & Dekel, A. 2009, ApJ, 707, 250 [Google Scholar]
  111. Matthee, J., Naidu, R. P., Brammer, G., et al. 2024, ApJ, 963, 129 [NASA ADS] [CrossRef] [Google Scholar]
  112. McLeod, D. J., McLure, R. J., Dunlop, J. S., et al. 2021, MNRAS, 503, 4413 [NASA ADS] [CrossRef] [Google Scholar]
  113. Merlin, E., Fortuni, F., Torelli, M., et al. 2019, MNRAS, 490, 3309 [CrossRef] [Google Scholar]
  114. Mosleh, M., Riahi-Zamin, M., & Tacchella, S. 2025, ApJ, 983, 181 [Google Scholar]
  115. Moster, B. P., Somerville, R. S., Newman, J. A., & Rix, H.-W. 2011, ApJ, 731, 113 [Google Scholar]
  116. Muzzin, A., Marchesini, D., Stefanon, M., et al. 2013, ApJ, 777, 18 [NASA ADS] [CrossRef] [Google Scholar]
  117. Nanayakkara, T., Glazebrook, K., Jacobs, C., et al. 2024, Scientific Rep., 14, 3724 [Google Scholar]
  118. Nanayakkara, T., Glazebrook, K., Schreiber, C., et al. 2025, ApJ, 981, 78 [Google Scholar]
  119. Pascalau, R. G., D’Eugenio, F., Tacchella, S., et al. 2025, ArXiv e-prints [arXiv:2505.06349] [Google Scholar]
  120. Peng, Y.-J., Lilly, S. J., Kovač, K., et al. 2010, ApJ, 721, 193 [Google Scholar]
  121. Peng, Y., Maiolino, R., & Cochrane, R. 2015, Nature, 521, 192 [Google Scholar]
  122. Pillepich, A., Springel, V., Nelson, D., et al. 2018, MNRAS, 473, 4077 [Google Scholar]
  123. Planck Collaboration VI. 2020, A&A, 641, A6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  124. Puskás, D., Tacchella, S., Simmonds, C., et al. 2025, MNRAS, 540, 2146 [Google Scholar]
  125. Remus, R.-S., & Kimmig, L. C. 2025, ApJ, 982, 30 [Google Scholar]
  126. Rieke, M. J., Robertson, B., Tacchella, S., et al. 2023, ApJS, 269, 16 [NASA ADS] [CrossRef] [Google Scholar]
  127. Russell, T.A., Dobric, N., Adams, N.J., et al. 2024, ArXiv e-prints [arXiv:2412.11861] [Google Scholar]
  128. Ruszkowski, M., & Pfrommer, C. 2023, A&ARv, 31, 4 [NASA ADS] [CrossRef] [Google Scholar]
  129. Sánchez-Blázquez, P., Peletier, R. F., Jiménez-Vicente, J., et al. 2006, MNRAS, 371, 703 [Google Scholar]
  130. Santini, P., Castellano, M., Merlin, E., et al. 2021, A&A, 652, A30 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  131. Sawicki, M., & Yee, H. K. C. 1998, AJ, 115, 1329 [NASA ADS] [CrossRef] [Google Scholar]
  132. Schaye, J., Crain, R. A., Bower, R. G., et al. 2015, MNRAS, 446, 521 [Google Scholar]
  133. Schaye, J., Kugel, R., Schaller, M., et al. 2023, MNRAS, 526, 4978 [NASA ADS] [CrossRef] [Google Scholar]
  134. Schechter, P. 1976, ApJ, 203, 297 [Google Scholar]
  135. Schreiber, C., Pannella, M., Elbaz, D., et al. 2015, A&A, 575, A74 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  136. Schreiber, C., Glazebrook, K., Nanayakkara, T., et al. 2018, A&A, 618, A85 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  137. Shuntov, M., Ilbert, O., Toft, S., et al. 2025, A&A, 695, A20 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  138. Silk, J., & Rees, M. J. 1998, A&A, 331, L1 [NASA ADS] [Google Scholar]
  139. Skilling, J. 2004, Am. Inst. Phys. Conf. Ser., 735, 395 [NASA ADS] [Google Scholar]
  140. Somerville, R. S., & Davé, R. 2015, ARA&A, 53, 51 [Google Scholar]
  141. Speagle, J. S. 2020, MNRAS, 493, 3132 [Google Scholar]
  142. Springel, V., Pakmor, R., Pillepich, A., et al. 2018, MNRAS, 475, 676 [Google Scholar]
  143. Steinhardt, C. L., Jespersen, C. K., & Linzer, N. B. 2021, ApJ, 923, 8 [NASA ADS] [CrossRef] [Google Scholar]
  144. Strait, V., Brammer, G., Muzzin, A., et al. 2023, ApJ, 949, L23 [NASA ADS] [CrossRef] [Google Scholar]
  145. Suess, K. A., Weaver, J. R., Price, S. H., et al. 2024, ApJ, 976, 101 [Google Scholar]
  146. Tacchella, S., Carollo, C. M., Renzini, A., et al. 2015, Science, 348, 314 [NASA ADS] [CrossRef] [Google Scholar]
  147. Tanaka, M., Valentino, F., Toft, S., et al. 2019, ApJ, 885, L34 [Google Scholar]
  148. Taylor, M. B. 2005, ASP Conf. Ser., 347, 29 [Google Scholar]
  149. Trussler, J., Maiolino, R., Maraston, C., et al. 2020, MNRAS, 491, 5406 [Google Scholar]
  150. Turner, C., Tacchella, S., D’Eugenio, F., et al. 2025, MNRAS, 537, 1826 [NASA ADS] [CrossRef] [Google Scholar]
  151. Urbano Stawinski, S. M., Cooper, M. C., Forrest, B., et al. 2024, Open J. Astrophys., 7, 46 [Google Scholar]
  152. Valentino, F., Tanaka, M., Davidzon, I., et al. 2020, ApJ, 889, 93 [Google Scholar]
  153. Valentino, F., Brammer, G., Gould, K. M. L., et al. 2023, ApJ, 947, 20 [NASA ADS] [CrossRef] [Google Scholar]
  154. Valentino, F., Heintz, K. E., Brammer, G., et al. 2025, A&A, 699, A358 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  155. van Dokkum, P., & Conroy, C. 2024, ApJ, 973, L32 [NASA ADS] [CrossRef] [Google Scholar]
  156. Veilleux, S., Cecil, G., & Bland-Hawthorn, J. 2005, ARA&A, 43, 769 [NASA ADS] [CrossRef] [Google Scholar]
  157. Vogelsberger, M., Marinacci, F., Torrey, P., & Puchwein, E. 2020, Nat. Rev. Phys., 2, 42 [Google Scholar]
  158. Wang, B., Fujimoto, S., Labbé, I., et al. 2023, ApJ, 957, L34 [NASA ADS] [CrossRef] [Google Scholar]
  159. Weaver, J. R., Davidzon, I., Toft, S., et al. 2023, A&A, 677, A184 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  160. Weaver, J. R., Taamoli, S., McPartland, C. J. R., et al. 2025, A&A, 697, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  161. Weibel, A., Oesch, P. A., Barrufet, L., et al. 2024, MNRAS, 533, 1808 [NASA ADS] [CrossRef] [Google Scholar]
  162. Weibel, A., de Graaff, A., Setton, D. J., et al. 2025, ApJ, 983, 11 [Google Scholar]
  163. Williams, C. C., Tacchella, S., Maseda, M. V., et al. 2023, ApJS, 268, 64 [NASA ADS] [CrossRef] [Google Scholar]
  164. Wu, P.-F. 2025, ApJ, 978, 131 [Google Scholar]

Appendix A: BAGPIPES set-up

Here we explain in more depth our BAGPIPES set-up. We use the updated Bruzual & Charlot (2003) stellar population models (Chevallard & Charlot 2016) alongside the MILES library of stellar spectra (Sánchez-Blázquez et al. 2006; Falcón-Barroso et al. 2011) and the updated stellar tracks from Bressan et al. (2012), Marigo et al. (2013). Nebular components, i.e. line and continuum emission is incorporated following the procedure in Byler et al. (2017). We enable the ionisation parameter U to vary between -4.5 and -0.5 in log space. We use a Calzetti et al. (2000) dust attenuation prescription, with the dust extinction in the V band allowed to vary between zero and eight magnitudes.

For our star-formation history we use a parametric double powerlaw prescription (Carnall et al. 2018, 2019a). There is much debate about how best to model high-z star-formation histories (see Carnall et al. 2019b; Leja et al. 2019a, for more of a review of the various methods). We chose a double powerlaw model due to the increased speed in model fitting and since we are primarily fitting broad band photometry we are not particularly sensitive to the exact details of the star-formation history. It should be noted that increases in stellar masses are found when using a more flexible ’continuity’ SFH (Leja et al. 2019b); however it remains unclear whether the ’continuity’ prior used in the fitting is really physically motivated at high-z (see Wang et al. 2023; Turner et al. 2025; Mosleh et al. 2025). In light of this we stick with the more simple prescription. We use a uniform prior between 0.01 and 1000 for the indices α and β in the double powerlaw SFH. We leave log stellar metallicity free to vary between -2.5 and 0.19 (i.e. log(Z/Z)=(−2.5, 0.19)), whilst noting that we cannot constrain this parameter from photometry. During our initial run we left the redshift free to vary and then compared the results for 18 spectroscopically confirmed galaxies from Baker et al. (2025a) (see Appendix E). We found that the EAZY redshifts were more accurate than the BAGPIPES redshifts so we fix the BAGPIPES redshifts to that of the EAZY values.

thumbnail Fig. A.1.

Observed F444W magnitude versus stellar mass as obtained via BAGPIPES SED modelling colour coded in bins of redshift for our sample. It shows that we are not missing any quiescent galaxies within our analysis due to being magnitude limited.

Appendix B: Completeness

Fig. B.1 shows the F444W magnitude versus stellar mass (obtained via fitting with BAGPIPES), colour-coded by bins of redshift for our entire sample. The key takeaway is that due to our mass cut of M* >  109.3M for the SMFs and M* >  109.5M for the number densities we are able to see any massive quiescent galaxy within the survey area. The observed F444W magnitude versus stellar mass figure shows that, due to our mass cut, we are clearly complete and are not missing sources due to being magnitude limited. We also see the standard expected evolution with redshift, with the higher redshift sources appearing fainter.

Appendix C: Stellar mass function fitting

The first aspect to note is that uncertainties on individual quantities of course end up affecting the SMF itself. For example, errors on redshifts and stellar masses resulting from SED modelling can lead galaxies to be in the wrong bins, which can be hard to diagnose since SED codes generally produce overconfident posteriors due to model misspecification Jespersen et al. (2025c). Therefore we consider typical uncertainties/biases derived from other studies. Generally photometric redshifts are measured accurately due to the number of bands and flexibility of templates, but there are offsets of 0.1-0.2 dex (Baker et al. 2025a, and see Appendix E). Stellar masses can have significant systematic offsets of up to 0.3 dex (Conroy 2013).

In the actual measurements of the SMF, there are three common forms of error (e.g. Adams et al. 2021; Weaver et al. 2023; Shuntov et al. 2025). The first of these is completeness. For a photometric study such as this one this takes the form of the well-known Malmquist bias – we are limited in magnitude as to the faintest object we can see. In our case we have a mass cut of greater than M >  109.3M, and as can be seen in Appendix Fig B.1 our faintest object is far above our limiting magnitude. So for our purposes our sample is mass complete above M >  109.3M

The second issue is cosmic variance. With deep extragalactic surveys we can only probe a small fraction of the sky hence our volume is necessarily limited. Whilst the Universe is likely homogeneous on large enough scales, we cannot probe those exact scales. We help minimise the effect of cosmic variance by using a large survey area (our area is approximately 6x the size of other comparable studies), but also by using fields in various different regions of the sky (as in Valentino et al. 2023). In computing our observational errors we use a prescription for fractional cosmic variance for each redshift and mass bin based on the approach of Jespersen et al. (2025a).

The final source is the so called ’Eddington Bias’ (Eddington 1913). This occurs due to measurement errors in quantities. For our work this almost entirely corresponds to the stellar mass of the galaxies. Due to the impossibility of knowing their masses exactly, galaxies can scatter to higher or lower masses than their ’true’ values. If the masses were equally distributed this would result in the same number of galaxies scattering to higher stellar masses as the number scattering to lower stellar masses. However, as can be seen in most papers exploring SMFs (e.g. Davidzon et al. 2017; Adams et al. 2021; Weaver et al. 2023; Shuntov et al. 2025) there are more galaxies at certain masses than at others. This means that this scatter has a preferred direction, meaning this result must be accounted for when fitting a SMF. This can be taken into account in multiple ways. The approach we will use is to take into account the posterior samples for each galaxy fit from BAGPIPES. We then fit a Voigt profile (a convolution of a Gaussian with a Lorentzian profile) to this normalised distribution to obtain a measure of the scatter. This Voigt profile is then convolved with the SMF and the convolution is fit to the data. It is also possible to create kernels based upon the posterior distributions (e.g. Shuntov et al. 2025), but we find that the Voigt profile fits our posterior distributions well (see Appendix Fig. C.1). An alternative approach is to use a likelihood based methodology when fitting (Leja et al. 2019b), but we do not use this approach in our work due to our smaller sample size.

thumbnail Fig. C.1.

Density kernels produced by stacking the normalised log stellar masses of all galaxies within each redshift bin. These are fit with a Voigt profile which is then convolved with the SMF for that redshift in order to account for Eddington (1913) bias.

Appendix D: EAZY fitting and posterior for high-z target

In this section, we explore in more depth our selection for high-redshift candidates. This is the most challenging in terms of rooting out possible contaminants due to the combined contamination of both BDs and LRDs. As mentioned earlier, BDs are removed by a colour cut following the criteria of Langeroodi & Hjorth (2023) and Kokorev et al. (2024) with any galaxy whose colour falls within the tracks of the modelled BDs being removed. The effect of this cut is to remove galaxies at z >  5 with both F150W-F200W< 0.2 mag and F200W-F277W> 1 mag. However, we acknowledge with this method it is possible that BDs still remain due to uncertainties in the blue bands of the photometry, although we have done our best to mitigate this with the visual inspection.

LRD contamination remains more challenging as they could be either AGN or quiescent galaxies and the populations exact nature is unconfirmed at this time. We want to minimise contamination from sources with significant non-stellar flux. At these redshifts almost all these sources are point-like so in some ways the difference between a LRD and a quiescent galaxy is a tricky question, especially as there is only one spectroscpically confirmed quiescent galaxy at z >  5 and it is only marginally resolved (Weibel et al. 2025). Our approach is to remove galaxies based on visual inspection of their SEDs and best fit BAGPIPES models. We do not use any AGN templates in our BAGPIPES SED modelling (although note that this is possible, e.g. Carnall et al. 2023b), hence we remove galaxies that appear poorly fit, likely due to an AGN contribution to the continuum or other contamination effect. We also remove galaxies with a clear ’V’ shaped SED.

In Fig. D.1 we show the best-fit EAZY model and the posterior for our high-z candidate 30744 which is found within the Primer Cosmos field. We also show the same for 66283 which is the only spectroscopically confirmed high-z quiescent galaxy (Weibel et al. 2025). We see that the posterior of our candidate is fully consistent with a redshift of around 6. We again note that this candidate has a MIRI detection which is not shown in the best-fit EAZY model.

thumbnail Fig. D.1.

Left: Best-fit EAZY templates to 30744 (upper) and 66283 (lower). We note that both have MIRI constraints that are not shown here. Right: the EAZY redshift posterior distribution for the sources.

Appendix E: Comparison to spectra

We can compare our SED fit quantities to other spectroscopic samples in order to see how well we are able to recover quantities from photometry alone. We compare to the 18 massive quiescent galaxies from Baker et al. (2025a) who used both NIRSpec PRISM spectra and photometry in their SED modelling. The results of this is shown in Fig. E.1. The upper left panel shows that our EAZY photometric redshifts are accurate for these galaxies with no significant offsets. The lower left panel shows the results of fits where the redshift was left free in BAGPIPES, compared to the spectroscopic redshift. We see increased scatter between the spectroscopic redshifts and the photometric redshifts from BAGPIPES (lower left) compared to those from EAZY (upper left). Our stellar masses are slightly lower (upper middle panel), but this is also likely due to the different SFHs between the Baker et al. (2025a) work and our own (for more details between offsets in SFHs see Leja et al. 2019a), alongside the differences between PSF matched and non-PSF matched photometry. However, we see that we obtain significant differences in the formation and quenching times.With the assumption that the spectroscopic formation and quenching times are closer to the ’true’ value, this suggests that with photometry alone we do not want to place undue importance on the formation and quenching time values. However, a much more detailed and comprehensive study is required to make significant conclusions on the accuracy of these values measured from photometry alone. This is because our comparison is still affected by the usual SED modelling assumptions, including different fitting codes and their stellar isocrones and stellar libraries, SFH models, and various other model parameters. For this work therefore, we shall simply accept that we do not have enough precision to explore formation and quenching times for our sample. This motivates further spectroscopic and photometric exploration of large samples of massive quiescent galaxies in order to understand their SFHs on a population level.

thumbnail Fig. E.1.

Comparison between quantities for a sample of 18 massive quiescent galaxies from Baker et al. (2025a) compared with our work. Upper left: comparison of our photometric EAZY redshifts versus the spectroscopic redshifts. Lower left: comparison of photometric redshifts from EAZY and those from BAGPIPES. Upper middle: comparison between the stellar masses obtained via photometric fitting with bagpipes and those from combined spectrophotometric fitting. Lower middle: comparison between the stellar masses obtained from photometry with BAGPIPES vs EAZY. Upper right: comparison of the formation times (t50) and lower right a comparison of the quenching times (t90).

All Tables

Table 1.

Number densities.

Table 2.

Best-fit stellar mass function parameters.

All Figures

thumbnail Fig. 1.

Upper panel: U–V and V–J colour selection colour-coded by redshift for the massive quiescent galaxies in our full sample. The black line corresponds to the Schreiber et al. (2015) quenching criterion with the grey line addition of the Belli et al. (2019) fast quenching criterion. The dotted red line corresponds to the selection criteria from Baker et al. (2025a) which we use as our initial selection criteria in this work.

In the text
thumbnail Fig. 2.

Quiescent galaxy number densities of those with M >  109.5 M from our sample (red points). We compare to Carnall et al. (2023a) green points, Valentino et al. (2023) blue points, and the single galaxy from Weibel et al. (2025) as the grey point. We offset the points ever so slightly in redshift for the Valentino et al. (2023) and Carnall et al. (2023a) number densities so as to make their errors visible within the figure.

In the text
thumbnail Fig. 3.

Quiescent galaxy number densities for M >  1010 M from our sample (red points). Upper panel: Comparison to other observational studies, Carnall et al. (2023a), green points, a spectroscopically corrected sample, Baker et al. (2025a), orange points, a spectroscopic sample, Nanayakkara et al. (2025), light green point, and the Weibel et al. (2025) single galaxy point. Middle panel: Comparison to five cosmological hydrodynamical simulations, EAGLE, ILLUSTRIS-TNG100, SIMBA, FLAMINGO, and MAGNETICUM (Lagos et al. 2025; Baker et al. 2025a; Remus & Kimmig 2025) and two SAMs, SHARK and GALFORM. Bottom panel: Four cosmological hydrodynamical simulations and two SAMs, but with Gaussian scatter included in the stellar masses and star-formation rates. We see that simulations struggle to reproduce the observed number densities at most redshifts.

In the text
thumbnail Fig. 4.

Upper: observed stellar mass function (SMF, Φ) versus stellar mass for massive quiescent galaxies in bins of redshift. Lower: Fitted SMF (Φ) versus stellar mass for massive quiescent galaxies in bins of redshift. The solid line is our best-fit SMFs including the correction for Eddington bias, the dotted lines exclude the Eddington bias correction. The dotted lines show that we would significantly overestimate the number of most massive quiescent galaxies at z = 2 − 2.5 without the correction.

In the text
thumbnail Fig. 5.

Comparison between our observed stellar mass function (solid lines) at z = 2 − 2.5 (upper panel) z = 2.5 − 3 (upper middle panel), z = 3 − 3.5 (lower middle panel) and z = 4 − 5 (lower panel), alongside six cosmological simulations denoted by dashed lines (left hand panels, Lagos et al. 2025). The right hand panel shows the same, but with Gaussian scatter added to the stellar masses and SFRs for the six simulations. We see that the simulations cannot accurately reproduce the observed SMFs.

In the text
thumbnail Fig. 6.

Upper panels: Field-to-field variations in the SMF with the same selection criteria as previously. We also plot the fitted SMF from Weaver et al. (2023) in blue for their quiescent galaxy sample. Left for z = 2 − 2.5, middle for z = 2.5 − 3.0 and right for z = 3.0 − 3.5. We see significant field-to-field variations underlining the effects of cosmic variance in small field observations. This increases at higher redshifts. Lower panels: Same, but using a stricter UVJ cut to better replicate the selection criteria of previous works McLeod et al. (2021), Weaver et al. (2023). The observed quiescent galaxy SMFs from this strict UVJ selection are denoted by the green line. We find greater consistency with the Weaver et al. (2023) and McLeod et al. (2021) results, highlighting the importance of selection criteria in obtaining as many quiescent galaxies as possible. We plot the SMF for the sample with our main selection criteria as the black line, showing how the largest difference between the selection criteria is for low-mass quiescent galaxies and at higher redshifts.

In the text
thumbnail Fig. 7.

Stellar mass density (SMD) of quiescent galaxies versus redshift based upon our best-fit SMFs (red points) with our best-fit overplotted (red line and shaded region). Also overplotted is a comparison for quiescent galaxies from Weaver et al. (2023), blue points, Santini et al. (2021), grey points, McLeod et al. (2021), light blue points, and theoretical predictions from UNIVERSEMACHINE (Behroozi et al. 2019), dashed black line. We slightly offset the z = 3.25 points for the McLeod et al. (2021) and Weaver et al. (2023) stellar mass densities in order to aid visibility within this figure. In addition, we plot SMDs from SIMBA, SHARK, ILLUSTRIS-TNG100, GALFORM, EAGLE and MAGNETICUM obtained by integrating their SMFs. We see a steeper increase in the observed SMD from z = 5 to z = 2 corresponding to ρ* ∝ (1 + z)−7.2 ± 0.3, highlighting the increasing importance of galaxy quenching over this epoch.

In the text
thumbnail Fig. 8.

BAGPIPES SED fits for our two highest redshift candidates (the highest-z of which is confirmed by spectroscopy Weibel et al. 2025). The yellow points are the observed photometry and the grey points correspond to non-detections with S/N < 3. The red-squares correspond to the best-fit model photometry and the red line corresponds to the best-fit model spectrum. Uncertainties in the best-fit model spectrum are denoted by the shaded regions which correspond to the 16th and 84th percentiles of the resulting distribution. An RGB image of the galaxy corresponding to F115W-F200W-F444W is included in the upper right of the figure. Both galaxies have MIRI coverage constraining the NIR part of the SED.

In the text
thumbnail Fig. A.1.

Observed F444W magnitude versus stellar mass as obtained via BAGPIPES SED modelling colour coded in bins of redshift for our sample. It shows that we are not missing any quiescent galaxies within our analysis due to being magnitude limited.

In the text
thumbnail Fig. C.1.

Density kernels produced by stacking the normalised log stellar masses of all galaxies within each redshift bin. These are fit with a Voigt profile which is then convolved with the SMF for that redshift in order to account for Eddington (1913) bias.

In the text
thumbnail Fig. D.1.

Left: Best-fit EAZY templates to 30744 (upper) and 66283 (lower). We note that both have MIRI constraints that are not shown here. Right: the EAZY redshift posterior distribution for the sources.

In the text
thumbnail Fig. E.1.

Comparison between quantities for a sample of 18 massive quiescent galaxies from Baker et al. (2025a) compared with our work. Upper left: comparison of our photometric EAZY redshifts versus the spectroscopic redshifts. Lower left: comparison of photometric redshifts from EAZY and those from BAGPIPES. Upper middle: comparison between the stellar masses obtained via photometric fitting with bagpipes and those from combined spectrophotometric fitting. Lower middle: comparison between the stellar masses obtained from photometry with BAGPIPES vs EAZY. Upper right: comparison of the formation times (t50) and lower right a comparison of the quenching times (t90).

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.