| Issue |
A&A
Volume 707, March 2026
|
|
|---|---|---|
| Article Number | A34 | |
| Number of page(s) | 12 | |
| Section | Cosmology (including clusters of galaxies) | |
| DOI | https://doi.org/10.1051/0004-6361/202558239 | |
| Published online | 25 February 2026 | |
Connecting clustering and the cosmic web
Observational constraints on secondary halo bias
1
CONICET. Instituto de Astronomía Teórica y Experimental (IATE) Laprida 854 Córdoba X5000BGR, Argentina
2
Universidad Nacional de Córdoba (UNC). Observatorio Astronómico de Córdoba (OAC) Laprida 854 Córdoba X5000BGR, Argentina
3
Departamento de Física, Universidad Técnica Federico Santa María Avenida Vicuña Mackenna 3939 San Joaquín Santiago, Chile
★ Corresponding authors: This email address is being protected from spambots. You need JavaScript enabled to view it.
; This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
24
November
2025
Accepted:
29
December
2025
Abstract
Context. Cosmological simulations predict significant secondary dependences of halo clustering on a variety of internal halo properties, and on environmental factors. Detecting these often subtle signals in observational data remains a significant challenge, with important ramifications for both galaxy evolution and cosmology.
Aims. We probed secondary halo bias in observational survey data, employing galaxy groups as proxies for dark matter haloes. We quantified the strength of multiple secondary bias signals defined by the colour of the central galaxy and various environmental diagnostics.
Methods. We employed an extended and refined galaxy group catalogue constructed from the Sloan Digital Sky Survey. Secondary bias is defined as any deviation in the clustering strength of groups at fixed mass, quantified through the projected two-point correlation function. Our environmental analysis uses the DisPerSE cosmic-web reconstruction algorithm to compute the distances to the critical points of the density field and incorporates local group overdensity measurements on multiple spatial scales.
Results. We robustly detected several forms of secondary bias in the clustering of galaxy groups. At fixed mass, groups hosting red central galaxies are more strongly clustered than those with blue centrals, with brelative ranging from ∼1.2 for the 15% reddest centrals to ∼0.8 for the bluest. Environmental dependences based on cosmic–web distances are also present, though significantly weaker and largely mass-independent. The strongest signal arises from local overdensity: groups in the densest 15% of environments reach brelative ∼ 1.4, while those in the least dense regions fall to brelative ∼ 0.7. These results establish a clear observational hierarchy for secondary halo bias.
Conclusions. The colour of central galaxies correlates with the local group overdensity, which in turn correlates with the bias at fixed group mass. Assuming that central galaxy colour traces halo assembly history, this three-stage picture offers a conceptual link between our results and halo assembly bias.
Key words: methods: statistical / galaxies: groups: general / galaxies: halos / dark matter / large-scale structure of Universe
© The Authors 2026
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe to Open model. This email address is being protected from spambots. You need JavaScript enabled to view it. to support open access publication.
1. Introduction
The large-scale linear bias of dark matter (DM) haloes encodes fundamental information on the process of halo formation from the underlying matter density field. Halo bias, which can be measured by comparing the clustering amplitude of haloes and the matter field on large scales, depends primarily on the peak height of density fluctuations, ν, as it can be analytically derived from structure formation formalisms (see e.g. Kaiser 1984; Bardeen et al. 1986; Mo & White 1996; Sheth & Tormen 1999; Sheth et al. 2001; Tinker et al. 2010). This fundamental dependence is naturally reflected in the halo virial mass: more massive haloes are more tightly clustered (have a higher bias) than less massive haloes. At fixed ν or halo mass, halo bias has been shown in cosmological simulations to depend on a variety of secondary halo properties, including formation time, concentration, spin, or shape (see e.g. Sheth & Tormen 2004; Gao et al. 2005; Wechsler et al. 2006; Gao & White 2007; Dalal et al. 2008; Angulo et al. 2008; Li et al. 2008; Faltenbacher & White 2010; Lazeyras et al. 2017; Salcedo et al. 2018; Mao et al. 2018; Han et al. 2019; Sato-Polito et al. 2019; Johnson et al. 2019; Paranjape et al. 2018; Ramakrishnan et al. 2019; Contreras et al. 2019; Tucci et al. 2021; Contreras et al. 2021; Montero-Dorta et al. 2021; Montero-Dorta & Rodriguez 2024; Balaguera-Antolínez et al. 2024). The secondary dependence of bias on internal halo properties beyond halo mass extends back to the initial field, i.e. to the properties of the initial collapsing regions that give rise to haloes (Montero-Dorta et al. 2025).
As can be also inferred from analytical models of the evolution of structure, halo bias is known to depend on the local and large-scale distribution of matter. At fixed halo mass, the dependences on the local density across multiple scales, on the geometry of the tidal field and its anisotropic (traceless) component, and on the specific location of haloes within the cosmic web have been extensively characterised in simulations (e.g. Borzyszkowski et al. 2017; Musso et al. 2018; Paranjape et al. 2018; Ramakrishnan et al. 2019; Montero-Dorta & Rodriguez 2024; Balaguera-Antolínez et al. 2024). Recognising that linear bias, overdensity, and cosmic-web geometry all reflect the same underlying density field, in this work we distinguish between local overdensity and cosmic-web architecture to isolate their specific impacts on secondary halo bias. In Montero-Dorta & Rodriguez (2024), in particular, we used the IllustrisTNG hydrodynamical simulation to measure secondary bias based on the distance of central galaxies (i.e. distinct haloes) to the critical points of the density field, identified through a cosmic web reconstruction with the Discrete Persistent Structures Extractor (DisPerSE; Sousbie 2011; Sousbie et al. 2011). The resulting dependences at fixed halo mass are shown to be very significant, largely exceeding the signal measured for halo assembly bias, the secondary dependence of halo bias on the assembly history of haloes at fixed mass.
The complete set of internal and environmental dependences of halo clustering at fixed halo mass, which we call here secondary halo bias, provides a detailed description of the halo-matter connection, which has important applications for cosmology (Wechsler & Tinker 2018). Observationally, the dependence of galaxy clustering on galaxy properties and environment has been extensively investigated using spectroscopic surveys. The investigations show that galaxies of higher mass, redder colours, and early-type morphologies, and those in denser regions, are more tightly clustered across a wide range of scales than the rest of the population (e.g. Abbas & Sheth 2006; Meneux et al. 2006; Coil et al. 2006; Abbas & Sheth 2007; Skibba et al. 2009; Zehavi et al. 2011; Hartley et al. 2013; McNaught-Roberts et al. 2014; Zhai et al. 2023). Specifically, Zehavi et al. (2011) used the SDSS to show, at fixed luminosity, that redder galaxies exhibit a higher-amplitude and steeper correlation function, a trend consistent with an increased satellite fraction in massive haloes. These general trends persist mostly when the analysis is performed at fixed stellar mass (e.g. Li et al. 2006; Zehavi et al. 2011; Law-Smith & Eisenstein 2017). Serious challenges arise, however, when we attempt to study these trends at fixed halo (or group) mass, largely due to the observational uncertainties involved in estimating halo masses. Potential dependences of galaxy clustering at fixed halo mass that can be directly attributed to secondary halo properties (such as concentration or formation time) are often considered manifestations of galaxy assembly bias, a phenomenon that remains to be robustly established observationally.
The study of secondary bias at fixed halo mass using observations has yielded a variety of results, often depending on the selection method and the proxy used for halo mass. For example, Wang et al. (2008), using an SDSS galaxy group catalogue, reported a dependence of group clustering on central galaxy colour at fixed group mass, whereby groups that have red centrals cluster more strongly, particularly in less massive haloes. These results are in line with those of Montero-Dorta et al. (2017), who reported a similar dependence, but for luminous red galaxies from the Baryon Oscillation Spectroscopic Survey (BOSS; Dawson et al. 2013). This trend was later shown by Niemiec et al. (2018) to be consistent with a pattern occurring at fixed halo mass based on weak lensing. Compatible results splitting the galaxy population using the stellar age were found by Lacerna et al. (2014). In contrast, other observational studies have found no evidence of a significant secondary signal. Using the Sloan Digital Sky Survey (SDSS; York et al. 2000) and the Yang et al. (2007) group catalogue, Lin et al. (2016) found no evidence for a dependence of galaxy clustering on star formation history, for samples selected at approximately fixed halo mass using weak lensing. More recently, Sunayama et al. (2022) also reported no significant secondary signals when probing clustering dependences at fixed halo mass using a variety of central galaxy properties from the redMaPPer cluster catalogue (Rykoff et al. 2014) from the SDSS.
In the context of galaxy assembly bias, secondary dependences at fixed halo mass have been reported, using indirect methods, in the anisotropic clustering of massive galaxies, Obuljen et al. (2019, 2020), halo occupation statistics (also known as occupancy variations1 Yuan et al. 2021; Wang et al. 2022; Pearl et al. 2024), and the stellar-to-halo mass relation (Oyarzún et al. 2024). Focusing on halo spin and applying a novel proxy based on the coherent motion of galaxies inside and around clusters, Kim et al. (2025) also presented observational indications of halo spin bias, the secondary dependence of halo bias on spin at fixed halo mass.
For this paper we used an extended SDSS group catalogue based on a refined version of the Rodriguez & Merchán (2020) method to provide observational constraints on secondary halo bias, understood here as any potential dependence of group (central galaxy) bias at fixed group mass. A key challenge stems from the uncertainty in the estimation of halo masses, an inherent limitation affecting many observational studies of galaxy and halo bias. This includes analyses that seek to detect assembly bias from galaxy clustering, especially those that do not explicitly rely on mock catalogues for validation. In our work, we employ halo masses derived from abundance matching. Although not a direct measurement of mass, the mass calibration in our group catalogue has been shown to align well with weak gravitational lensing estimates (e.g. Gonzalez et al. 2021) and is widely considered an effective indicator of the mass of the halo hosting the central galaxy.
More restrictive definitions of secondary bias have been adopted in the literature. However, as mentioned above, we adopted a broad and operational definition of secondary halo bias that encompasses both environmental and central galaxy properties. This definition is intentionally generic and primarily motivated by observational considerations, which allowed us to compare different dependences within the same framework consistently. We explored several secondary dependences, including both central galaxy properties and environmental metrics. To characterise the environment, we used local density measurements on multiple physical scales, as well as the distance of galaxies to the critical points of the density field, as defined by the cosmic web reconstruction from DisPerSE (Malavasi et al. 2020a). Our work provides key observational constraints on secondary bias, establishing a hierarchy of dependence strengths that can be directly compared with simulations. Previous studies explored this environmental dependence with different approaches. In simulations, Paranjape et al. (2018) proposed the key role of tidal anisotropy as a mediator between bias and internal halo properties, whereas Montero-Dorta & Rodriguez (2024) provide detailed predictions for the dependence of clustering on the distance to the DisPerSE critical points in the IllustrisTNG hydrodynamical simulation. On the observational side, Alam et al. (2019) measured the dependence of clustering on several of these environmental properties and used these results to constrain modified halo occupation distribution (HOD) models aimed at testing assembly bias. Our analysis complements these efforts by providing a direct measurement, using a cluster catalogue to achieve precise halo mass control and isolate the geometric influence of the density and cosmic web.
This paper is organised as follows. In Sect. 2 we describe the data used, including the SDSS DR18 galaxy catalogue and our extended group catalogue, as well as the cosmic web information from DisPerSE. Section 3 details the methodology followed to measure secondary bias based on the ratios of the correlation functions. Section 4 presents our main results, focusing on the dependence of secondary bias on intrinsic (colour) and environmental properties (distances to critical points and local density), accompanied by a robustness analysis. Finally, in Sect. 5 we discuss the implications of our findings and present our conclusions. Throughout this work, we adopt the standard ΛCDM cosmology (Planck Collaboration XIII 2016), with parameters Ωm = 0.3089, Ωb = 0.0486, ΩΛ = 0.6911, and H0 = 100 h km s−1 Mpc−1, where h = 0.6774, σ8 = 0.8159, and ns = 0.9667.
2. Data
2.1. The SDSS galaxy and group catalogue
We base our analysis on the Main Galaxy Sample from the SDSS Data Release 18 (DR18; Almeida et al. 2023). This latest data release provides comprehensive multiband photometry (u, g, r, i, z) over a sky area exceeding 10 000 square degrees, complemented by spectroscopic redshifts for over 1.2 million galaxies, extending to z ∼ 0.3. The sample’s spectroscopic completeness is well defined for galaxies brighter than an r-band apparent magnitude of 17.77, making it an exceptional resource for statistical studies of cosmic structure. For each object, the catalogue provides precise astrometry, photometry, redshift, and morphological parameters.
The identification of galaxy groups within this dataset was performed using the methodology of Rodriguez & Merchán (2020), which integrates a Friends-of-Friends (FoF) percolation scheme (Merchán & Zandivarez 2005) with a subsequent iterative halo-based refinement phase (Yang et al. 2005). A key adaptation in our implementation for this work was to extend the selection of potential central galaxies to those with magnitudes as faint as Mr = −18. This modification enhances the catalogue’s completeness for lower-mass haloes, allowing us to investigate a broader dynamic range of group environments. The algorithm first identifies candidate systems through redshift-space FoF linking. An initial halo mass is then assigned to each candidate based on the total luminosity of its member galaxies. In the iterative phase, these mass estimates are used to refine membership and recalculate halo properties until a stable solution is achieved, robustly identifying systems ranging from poor groups to massive clusters.
The final group catalogue defines galaxy memberships and provides estimates for key halo properties, such as the halo mass (Mgroup), which is derived through abundance matching (Vale & Ostriker 2004; Kravtsov et al. 2004; Conroy et al. 2006; Behroozi et al. 2010). This technique calibrates the relationship between a group’s characteristic luminosity and the mass of its underlying DM halo. Furthermore, galaxies are classified as centrals (the brightest group member) or satellites (e.g. Rodriguez et al. 2021). The reliability of this kind of group identification is well established: its mass estimates show excellent agreement with weak lensing measurements (e.g. Gonzalez et al. 2021), and it accurately reproduces galaxy population statistics (e.g. the halo occupancy distribution) from simulations across diverse environments (e.g. Alfaro et al. 2022; Rodríguez-Medrano et al. 2023). For our study, this data set provides the essential spectroscopic, photometric, and group-based properties required to conduct a detailed investigation of secondary halo bias. To ensure a robust cosmic web classification, we restrict our analysis to the central, most complete region of the catalogue, using all groups up to a redshift of z = 0.1, where the large-scale cosmic environments, as determined by DisPerSE, are well defined. This sample comprises a total of more than 180 000 groups with a mass range of 1011.3–1015.0 h−1 M⊙.
Throughout this work, we discuss groups, their masses, and related properties, but it is important to note that our analysis of secondary bias primarily relies on the central galaxies (the brightest member of each group). For most groups, especially those of low mass, this refers to a single galaxy, which is by definition the central galaxy.
Figure 1 displays the mass distribution of the groups analysed in this work (upper panel). This observed distribution follows the characteristic shape of the group mass function, where the number of systems decreases steeply with increasing mass, a well-known manifestation of the hierarchical halo formation paradigm. The bottom panel of Fig. 1 shows the colour distribution of the central galaxies within these groups, revealing the expected bimodal form that separates the red and blue galaxy subpopulations.
![]() |
Fig. 1. Main properties of the galaxy groups analysed in this study. Top panel: group mass distribution exhibiting the characteristic decreasing shape. Bottom panel: colour (g − r) distribution of the central galaxies within these groups. The bimodal nature of this distribution clearly distinguishes between the red and blue galaxy subpopulations. |
2.2. Cosmic web characterisation
This work provides observational constraints on secondary halo bias using central galaxies and groups located in different cosmic environments. The cosmic web is characterised based on the publicly available filament catalogues derived by Malavasi et al. (2020a) using the Discrete Persistent Structures Extractor (DisPerSE; Sousbie 2011; Sousbie et al. 2011). DisPerSE identifies persistent topological features (such as filaments, walls, and voids) by applying discrete Morse theory (Morse 1934, and references therein) to the cosmic density field. The algorithm first computes the density field from the discrete galaxy distribution using the Delaunay Tessellation Field Estimator (DTFE; van de Weygaert & Schaap 2009). It then locates critical points (Maxima, Minima, and Saddles) where the density gradient vanishes and reconstructs the manifold structure (the Morse complex) that defines the cosmic web. In this framework, the Maxima are typically associated with high-density regions, the saddles correspond to Filaments and Walls, and the Minima are linked to voids or empty cosmic areas.
The implementation of Malavasi et al. (2020a) adapts DisPerSE to account for the observational limitations and projection effects inherent to the SDSS spectroscopic survey. Their analysis, performed on the SDSS DR12 Main Galaxy Sample, produced multiple catalogues of critical points and filaments by varying the persistence threshold (which controls the significance of detected features) and the smoothing scale applied to the density field2. These catalogues have been used in several previous studies (Malavasi et al. 2020b; Tanimura et al. 2020a; Bonjean et al. 2020; Tanimura et al. 2020b; Izzo Dominguez et al. 2025; Rodriguez et al. 2025). Although we conducted this analysis using DR18, the resulting critical points and structures of the cosmic web are expected to remain consistent with the latest data releases. This consistency is largely due to the minimal large-scale differences observed in the spectroscopic samples. Additionally, the derived critical points should remain largely unaffected by minor variations in data releases. In future analyses, these critical points, which act as stable structural references, can be integrated with galaxy samples from more recent data publications.
In our analysis, we employ the catalogue generated from the Legacy North SDSS galaxy distribution, constructed using a 5σ persistence threshold and two smoothing cycles. This specific parameter choice is the most conservative and restrictive, ensuring that the identified filaments are highly reliable. A consequence of this strict selection is a significant reduction in the number of detected Minima and, even more so, of Maxima, leaving only a few of these critical points in the final catalogue. The specific combination of persistence threshold and smoothing adopted here was selected based on the extensive analysis of Izzo Dominguez et al. (2025), who used these same DisPerSE catalogues to investigate potential variations in the HOD as a function of environment, a probe inherently linked to secondary bias. We have verified that using less restrictive catalogues yields consistent trends, although the amplitude of the relative bias differences decreases due to the inclusion of less prominent and noisy structures.
With this description of the cosmic web, the environment of each central galaxy in the Rodriguez & Merchán (2020) group catalogue was characterised by quantifying its relative location within the geometry of the cosmic web, rather than using a purely categorised environmental label. This was achieved by calculating the minimum three-dimensional distance of each galaxy to the various critical points defined by DisPerSE. These distances are not a direct measure of local density, but rather an indicator of the proximity to the nearest large-scale morphological structure (nodes, filaments, or walls). We specifically focus on four types of critical points: Maxima, Filament saddles, Wall saddles, and Minima, as they represent the most commonly defined cosmic web environments. Throughout this work, we denote the corresponding distances as dMax, dFsadd, dWsadd, and dMin, respectively. This distance-based quantitative approach provides a continuous measure of environmental influence. It allows us to classify the structures in a manner directly analogous to the methodology established in the simulation-based work of Montero-Dorta & Rodriguez (2024).
The distribution of the minimum distance to the DisPerSE critical points is shown as histograms in Fig. 2. In addition, Fig. 3 presents the log10-scaled, colour-coded maps tracing how these distances vary across the large-scale structure within a −5°-to-15° declination slice of the SDSS Legacy North survey. The vast distances to Maxima seen in these figures are a direct reflection of our conservative DisPerSE parameter choice, which naturally yields a very low number of nodes.
![]() |
Fig. 2. Distribution of minimum distances to the cosmic web critical points obtained from DisPerSE. The histograms show the number of central galaxies as a function of their minimum three-dimensional distance to Maxima, Minima, Filament saddles, and Wall saddles. |
![]() |
Fig. 3. Spatial distribution of central galaxies relative to cosmic web topology. The figure shows central galaxies within a −5° to 15° declination slice of the SDSS Legacy North survey. The panels display the spatial positions of central galaxies, characterised by their minimum three-dimensional distance to the four DisPerSE critical point types (Maxima, Minima, Filament saddles, and Wall saddles). The colour-coding represents the scaled distance to the corresponding critical point, given by log10(distance + 1). |
2.3. Local group overdensity characterisation
We complement our analysis of the environment by measuring the local group overdensity (δ) within spheres of 5 and 10 h−1 Mpc around each central galaxy. This overdensity is calculated by comparing the number of observed galaxies (Ngal) to the number expected in a homogeneous universe (Nrand). The reference sample Nrand is generated randomly to match the observational mask and the redshift distribution of the SDSS survey. The random catalogue used for this purpose contains more than 50 times the number of galaxies in our observed sample (Ntotal, rand > 50 × Ntotal, gal), ensuring minimal Poisson noise in our density estimates. The overdensity δ is defined as the relative fluctuation about the mean density. However, as a global normalisation is required, the definition becomes
(1)
where Ntotal, gal and Ntotal, rand are the total number of galaxies in the observed and random catalogues, respectively. The measurements corresponding to 5 and 10 h−1 Mpc radii are hereafter referred to as δ5 and δ10. To enhance the statistical robustness of this measurement, all galaxies (both centrals and satellites) are included in the Ngal count. We have verified that our results are qualitatively consistent when only central galaxies are used, demonstrating that our conclusions are not sensitive to this choice.
Figure 4 shows the distance to critical points as a function of local group overdensity δ5, with each central galaxy colour-coded based on its g − r colour. Upon visual examination, it is evident that there is a very weak correlation between the distances to the various critical points (dMax, dFsadd, dWsadd, and dMin) and the local density. To complement this qualitative assessment, we computed the Pearson correlation coefficient and found that, in all cases, its absolute value is below 0.025, confirming the absence of any significant correlation. However, we acknowledge that even very weak environmental correlations could be non-negligible in the context of halo clustering, as demonstrated by studies such as Zehavi et al. (2018), Artale et al. (2018), and Montero-Dorta et al. (2020). Specifically, additional tests show that while no significant trends are observed for the other critical points, the mean values of dMax can vary by up to ∼20% across the overdensity range considered. Although this trend is subtle and represents the most notable signal among the distances analysed, such a level of correlation has been shown to potentially induce non-negligible effects on occupancy variations and, consequently, on the resulting clustering signals. In contrast, there is a significant correlation between this local density δ5 and the g − r colour, as higher densities are systematically related to red central galaxies. As for the relationship between distances to critical points and colour, no clear trend is evident between galaxy colour and proximity to critical points such as minima, filaments, or walls. We have verified that using the local density measured on the scale of 10 h−1 Mpc yields qualitatively similar results. It is important to emphasise again that we are analysing the distances to critical points, not necessarily the environments themselves. This explains why the relationship between these distances and properties such as the colour of central galaxies does not appear to be straightforward. Furthermore, because galaxies of all masses are mixed in Fig. 4, both red and blue systems appear across the full density range, including high-density regions. This scatter is amplified by the small number of identified critical points – especially maxima and minima – which provides only a sparse sampling of the cosmic-web skeleton. As a result, galaxies of all colours and densities are found at a wide range of distances from critical points, diluting any clear trends.
![]() |
Fig. 4. Distribution of central galaxy colour as a function of cosmic web location and local overdensity. In each panel, a scatter diagram shows the relation between the distance to each critical point of the density field and the local group overdensity (log10(δ)) on 5 h−1 Mpc scales for all central galaxies in our sample, colour-coded by their g − r colour. From top to bottom, the panels illustrate the distribution with respect to the distance to (1) Maxima (dMax), (2) Filament saddles (dFsadd), (3) Wall saddles (dWSadd), and (4) Minima (dMin). The contour lines (black lines) represent the progressively lower density levels (down to 5% of the peak value). The horizontal dashed lines (green and magenta) mark the 15th (P15) and 85th (P85) percentiles of the distance distribution, respectively, used to define environment thresholds. |
3. Secondary bias measurement
To quantify secondary bias in our galaxy group sample, we measure the relative clustering at fixed group mass of several central galaxy (group) subsets defined by both environmental and intrinsic properties. The large-scale environment of each central galaxy is characterised using the distance to various critical points of the cosmic web (Maxima, Minima, Filament saddles, and Wall saddles), as identified by DisPerSE, whereas the density within 5 and 10 h−1 Mpc is employed as a local environment descriptor. The g − r colour is chosen as the intrinsic central galaxy/group property.
Central galaxies within each group mass bin are divided into subsets based on percentile rankings according to the aforementioned properties. We explore multiple sample definitions by varying both the percentile thresholds used to define subsets and the specific property ranges examined. This multifaceted approach allows us to probe different manifestations of secondary bias while rigorously controlling for halo mass.
We compute the relative bias brelative between these selected subsets following the approach of Salcedo et al. (2018), Sato-Polito et al. (2019), Montero-Dorta et al. (2021), Montero-Dorta & Rodriguez (2024), which relies on cross-correlations with the full sample of groups to maximise signal-to-noise. The relative bias is calculated from the projected cross-correlation functions as
(2)
where wp, [S, all](rp) is the projected cross-correlation function between central galaxies in the subset selected by property S in a given mass bin Mi and the entire galaxy group sample, and wp, [Mi, all](rp) is the projected cross-correlation between all central galaxies in the same mass bin Mi and the full group galaxy sample. These measurements are performed on scales ranging from 0.2 to 30 h−1 Mpc, covering both the one-halo and two-halo regimes where the correlation functions scale approximately as the square of the bias. The final relative bias parameter used to probe secondary bias is, however, obtained by averaging over scales of 5–20 h−1 Mpc, which is a criterion commonly used to isolate the large-scale effect in the context of secondary bias (e.g. Sato-Polito et al. 2019; Montero-Dorta et al. 2021; Montero-Dorta & Rodriguez 2024):
(3)
The projected cross-correlation function wp(rp) is calculated from the observable redshift-space coordinates using
(4)
where πmin = 0.1 h−1 Mpc, πmax = 20 h−1 Mpc, and the Ξ(rp, π) estimator is given by
(5)
and CG(rp, π) and CR(rp, π) represent the central-galaxy and central-random pair counts separated by rp and π (the line-of-sight distances), respectively; NG and NR are the total numbers of galaxies and random points in the sample. The random catalogue was generated by uniformly distributing points across the SDSS footprint, reproducing both the angular and redshift distribution of the SDSS galaxy sample. These computations were performed using the correlation code developed by Rodriguez et al. (2022), which calculated similar correlation functions for galaxy samples selected by colour, mass, and other physical properties.
It is important to note that the relative bias can itself be interpreted as a measure of the large-scale environment of galaxies, and is therefore expected to exhibit some degree of correlation with the environmental properties discussed here. Throughout this work, we implicitly distinguish between a global clustering property, quantified through two-point statistics, and environmental properties, which are either local, one-point statistics such as overdensities, or metrics associated with specific features of the cosmic web.
4. Observational constraints on secondary bias
This section presents a systematic investigation of secondary bias based on the different galaxy and environmental properties described above. We begin by establishing a reference signal using galaxy colour, a well-studied intrinsic property known to correlate with halo assembly history (e.g. Montero-Dorta et al. 2021). For each property, we measure the clustering of subsamples of central galaxies, divided by their specific characteristics, using narrow group mass bins to ensure a fixed-mass baseline and enable fair comparisons. The mass bins used are 0.2 dex wide in log10(Mgroup/h−1 M⊙) covering the range between 11.3 and 13.1, plus two additional higher-mass bins spanning 13.1–13.5 and 13.5–14.3.
As mentioned above, we employ a broad definition of secondary bias that includes any potential secondary dependence. Although the environmental distances and local density are not intrinsic halo properties, they may serve as probes of environmentally dependent accretion and formation history. As discussed in the literature based on simulations, a link between assembly bias and these environment-based secondary bias trends can be established, arising from the physical processes that affect halo accretion depending on the specific location of haloes within the cosmic web and the geometry of the surrounding tidal field (Dalal et al. 2008; Hahn et al. 2009; Paranjape et al. 2018; Borzyszkowski et al. 2017; Musso et al. 2018; Montero-Dorta & Rodriguez 2024).
4.1. Dependence on galaxy colour
We begin our analysis by examining the secondary bias produced by galaxy colour (g − r), an intrinsic property known to correlate with star formation history and halo assembly. Previous observational works have mostly focused on the more directly measurable dependence of clustering on colour at fixed stellar mass (e.g. Law-Smith & Eisenstein 2017). However, Wang et al. (2008) reported a significant dependence of group clustering on central galaxy colour at fixed group mass, with groups hosting red centrals clustering more strongly, particularly for less massive haloes. Results at fixed halo mass are well established in hydrodynamical simulations and are consistent with these observational trends (e.g. Montero-Dorta et al. 2020).
Figure 5 presents the relative bias for groups split by the colour of their central galaxy, using two selections at fixed halo mass: the 15% reddest and bluest centrals (saturated colours) and the 50% reddest and bluest centrals (lighter colours). Throughout this figure and the remainder of our analysis, uncertainties are estimated using the jackknife resampling technique, dividing the sample into 50 subsamples for the calculation of correlation functions and error propagation for brelative calculations. We find that groups with redder centrals exhibit systematically stronger clustering than those with bluer centrals. The amplitude of this effect is largest at low halo masses, reaching brelative ∼ 1.2 for the 15% reddest centrals and brelative ∼ 0.8 for the bluest ones, which is consistent with expectations. While the signal is strongest for the extreme 15% colour selection, it is also noisier due to the smaller sample size. The 50% split reduces the amplitude slightly, as expected, but provides a significantly cleaner measurement. This secondary bias is most pronounced below log10(Mgroup/h−1 M⊙)∼13.5 and diminishes at higher masses, a trend consistent with both observational results and simulations (Wang et al. 2008; Montero-Dorta et al. 2020). Furthermore, the amplitude and mass dependence of the signal agree remarkably well with the simulation results from Fig. 5 of Montero-Dorta et al. (2020), showing very similar bias values across the relevant mass range. Despite inherent mass-assignment uncertainties, which could systematically shift or add noise to the signal, the clear trend we recover remains physically meaningful and consistent with simulations.
![]() |
Fig. 5. Secondary bias measurements for subsets of groups split by the colour of their central galaxies (colour bias). Results are shown for both the extreme 15% reddest and bluest centrals (darker colours) and the 50% reddest and bluest centrals (lighter colours). Uncertainties are estimated using jackknife resampling with 50 subsamples for the correlation functions and propagated to the relative bias. |
Our analysis confirms and extends the early findings of Wang et al. (2008) to a broader mass range, while showing consistent results in the overlapping mass regime. This demonstrates that our sample is capable of directly recovering the colour bias reported in both observations and simulations. These results highlight the role of central galaxy colour as a sensitive tracer of secondary bias, encapsulating the interplay between star formation history, halo assembly, and the large-scale environment. The detection of this effect in our sample provides a benchmark against which other secondary dependences can be tested.
4.2. Dependence on cosmic web location and environment
Building on the colour analysis, we now turn to the analysis of an environmental factor: the specific location within the cosmic web, which has been shown to influence group and galaxy properties (e.g. Einasto et al. 2024). We investigate how secondary bias relates to this cosmic web environment by measuring the clustering dependence on the distance to its critical points. The results of our analysis are presented in Fig. 6. The figure shows the relative bias as a function of the group mass for populations split by their distance from the four critical features identified by DisPerSE: Maxima, Filament saddles, Wall saddles, and Minima. Here, the distance from the closest point among these categories is adopted.
![]() |
Fig. 6. Secondary halo bias based on the distance to the critical points of the cosmic web. Each panel shows the relative bias (brelative) as a function of the logarithm of the group mass (log10(Mgroup/h−1 M⊙)). The panels are organised into two rows and four columns; each column represents a specific cosmic web feature identified by the DisPerSE algorithm: Maxima (first column), Filament saddles (second), Wall saddles (third), and Minima (fourth). In each panel, the cyan and orange lines represent the 15% of galaxies farthest from and closest to the feature, respectively. The green and red lines represent the 50% of galaxies farthest from and closest to the feature, respectively. In each panel, the orange line represents the subset of galaxies closer to the feature, and the light blue line shows the subset of galaxies farther away. The dashed black line at brelative = 1.00 serves as a reference point for the relative bias. The error bars on each data point were calculated using the jackknife method. |
A primary result, consistent with Montero-Dorta & Rodriguez (2024), is that the distance parameters induce significant secondary bias, although the nature of the signal is highly dependent on the specific feature. The analysis reveals a clear difference when comparing the groups with the 15% largest and smallest distance to the Maxima, where the farthest population exhibits a relative bias brelative < 0.89 in all the range bins. When using the median split (50% closest and farthest), the differences remain pronounced, but exhibit greater symmetry; the populations farthest from the features show brelative ∼ 0.97, while the closest show brelative ∼ 1.03. This symmetric trend is evident across the entire mass range.
For Filament saddles, the observed trend aligns with that of the Maxima, but exhibits a key difference: the bias signals for the 15% and 50% subsets are more consistent with each other. The relative bias stabilises at approximately 1.05 for haloes closest to the filaments and 0.95 for those farthest away.
An analysis of Fig. 6 reveals that the secondary bias signal associated with Wall Saddles is notably weaker than that of Maxima or Filament Saddles. A consistent signal, comparable to that of Filament Saddles, is only observed for 15% of groups farthest from the Wall Saddles. For all other cases, particularly when considering groups closest to these features or when using the 50% split, the relative bias remains close to unity, showing no significant effect. This overall subdued signal stands in contrast to the strong secondary bias predicted for Wall Saddles in Montero-Dorta & Rodriguez (2024).
The results for the Minima align with the expected theoretical framework, yet reveal important quantitative details. We find a symmetric signal where groups closer to the voids exhibit a relative bias of approximately 0.97, while those farther away show a bias of about 1.03. This pattern is remarkably consistent for both the 15% and 50% splits, underscoring the robustness of the measurement. However, this symmetric behaviour presents a nuance when compared to Montero-Dorta & Rodriguez (2024); in that finding, the signal was less balanced, as groups far from the minima displayed a relative bias much closer to unity, lacking a significant signal of enhanced clustering.
These results support a physical interpretation where the clustering strength of haloes, at fixed mass, is modulated by the anisotropy of their large-scale environment as traced by the cosmic web skeleton. Haloes near the most anisotropic features (Maxima and Filament saddles) show enhanced clustering, while those near Minima exhibit reduced clustering. The complex behaviour of Wall Saddles, where only the 15% most distant groups show a signal comparable to filaments, suggests their environmental influence is more subtle and may represent a transitional regime between isotropic voids and anisotropic filaments. This nuanced picture contrasts with the stronger secondary bias predicted for all these environments in Montero-Dorta & Rodriguez (2024). Further investigation is needed to confirm that these results are not influenced by systematics in the identification of the cosmic web components.
Our observational results reveal an important distinction in secondary bias signals: the amplitude of the environmental dependence based on distance to cosmic web critical points, while clearly detectable, is generally weaker than the colour bias reported in Sect. 4.1. This hierarchy is particularly noteworthy given that this signal exhibits a remarkably constant amplitude across the entire mass range studied. This behaviour presents a double contrast: first, with the colour-based secondary bias, whose strength diminishes at higher masses; and second, with predictions from hydrodynamical simulations like those from Montero-Dorta & Rodriguez (2024), which found a pronounced declining mass dependence in analogous environmental measures. Our results suggest a distinct hierarchy in observational data: internal properties like colour dominate secondary bias, while the influence of the cosmic web environment, though robustly detected, is weaker and mass-independent. This may indicate that this signal is diluted or modulated in observational datasets, potentially due to the challenges in accurately reconstructing the cosmic web from galaxy surveys.
Finally, we examine the relationship between secondary bias and local group overdensity, a key environmental property that captures the immediate surroundings of each group. This local density measurement complements the large-scale environmental information from cosmic web distances, providing a more complete picture of how environment impacts secondary bias. Figure 7 shows results for both 15% and 50% subsets selected by density. A clear and robust trend emerges across both selections: groups in higher-density environments exhibit significantly stronger clustering, reaching relative bias values up to 1.4 for the 15% highest-density regions (δ10) and 1.3 for δ5. Conversely, groups in lower-density environments show systematically weaker clustering, with brelative values dropping below 0.8 (δ10) and 0.7 (δ5) for the 15% least dense regions. The signal displays a weak mass dependence and shows a clear amplitude increase when moving from the 50% to the 15% selection. Notably, these density-based measurements produce the strongest secondary bias signal in our analysis, surpassing even the well-established colour-based bias. The consistency of these results is particularly meaningful for the δ5 measurement, where the density is computed within a 5 h−1 Mpc radius that does not overlap with the scales used for bias calculation (5–20 h−1 Mpc), confirming this as a genuine environmental effect rather than a scale-dependent artifact. We note that Montero-Dorta & Rodriguez (2024) also identify overdensity as the environmental property showing the strongest secondary bias dependence (see their appendix), although with a smaller overall amplitude. Moreover, while our overdensity- and distance-based signals are weaker than those reported in that work, the colour-based secondary bias in our measurements reaches amplitudes comparable to those found in Montero-Dorta & Rodriguez (2024).
![]() |
Fig. 7. Secondary bias based on local group overdensity presented in the same format as Fig. 6. The four lines represent galaxy subsamples split by their local density, with two lines for the higher densities (15% and 50%) and two for the lower densities (15% and 50%). The error bars on each data point were estimated using a jackknife resampling method. |
The larger error bars in Fig. 6 compared to Fig. 7 reflect a physical finding: the environmental classification based on the cosmic web geometry is noisier or less efficient at segregating haloes than that based on local density. This implies a greater intrinsic variance in secondary bias when accounting for the geometry of the environment.
Our analysis reveals a definitive hierarchy in secondary bias drivers. The strongest signal originates from local group overdensity, followed by the secondary bias based on galaxy colour or colour bias. The influence of distance to cosmic web critical points, while clearly detectable, constitutes the weakest of the three effects. This establishes that local environmental measures and internal properties are more dominant determinants of secondary bias than the large-scale cosmic web structure in our observational sample.
4.3. Signal significance analysis
A potential concern when measuring secondary bias in finite mass bins is that the intrinsic dependence of clustering on mass can be can be confused with genuine secondary bias signals. Even within a narrow mass bin, differences in the mean mass of two subsamples will translate into an expected difference in their clustering strength due to the primary halo bias relation. The following analysis quantifies whether our measured secondary bias signals are significantly larger than the differences expected from this residual intra-bin mass spread. We note, however, that this procedure neglects the potential impact of halo mass uncertainties, so it must be regarded as a first-order theoretical test.
Figure 8 assesses the robustness of our secondary bias signals against potential artifacts from the finite width of our mass bins. We quantify the expected bias variation arising from intra-bin mass spread using established bias models. The coloured lines represent the predicted relative bias for each subsample, computed by applying the theoretical Sheth et al. (2001) bias model to each subset’s mean mass. For reference, the shaded regions show the maximum theoretical bias difference across the entire mass bin, obtained from both the analytical Sheth et al. (2001) model and the simulation-calibrated Tinker et al. (2010) fit evaluated at the bin edges.
![]() |
Fig. 8. Assessment on the robustness of the secondary bias measurements against potential intra-bin mass dependences of group bias. Each line corresponds to the relative bias of the upper or lower subsets computed assuming the Sheth et al. (2001) halo bias model, where the mass assigned to each subsample corresponds to the mean mass. For reference, the shaded grey and green regions show the maximum intra-bin bias difference, obtained from the edges of the group mass bin; here both the Sheth et al. (2001) model and the Tinker et al. (2010) analytical fit are employed. |
The key result is that the bias differences measured in our analysis (shown in previous sections) are substantially larger than the intra-bin mass-dominated bias differences derived from theoretical models. The signals derived from the subsets’ mean masses (colored lines) are significantly weaker than our measurements, being consistent with 1 for most of the mass range. Even the maximum possible variation from the full bin width (shaded regions) would be insufficient to explain our observed signals below log10(Mgroup/h−1 M⊙)≲13. This confirms that our detected secondary bias cannot be attributed to residual mass dependences within the bins. We note that the increase in the theoretical expectations at the highest-mass bins is due to the larger bin widths employed in that regime.
5. Discussion and conclusions
This work provides detailed observational constraints on secondary halo bias: the dependence of halo clustering strength on properties beyond halo mass. We constructed an extended galaxy group catalogue from the SDSS-DR18 specifically for this analysis, applying a refined version of the Rodriguez & Merchán (2020) group-finder algorithm to improve the recovery of low-mass systems and thereby extend the mass range of our determinations. Using this catalogue, our results clearly confirm a key prediction from cosmological simulations and analytical models of structure formation: the large-scale clustering of haloes, traced here by galaxy groups, is significantly influenced by secondary properties at fixed halo mass (e.g. Gao & White 2007; Dalal et al. 2008; Wechsler & Tinker 2018; Paranjape et al. 2018; Sato-Polito et al. 2019; Montero-Dorta & Rodriguez 2024).
We began by evaluating secondary bias based on central galaxy colour, a well-established intrinsic property. Our analysis, conducted across a broader mass range than typically explored in observational studies – more comparable to the dynamic range accessible in simulations – reveals that groups with redder centrals exhibit systematically stronger clustering than their bluer counterparts at fixed mass. This signal is particularly pronounced at lower masses (log10(Mgroup/h−1 M⊙)∼11.5 − 12.7). We quantify this effect by computing the final relative bias parameter from scales of 5–20 h−1 Mpc, a conventional range for isolating secondary bias signals. This robust detection confirms and extends the early observational findings of Wang et al. (2008) to a broader mass range, while showing consistent results in the overlapping mass regime. The observed trend aligns with expectations from hydrodynamic simulations (Li et al. 2006; Montero-Dorta et al. 2020), which show similar colour-dependent clustering at fixed halo mass. Since central galaxy colour correlates with halo assembly history (Hearin & Watson 2013; Wechsler & Tinker 2018; Behroozi et al. 2019; Chaves-Montero & Hearin 2020; Montero-Dorta et al. 2021), this observed dependence provides an important observational test of halo assembly bias, suggesting that colour bias may serve as an accessible proxy tracing the formation and evolutionary pathways of haloes, even if not directly equivalent to assembly bias itself.
We performed the first detailed observational measurement of secondary bias based on the distance to the cosmic web critical points, identified using the DisPerSE algorithm. Our results reveal a clear environmental dependence: groups of identical masses located near high-density regions (Maxima and Filament saddles) tend to exhibit enhanced clustering (specially when 15% subsets are employed to optimise the signal), while those in underdense regions (voids) or distant from the major structures show suppressed clustering. This demonstrates that the geometric location of groups within the cosmic web fundamentally influences their clustering beyond halo mass, supporting the scenario where haloes residing in different cosmic web environments originate from distinct initial conditions, leading to bias variations even at fixed mass (e.g. Hahn et al. 2009; Borzyszkowski et al. 2017; Musso et al. 2018; Paranjape et al. 2018; Montero-Dorta & Rodriguez 2024).
Our observational results show both significant agreement and notable discrepancies with the predictions from the TNG300 simulation (Montero-Dorta & Rodriguez 2024). As shown in Fig. 6, we confirm the general theoretical expectation that cosmic web location modulates secondary bias: haloes near anisotropic features (Maxima and Filament saddles) exhibit enhanced clustering (brelative > 1), while those in isotropic underdense regions (near Minima) show suppressed clustering (brelative < 1). However, two key differences emerge from our observational measurements. First, we find that the secondary bias signal for Wall Saddles is substantially weaker in observations than predicted by simulations. While Montero-Dorta & Rodriguez (2024) identified wall-type saddles as the most powerful environmental discriminators, our measurements show the relative bias for groups segregated by distance to Wall Saddles remains consistent with unity across most of the mass range, with only the 15% most distant groups showing a modest signal. Second, the mass dependence of environmental secondary bias appears flatter in our observational data compared to the pronounced decrease with mass seen in simulations. These discrepancies could arise from systematic limitations in cosmic web identification within galaxy surveys, particularly for wall-like structures that are challenging to reconstruct from discrete, redshift-space galaxy distributions. To rigorously distinguish between physical origins and methodological artifacts, future work will require mock catalogues that replicate our observational setup, enabling direct comparison with simulation predictions while accounting for survey-specific systematics.
Complementing the cosmic web analysis, we find that local group overdensity produces the strongest secondary bias signals in our study (Fig. 7). Groups in high-density environments exhibit dramatically enhanced clustering, with relative bias reaching up to 1.4 for the 15% highest-density regions, while those in low-density environments show suppression down to brelative ∼ 0.7. This local density signal, measured on scales (5 h−1 Mpc) that do not overlap with our bias calculation range, represents a genuine environmental effect that surpasses even the well-established colour-based secondary bias. The exceptional strength of local density as a secondary bias indicator aligns with findings from hydrodynamical simulations (e.g. Borzyszkowski et al. 2017; Paranjape et al. 2018; Ramakrishnan et al. 2019; Montero-Dorta & Rodriguez 2024) and theoretical expectations, given that local environment is a fundamental determinant of structure formation and bias. While the local density signal broadly mirrors the environmental trends observed with cosmic web distances – with dense regions showing enhanced clustering and underdense regions suppression – its significantly larger amplitude underscores local density as a more direct and powerful environmental factor in shaping halo clustering, operating somewhat independently of the topological cosmic web framework.
Our results establishes a clear hierarchy in observational secondary bias. In Fig. 9 we compare the strength of several secondary bias signals using the average difference in relative bias between the upper and lower subsets within the representative mass range log10(Mgroup/h−1 M⊙) = [12, 12.5]. This estimator, ΔSB, provides a direct way to quantify and visualise the hierarchy among the different secondary bias effects. The largest integrated signal corresponds to the local overdensity measured on a 10 h−1 Mpc scale (δ10), with ΔSB ≃ 0.6 for the 15% subset. The overdensity on a smaller scale, 5 h−1 Mpc (δ5), follows with a similarly strong but slightly smaller signal (ΔSB ≃ 0.3). Central galaxy colour also shows a clearly positive signal (ΔSB ≃ 0.35 for the 15% subset). The geometric quantities (distances to Minima, Wall saddles, Filament saddles, and Maxima) exhibit comparatively weaker signals, with values ranging from mildly positive to mildly negative, and with Wall saddles remaining consistent with zero. The overall hierarchy is robust, as the ordering of signal strengths is consistent between the 15% and 50% subsets. We emphasise that, as with any observational study, we have the inherent uncertainty of mass assignment that could affect our results. Nevertheless, the clear and consistent trends we observe remain robust and physically meaningful. Our hierarchy of secondary bias factors holds despite these limitations, and our findings provide valuable observational benchmarks for testing theoretical models of halo formation.
![]() |
Fig. 9. Comparison of the secondary bias signal for different properties. The stars represent the difference between the average relative bias of the upper subsets and the average relative bias of the lower subsets, in the mass range of log10(Mgroup/h−1 M⊙) 12 and 12.5. The results are shown for the 15% (purple) and 50% (green) subsets. The properties analysed are local density, Minimum density, Wall saddles, Filament saddles, Maximum density, and colour. |
The results presented in this paper provide a robust observational perspective on environmental secondary bias that, at face value, appears to be in slight tension with the conclusions of Alam et al. (2019), who found no evidence of a direct effect of the cosmic web beyond its modulation of the halo mass function. Methodological differences might, however, explain these apparent discrepancies. While Alam et al. (2019) analysed individual galaxies and inferred halo mass using a halo occupation distribution model, our direct use of a cluster catalogue allows for more precise control of halo mass, which enables us to disentangle and more clearly detect the direct influence of cosmic web geometry on the clustering of fixed-mass haloes. This is consistent with Izzo Dominguez et al. (2025), who found minimal HOD variation across environments, confirming that the cosmic web influences halo clustering while only minimally altering their internal galaxy content. Our findings complement these previous results by demonstrating that a secondary bias signal emerges once we effectively control for halo mass.
To explore the connection between the observed environmental secondary bias signals and potential assembly bias effects, we analysed the relationship between our selected subsamples and galaxy colour, a property expected to correlate with halo assembly history (Hearin & Watson 2013; Wechsler & Tinker 2018; Behroozi et al. 2019; Chaves-Montero & Hearin 2020; Montero-Dorta et al. 2021). Figure 10 presents this analysis, showing the colour distributions of galaxies across different density environments. The clear separation in colour between high- and low-density subsets suggests that environmental secondary bias based on local density may be linked to halo assembly bias through halo accretion history. These results align with the broader picture in which (local) environmental diagnostics act as mediators between internal halo properties and large-scale bias, as indicated by some hydrodynamical simulation results (Paranjape et al. 2018; Montero-Dorta & Rodriguez 2024).
![]() |
Fig. 10. Colour distribution of central galaxies with different density environments. The vertical lines indicate the medians of the respective samples. |
However, as shown in Fig. 4, we do not find a clear connection between the distance of central galaxies to the critical points of the cosmic web and their colour, which complicates efforts to probe the aforementioned link based on our current cosmic web characterisation. Although simulations show that the relation between colour (or assembly history) and DisPerSE-derived distances is weak, Montero-Dorta & Rodriguez (2024) demonstrated that the halo assembly bias signal changes significantly when the sample is conditioned on proximity to critical points. Further investigation, incorporating additional cosmic web identifiers, will be necessary to clarify whether these results are affected by the specific characteristics of the adopted cosmic web descriptor.
In the future, it would be interesting to investigate how two-halo galactic conformity relates to the secondary bias trends reported here, especially those involving central galaxy colour and environment (e.g. Kauffmann 2015; Hearin et al. 2016; Hatfield & Jarvis 2017; Lacerna et al. 2022, 2025).
In conclusion, our study provides comprehensive observational evidence that secondary bias is a multifaceted and widespread phenomenon in the galaxy distribution. We have established a definitive hierarchy of secondary bias drivers: local environmental measures, particularly overdensity, produce the strongest signals, followed by intrinsic galaxy properties like colour, with cosmic web geometry generating more subtle though significant effects. This hierarchy underscores the complex interplay between local environmental processes and larger-scale cosmic web structure in shaping halo clustering. The consistent detection of these signals across multiple environmental metrics confirms that a halo’s location within the cosmic web is a fundamental determinant of its clustering strength, independent of mass. Furthermore, our identification of colour bias and its mediation through local density provides a direct observational link to galaxy assembly bias, demonstrating that secondary dependences at fixed halo mass are not merely theoretical predictions but observable features of the galaxy population. Our work thus contributes to the growing body of observational evidence of assembly bias, joining findings from anisotropic clustering (Obuljen et al. 2019, 2020), halo occupancy variations (Yuan et al. 2021; Wang et al. 2022; Pearl et al. 2024), the stellar-to-halo mass relation (Oyarzún et al. 2024), and spin-dependent clustering (Kim et al. 2025). The robust quantification of these effects presented here provides crucial benchmarks for theoretical models and highlights the importance of accounting for secondary bias in precision cosmological analyses.
Acknowledgments
We thank the anonymous referee for their constructive comments and suggestions, which helped to improve the clarity and quality of this manuscript. FR thanks the support by Agencia Nacional de Promoción Científica y Tecnológica, the Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET, Argentina) and the Secretaría de Ciencia y Tecnología de la Universidad Nacional de Córdoba (SeCyT-UNC, Argentina). ADMD acknowledges support from the Universidad Técnica Federico Santa María through the Proyecto Interno Regular PI_LIR_25_04. FR and ADMD thank the ICTP for their hospitality and financial support through the Junior Associates Programme 2023–2028 and Regular Associates Programme 2022–2027, respectively.
References
- Abbas, U., & Sheth, R. K. 2006, MNRAS, 372, 1749 [Google Scholar]
- Abbas, U., & Sheth, R. K. 2007, MNRAS, 378, 641 [Google Scholar]
- Alam, S., Zu, Y., Peacock, J. A., & Mandelbaum, R. 2019, MNRAS, 483, 4501 [NASA ADS] [CrossRef] [Google Scholar]
- Alfaro, I. G., Rodriguez, F., Ruiz, A. N., Luparello, H. E., & Lambas, D. G. 2022, A&A, 665, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Almeida, A., Anderson, S. F., Argudo-Fernández, M., et al. 2023, ApJS, 267, 44 [NASA ADS] [CrossRef] [Google Scholar]
- Angulo, R. E., Baugh, C. M., & Lacey, C. G. 2008, MNRAS, 387, 921 [NASA ADS] [CrossRef] [Google Scholar]
- Artale, M. C., Zehavi, I., Contreras, S., & Norberg, P. 2018, MNRAS, 480, 3978 [NASA ADS] [CrossRef] [Google Scholar]
- Balaguera-Antolínez, A., Montero-Dorta, A. D., & Favole, G. 2024, A&A, 685, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Bardeen, J. M., Bond, J. R., Kaiser, N., & Szalay, A. S. 1986, ApJ, 304, 15 [Google Scholar]
- Behroozi, P. S., Conroy, C., & Wechsler, R. H. 2010, ApJ, 717, 379 [Google Scholar]
- Behroozi, P., Wechsler, R. H., Hearin, A. P., & Conroy, C. 2019, MNRAS, 488, 3143 [NASA ADS] [CrossRef] [Google Scholar]
- Bonjean, V., Aghanim, N., Douspis, M., Malavasi, N., & Tanimura, H. 2020, A&A, 638, A75 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Borzyszkowski, M., Porciani, C., Romano-Díaz, E., & Garaldi, E. 2017, MNRAS, 469, 594 [NASA ADS] [CrossRef] [Google Scholar]
- Chaves-Montero, J., & Hearin, A. 2020, MNRAS, 495, 2088 [NASA ADS] [CrossRef] [Google Scholar]
- Coil, A. L., Newman, J. A., Cooper, M. C., et al. 2006, ApJ, 644, 671 [NASA ADS] [CrossRef] [Google Scholar]
- Conroy, C., Wechsler, R. H., & Kravtsov, A. V. 2006, ApJ, 647, 201 [Google Scholar]
- Contreras, S., Zehavi, I., Padilla, N., et al. 2019, MNRAS, 484, 1133 [NASA ADS] [CrossRef] [Google Scholar]
- Contreras, S., Chaves-Montero, J., Zennaro, M., & Angulo, R. E. 2021, MNRAS, 507, 3412 [NASA ADS] [CrossRef] [Google Scholar]
- Dalal, N., White, M., Bond, J. R., & Shirokov, A. 2008, ApJ, 687, 12 [NASA ADS] [CrossRef] [Google Scholar]
- Dawson, K. S., Schlegel, D. J., Ahn, C. P., et al. 2013, AJ, 145, 10 [Google Scholar]
- Einasto, M., Einasto, J., Tenjes, P., et al. 2024, A&A, 681, A91 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Faltenbacher, A., & White, S. D. M. 2010, ApJ, 708, 469 [NASA ADS] [CrossRef] [Google Scholar]
- Gao, L., & White, S. D. M. 2007, MNRAS, 377, L5 [NASA ADS] [CrossRef] [Google Scholar]
- Gao, L., Springel, V., & White, S. D. 2005, MNRAS, 363, L66 [NASA ADS] [CrossRef] [Google Scholar]
- Gonzalez, E. J., Rodriguez, F., Merchán, M., et al. 2021, MNRAS, 504, 4093 [NASA ADS] [CrossRef] [Google Scholar]
- Hahn, O., Porciani, C., Dekel, A., & Carollo, C. M. 2009, MNRAS, 398, 1742 [CrossRef] [Google Scholar]
- Han, J., Li, Y., Jing, Y., et al. 2019, MNRAS, 482, 1900 [NASA ADS] [CrossRef] [Google Scholar]
- Hartley, W. G., Wilkins, S. M., Conselice, C. J., et al. 2013, MNRAS, 431, 3045 [Google Scholar]
- Hatfield, P. W., & Jarvis, M. J. 2017, MNRAS, 472, 3570 [Google Scholar]
- Hearin, A. P., & Watson, D. F. 2013, MNRAS, 435, 1313 [Google Scholar]
- Hearin, A. P., Behroozi, P. S., & van den Bosch, F. C. 2016, MNRAS, 461, 2135 [Google Scholar]
- Izzo Dominguez, V., Rodriguez, F., & Montero-Dorta, A. D. 2025, A&A, 706, A42 [Google Scholar]
- Johnson, J. W., Maller, A. H., Berlind, A. A., Sinha, M., & Holley-Bockelmann, J. K. 2019, MNRAS, 486, 1156 [CrossRef] [Google Scholar]
- Kaiser, N. 1984, ApJ, 284, L9 [NASA ADS] [CrossRef] [Google Scholar]
- Kauffmann, G. 2015, MNRAS, 454, 1840 [NASA ADS] [CrossRef] [Google Scholar]
- Kim, Y., Montero-Dorta, A. D., Smith, R., & Shinn, J.-H. 2025, A&A, 700, A149 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Kravtsov, A. V., Berlind, A. A., Wechsler, R. H., et al. 2004, ApJ, 609, 35 [Google Scholar]
- Lacerna, I., Padilla, N., & Stasyszyn, F. 2014, MNRAS, 443, 3107 [NASA ADS] [CrossRef] [Google Scholar]
- Lacerna, I., Rodriguez, F., Montero-Dorta, A. D., et al. 2022, MNRAS, 513, 2271 [NASA ADS] [CrossRef] [Google Scholar]
- Lacerna, I., Padilla, N., & Palma, D. 2025, A&A, 703, A247 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Law-Smith, J., & Eisenstein, D. J. 2017, ApJ, 836, 87 [Google Scholar]
- Lazeyras, T., Musso, M., & Schmidt, F. 2017, JCAP, 2017, 059 [Google Scholar]
- Li, C., Kauffmann, G., Jing, Y. P., et al. 2006, MNRAS, 368, 21 [NASA ADS] [CrossRef] [Google Scholar]
- Li, Y., Mo, H. J., & Gao, L. 2008, MNRAS, 389, 1419 [NASA ADS] [CrossRef] [Google Scholar]
- Lin, Y.-T., Mandelbaum, R., Huang, Y.-H., et al. 2016, ApJ, 819, 119 [NASA ADS] [CrossRef] [Google Scholar]
- Malavasi, N., Aghanim, N., Douspis, M., Tanimura, H., & Bonjean, V. 2020a, A&A, 642, A19 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Malavasi, N., Aghanim, N., Tanimura, H., Bonjean, V., & Douspis, M. 2020b, A&A, 634, A30 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Mao, Y.-Y., Zentner, A. R., & Wechsler, R. H. 2018, MNRAS, 474, 5143 [NASA ADS] [CrossRef] [Google Scholar]
- McNaught-Roberts, T., Norberg, P., Driver, S. P., et al. 2014, MNRAS, 445, 2125 [Google Scholar]
- Meneux, B., Le Fèvre, O., Guzzo, L., et al. 2006, A&A, 452, 387 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Merchán, M. E., & Zandivarez, A. 2005, ApJ, 630, 759 [CrossRef] [Google Scholar]
- Mo, H. J., & White, S. D. M. 1996, MNRAS, 282, 347 [Google Scholar]
- Montero-Dorta, A. D., & Rodriguez, F. 2024, MNRAS, 531, 290 [NASA ADS] [CrossRef] [Google Scholar]
- Montero-Dorta, A. D., Artale, M. C., Abramo, L. R., et al. 2020, MNRAS, 496, 1182 [NASA ADS] [CrossRef] [Google Scholar]
- Montero-Dorta, A. D., Pérez, E., Prada, F., et al. 2017, ApJ, 848, L2 [NASA ADS] [CrossRef] [Google Scholar]
- Montero-Dorta, A. D., Chaves-Montero, J., Artale, M. C., & Favole, G. 2021, MNRAS, 508, 940 [NASA ADS] [CrossRef] [Google Scholar]
- Montero-Dorta, A. D., Contreras, S., Artale, M. C., Rodriguez, F., & Favole, G. 2025, A&A, 695, A159 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Morse, M. 1934, The Calculus of Variations in the Large (American Mathematical Soc.), 18 [Google Scholar]
- Musso, M., Cadiou, C., Pichon, C., et al. 2018, MNRAS, 476, 4877 [Google Scholar]
- Niemiec, A., Jullo, E., Montero-Dorta, A. D., et al. 2018, MNRAS, 477, L1 [NASA ADS] [CrossRef] [Google Scholar]
- Obuljen, A., Dalal, N., & Percival, W. J. 2019, JCAP, 2019, 020 [Google Scholar]
- Obuljen, A., Percival, W. J., & Dalal, N. 2020, JCAP, 2020, 058 [CrossRef] [Google Scholar]
- Oyarzún, G. A., Tinker, J. L., Bundy, K., Xhakaj, E., & Wyithe, J. S. B. 2024, ApJ, 974, 29 [Google Scholar]
- Paranjape, A., Hahn, O., & Sheth, R. K. 2018, MNRAS, 476, 3631 [NASA ADS] [CrossRef] [Google Scholar]
- Pearl, A. N., Zentner, A. R., Newman, J. A., et al. 2024, ApJ, 963, 116 [Google Scholar]
- Planck Collaboration XIII. 2016, A&A, 594, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Ramakrishnan, S., Paranjape, A., Hahn, O., & Sheth, R. K. 2019, MNRAS, 489, 2977 [NASA ADS] [CrossRef] [Google Scholar]
- Rodriguez, F., & Merchán, M. 2020, A&A, 636, A61 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Rodriguez, F., Merchán, M., & Artale, M. C. 2022, MNRAS, 514, 1077 [NASA ADS] [CrossRef] [Google Scholar]
- Rodriguez, F., Merchán, M., Galárraga-Espinosa, D., et al. 2025, A&A, 699, A215 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Rodriguez, F., Montero-Dorta, A. D., Angulo, R. E., Artale, M. C., & Merchán, M. 2021, MNRAS, 505, 3192 [NASA ADS] [CrossRef] [Google Scholar]
- Rodríguez-Medrano, A. M., Paz, D. J., Stasyszyn, F. A., et al. 2023, MNRAS, 521, 916 [CrossRef] [Google Scholar]
- Rykoff, E. S., Rozo, E., Busha, M. T., et al. 2014, ApJ, 785, 104 [Google Scholar]
- Salcedo, A. N., Maller, A. H., Berlind, A. A., et al. 2018, MNRAS, 475, 4411 [NASA ADS] [CrossRef] [Google Scholar]
- Sato-Polito, G., Montero-Dorta, A. D., Abramo, L. R., Prada, F., & Klypin, A. 2019, MNRAS, 487, 1570 [NASA ADS] [CrossRef] [Google Scholar]
- Sheth, R. K., & Tormen, G. 1999, MNRAS, 308, 119 [Google Scholar]
- Sheth, R. K., & Tormen, G. 2004, MNRAS, 350, 1385 [NASA ADS] [CrossRef] [Google Scholar]
- Sheth, R. K., Mo, H. J., & Tormen, G. 2001, MNRAS, 323, 1 [NASA ADS] [CrossRef] [Google Scholar]
- Skibba, R. A., Bamford, S. P., Nichol, R. C., et al. 2009, MNRAS, 399, 966 [NASA ADS] [CrossRef] [Google Scholar]
- Sousbie, T. 2011, MNRAS, 414, 350 [NASA ADS] [CrossRef] [Google Scholar]
- Sousbie, T., Pichon, C., & Kawahara, H. 2011, MNRAS, 414, 384 [NASA ADS] [CrossRef] [Google Scholar]
- Sunayama, T., More, S., & Miyatake, H. 2022, ArXiv e-prints [arXiv:2205.03277] [Google Scholar]
- Tanimura, H., Aghanim, N., Bonjean, V., Malavasi, N., & Douspis, M. 2020a, A&A, 637, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Tanimura, H., Aghanim, N., Kolodzig, A., Douspis, M., & Malavasi, N. 2020b, A&A, 643, L2 [EDP Sciences] [Google Scholar]
- Tinker, J. L., Robertson, B. E., Kravtsov, A. V., et al. 2010, ApJ, 724, 878 [NASA ADS] [CrossRef] [Google Scholar]
- Tucci, B., Montero-Dorta, A. D., Abramo, L. R., Sato-Polito, G., & Artale, M. C. 2021, MNRAS, 500, 2777 [Google Scholar]
- Vale, A., & Ostriker, J. 2004, MNRAS, 353, 189 [NASA ADS] [CrossRef] [Google Scholar]
- van de Weygaert, R., & Schaap, W. 2009, Data Anal. Cosmol., 291 [Google Scholar]
- Wang, K., Mao, Y.-Y., Zentner, A. R., et al. 2022, MNRAS, 516, 4003 [Google Scholar]
- Wang, Y., Yang, X., Mo, H. J., et al. 2008, MNRAS, 385, 1511 [NASA ADS] [CrossRef] [Google Scholar]
- Wechsler, R. H., & Tinker, J. L. 2018, ARA&A, 56, 435 [NASA ADS] [CrossRef] [Google Scholar]
- Wechsler, R. H., Zentner, A. R., Bullock, J. S., Kravtsov, A. V., & Allgood, B. 2006, ApJ, 652, 71 [NASA ADS] [CrossRef] [Google Scholar]
- Yang, X., Mo, H., Van Den Bosch, F. C., & Jing, Y. 2005, MNRAS, 356, 1293 [NASA ADS] [CrossRef] [Google Scholar]
- Yang, X., Mo, H. J., van den Bosch, F. C., et al. 2007, ApJ, 671, 153 [Google Scholar]
- York, D. G., Adelman, J., Anderson, J. E., Jr, et al. 2000, AJ, 120, 1579 [Google Scholar]
- Yuan, S., Hadzhiyska, B., Bose, S., Eisenstein, D. J., & Guo, H. 2021, MNRAS, 502, 3582 [NASA ADS] [CrossRef] [Google Scholar]
- Zehavi, I., Zheng, Z., Weinberg, D. H., et al. 2011, ApJ, 736, 59 [NASA ADS] [CrossRef] [Google Scholar]
- Zehavi, I., Contreras, S., Padilla, N., et al. 2018, ApJ, 853, 84 [NASA ADS] [CrossRef] [Google Scholar]
- Zhai, Z., Percival, W. J., & Guo, H. 2023, MNRAS, 523, 5538 [Google Scholar]
The concept and specific term occupancy variation were first introduced and defined in the studies by Zehavi et al. (2018) and Artale et al. (2018).
All Figures
![]() |
Fig. 1. Main properties of the galaxy groups analysed in this study. Top panel: group mass distribution exhibiting the characteristic decreasing shape. Bottom panel: colour (g − r) distribution of the central galaxies within these groups. The bimodal nature of this distribution clearly distinguishes between the red and blue galaxy subpopulations. |
| In the text | |
![]() |
Fig. 2. Distribution of minimum distances to the cosmic web critical points obtained from DisPerSE. The histograms show the number of central galaxies as a function of their minimum three-dimensional distance to Maxima, Minima, Filament saddles, and Wall saddles. |
| In the text | |
![]() |
Fig. 3. Spatial distribution of central galaxies relative to cosmic web topology. The figure shows central galaxies within a −5° to 15° declination slice of the SDSS Legacy North survey. The panels display the spatial positions of central galaxies, characterised by their minimum three-dimensional distance to the four DisPerSE critical point types (Maxima, Minima, Filament saddles, and Wall saddles). The colour-coding represents the scaled distance to the corresponding critical point, given by log10(distance + 1). |
| In the text | |
![]() |
Fig. 4. Distribution of central galaxy colour as a function of cosmic web location and local overdensity. In each panel, a scatter diagram shows the relation between the distance to each critical point of the density field and the local group overdensity (log10(δ)) on 5 h−1 Mpc scales for all central galaxies in our sample, colour-coded by their g − r colour. From top to bottom, the panels illustrate the distribution with respect to the distance to (1) Maxima (dMax), (2) Filament saddles (dFsadd), (3) Wall saddles (dWSadd), and (4) Minima (dMin). The contour lines (black lines) represent the progressively lower density levels (down to 5% of the peak value). The horizontal dashed lines (green and magenta) mark the 15th (P15) and 85th (P85) percentiles of the distance distribution, respectively, used to define environment thresholds. |
| In the text | |
![]() |
Fig. 5. Secondary bias measurements for subsets of groups split by the colour of their central galaxies (colour bias). Results are shown for both the extreme 15% reddest and bluest centrals (darker colours) and the 50% reddest and bluest centrals (lighter colours). Uncertainties are estimated using jackknife resampling with 50 subsamples for the correlation functions and propagated to the relative bias. |
| In the text | |
![]() |
Fig. 6. Secondary halo bias based on the distance to the critical points of the cosmic web. Each panel shows the relative bias (brelative) as a function of the logarithm of the group mass (log10(Mgroup/h−1 M⊙)). The panels are organised into two rows and four columns; each column represents a specific cosmic web feature identified by the DisPerSE algorithm: Maxima (first column), Filament saddles (second), Wall saddles (third), and Minima (fourth). In each panel, the cyan and orange lines represent the 15% of galaxies farthest from and closest to the feature, respectively. The green and red lines represent the 50% of galaxies farthest from and closest to the feature, respectively. In each panel, the orange line represents the subset of galaxies closer to the feature, and the light blue line shows the subset of galaxies farther away. The dashed black line at brelative = 1.00 serves as a reference point for the relative bias. The error bars on each data point were calculated using the jackknife method. |
| In the text | |
![]() |
Fig. 7. Secondary bias based on local group overdensity presented in the same format as Fig. 6. The four lines represent galaxy subsamples split by their local density, with two lines for the higher densities (15% and 50%) and two for the lower densities (15% and 50%). The error bars on each data point were estimated using a jackknife resampling method. |
| In the text | |
![]() |
Fig. 8. Assessment on the robustness of the secondary bias measurements against potential intra-bin mass dependences of group bias. Each line corresponds to the relative bias of the upper or lower subsets computed assuming the Sheth et al. (2001) halo bias model, where the mass assigned to each subsample corresponds to the mean mass. For reference, the shaded grey and green regions show the maximum intra-bin bias difference, obtained from the edges of the group mass bin; here both the Sheth et al. (2001) model and the Tinker et al. (2010) analytical fit are employed. |
| In the text | |
![]() |
Fig. 9. Comparison of the secondary bias signal for different properties. The stars represent the difference between the average relative bias of the upper subsets and the average relative bias of the lower subsets, in the mass range of log10(Mgroup/h−1 M⊙) 12 and 12.5. The results are shown for the 15% (purple) and 50% (green) subsets. The properties analysed are local density, Minimum density, Wall saddles, Filament saddles, Maximum density, and colour. |
| In the text | |
![]() |
Fig. 10. Colour distribution of central galaxies with different density environments. The vertical lines indicate the medians of the respective samples. |
| In the text | |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.









