Euclid: Photometric redshift calibration performance with the clustering-redshifts technique in the Flagship 2 simulation

W. d’Assignies; M. Manera; C. Padilla; O. Ilbert; H. Hildebrandt; L. Reynolds; J. Chaves-Montero; A. H. Wright; P. Tallada-Crespí; M. Eriksen; J. Carretero; W. Roster; Y. Kang; K. Naidoo; R. Miquel; B. Altieri; A. Amara; S. Andreon; N. Auricchio; C. Baccigalupi; D. Bagot; M. Baldi; A. Balestra; S. Bardelli; P. Battaglia; A. Biviano; E. Branchini; M. Brescia; S. Camera; V. Capobianco; C. Carbone; V. F. Cardone; S. Casas; F. J. Castander; M. Castellano; G. Castignani; S. Cavuoti; K. C. Chambers; A. Cimatti; C. Colodro-Conde; G. Congedo; C. J. Conselice; L. Conversi; Y. Copin; F. Courbin; H. M. Courtois; M. Crocce; A. Da Silva; H. Degaudenzi; S. de la Torre; G. De Lucia; M. Douspis; X. Dupac; A. Ealet; S. Escoffier; M. Farina; F. Faustini; S. Ferriol; F. Finelli; P. Fosalba; S. Fotopoulou; M. Frailis; E. Franceschi; M. Fumana; S. Galeotta; K. George; B. Gillis; C. Giocoli; P. Gómez-Alvarez; J. Gracia-Carpio; A. Grazian; F. Grupp; W. Holmes; I. M. Hook; A. Hornstrup; K. Jahnke; M. Jhabvala; B. Joachimi; E. Keihänen; S. Kermiche; A. Kiessling; B. Kubik; M. Kümmel; M. Kunz; H. Kurki-Suonio; O. Lahav; A. M. C. Le Brun; S. Ligori; P. B. Lilje; V. Lindholm; I. Lloro; G. Mainetti; D. Maino; E. Maiorano; O. Mansutti; S. Marcin; O. Marggraf; K. Markovic; M. Martinelli; N. Martinet; F. Marulli; R. Massey; D. C. Masters; E. Medinaceli; S. Mei; M. Melchior; Y. Mellier; M. Meneghetti; E. Merlin; G. Meylan; A. Mora; M. Moresco; L. Moscardini; C. Neissner; S.-M. Niemi; S. Paltani; F. Pasian; K. Pedersen; V. Pettorino; S. Pires; G. Polenta; M. Poncet; L. A. Popa; L. Pozzetti; F. Raison; R. Rebolo; A. Renzi; J. Rhodes; G. Riccio; E. Romelli; M. Roncarelli; E. Rossetti; R. Saglia; Z. Sakr; D. Sapone; B. Sartoris; J. A. Schewtschenko; P. Schneider; T. Schrabback; A. Secroun; E. Sefusatti; G. Seidel; M. Seiffert; S. Serrano; P. Simon; C. Sirignano; G. Sirri; A. Spurio Mancini; L. Stanco; J. Steinwagner; D. Tavagnacco; A. N. Taylor; H. I. Teplitz; I. Tereno; N. Tessore; S. Toft; R. Toledo-Moreo; F. Torradeflot; A. Tsyganov; I. Tutusaus; L. Valenziano; J. Valiviita; T. Vassallo; G. Verdoes Kleijn; Y. Wang; J. Weller; G. Zamorani; E. Zucca; M. Bolzonella; C. Burigana; L. Gabarra; J. Martín-Fleitas; I. Risso; V. Scottez; M. Viel

doi:10.1051/0004-6361/202555551

Home

All issues

Volume 702 (October 2025)

A&A, 702 (2025) A155

Full HTML

Open Access

Issue		A&A Volume 702, October 2025


Article Number		A155
Number of page(s)		31
Section		Cosmology (including clusters of galaxies)
DOI		https://doi.org/10.1051/0004-6361/202555551
Published online		21 October 2025

A&A, 702, A155 (2025)

Euclid: Photometric redshift calibration performance with the clustering-redshifts technique in the Flagship 2 simulation^⋆

W. d’Assignies¹^⋆⋆, M. Manera²^,1, C. Padilla¹, O. Ilbert³, H. Hildebrandt⁴, L. Reynolds¹^,5, J. Chaves-Montero¹, A. H. Wright⁴, P. Tallada-Crespí⁶^,7, M. Eriksen¹^,7, J. Carretero⁶^,7, W. Roster⁸, Y. Kang⁹, K. Naidoo¹⁰, R. Miquel¹^,11, B. Altieri¹², A. Amara¹³, S. Andreon¹⁴, N. Auricchio¹⁵, C. Baccigalupi¹⁶^,17^,18^,19, D. Bagot²⁰, M. Baldi²¹^,15^,22, A. Balestra²³, S. Bardelli¹⁵, P. Battaglia¹⁵, A. Biviano¹⁷^,16, E. Branchini²⁴^,25^,14, M. Brescia²⁶^,27, S. Camera²⁸^,29^,30, V. Capobianco³⁰, C. Carbone³¹, V. F. Cardone³²^,33, S. Casas³⁴, F. J. Castander³⁵^,36, M. Castellano³², G. Castignani¹⁵, S. Cavuoti²⁷^,37, K. C. Chambers³⁸, A. Cimatti³⁹, C. Colodro-Conde⁴⁰, G. Congedo⁴¹, C. J. Conselice⁴², L. Conversi⁴³^,12, Y. Copin⁴⁴, F. Courbin⁴⁵^,11, H. M. Courtois⁴⁶, M. Crocce³⁵^,36, A. Da Silva⁴⁷^,48, H. Degaudenzi⁹, S. de la Torre³, G. De Lucia¹⁷, M. Douspis⁴⁹, X. Dupac¹², A. Ealet⁴⁴, S. Escoffier⁵⁰, M. Farina⁵¹, F. Faustini³²^,52, S. Ferriol⁴⁴, F. Finelli¹⁵^,53, P. Fosalba³⁶^,35, S. Fotopoulou⁵⁴, M. Frailis¹⁷, E. Franceschi¹⁵, M. Fumana³¹, S. Galeotta¹⁷, K. George⁵⁵, B. Gillis⁴¹, C. Giocoli¹⁵^,22, P. Gómez-Alvarez⁵⁶^,12, J. Gracia-Carpio⁸, A. Grazian²³, F. Grupp⁸^,55, W. Holmes⁵⁷, I. M. Hook⁵⁸, A. Hornstrup⁵⁹^,60, K. Jahnke⁶¹, M. Jhabvala⁶², B. Joachimi⁶³, E. Keihänen⁶⁴, S. Kermiche⁵⁰, A. Kiessling⁵⁷, B. Kubik⁴⁴, M. Kümmel⁵⁵, M. Kunz⁶⁵, H. Kurki-Suonio⁶⁶^,67, O. Lahav⁶³, A. M. C. Le Brun⁶⁸, S. Ligori³⁰, P. B. Lilje⁶⁹, V. Lindholm⁶⁶^,67, I. Lloro⁷⁰, G. Mainetti⁷¹, D. Maino⁷²^,31^,73, E. Maiorano¹⁵, O. Mansutti¹⁷, S. Marcin⁷⁴, O. Marggraf⁷⁵, K. Markovic⁵⁷, M. Martinelli³²^,33, N. Martinet³, F. Marulli⁷⁶^,15^,22, R. Massey⁷⁷, D. C. Masters⁷⁸, E. Medinaceli¹⁵, S. Mei⁷⁹^,80, M. Melchior⁸¹, Y. Mellier⁸²^,83, M. Meneghetti¹⁵^,22, E. Merlin³², G. Meylan⁸⁴, A. Mora⁸⁵, M. Moresco⁷⁶^,15, L. Moscardini⁷⁶^,15^,22, C. Neissner¹^,7, S.-M. Niemi⁸⁶, S. Paltani⁹, F. Pasian¹⁷, K. Pedersen⁸⁷, V. Pettorino⁸⁶, S. Pires⁸⁸, G. Polenta⁵², M. Poncet²⁰, L. A. Popa⁸⁹, L. Pozzetti¹⁵, F. Raison⁸, R. Rebolo⁴⁰^,90^,91, A. Renzi⁹²^,93, J. Rhodes⁵⁷, G. Riccio²⁷, E. Romelli¹⁷, M. Roncarelli¹⁵, E. Rossetti²¹, R. Saglia⁵⁵^,8, Z. Sakr⁹⁴^,95^,96, D. Sapone⁹⁷, B. Sartoris⁵⁵^,17, J. A. Schewtschenko⁴¹, P. Schneider⁷⁵, T. Schrabback⁹⁸, A. Secroun⁵⁰, E. Sefusatti¹⁷^,16^,18, G. Seidel⁶¹, M. Seiffert⁵⁷, S. Serrano³⁶^,99^,35, P. Simon⁷⁵, C. Sirignano⁹²^,93, G. Sirri²², A. Spurio Mancini¹⁰⁰, L. Stanco⁹³, J. Steinwagner⁸, D. Tavagnacco¹⁷, A. N. Taylor⁴¹, H. I. Teplitz⁷⁸, I. Tereno⁴⁷^,101, N. Tessore⁶³, S. Toft¹⁰²^,103, R. Toledo-Moreo¹⁰⁴, F. Torradeflot⁷^,6, A. Tsyganov¹⁰⁵, I. Tutusaus⁹⁵, L. Valenziano¹⁵^,53, J. Valiviita⁶⁶^,67, T. Vassallo⁵⁵^,17, G. Verdoes Kleijn¹⁰⁶, Y. Wang⁷⁸, J. Weller⁵⁵^,8, G. Zamorani¹⁵, E. Zucca¹⁵, M. Bolzonella¹⁵, C. Burigana¹⁰⁷^,53, L. Gabarra¹⁰⁸, J. Martín-Fleitas¹⁰⁹, I. Risso¹¹⁰, V. Scottez⁸²^,111 and M. Viel¹⁶^,17^,19^,18^,112

¹ Institut de Física d’Altes Energies (IFAE), The Barcelona Institute of Science and Technology, Campus UAB, 08193 Bellaterra (Barcelona), Spain
² Serra Húnter Fellow, Departament de Física, Universitat Autònoma de Barcelona, E-08193 Bellaterra, Spain
³ Aix-Marseille Université, CNRS, CNES, LAM, Marseille, France
⁴ Ruhr University Bochum, Faculty of Physics and Astronomy, Astronomical Institute (AIRUB), German Centre for Cosmological Lensing (GCCL), 44780 Bochum, Germany
⁵ Departament de Física, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain
⁶ Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas (CIEMAT), Avenida Complutense 40, 28040 Madrid, Spain
⁷ Port d’Informació Científica, Campus UAB, C. Albareda s/n, 08193 Bellaterra (Barcelona), Spain
⁸ Max Planck Institute for Extraterrestrial Physics, Giessenbachstr. 1, 85748 Garching, Germany
⁹ Department of Astronomy, University of Geneva, ch. d’Ecogia 16, 1290 Versoix, Switzerland
¹⁰ Institute of Cosmology and Gravitation, University of Portsmouth, Portsmouth PO1 3FX, UK
¹¹ Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig de Lluís Companys 23, 08010 Barcelona, Spain
¹² ESAC/ESA, Camino Bajo del Castillo, s/n., Urb. Villafranca del Castillo, 28692 Villanueva de la Cañada, Madrid, Spain
¹³ School of Mathematics and Physics, University of Surrey, Guildford, Surrey GU2 7XH, UK
¹⁴ INAF-Osservatorio Astronomico di Brera, Via Brera 28, 20122 Milano, Italy
¹⁵ INAF-Osservatorio di Astrofisica e Scienza dello Spazio di Bologna, Via Piero Gobetti 93/3, 40129 Bologna, Italy
¹⁶ IFPU, Institute for Fundamental Physics of the Universe, Via Beirut 2, 34151 Trieste, Italy
¹⁷ INAF-Osservatorio Astronomico di Trieste, Via G. B. Tiepolo 11, 34143 Trieste, Italy
¹⁸ INFN, Sezione di Trieste, Via Valerio 2, 34127 Trieste TS, Italy
¹⁹ SISSA, International School for Advanced Studies, Via Bonomea 265, 34136 Trieste TS, Italy
²⁰ Centre National d’Etudes Spatiales – Centre spatial de Toulouse, 18 avenue Edouard Belin, 31401 Toulouse Cedex 9, France
²¹ Dipartimento di Fisica e Astronomia, Università di Bologna, Via Gobetti 93/2, 40129 Bologna, Italy
²² INFN-Sezione di Bologna, Viale Berti Pichat 6/2, 40127 Bologna, Italy
²³ INAF-Osservatorio Astronomico di Padova, Via dell’Osservatorio 5, 35122 Padova, Italy
²⁴ Dipartimento di Fisica, Università di Genova, Via Dodecaneso 33, 16146 Genova, Italy
²⁵ INFN-Sezione di Genova, Via Dodecaneso 33, 16146 Genova, Italy
²⁶ Department of Physics "E. Pancini", University Federico II, Via Cinthia 6, 80126 Napoli, Italy
²⁷ INAF-Osservatorio Astronomico di Capodimonte, Via Moiariello 16, 80131 Napoli, Italy
²⁸ Dipartimento di Fisica, Università degli Studi di Torino, Via P. Giuria 1, 10125 Torino, Italy
²⁹ INFN-Sezione di Torino, Via P. Giuria 1, 10125 Torino, Italy
³⁰ INAF-Osservatorio Astrofisico di Torino, Via Osservatorio 20, 10025 Pino Torinese (TO), Italy
³¹ INAF-IASF Milano, Via Alfonso Corti 12, 20133 Milano, Italy
³² INAF-Osservatorio Astronomico di Roma, Via Frascati 33, 00078 Monteporzio Catone, Italy
³³ INFN-Sezione di Roma, Piazzale Aldo Moro, 2 – c/o Dipartimento di Fisica, Edificio G. Marconi, 00185 Roma, Italy
³⁴ Institute for Theoretical Particle Physics and Cosmology (TTK), RWTH Aachen University, 52056 Aachen, Germany
³⁵ Institute of Space Sciences (ICE, CSIC), Campus UAB, Carrer de Can Magrans, s/n, 08193 Barcelona, Spain
³⁶ Institut d’Estudis Espacials de Catalunya (IEEC), Edifici RDIT, Campus UPC, 08860 Castelldefels, Barcelona, Spain
³⁷ INFN section of Naples, Via Cinthia 6, 80126 Napoli, Italy
³⁸ Institute for Astronomy, University of Hawaii, 2680 Woodlawn Drive, Honolulu, HI 96822, USA
³⁹ Dipartimento di Fisica e Astronomia “Augusto Righi” – Alma Mater Studiorum Università di Bologna, Viale Berti Pichat 6/2, 40127 Bologna, Italy
⁴⁰ Instituto de Astrofísica de Canarias, Vía Láctea, 38205 La Laguna, Tenerife, Spain
⁴¹ Institute for Astronomy, University of Edinburgh, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ, UK
⁴² Jodrell Bank Centre for Astrophysics, Department of Physics and Astronomy, University of Manchester, Oxford Road, Manchester M13 9PL, UK
⁴³ European Space Agency/ESRIN, Largo Galileo Galilei 1, 00044 Frascati, Roma, Italy
⁴⁴ Université Claude Bernard Lyon 1, CNRS/IN2P3, IP2I Lyon, UMR 5822, Villeurbanne, F-69100, France
⁴⁵ Institut de Ciències del Cosmos (ICCUB), Universitat de Barcelona (IEEC-UB), Martí i Franquès 1, 08028 Barcelona, Spain
⁴⁶ UCB Lyon 1, CNRS/IN2P3, IUF, IP2I Lyon, 4 rue Enrico Fermi, 69622 Villeurbanne, France
⁴⁷ Departamento de Física, Faculdade de Ciências, Universidade de Lisboa, Edifício C8, Campo Grande, PT1749-016 Lisboa, Portugal
⁴⁸ Instituto de Astrofísica e Ciências do Espaço, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016 Lisboa, Portugal
⁴⁹ Université Paris-Saclay, CNRS, Institut d’astrophysique spatiale, 91405 Orsay, France
⁵⁰ Aix-Marseille Université, CNRS/IN2P3, CPPM, Marseille, France
⁵¹ INAF-Istituto di Astrofisica e Planetologia Spaziali, via del Fosso del Cavaliere, 100, 00100 Roma, Italy
⁵² Space Science Data Center, Italian Space Agency, via del Politecnico snc, 00133 Roma, Italy
⁵³ INFN-Bologna, Via Irnerio 46, 40126 Bologna, Italy
⁵⁴ School of Physics, HH Wills Physics Laboratory, University of Bristol, Tyndall Avenue, Bristol BS8 1TL, UK
⁵⁵ Universitäts-Sternwarte München, Fakultät für Physik, Ludwig-Maximilians-Universität München, Scheinerstrasse 1, 81679 München, Germany
⁵⁶ FRACTAL S.L.N.E., calle Tulipán 2, Portal 13 1A, 28231 Las Rozas de Madrid, Spain
⁵⁷ Jet Propulsion Laboratory, California Institute of Technology, 4800 Oak Grove Drive, Pasadena, CA 91109, USA
⁵⁸ Department of Physics, Lancaster University, Lancaster LA1 4YB, UK
⁵⁹ Technical University of Denmark, Elektrovej 327, 2800 Kgs. Lyngby, Denmark
⁶⁰ Cosmic Dawn Center (DAWN), Denmark
⁶¹ Max-Planck-Institut für Astronomie, Königstuhl 17, 69117 Heidelberg, Germany
⁶² NASA Goddard Space Flight Center, Greenbelt, MD 20771, USA
⁶³ Department of Physics and Astronomy, University College London, Gower Street, London WC1E 6BT, UK
⁶⁴ Department of Physics and Helsinki Institute of Physics, Gustaf Hällströmin katu 2, 00014 University of Helsinki, Finland
⁶⁵ Université de Genève, Département de Physique Théorique and Centre for Astroparticle Physics, 24 quai Ernest-Ansermet, CH-1211 Genève 4, Switzerland
⁶⁶ Department of Physics, P.O. Box 64 00014 University of Helsinki, Finland
⁶⁷ Helsinki Institute of Physics, Gustaf Hällströmin katu 2, University of Helsinki, Helsinki, Finland
⁶⁸ Laboratoire d’etude de l’Univers et des phenomenes eXtremes, Observatoire de Paris, Université PSL, Sorbonne Université, CNRS, 92190 Meudon, France
⁶⁹ Institute of Theoretical Astrophysics, University of Oslo, P.O. Box 1029 Blindern 0315 Oslo, Norway
⁷⁰ SKA Observatory, Jodrell Bank, Lower Withington, Macclesfield, Cheshire SK11 9FT, UK
⁷¹ Centre de Calcul de l’IN2P3/CNRS, 21 avenue Pierre de Coubertin, 69627 Villeurbanne Cedex, France
⁷² Dipartimento di Fisica “Aldo Pontremoli”, Università degli Studi di Milano, Via Celoria 16, 20133 Milano, Italy
⁷³ INFN-Sezione di Milano, Via Celoria 16, 20133 Milano, Italy
⁷⁴ University of Applied Sciences and Arts of Northwestern Switzerland, School of Computer Science, 5210 Windisch, Switzerland
⁷⁵ Universität Bonn, Argelander-Institut für Astronomie, Auf dem Hügel 71, 53121 Bonn, Germany
⁷⁶ Dipartimento di Fisica e Astronomia “Augusto Righi” – Alma Mater Studiorum Università di Bologna, via Piero Gobetti 93/2, 40129 Bologna, Italy
⁷⁷ Department of Physics, Institute for Computational Cosmology, Durham University, South Road, Durham DH1 3LE, UK
⁷⁸ Infrared Processing and Analysis Center, California Institute of Technology, Pasadena, CA 91125, USA
⁷⁹ Université Paris Cité, CNRS, Astroparticule et Cosmologie, 75013 Paris, France
⁸⁰ CNRS-UCB International Research Laboratory, Centre Pierre Binétruy, IRL2007, CPB-IN2P3, Berkeley, USA
⁸¹ University of Applied Sciences and Arts of Northwestern Switzerland, School of Engineering, 5210 Windisch, Switzerland
⁸² Institut d’Astrophysique de Paris, 98bis Boulevard Arago, 75014 Paris, France
⁸³ Institut d’Astrophysique de Paris, UMR 7095, CNRS, and Sorbonne Université, 98 bis boulevard Arago, 75014 Paris, France
⁸⁴ Institute of Physics, Laboratory of Astrophysics, Ecole Polytechnique Fédérale de Lausanne (EPFL), Observatoire de Sauverny, 1290 Versoix, Switzerland
⁸⁵ Telespazio UK S.L. for European Space Agency (ESA), Camino bajo del Castillo, s/n, Urbanizacion Villafranca del Castillo, Villanueva de la Cañada, 28692 Madrid, Spain
⁸⁶ European Space Agency/ESTEC, Keplerlaan 1, 2201 AZ Noordwijk, The Netherlands
⁸⁷ DARK, Niels Bohr Institute, University of Copenhagen, Jagtvej 155, 2200 Copenhagen, Denmark
⁸⁸ Université Paris-Saclay, Université Paris Cité, CEA, CNRS, AIM, 91191 Gif-sur-Yvette, France
⁸⁹ Institute of Space Science, Str. Atomistilor, nr. 409 Măgurele, Ilfov 077125, Romania
⁹⁰ Consejo Superior de Investigaciones Cientificas, Calle Serrano 117, 28006 Madrid, Spain
⁹¹ Universidad de La Laguna, Departamento de Astrofísica, 38206 La Laguna, Tenerife, Spain
⁹² Dipartimento di Fisica e Astronomia “G. Galilei”, Università di Padova, Via Marzolo 8, 35131 Padova, Italy
⁹³ INFN-Padova, Via Marzolo 8, 35131 Padova, Italy
⁹⁴ Institut für Theoretische Physik, University of Heidelberg, Philosophenweg 16, 69120 Heidelberg, Germany
⁹⁵ Institut de Recherche en Astrophysique et Planétologie (IRAP), Université de Toulouse, CNRS, UPS, CNES, 14 Av. Edouard Belin, 31400 Toulouse, France
⁹⁶ Université St Joseph; Faculty of Sciences, Beirut, Lebanon
⁹⁷ Departamento de Física, FCFM, Universidad de Chile, Blanco Encalada 2008, Santiago, Chile
⁹⁸ Universität Innsbruck, Institut für Astro- und Teilchenphysik, Technikerstr. 25/8, 6020 Innsbruck, Austria
⁹⁹ Satlantis, University Science Park, Sede Bld 48940, Leioa-Bilbao, Spain
¹⁰⁰ Department of Physics, Royal Holloway, University of London, TW20 0EX, UK
¹⁰¹ Instituto de Astrofísica e Ciências do Espaço, Faculdade de Ciências, Universidade de Lisboa, Tapada da Ajuda, 1349-018 Lisboa, Portugal
¹⁰² Cosmic Dawn Center (DAWN)
¹⁰³ Niels Bohr Institute, University of Copenhagen, Jagtvej 128, 2200 Copenhagen, Denmark
¹⁰⁴ Universidad Politécnica de Cartagena, Departamento de Electrónica y Tecnología de Computadoras, Plaza del Hospital 1, 30202 Cartagena, Spain
¹⁰⁵ Centre for Information Technology, University of Groningen, PO Box 11044 9700 CA Groningen, The Netherlands
¹⁰⁶ Kapteyn Astronomical Institute, University of Groningen, PO Box 800 9700 AV Groningen, The Netherlands
¹⁰⁷ INAF, Istituto di Radioastronomia, Via Piero Gobetti 101, 40129 Bologna, Italy
¹⁰⁸ Department of Physics, Oxford University, Keble Road, Oxford OX1 3RH, UK
¹⁰⁹ Aurora Technology for European Space Agency (ESA), Camino bajo del Castillo, s/n, Urbanizacion Villafranca del Castillo, Villanueva de la Cañada, 28692 Madrid, Spain
¹¹⁰ INAF-Osservatorio Astronomico di Brera, Via Brera 28, 20122 Milano, Italy, and INFN-Sezione di Genova, Via Dodecaneso 33, 16146 Genova, Italy
¹¹¹ ICL, Junia, Université Catholique de Lille, LITL, 59000 Lille, France
¹¹² ICSC – Centro Nazionale di Ricerca in High Performance Computing, Big Data e Quantum Computing, Via Magnanelli 2, Bologna, Italy

^⋆⋆ Corresponding author: wdoumerg@ifae.es

Received: 16 May 2025
Accepted: 28 July 2025

Abstract

Aims. The precision of cosmological constraints from imaging surveys hinges on an accurately estimated redshift distribution n(z) of the tomographic bins, especially their mean redshifts. We assess the effectiveness of the clustering-redshifts technique in constraining Euclid tomographic redshift bins to meet the target uncertainty of σ(⟨z⟩) < 0.002(1 + z). We inferred these mean redshifts from the small-scale angular clustering of Euclid galaxies, which were distributed into bins with spectroscopic samples localised in narrow redshift slices.

Methods. We generated spectroscopic mocks from the Flagship2 simulation for the Baryon Oscillation Spectroscopic Survey (BOSS), the Dark Energy Spectroscopic Instrument (DESI), and the Euclid Near-Infrared Spectrometer and Photometer (NISP) spectroscopic survey. We evaluated and optimised the clustering-redshifts pipeline, and we introduced a new method for measuring the photometric galaxy bias (clustering), which is the primary limitation of this technique.

Results. We have successfully constrained the means and standard deviations of the redshift distributions for all of the tomographic bins (with a maximum photometric redshift of 1.6). We achieved precision beyond the required thresholds. We have identified the main sources of bias, particularly the impact of the one-halo galaxy distribution, which imposed the minimal separation scale to be larger than 1.5 Mpc for evaluating cross-correlations. These results demonstrate that clustering-redshifts can meet the precision requirements for Euclid, and we highlighted several avenues for future improvements.

Key words: methods: data analysis / methods: statistical / techniques: photometric / techniques: spectroscopic / large-scale structure of Universe

^⋆

This paper is published on behalf of the Euclid Consortium.

© The Authors 2025

Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1. Introduction

The Euclid space telescope is currently mapping the positions and shapes of billions of galaxies. This provides data that are critical to understand the large-scale structure of the Universe, in particular, the mysterious nature of dark matter and dark energy (Euclid Collaboration: Mellier et al. 2025). Through its imaging survey, Euclid measures the flux of galaxies in the visible and near-infrared wavelengths using broadband filters. Additionally, it determines spectroscopic redshifts for a subset of galaxies via slitless spectroscopy, targeting emission-line galaxies (ELGs). These data will enable the precise determination of cosmological parameters and models, primarily through the study of galaxy clustering, galaxy-galaxy lensing, and cosmic shear (Euclid Collaboration: Blanchard et al. 2020), with galaxy samples divided into approximate line-of-sight tomographic bins.

It is essential to accurately estimate the redshift distributions of these galaxy samples to interpret cosmological measurements correctly (see e.g. Huterer et al. 2006; Bordoloi et al. 2010; Hildebrandt et al. 2021; Stölzner et al. 2021). The vast amount of forthcoming data means that it is not feasible to obtain spectroscopic redshifts for every individual galaxy because spectroscopy of large samples is time-consuming and costly. Photometric surveys provide redshift estimates for each galaxy based on multi-band photometry of that galaxy, a technique called photometric redshift, or photo-z. A large variety of photo-z methods exists (e.g. Tanaka et al. 2018; Salvato et al. 2019; Euclid Collaboration: Ilbert et al. 2021; Euclid Collaboration: Desprez et al. 2020). The photometric information can also be used to produce redshift estimates using scheme based on a self-organising map (hereafter, SOM), which allows a more general control over all the known potential sources of uncertainties that affect the estimates (Wright et al. 2020a,b; Myles et al. 2021; Giannini et al. 2024; Campos et al. 2024; Wright et al. 2025; Roster et al. 2025). The degeneracies between colours and redshift, however, and unrepresentative spectroscopic samples for training and calibration ultimately limit the performance of photometric methods (Wright et al. 2020a; Hartley et al. 2020).

Clustering-based redshift estimation methods offer another alternative to infer redshift distributions (see e.g. Newman 2008; Ménard et al. 2013; McQuinn & White 2013; Morrison et al. 2017; Scottez et al. 2018; Gatti et al. 2018; van den Busch et al. 2020; Hildebrandt et al. 2021; Gatti et al. 2022; Cawthon et al. 2022; Rau et al. 2023; Wright et al. 2025). Unlike photo-z, clustering-redshifts techniques statistically infer the redshift properties for entire bins instead of individual galaxies. Furthermore, this approach is independent of photometric uncertainties, observations over smaller deep fields, or representative spectroscopic samples.

The clustering-redshifts method relies on angular cross-correlations between spectroscopic samples, with secure redshifts, and photometric samples. Samples that overlap in redshift trace the same underlying dark matter field and are consequently correlated in their position. The amplitude of this angular correlation provides insight into their redshift overlap, which results in the measurement of the redshift distribution of the photometric samples. This amplitude is degenerate with galaxy biases (of both samples), however, which makes the measurement of these biases, and their redshift evolution, a critical step (McQuinn & White 2013; Gatti et al. 2018; Naidoo et al. 2023). While the galaxy bias for the spectroscopic sample can be directly constrained through auto-correlation functions, it is more challenging to estimate the galaxy bias for the photometric sample (van den Busch et al. 2020; Cawthon et al. 2022), and is sometimes evaluated using simulations (Gatti et al. 2022).

The precision of the clustering-redshifts method depends on the redshift and sky overlap of the photometric and spectroscopic data, but also on range of the clustering angular scale. To achieve the primary science goal of Euclid, the uncertainty on the mean redshift, σ(⟨z⟩), for each tomographic bin must remain below 0.002(1 + z) at 68% confidence (Laureijs et al. 2011). Naidoo et al. (2023) found that clustering-redshifts would be able to reach the statistical uncertainties required by Euclid for a sufficiently large sky overlap, typically several hundreds of deg², when analysing scales from 100 kpc to 1 Mpc, although systematic biases limit the accuracy. In particular, there were some unknown residual biases for high-redshift bins with their methods or simulated data.

This paper aims to re-evaluate the potential of photometric-spectroscopic cross-correlations as a core component of the redshift calibration for Euclid. Our method may be seen as a continuation, and a substantial improvement from Naidoo et al. (2023). We used 5000 deg² of simulated data from the Euclid Flagship2 simulation (Euclid Collaboration: Castander et al. 2025). Our goal is to evaluate the uncertainties associated with the clustering-redshifts calibration through a realistic framework, and to determine the limitation of the method. We created spectroscopic mock samples for the Dark Energy Spectroscopic Instruments (DESI), or alternatively, for the 4-metre Multi-Object Spectroscopic Telescope (4MOST), the Baryon Oscillation Spectroscopic Survey (BOSS), and Euclid. We assessed the potential systematic effects on the measurement, introduced a new method to measure photometric galaxy bias, optimised the clustering-z pipeline, and derived methods for inferring the redshift constraints from the amplitude of two-point correlations. We assumed the same flat ΛCDM cosmology as was used for the simulation, with Ω_m = 0.319, Ω_b = 0.049, Ω_Λ = 0.681, A_s = 2.1 × 10⁻⁹, n_s = 0.96, σ₈ = 0.813, and h = 0.67. Our code is publicly available¹.

This paper is organised as follows. In Sect. 2 we describe the Flagship2 simulation and the mocks we generated to mimic Euclid and spectroscopic data. In Sect. 3 we give a detailed overview of angular clustering, its application to clustering redshifts, and potential sources of errors. In Sect. 4 we present the different tests we ran to optimise the pipeline and address potential sources of bias. Finally in Sect. 5 we apply the results pipeline to realistic Euclid tomographic bins and compare our results with previous work and the requirements.

2. Simulated data

To assess the performance of the Euclid clustering-redshifts method, we constructed simulated datasets that were as similar as possible to the observational datasets. In this section, we describe how these samples can be generated from the Flagship simulation.

2.1. The Flagship simulation and survey mocks

For this study, we used the second edition of the Flagship catalogue (version 2.1.10), which was presented in detail in Euclid Collaboration: Castander et al. (2025). The Euclid Flagship N-body dark matter simulation comprises a simulation box measuring 3600 h⁻¹ Mpc on each side, containing 4 trillion particles. This simulation is the largest N-body simulation performed to date and encompasses a cosmological volume comparable to what the telescope will survey. It enables precise resolution of 16 billion dark matter haloes, which host even the faintest galaxies that Euclid intends to observe. They are identified with the ROCKSTAR halo finder (Behroozi et al. 2013). From this dark matter simulation, a synthetic galaxy catalogue was generated with a prescription including the halo occupation distribution (HOD) and abundance matching (AM) techniques, as well as empirical relations between galaxy properties (Euclid Collaboration: Castander et al. 2025).

Each galaxy entry in the catalogue is associated with observed fluxes for multiple survey bands, observed spectroscopic redshift (z_spec, including peculiar velocity), photometric redshift estimates (z_p), true redshift (z_true), true and magnified coordinates, lensing convergence κ and shear γ = γ₁ + iγ₂. Source photometry is provided in terms of fluxes F (in erg s⁻¹ cm⁻² Hz⁻¹) rather than in magnitude m. We generate magnified fluxes as in Lepori et al. (2025), using the relation

$\begin{matrix} F_{magn} = \frac{F}{{(1 - κ)}^{2} - γ_{1}^{2} - γ_{2}^{2}} . \end{matrix}$ $\begin{aligned} {F}_{\rm magn}=\frac{F}{(1-\kappa )^2-\gamma _1^2-\gamma _2^2}\,. \end{aligned}$ (1)

We do not include the Doppler effect, which is usually a very small correction, and note that the effect of magnification is achromatic: that is, it is the same for for the different bands and it cancels for colours.

We need mock samples Euclid-like photometric galaxies, Euclid-like Near-Infrared Spectrometer and Photometer Survey (NISP-S) galaxies, and other spectroscopic tracers (Dawson et al. 2013; Adame et al. 2024): BOSS-like low-redshift (LOWZ) and constant mass (CMASS) galaxies; DESI-like bright galaxies (BGs), luminous red galaxies (LRGs), and ELGs. The selection of each sample is described in the following sections. Differences in the redshift distributions or colours of observational and simulated galaxy samples may stem from the incompleteness of the target sample or limitations in the simulation’s ability to replicate accurate colour distributions. As in Naidoo et al. (2023), systematic issues such as variable photometry, variable survey depth, and complex masks are not addressed. We generated one version of these samples with unmagnified fluxes, which will be our default samples, and one version with the same cuts on magnified photometry. We produced numerous fast generations of galaxy catalogues, varying sample definitions and areas, using the web application CosmoHub (Carretero et al. 2017; Tallada et al. 2020). Our photometric and spectroscopic samples are plotted in Fig. 1.

Fig. 1.

Top: Simulated true-redshift distribution of each of the first ten tomographic bins (z_photo < 1.6). Bottom: Simulated redshift distribution of the spectroscopic samples from the BOSS, DESI, and Euclid surveys. The lack of spectroscopic samples for z_spec > 1.8 imposes the z_photo < 1.6 condition for clustering-redshifts calibration.

2.2. Euclid photometric sample

The photometric sample was constructed by imposing an upper limit on the I_E magnitude (Euclid Collaboration: Cropper et al. 2025), set at 24.5, to emulate the expected performance of Euclid (Laureijs et al. 2011). Several algorithms exist for evaluating the photometric redshift. We used the Deep-z estimate which was run for the full catalogue. The Deep-z photo-z method, introduced for the Physics of the Accelerating Universe (PAU) survey in Eriksen et al. (2020), uses a linear neural network with a mixture density network for predicting the redshift distributions. The network has not been pretrained on simulations, avoiding potential problems with over-fitting by using too similar simulations. The network was trained with 20 000 simulated galaxies, magnitude-limited to i_LSST < 23 without colour selection. The training used the Legacy Survey of Space and Time (LSST; LSST Science Collaboration 2009) bands plus Euclid near-infrared (NIR) bands, a batch size of 200 and 1000 epochs with varying learning rates. The photometric noise added to noiseless fluxes corresponds to the depth forecast in the southern hemisphere for the Euclid DR3 time. The limiting magnitudes at 10σ are 24.4, 25.6, 25.7, 25.0, 24.3, 24.3 for the ugrizy bands of LSST, and 25.0, 23.5, 23.5, 23.5 for the I_E, Y_E, J_E, H_E bands of Euclid (Euclid Collaboration: Cropper et al. 2025; Euclid Collaboration: Jahnke et al. 2025).

LSST photometry will not be available over the whole Euclid footprint, and therefore the photometry will be different in the northern region where photometry will be provided by UNIONS (Gwyn et al. 2025). However, for clustering redshifts, we do not depend on photo-z quality except for the tomographic bin definitions and photometric bias correction. Spectroscopy from DESI and BOSS-eBOSS will be available in the north, and 4MOST will provide DESI-like samples in the south. Thus, photometric angular anisotropies will be tested in practice.

We selected galaxies with photo-z between 0.2 and 1.6. We did not consider galaxies with a photo-z higher than 1.6 because we do not have large spectroscopic samples at redshifts greater than 1.8. Future studies could test the possibility of using quasar galaxy samples from BOSS, eBOSS, DESI, and 4MOST (Palanque-Delabrouille et al. 2016; Yèche et al. 2020) for high-z calibration. BOSS and eBOSS quasars were used for the clustering-redshifts DES calibration, but as a complement to ELGs and LRGs data sets for z < 1.1 (Gatti et al. 2022; Giannini et al. 2024). The low density of this sample, and clustering properties at high-z, require a dedicated study. Nonetheless, eBOSS QSOs will be used for the high-z calibration of the final DES data for 0.8 < z < 2.2 (d’Assignies et al., in prep.); thus one expects that DESI QSOs could be used up to z = 3.5 for Euclid. Generating DESI QSO mocks and testing clustering-redshifts with them is left for future work.

We produced ten tomographic bins with approximately equal numbers of galaxies, through photo-z cuts. The fiducial Euclid survey at data release 3 plans to use 13 bins instead of ten, and we can assume that three of these bins will contain galaxies with z_p > 1.6. The true redshift distributions (that we want to measure) of our simulated tomographic bins are reported in Fig. 1.

2.3. Euclid NISP sample

We defined mock NISP-S samples in Flagship2 by selecting galaxies with H α fluxes greater than 2 × 10⁻¹⁶erg cm⁻² s⁻¹ (Laureijs et al. 2011), in the expected redshift range, with

$\begin{matrix} logf_halpha_model 3_ext > - 15.7, \end{matrix}$ $\begin{aligned}&\mathtt {logf\_halpha\_model3\_ext} >-15.7\,,\end{aligned}$ (2)

$\begin{matrix} 0.9 < z_{spec} < 1.8 . \end{matrix}$ $\begin{aligned}&0.9<z_{\rm spec} < 1.8\,. \end{aligned}$ (3)

We obtained a galaxy density of 1990 deg⁻². Unlike the other spectroscopic samples, this dataset is expected to suffer from a significant level of contamination by interlopers (Euclid Collaboration: Le Brun et al. 2025, Risso et al., in prep.). These interlopers are galaxies whose true redshifts differ substantially from their estimated values, leading to incorrect associations in redshift space. Although our analysis did not employ realistic mocks that include such contaminants, we adopted the approach of Contarini et al. (2022): we uniformly down-sampled the galaxy catalogue, retaining only 60% of the originally selected galaxies. This down-sampled catalogue is then assumed to have 100% purity, effectively removing the impact of interlopers in our modelling. Thus, our fiducial density is 1200 deg⁻² as the sample from Naidoo et al. (2023), which was generated with a brighter cut on the H α flux. This is a pessimistic sample from the precision point of view (i.e. fewer spectra means larger uncertainty on the redshift distribution reconstruction; cf. McQuinn & White 2013), but optimistic in terms of accuracy, because our sample is interloper free (Addison et al. 2019). Interlopers may significantly degrade the performance of clustering-redshifts if not taken into consideration, as they are not necessarily distributed randomly in redshift. Indeed, clustering-redshifts might be helpful to quantify the interlopers’ properties such as their fraction and redshift distribution, by cross-correlating them with spectroscopic samples which do not suffer from the same misidentification. We will explore these two aspects in future work.

2.4. BOSS LOWZ and CMASS samples

We replicated the BOSS colour-magnitude selections specified in Dawson et al. (2013) in Flagship, as in Naidoo et al. (2023). We first define

$\begin{matrix} c_{‖} = 0.7 (g - r) + 1.2 (r - i - 0.18), \end{matrix}$ $\begin{aligned}&c_\parallel = 0.7(g-r)+1.2\,(r-i-0.18)\,,\end{aligned}$ (4)

$\begin{matrix} c_{⊥} = (r - i) - (g - r) / 4 - 0.18, \end{matrix}$ $\begin{aligned}&c_\perp =(r-i)-(g-r)/4-0.18\,,\end{aligned}$ (5)

$\begin{matrix} d_{⊥} = (r - i) - (g - r) / 8 . \end{matrix}$ $\begin{aligned}&d_\perp =(r-i)-(g-r)/8\,. \end{aligned}$ (6)

There are two spectroscopic BOSS LRG samples: LOWZ and CMASS. They correspond to adjacent redshift intervals 0.15 ≤ z_spec ≤ 0.43 and 0.43 < z_spec ≤ 0.7, respectively, with true densities of about 30 deg⁻² and 120 deg⁻², respectively.

The LOWZ sample is defined by

$\begin{matrix} | c_{⊥} | < 0.2, \end{matrix}$ $\begin{aligned}&\vert c_\perp \vert < 0.2\,,\end{aligned}$ (7)

$\begin{matrix} 16 < r < 19.5, \end{matrix}$ $\begin{aligned}&16 < r < 19.5\,,\end{aligned}$ (8)

$\begin{matrix} r < 13.6 + c_{‖} / 0.3, \end{matrix}$ $\begin{aligned}&r < 13.6+c_\parallel /0.3\,, \end{aligned}$ (9)

and the CMASS sample by

$\begin{matrix} d_{⊥} > 0.55, \end{matrix}$ $\begin{aligned}&d_\perp >0.55\,,\end{aligned}$ (10)

$\begin{matrix} 17.5 < i < 19.9, \end{matrix}$ $\begin{aligned}&17.5 < i < 19.9\,,\end{aligned}$ (11)

$\begin{matrix} i < 19.86 + 1.6 (d_{⊥} - 0.8) . \end{matrix}$ $\begin{aligned}&i < 19.86+1.6\,(d_\perp -0.8)\,. \end{aligned}$ (12)

We got LOWZ- and CMASS- like objects, with densities of 48.5 deg⁻² and 164.3 deg⁻². We then applied sparse sampling to get the desired densities. We refer to the joint LOWZ+CMASS sample as the BOSS sample in our study.

2.5. DESI samples

We generated DESI BG, LRG, and ELG mocks that mimic the sample characteristics reported in Adame et al. (2024). We associated these galaxy samples to the DESI survey, but similar samples are expected to be observed by 4MOST (Verdier et al. 2025). For BGs, and ELGs, we used the same cuts as Naidoo et al. (2023), which were based on DESI Collaboration (2016). For LRGs, we used a different selection to generate a sample covering the entire z < 1.1 redshift range. As we previously mentioned in Sect. 2.1, there is a mismatch in the Flagship2 colour distribution and data expectation, and using the same colour cuts can lead to some significant differences. We find this issue to be particularly problematic for LRGs. Therefore we did not apply the same selection as Adame et al. (2024), but rather we tried to closely reproduce the redshift distribution and galaxy properties with similar but modified cuts.

We selected the BG-like galaxies with

$\begin{matrix} r < 19.5, \end{matrix}$ $\begin{aligned}&r < 19.5\,,\end{aligned}$ (13)

$\begin{matrix} z_{spec} < 0.4 . \end{matrix}$ $\begin{aligned}&z_{\rm spec} < 0.4\,. \end{aligned}$ (14)

We adopted for LRGs the tailored cuts

$\begin{matrix} 18 < z < 21, \end{matrix}$ $\begin{aligned}& 18 < z < 21\,,\end{aligned}$ (15)

$\begin{matrix} - 6.5 < r - z - 0.4 z < - 5.8, \end{matrix}$ $\begin{aligned}&-6.5 < r-z-0.4\,z < -5.8\,,\end{aligned}$ (16)

$\begin{matrix} color_kind = 0 (i.e. red sequence) . \end{matrix}$ $\begin{aligned}&\mathtt {color\_kind} = 0 \text{(i.e.} \text{ red} \text{ sequence)}\,. \end{aligned}$ (17)

Finally, we selected ELGs with

$\begin{matrix} r < 23.4, \end{matrix}$ $\begin{aligned}& r < 23.4\,,\end{aligned}$ (18)

$\begin{matrix} r - z > 0.3, \end{matrix}$ $\begin{aligned}&r-z>0.3\,,\end{aligned}$ (19)

$\begin{matrix} g - r < 0.7, \end{matrix}$ $\begin{aligned}&g-r < 0.7\,,\end{aligned}$ (20)

$\begin{matrix} 0.6 < z_{spec} < 1.6, \end{matrix}$ $\begin{aligned}&0.6<z_{\rm spec} < 1.6\,,\end{aligned}$ (21)

$\begin{matrix} color_kind \neq 0, \end{matrix}$ $\begin{aligned}&\mathtt {color\_kind} \ne 0,\end{aligned}$ (22)

$\begin{matrix} logf_o 2_model 1_ext > - 16 . \end{matrix}$ $\begin{aligned}&\mathtt {logf\_o2\_model1\_ext} >-16\,. \end{aligned}$ (23)

These cuts provided BG-, LRG-, and ELG-like objects with densities of 850, 909, and 4158 deg⁻². We achieved the fiducial densities 700, 550, and 1140 deg⁻² with sparse sampling.

3. Theory and method

In this section, we provide a detailed description of the clustering-redshifts method. Our approach begins with an overview of galaxy clustering in Sect. 3.1. In Sect. 3.2 we explain how the clustering equations are simplified for some particular redshift distributions. We then apply this formalism to the clustering-redshifts context in Sect. 3.3. In Sect. 3.4, we explain how clustering-redshifts are evaluated with data in practice. Finally, in Sect. 3.5 we explain our strategy to deduce from the data-vectors constraints on the redshift distribution moments. In particular, in Sect. 3.5.1, we state the Euclid requirements for the n_p measurements, which must be fulfilled in order to provide robust constraints on cosmological parameters.

3.1. Angular galaxy clustering

In this section we provide a general introduction to photometric angular clustering, as a basis for the development of our clustering-redshifts method. It draws inspiration from many previous weak-lensing and clustering-z works such as Davis et al. (2018), Krause et al. (2021), Gatti et al. (2018, 2022), Cawthon et al. (2022), and Pandey et al. (2025).

3.1.1. Galaxy density contrast

The observed angular galaxy count of a sample a (photometric or spectroscopic) is

$\begin{matrix} N_{a}^{obs} (θ) = {\bar{N}}_{a} (1 + Δ_{a}^{obs} (θ)), \end{matrix}$ $\begin{aligned} N_{a}^\mathrm{obs}({\theta }) = \overline{N}_{a}\left(1+\Delta _{a}^\mathrm{obs}({\theta })\right)\,, \end{aligned}$ (24)

where ${\bar{N}}_{a}$ $\overline{N}_{a}$ is the mean (observed) galaxy count, and $Δ_{a}^{obs}$ $\Delta_{a}^{\mathrm{obs}}$ is the observed projected galaxy density contrast. It can be expressed as (Krause et al. 2021)

$\begin{matrix} Δ_{a}^{obs} (θ) \approx Δ_{a}^{D} (θ) + Δ_{a}^{RSD} (θ) + Δ_{a}^{μ} (θ), \end{matrix}$ $\begin{aligned} \Delta _{a}^\mathrm{obs}({\theta })\approx \Delta _{a}^\mathrm{D}({\theta })+ \Delta _{a}^\mathrm{RSD}({\theta })+ \Delta _{a}^{\mu }({\theta })\,, \end{aligned}$ (25)

where μ refers to magnification, RSD to redshift space distortion (in the linear regime²), and $Δ_{a}^{D}$ $\Delta_{a}^{\mathrm{D}}$ to the line of sight projection of the three-dimensional (3D) galaxy density contrast δ_a(z, ⃗θ):

$\begin{matrix} Δ_{a}^{D} (θ) = \int d z n_{a} (z) δ_{a} (z, θ), \end{matrix}$ $\begin{aligned} \Delta _{a}^\mathrm{D}({\theta }) = \int {\mathrm{d} z} \;n_{a}(z)\,\delta _{a}(z,{\theta })\,, \end{aligned}$ (26)

where n_a(z) is the redshift distribution of the observed sample, normalised to unity, and function of the observed redshift. All these quantities are associated with observations, and they depend on survey properties (depth, masks etc.), but for clarity, we drop the “obs” index for the rest of our work, as we do not consider observational systematics. We also do not model relativistic corrections.

3.1.2. Angular two-point correlation function

The angular two-point correlation function of samples a and b (with possibly a = b) from the observed projected density contrast can be written as

$\begin{matrix} \begin{matrix} w_{ab}^{full} (θ) & = ⟨ Δ_{a}^{D} Δ_{b}^{D} ⟩ (θ) + ⟨ Δ_{a}^{RSD} Δ_{b}^{D} ⟩ (θ) + ⟨ Δ_{a}^{D} Δ_{b}^{RSD} ⟩ (θ) \\ + ⟨ Δ_{a}^{D} Δ_{b}^{μ} ⟩ (θ) + ⟨ Δ_{a}^{μ} Δ_{b}^{D} ⟩ (θ) . \end{matrix} \end{matrix}$ $\begin{aligned} \begin{aligned} { w}_{ab}^\mathrm{full}(\theta )&=\langle \Delta ^\mathrm{D}_a\Delta ^\mathrm{D}_b \rangle (\theta )+\langle \Delta ^\mathrm{RSD}_a\Delta ^\mathrm{D}_b \rangle (\theta )+\langle \Delta ^\mathrm{D}_a\Delta ^\mathrm{RSD}_b \rangle (\theta )\\&+\langle \Delta ^\mathrm{D}_a\Delta ^{\mu }_b \rangle (\theta )+\langle \Delta ^{\mu }_a\Delta ^{D}_b \rangle (\theta )\,. \end{aligned} \end{aligned}$ (27)

where ⟨…⟩ indicates an average over angular scales, and θ is now a scalar³. The dominant corrections (RSD×D; μ ×D) are already small with respect to the dominant term (D×D), and we decided to omit even smaller corrections (Krause et al. 2021; Gatti et al. 2022).

We will mainly focus on the D × D correlation. Since it corresponds to the leading order, we omit its D index in the rest of the article. The clustering two-point correlation can be expressed as

$\begin{matrix} w_{ab} (θ) & = ⟨ Δ_{a}^{D} Δ_{b}^{D} ⟩ (θ) \end{matrix}$ $\begin{aligned} { w}_{ab}(\theta )&=\langle \Delta ^\mathrm{D}_a\Delta ^\mathrm{D}_b \rangle (\theta )\end{aligned}$ (28)

$\begin{matrix} = \int \int {d z}_{1} {d z}_{2} n_{a} (z_{1}) n_{b} (z_{2}) ⟨ δ_{a} δ_{b} ⟩ (z_{1}, z_{2}, θ) \end{matrix}$ $\begin{aligned}&=\int \int {\mathrm{d} z}_1 \,{\mathrm{d} z}_2\, n_a(z_1)\,n_b(z_2)\, \langle \delta _a\,\delta _b \rangle (z_1,z_2,\theta )\end{aligned}$ (29)

$\begin{matrix} = \int \int {d z}_{1} {d z}_{2} n_{a} (z_{1}) n_{b} (z_{2}) ξ_{ab} (z_{1}, z_{2}, θ), \end{matrix}$ $\begin{aligned}&=\int \int {\mathrm{d} z}_1\, {\mathrm{d} z}_2\, n_a(z_1)\,n_b(z_2)\, \xi _{ab}(z_1,z_2,\theta )\,, \end{aligned}$ (30)

where we have defined the 3D cross-correlation function ξ_ab(z_a, z_b, θ), and used ⟨δ_x⟩ = 0. While we have introduced the correlations as a function of sky separation θ, we can also express them as functions of the perpendicular comoving separation r_p, defined as

$\begin{matrix} r_{p} = θ χ (z), \end{matrix}$ $\begin{aligned} {r_{\rm p}}=\theta \; \chi (z)\,, \end{aligned}$ (31)

in the flat sky approximation, and small angle limit, where χ is the comoving radial distance (equal to the comoving angular-diameter distance with no spatial curvature). Working with r_p instead of θ is relevant when making particular scale choices for evaluating the correlation, but the Python package we use, TreeCorr (Jarvis 2015) has angular separation as the default convention.

3.1.3. Moving to matter statistics with galaxy bias

For large-scale correlations, the linear galaxy bias provides a relation between the galaxy overdensity and the matter density field δ_m

$\begin{matrix} δ_{g} (θ, z) = b_{g} (z) δ_{m} (θ, z), \end{matrix}$ $\begin{aligned} \delta _{\rm g}({\theta },\,z) = b_{\rm g}(z)\,\delta _{\rm m}({\theta },z)\,, \end{aligned}$ (32)

where b_g is the galaxy sample bias, and δ_m is the matter density field. Under this approximation, the angular correlation function becomes,

$\begin{matrix} w_{ab} (θ) = \int \int {d z}_{1} {d z}_{2} b_{a} (z_{1}) b_{b} (z_{2}) n_{a} (z_{1}) n_{b} (z_{2}) ξ_{m} (z_{1}, z_{2}, θ), \end{matrix}$ $\begin{aligned} { w}_{ab}(\theta ) = \int \int {\mathrm{d} z}_1\, {\mathrm{d} z}_2\, b_a(z_1)\,b_b(z_2)\,{n}_a(z_1)\,{n}_b(z_2)\,\xi _{\rm m}(z_1,z_2,\theta )\,, \end{aligned}$ (33)

where ξ_m is the 3D matter two-point correlation.

Following the literature, we used the linear bias expression of Eq. (32) by default. It is worth noting, however, that the linear bias expression is expected to break at scales around 6 h⁻¹ Mpc. In addition, the next-order expression would extend this scale range to around 4 h⁻¹ Mpc (Pandey et al. 2020). Thus, none of the equations (linear or non-linear e.g. Appendix A) are properly adapted to the standard clustering-redshifts scale ranges, (0.1–1) h⁻¹ Mpc, (0.5–1.5) h⁻¹ Mpc or (1–5) h⁻¹ Mpc. However, linear bias has been tested to give accurate enough modelling for clustering-redshifts analysis (Gatti et al. 2018, 2022; van den Busch et al. 2020; Cawthon et al. 2022; Naidoo et al. 2023). As a consequence, it is crucial to quantify and address potential deviations introduced in the reconstructed n(z). We will pay particular attention to the galaxy bias issue, nonlinear contributions, and the scale range choice.

3.1.4. Magnification

In this section, we describe the two magnification terms in Eq. (27),

$\begin{matrix} M_{ab} (r_{p}) & = ⟨ Δ_{a}^{D} Δ_{b}^{μ} ⟩ (r_{p}) + ⟨ Δ_{a}^{μ} Δ_{b}^{D} ⟩ (r_{p}) . \end{matrix}$ $\begin{aligned} M_{ab}({r_{\rm p}})&=\langle \Delta ^\mathrm{D}_{a}\Delta ^{\mu }_{b} \rangle ({r_{\rm p}})+\langle \Delta ^{\mu }_{a}\Delta ^{D}_{b} \rangle ({r_{\rm p}})\,. \end{aligned}$ (34)

Magnification is a lensing effect due to the gravitational bending of distant source light by the matter between those sources and the observer (Krause et al. 2021; Elvin-Poole et al. 2023). It introduces an angular position correlation for galaxies distant in redshift because of the lensing of a high-z galaxy sample by the matter traced by a lower-z sample⁴. Fortunately, this effect, which breaks the naive hypothesis that in clustering-redshifts only galaxies close in redshift are correlated, can be modelled.

Magnification impacts the apparent position of galaxies and their density. In regions of positive convergence⁵, the apparent separation between any two points on a source plane is enlarged, leading the telescope to capture a larger apparent solid angle, consequently reducing the apparent galaxy density locally. Another effect of positive convergence is an increase in distant source flux received by the telescope, potentially resulting in a change in the number of galaxies passing through the selection function.

Magnification was taken into account for clustering-z in DES, through direct modelling of its effect, and simulation-based estimation of magnification coefficients (Gatti et al. 2022; Cawthon et al. 2022). Complete inclusion of magnification should include position magnification, flux magnification, and its impact on photo-z, which can be significant (Legnani et al., in prep.), but we restricted this work to position and flux magnification. We detail how magnification can be modelled in the Appendix B. Since the tomographic bins are expected to be narrower for Euclid than for DES, the effect of magnification is expected to be smaller, as illustrated by the toy examples in Appendix B. Therefore, we do not include magnification in our default modelling. We will reassess the need to incorporate magnification in future work, based on the properties of the data bins.

3.1.5. RSD

In the linear regime, RSD contributes to the projected galaxy density contrast through the apparent large-scale flow of galaxies across the redshift boundaries of the bins (Krause et al. 2021). Equivalently, the redshift distribution of galaxies as a function of the observed redshift is slightly different from the distribution as a function of the Hubble redshift. The latter should be used, however, as the modelling assumes isotropy through the correlation function ξ. Since the photometric tomographic bins are quite large, it would mainly affect the spectroscopic sample which is divided into narrow redshift slices; the narrower the slices, the larger the effect is. Therefore, only the term $⟨ Δ_{s}^{RSD} Δ_{p}^{D} ⟩ (θ)$ $\langle \Delta^{\mathrm{RSD}}_{\mathrm{s}} \Delta^{\mathrm{D}}_{\mathrm{p}}\rangle(\theta)$ may contribute to the two-point correlation of Eq. (27). We tested in Sect. 4.7 the bias induced by the use of the spec-z instead of true-z for the spectroscopic slicing, in the Flagship simulation, since peculiar velocities are included, cf. Sect. 2.1.

3.2. Impact of the galaxy redshift distribution and its modelling

A tomographic bin will be referred to as a sample p, without referring to the tomographic ID, as the latter is always explicit. Different spectroscopic samples are considered in this article from different simulated surveys and tracers, such as DESI ELGs. For a given survey tracer, we will additionally distribute the sample into redshift slices, with spectroscopic redshift cuts. For a given redshift slice, the galaxies will be referred to as s_j, or {s, z_j} with z_j the mean of the redshift slice. We explore two cases for the modelling of the distribution and two-point correlations with a sliced spectroscopic sample.

3.2.1. Ideal case of a Dirac galaxy distribution

The situation often presented in clustering-redshifts literature (e.g. Gatti et al. 2022; Giannini et al. 2024) is the one of a sample localised in such a small redshift slice that it can be approximated by a Dirac distribution

$\begin{matrix} n_{a} (z) \approx δ_{D} (z - z_{i}) . \end{matrix}$ $\begin{aligned} n_a(z)\approx \delta _{\rm D}(z-z_i)\,. \end{aligned}$ (35)

The cross-correlation with a second sample is then simplified to

$\begin{matrix} w_{ab}^{Dirac} (θ) & = b_{a} (z_{i}) \int d z b_{b} (z) n_{b} (z) ξ_{m} (z_{i}, z, θ), \end{matrix}$ $\begin{aligned} { w}_{ab}^\mathrm{Dirac}(\theta )&=b_a(z_i)\int {\mathrm{d} z} \;b_b(z)\,{n}_b(z)\,\xi _{\rm m}(z_i,\,z,\,\theta ),\end{aligned}$ (36)

$\begin{matrix} \approx b_{a} (z_{i}) b_{b} (z_{i}) n_{b} (z_{i}) ξ_{m} (z_{i}, θ), \end{matrix}$ $\begin{aligned}&\approx b_a(z_i)\,b_b(z_i)\, n_b(z_i)\, \xi _{\rm m}(z_i,\theta )\,, \end{aligned}$ (37)

with

$\begin{matrix} ξ_{m} (z_{i}, θ) = \int d z ξ_{m} (z_{i}, z, θ) . \end{matrix}$ $\begin{aligned} \xi _{\rm m}(z_i,\theta ) = \int {\mathrm{d} z} \;\xi _{\rm m}(z_i,\,z,\,\theta )\,. \end{aligned}$ (38)

To go from Eq. (36) to Eq. (37), we assume the redshift evolution of the bias and distribution of the second sample is small in the support of ξ_m (i.e. in a small redshift interval centred in z_i).

3.2.2. Realistic case: Spectroscopic sample into small redshift slices

Using the Limber approximation, Eq. (33) reduces to,

$\begin{matrix} w_{ab}^{Limber - full - z} (θ) = \int d z b_{a} (z) b_{b} (z) n_{a} (z) n_{b} (z) ξ_{m} (z, θ), \end{matrix}$ $\begin{aligned} { w}_{ab}^\mathrm{Limber-full-z}(\theta ) = \int {\mathrm{d} z}\; b_a(z)\,b_b(z)\,{n}_a(z)\,{n}_b(z)\,\xi _{\rm m}(z,\,\theta )\,, \end{aligned}$ (39)

where ξ_m is the same as in Eq. (38). In the standard scenario of clustering redshifts, the spectroscopic galaxy sample is localised within small redshift slices z_i ± Δz_i/2, so that the distribution can be approximated as uniform

$\begin{matrix} n_{a} (z) = {\begin{matrix} 1 / Δ z_{i} & if | z - z_{i} | < Δ z_{i} / 2, \\ 0 & else, \end{matrix} \end{matrix}$ $\begin{aligned} n_a(z) = {\left\{ \begin{array}{ll} 1/\Delta z_i&\text{ if} \vert z-z_i\vert < \Delta z_i/2\,, \\ 0&\text{ else,} \end{array}\right.} \end{aligned}$ (40)

where Δz_i is typically between 0.01 and 0.05. This modelling is of course only approximated true in practice, and would lead to some deviations which we will evaluate. In particular, under the assumption of constant galaxy bias across the redshift bin, the spectroscopic auto-correlation (b = a) becomes

$\begin{matrix} w_{aa}^{Limber - 1 - bin} (θ) \approx \frac{b_{a}^{2} (z_{i})}{Δ z^{2}} \int_{- Δ z_{i} / 2}^{Δ z_{i} / 2} d z ξ_{m} (z_{i} + z, θ) . \end{matrix}$ $\begin{aligned} { w}_{aa}^\mathrm{Limber-1-bin}(\theta )\approx \frac{b_a^2(z_i)}{\Delta z^2}\int _{-\Delta z_i/2}^{\Delta z_i/2}{\mathrm{d} z}\; \xi _{\rm m}(z_i+z,\,\theta )\,. \end{aligned}$ (41)

Thus, given a cosmology, w_aa provides a straightforward measurement of the spectroscopic galaxy bias. We note that this equation diverges as Δz → 0, and that, more generally, both our modelling and the Limber approximation are expected to become less accurate for narrower redshift bins and on larger angular scales (Simon 2007).

For b ≠ a, when the redshift evolution of the biases b_a and b_b and the redshift distribution n_b is neglected, we obtain

$\begin{matrix} w_{ab}^{Limber - 1 - bin} (θ) = \frac{b_{a} (z_{i})}{Δ z_{i}} b_{b} (z_{i}) n_{b} (z_{i}) \int_{- Δ z_{i} / 2}^{Δ z_{i} / 2} d z ξ_{m} (z_{i} + z, θ) . \end{matrix}$ $\begin{aligned} { w}_{ab}^\mathrm{Limber-1-bin}(\theta ) = \frac{b_a(z_i)}{\Delta z_i}\,b_b(z_i)\,{n}_b(z_i)\,\int _{-\Delta z_i/2}^{\Delta z_i/2} {\mathrm{d} z}\;\xi _{\rm m}(z_i+z,\,\theta )\,. \end{aligned}$ (42)

The superscript Limber-1-bin refers to the integration of the matter distribution over a redshift bin equal to the spectroscopic slice; we uses the Limber approximation too. If one further neglects the matter correlation variation over Δz_i, one recovers the Dirac approximation Eq. (37); this is the approach of van den Busch et al. (2020), for example.

A limitation of Eq. (42) is the assumption that the redshift distribution n_b is constant across the bin. In practice, n_b could be larger near the bin edges and smaller near the centre (or the opposite), leading to an underestimation (resp. overestimation) of correlations with galaxies localised near z_i ± Δz_i. To address these issues without resorting to Eq. (39), we propose a correction by explicitly incorporating contributions from galaxies outside the spectroscopic redshift slice (three-bins):

$\begin{matrix} w_{ab}^{3 - bins} (θ) = \frac{b_{a} (z_{i})}{Δ z_{i}} \sum_{δ z \in {0, - Δ z_{i}, Δ z_{i}}} & b_{b} (z_{i} + δ z) n_{b} (z_{i} + δ z) \\ \times \int_{- Δ z_{i} / 2}^{Δ z_{i} / 2} d z ξ_{m}^{δ z} (z_{i} + z, θ), \end{matrix}$ $\begin{aligned} { w}_{ab}^\mathrm{3-bins}(\theta ) = \frac{b_a(z_i)}{\Delta z_i}\sum _{\delta z\in \{0,\,-\Delta z_i,\, \Delta z_i\}}&b_b(z_i+\delta z)\,{n}_b(z_i+\delta z)\\&\times \int _{-\Delta z_i/2}^{\Delta z_i/2} {\mathrm{d} z}\;\xi _{\rm m}^{\,\delta z}(z_i+z,\,\theta )\,,\nonumber \end{aligned}$ (43)

We introduce the phased correlations

$\begin{matrix} ξ_{m}^{δ z} (z, θ) = \int_{- Δ z_{i} / 2}^{Δ z_{i} / 2} d z' ξ_{m} (z, z + z' + δ z, θ), \end{matrix}$ $\begin{aligned} \xi _{\rm m}^{\,\delta z }(z,\,\theta ) = \int _{-\Delta z_i/2}^{\Delta z_i/2} {\mathrm{d} z}\prime \;\xi _{\rm m}(z,\, z+z\prime + \delta z,\,\theta )\,, \end{aligned}$ (44)

which account for the correlation of two bins of widths Δz_i, separated by δz. Here, the three-bins modelling does not uses the Limber approximation. One can write Eq. (43) as

$\begin{matrix} w_{ab} & (θ) \approx b_{a} (z_{i}) \int_{- Δ z_{i} / 2}^{Δ z_{i} / 2} d z ξ_{m}^{0} (z_{i} + z, θ) \\ \times \sum_{j \in {- 1, 0, 1}} b_{b} (z_{i} + j Δ z) n_{b} (z_{i} + j Δ z) η_{j} (z_{i}, θ), \end{matrix}$ $\begin{aligned} { w}_{ab}&(\theta )\approx b_a(z_i)\, \int _{-\Delta z_i/2}^{\Delta z_i/2} {\mathrm{d} z}\;\xi _{\rm m}^{\,0}(z_i+z,\,\theta )\\&\times \sum _{j\in \{-1,0,1\}} b_b(z_i+j\,\Delta z)\, n_b(z_i+j\,\Delta z) \;\eta _j(z_i,\theta ) \,,\nonumber \end{aligned}$ (45)

with η₀ = 1 and

$\begin{matrix} η_{\pm} (z_{i}, θ) = \frac{\int_{- Δ z_{i} / 2}^{Δ z_{i} / 2} d z ξ_{m}^{\pm Δ z_{i}} (z_{i} + z, θ)}{\int_{- Δ z_{i} / 2}^{Δ z_{i} / 2} d z ξ_{m}^{0} (z_{i} + z, θ)}, \end{matrix}$ $\begin{aligned} \eta _\pm (z_i,\, \theta ) = \frac{\int _{-\Delta z_i/2}^{\Delta z_i/2} {\mathrm{d} z}\;\xi _{\rm m}^{\pm \Delta z_i}(z_i+z,\,\theta )}{\int _{-\Delta z_i/2}^{\Delta z_i/2} {\mathrm{d} z}\;\xi _{\rm m}^{\,0}(z_i+z,\,\theta )}\,, \end{aligned}$ (46)

where ξ_m^0, ±Δz_i is defined in Eq. (44). This formulation allows for a linear inference of ⃗n from the two-point correlation, which can be implemented in practice, unlike the Limber-full-z case where n_p(z) is integrated over, complicating the process.

3.2.3. Comparison of the correlation modelling

In the previous sections, we introduced four different approximations of the two-point correlation function described by Eq. (33): $w_{ab}^{Dirac}$ $\mathit{w}_{ab}^{\mathrm{Dirac}}$ , $w_{ab}^{Limber}$ $\mathit{w}_{ab}^{\mathrm{Limber}}$ , $w_{ab}^{1 - bin}$ $\mathit{w}_{ab}^{\mathrm{1-bin}}$ , and $w_{ab}^{3 - bins}$ $\mathit{w}_{ab}^{\mathrm{3-bins}}$ . In this section, we investigate the differences between them with a toy model. We consider 10 samples a_i, each with a step-n(z) distribution as given by Eq. (40), centred at z_i = 0.325 + 0.05 i, with i = 0, 1…, 9, and Δz_i = 0.05. For the redshift distribution of sample b we take a Gaussian (μ, σ) = (0.5, 0.08). This reproduces an idealistic case scenario of clustering-z calibration, with sample b representing the photometric data, and samples a_i representing the spectroscopic data. Several of these n(z)s are plotted in the upper panel of Fig. 2. We assume a constant galaxy bias b_a(z) = b_b(z) = 1 for all samples, and evaluate correlation functions for r_p = 2 Mpc (a scale standard for clustering-z analysis). We numerically evaluated the functions with the CCL library⁶ (Chisari et al. 2019). We used the default configuration, which is based on predictions from CLASS (Blas et al. 2011) and halofit (Takahashi et al. 2012) for the nonlinear power spectrum. We evaluated the functions at scales where we know the predictions are not completely precise because of the galaxy-halo connection, but as will be explicit in Sect. 3.3, this is expected to have little impact on real data.

Fig. 2.

Top: Redshift distribution of sample b in blue, and four out of the ten samples a_i in red. Bottom: Deviation of approximated w_{a_ib} with respect to the exact one for the ten cross-correlations.

In the lower panel of Fig. 2, we show the deviation of the four approximations to the “exact” correlation Eq. (33), for the samples a_i, b. The level of the deviations depends on the n_b(z) where the correlation is evaluated: we observe a larger deviation near the peak and for the tails. We evaluated the root mean square (RMS) of the deviation to 1 and got 0.022 (Dirac), 0.021 (Limber-1-bin), 0.013 (3-bins), and 0.002 (Limber-full-z). We can see that the Dirac and one-bin approximations are very close. Taking into consideration the correlations from bin neighbours with three-bins, we get an improvement of 40%. Finally, the best approximation is the Limber-full-z case [i.e. integrating n²(z) instead of assuming some effective values], with a deviation of 0.2% for individual points, and an RMS smaller by an order of magnitude to the Dirac and one-bin approximations.

In clustering-z studies, however, the redshift distribution n_p(z) is an unknown quantity that we aim to constrain from the measurement of w_ab. As a result, the Limber-full-z case is challenging to apply in practice, as the unknown n_p(z) is integrated over. In contrast, for the Dirac and one-bin cases, n_p(z_i) enters the calculation linearly, making it easier to extract information. The same applies to the three-bins approximation, which provides a better approximation, as we have demonstrated.

3.3. Application to clustering redshifts

In the context of clustering redshifts, a photometric sample has an unknown redshift distribution n_p(z). To reconstruct it, we measure the cross-correlations with spectroscopic samples, distributed into fine redshift slices (usually 0.02 ≤ Δz ≤ 0.05). The distribution of these spectroscopic samples is assumed to be constant in each of these smaller slices (see Eq. 40). The amplitude of the photometric distribution at redshift z_i can then be inferred linearly from the cross-correlation function between a spectroscopic bin centred in z_i and the photometric sample [noted w_sp(z_i, θ) instead of w_{s_ip}(θ)], assuming the Dirac modelling Eq. (37):

$\begin{matrix} n_{p} (z_{i}) = \frac{w_{sp} (z_{i}, θ)}{b_{s} (z_{i}) b_{p} (z_{i}) ξ_{m} (z_{i}, θ)} . \end{matrix}$ $\begin{aligned} {n_{\rm p}}(z_i) = \frac{{{ w}_{\rm sp}}(z_i,\,\theta )}{b_{\rm s}(z_i)\,b_{\rm p}(z_i)\,\xi _{\rm m}(z_i,\,\theta )}\,. \end{aligned}$ (47)

One can write a similar expression for the one-bin and three-bins modelling, as described in Sect. 3.2.3.

The spectroscopic bias can be measured with the spectroscopic auto-correlation, following Eq. (41),

$\begin{matrix} n_{p} (z_{i}) = \frac{w_{sp} (z_{i}, θ)}{\sqrt{Δ z w_{ss} (z_{i}, θ) ξ_{m} (z_{i}, θ)} b_{p} (z_{i})} . \end{matrix}$ $\begin{aligned} {n_{\rm p}}(z_i) = \frac{{{ w}_{\rm sp}}(z_i,\,\theta )}{\sqrt{\Delta z\, { w}_{\rm ss}(z_i,\,\theta )\,\xi _{\rm m}(z_i,\,\theta )}\,b_{\rm p}(z_i)}\,. \end{aligned}$ (48)

The n_p(z) is still degenerate with the photometric galaxy bias. In the next section, we describe different possibilities for correcting this degeneracy.

3.3.1. Correcting the photometric galaxy bias

It is more complicated to measure the photometric bias than the spectroscopic bias since the sample cannot be split into small redshift slices and Eq. (41) cannot be applied. Here we introduce different methods.

M0: No correction at all. We assume there is no b_r and b_p redshift evolutions and renormalise the measured redshift distribution.
M1: Spectroscopic bias only. We assume there is no b_p redshift evolution and renormalise the measured redshift distribution.
M2: The photometric bias is measured for the whole tomographic bin. Hence we do not take into account the redshift evolution within the tomographic bin, but we are increasing the uncertainty on the n_p(z_i), and thus we are more conservative.
M3: The photometric bias is measured for relatively large photometric bins (Δz_photo = 0.1), and the redshift evolution is deduced from a polynomial fit. The photo-z spreading of the bin needs to be corrected before the fit (more details in Sect. 3.3.2).
M4: The photometric bias is measured for small photometric bins (Δz_photo = 0.02), and the redshift evolution is deduced from a polynomial fit. The photo-z spreading of the bin needs to be corrected before the fit (cf. Sect. 3.3.2).
M5: Full correction. The two biases are measured using the true redshift (simulation only).

Methods 0-1-2-5 were already tested in Naidoo et al. (2023). Methods 3 and 4 are new in that context, but similar to methods from Cawthon et al. (2022) and van den Busch et al. (2020). Ideally, we would like to measure w_ss and w_pp over the same redshift slices: [z_i±Δz/2[. In doing so, we would get

$\begin{matrix} n_{p} (z_{i}) = \frac{w_{sp} (z_{i}, θ)}{Δ z \sqrt{w_{ss} (z_{i}, θ) w_{pp} (z_{i}, θ)}} . \end{matrix}$ $\begin{aligned} {n_{\rm p}}(z_i) = \frac{{{ w}_{\rm sp}}(z_i,\,\theta )}{\Delta z\,\sqrt{ { w}_{\rm ss}(z_i,\,\theta )\,{ w}_{\rm pp}(z_i,\,\theta )}}\,. \end{aligned}$ (49)

We note that we derive this equation with the linear bias assumption. Indeed we show that it remains valid at the next-order of the bias expansion in Appendix A. This ideal situation corresponds to M5, the photometric sample being distributed into redshift slices with the true redshift variable. This method can only be evaluated via simulation. The aim of methods 3 and 4 is to estimate w_pp but with a different binning, and taking into account the photo-z spreading. With M2, we don’t correct for the redshift evolution, but we aim to take into account the variance induced by the photometric galaxy bias (Naidoo et al. 2023).

3.3.2. More details about the M3 and M4 corrections

With methods 3 and 4, the photometric sample is divided into bins (denoted j) based on photo-z cuts. These differ from the spectroscopic distribution described in Eq. (40), therefore requiring correction for the photo-z spreading. In other words, a photometric bin cannot be approximated by a redshift slice.

In practice, we measure the n(z) of these bins with the clustering-redshifts M1 method (cf. Sect. 3.3.1). This method assumes negligible photometric redshift evolution within each small bin. Then, we want to convert our imperfect-bin measurement to what we would measure for an ideal case with step n(z). Using the Limber case (cf. Sect. 3.2.3), we introduce the correction to the two-point auto-correlation function

$\begin{matrix} w_{pp}^{corr} (z_{j}, θ) = w_{pp}^{meas} (z_{j}, θ) \frac{{(Δ z)}^{- 2} \int_{- Δ z / 2}^{Δ z / 2} d z ξ_{m} (z_{j} + z, θ)}{\int_{0}^{\infty} d z n_{p_{j}}^{2} (z) ξ_{m} (z, θ)}, \end{matrix}$ $\begin{aligned} { w}_{\rm pp}^\mathrm{corr}(z_j,\,\theta ) = { w}_{\rm pp}^\mathrm{meas}(z_j,\,\theta )\; \frac{{(\Delta z)^{-2}}\int _{-\Delta z/2}^{\Delta z/2} {\mathrm{d} z} \;\xi _{\rm m}(z_j+z,\,\theta )}{\int _0^{\infty } {\mathrm{d} z}\; n_{\mathrm{p}_j}^2(z)\, \xi _{\rm m}(z,\,\theta )}\,, \end{aligned}$ (50)

where the Δz corresponds to the spectroscopic redshift bin of the w_sp correlation. We refer to this as the “full” correction, as it accounts for the redshift evolution of the matter correlation function across the bin. We use CCL to evaluate ξ_m at small scales.

A simpler approximation has been implemented for the DES redMaGiC and Maglim samples (Cawthon et al. 2022; Giannini et al. 2024). The correction was modelled with

$\begin{matrix} w_{pp}^{corr} (z_{j}, θ) = w_{pp}^{meas} (z_{j}, θ) \frac{{(Δ z)}^{- 1}}{\int_{0}^{\infty} d z n_{i}^{2} (z)}, \end{matrix}$ $\begin{aligned} { w}_{\rm pp}^\mathrm{corr}(z_j,\,\theta ) = { w}_{\rm pp}^\mathrm{meas}(z_j,\,\theta )\;\frac{(\Delta z)^{-1}}{\int _0^\infty {\mathrm{d} z}\, n_i^2(z)}\,, \end{aligned}$ (51)

where n_i(z) was estimated for the redMaGiC subsample having a spec-z and a photo-z (Gatti et al. 2022), or with clustering-redshifts (Cawthon et al. 2022). This is referred to as the “partial” correction, as it makes additional simplifying assumptions compared to the “full” correction in Eq. (50).

The “width-only” correction applies when using different bin widths for the auto- and cross-correlation functions, while still neglecting the effects of photo-z spreading across the bin. This correction simplifies to

$\begin{matrix} w_{pp}^{corr} (z_{j}, θ) = w_{pp}^{meas} (z_{j}, θ) \frac{{(Δ z)}_{meas}}{{(Δ z)}_{w_{sp}}} . \end{matrix}$ $\begin{aligned} { w}_{\rm pp}^\mathrm{corr}(z_j,\,\theta ) = { w}_{\rm pp}^\mathrm{meas}(z_j,\,\theta )\;\frac{(\Delta z)_{\rm meas}}{(\Delta z)_{{{ w}_{\rm sp}}}}\,. \end{aligned}$ (52)

In summary, the “full” correction accounts for the redshift evolution of the matter correlation function, the “partial” correction introduces additional approximations by assuming a simplified n(z), and the “width-only” correction applies when focusing solely on differences in bin widths while neglecting photo-z spreading. We illustrate these methods and apply them to clustering-redshifts in Sect. 4.8.

These two methods for correcting photometric galaxy bias share similarities with the method in van den Busch et al. (2020). We do not assume a bias model of the form B_p(z)∝(1 + z)^α, however. Instead, we directly measure and correct w_pp(z) by constraining the photo-z spreading using clustering-z for smaller bins. Theoretically, the two approaches could be combined, which would eliminate any systematic effects continuous on the redshift, without assuming a particular model.

3.3.3. Data vectors reduced to scalars with some weighting

Using Eq. (48), one can extract a measurement of n_p for different r_p (or equivalently θ). To optimise the sensitivity, it is standard to reduce all the data vectors to scalars with a scale weighting W_w(r_p),

$\begin{matrix} {\bar{w}}_{ab} (z_{i}) = \int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W_{w} (r_{p}) w_{ab} (r_{p}, z_{i}) . \end{matrix}$ $\begin{aligned} \overline{w}_{ab}(z_i) = \int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}} \mathrm{d} {r_{\rm p}} \;W_{ w}(r_{\rm p})\,{ w}_{ab}(r_{\rm p},\,z_i)\,. \end{aligned}$ (53)

We introduce the scale-weighted matter correlation

$\begin{matrix} {\bar{ξ}}_{m} (z) = \int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W_{w} (r_{p}) ξ_{m} (r_{p}, z) . \end{matrix}$ $\begin{aligned} \overline{\xi }_{\rm m}(z) = \int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}} \mathrm{d} {r_{\rm p}} \;W_{ w}(r_{\rm p})\,\xi _{\rm m}(r_{\rm p},\,z)\,. \end{aligned}$ (54)

It is easy to show that Eq. (48) or Eq. (49) are still valid with integrated quantities. The scale weighting is usually chosen as a power law W_w(r_p)∝r_p^γ, where the function is normalised to unity (e.g. Gatti et al. 2018; van den Busch et al. 2020; Naidoo et al. 2023). Choosing γ < 0 increases the impact of small scales. They are expected to be less noisy but also more sensitive to non-linearities. Inversely, the measurement is not biased at large scales (i.e. the linear bias is a good model), but uncertainties are usually larger. Another reason to use small scales instead of large ones is to keep the calibration independent from the two-point weak-lensing analysis: using smaller and different scales, one can safely neglect the covariance between calibration and the observables. Nonetheless, these small scales are not used for the cosmological analysis for physically motivated reasons (non-linearities), which in theory should apply to clustering-redshifts as well (Pandey et al. 2020). Hence, the scale range is an important parameter and we will investigate this scale-weighting procedure in detail in Sect. 4.

Another possibility of weighting is to combine the different n_p(z_i|r_p), with a particular weighting W_n(r_p):

$\begin{matrix} {\bar{n}}_{p} (z_{i}) = \int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W_{n} (r_{p}) n_{p} (z_{i} | r_{p}) . \end{matrix}$ $\begin{aligned} \overline{n}_{\rm p}(z_i) = \int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}} {\mathrm{d}r_{\rm p}} \;W_n(r_{\rm p})\,n_{\rm p}(z_i\vert r_{\rm p})\,. \end{aligned}$ (55)

In other words, we integrate directly n(z| r_p), which is w_sp for M0, $w_{sp} / \sqrt{w_{ss}}$ $\mathit{w}_{\mathrm{sp}}/\sqrt{\mathit{w}_{\mathrm{ss}}}$ for M1 or $w_{sp} / \sqrt{w_{ss} w_{pp}}$ $\mathit{w}_{\mathrm{sp}}/\sqrt{\mathit{w}_{\mathrm{ss}}\, \mathit{w}_{\mathrm{pp}}}$ for M2-3-4-5.

3.4. Estimating the correlations with pair counts

We evaluated position-position correlations by counting pairs separated by a certain distance divided by the expectation of uncorrelated samples (random) to properly normalise the correlation. We chose the Landy–Szalay (LS) estimator (Landy & Szalay 1993),

$\begin{matrix} w_{ab}^{data} (r_{p}) & = \frac{D_{a} D_{b} (r_{p}) - R_{a} D_{b} (r_{p}) - D_{a} R_{b} (r_{p}) + R_{a} R_{b} (r_{p})}{R_{a} R_{b} (r_{p})}, \end{matrix}$ $\begin{aligned} { w}_{ab}^\mathrm{data}({r_{\rm p}} )&=\frac{\mathrm{D}_a\mathrm{D}_b({r_{\rm p}})-\mathrm{R}_a\mathrm{D}_b({r_{\rm p}})-\mathrm{D}_a\mathrm{R}_b({r_{\rm p}})+\mathrm{R}_a\mathrm{R}_b({r_{\rm p}})}{\mathrm{R}_a\mathrm{R}_b({r_{\rm p}})}\,, \end{aligned}$ (56)

where D_aD_b, D_aR_b, R_aD_b, and R_aR_b are the standard data-data, data-random, random-data, random-random pair counts of samples a and b. We note that we omitted the galaxy number normalisation factors which correct for the randoms being ten times more numerous than data. Further details are given in Appendix C. As we were working with a simulation omitting observational systematics and masks, we generated random catalogues homogeneously over the footprint, taking 50 times more randoms than data.

3.4.1. Estimator weighting scheme

Given the weighting procedure Eq. (53), it is natural to weight the estimator as

$\begin{matrix} {\tilde{w}}_{ab}^{(1)} (z_{i}) = \int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W (r_{p}) \frac{(D_{a} - R_{a}) (D_{b} - R_{b}) (r_{p})}{R_{a} R_{b} (r_{p})}, \end{matrix}$ $\begin{aligned} \tilde{w}_{ab}^{(1)}(z_i) = \int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}\mathrm{d} {r_{\rm p}}\; W({r_{\rm p}})\frac{(\mathrm{D}_a-\mathrm{R}_a)(\mathrm{D}_b-\mathrm{R}_b)({r_{\rm p}})}{\mathrm{R}_a\mathrm{R}_b({r_{\rm p}})}\,, \end{aligned}$ (57)

where the numerator is a concise notation for the numerator of Eq. (56). In the context of clustering redshifts, this weighted estimator was first introduced by Ménard et al. (2013), and later used by Cawthon et al. (2022).

Another weighting scheme was introduced by Schmidt et al. (2013), and was used by Davis et al. (2018), Gatti et al. (2018), van den Busch et al. (2020), and Naidoo et al. (2023). The principle is to scale-average data-data and random-random pair counts:

$\begin{matrix} {\tilde{w}}_{ab}^{(2)} (z_{i}) = \frac{\int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W (r_{p}) (D_{a} - R_{a}) (D_{b} - R_{b}) (r_{p})}{\int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W (r_{p}) R_{a} R_{b} (r_{p})} . \end{matrix}$ $\begin{aligned} \tilde{w}_{ab}^{(2)}(z_i) = \frac{\int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}\mathrm{d} {r_{\rm p}} \;W({r_{\rm p}})\,(\mathrm{D}_a-\mathrm{R}_a)(\mathrm{D}_b-\mathrm{R}_b)({r_{\rm p}})}{\int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}\mathrm{d} {r_{\rm p}}\; W({r_{\rm p}})\,\mathrm{R}_a\mathrm{R}_b({r_{\rm p}})}\,. \end{aligned}$ (58)

Schmidt’s motivation to do so was to assume that the ratio of the averaged quantities would lead to higher signal-to-noise (S/N) than the averaged ratio (which is expected for a constant ratio, which is not the case here). With this second estimator what we have is a re-weighting of the correlation functions with a new effective weighting, as

$\begin{matrix} {\tilde{w}}_{ab}^{(2)} (z_{i}) = & \int \int {d z}_{1} {d z}_{2} n_{a} (z_{1}) n_{b} (z_{2}) \\ \times & \frac{\int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W (r_{p}) r_{p} Δ r_{p} η_{mask} (r_{p}) {⟨ δ_{a} δ_{b} ⟩}_{r_{p}, z_{1}, z_{2}}}{\int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W (r_{p}) r_{p} Δ r_{p} η_{mask} (r_{p})}, \end{matrix}$ $\begin{aligned} \tilde{w}_{ab}^{(2)}(z_i) = &\int \int {\mathrm{d} z}_1\,{\mathrm{d} z}_2\; n_a(z_1)\,n_b(z_2) \\ \times&\frac{\int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}\mathrm{d}{r_{\rm p}}\; W({r_{\rm p}})\,{r_{\rm p}}\,\Delta {r_{\rm p}} \,\eta _{\rm mask}({r_{\rm p}})\,\langle \delta _a\,\delta _b\rangle _{{r_{\rm p}},\,z_1,\,z_2}}{\int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}\mathrm{d}{r_{\rm p}}\; W({r_{\rm p}})\,{r_{\rm p}}\,\Delta {r_{\rm p}}\,\eta _{\rm mask}({r_{\rm p}})}\,,\nonumber \end{aligned}$ (59)

where η_mask(r_p) is a geometrical factor due to the masking of some sky area (cf. Appendix C). In particular, the effective scale weighting is

$\begin{matrix} W_{eff} (r_{p}) & \propto W (r_{p}) r_{p} Δ r_{p} η_{mask} (r_{p}) \end{matrix}$ $\begin{aligned} W_{\rm eff}({r_{\rm p}})&\propto W({r_{\rm p}})\,{r_{\rm p}}\,\Delta {r_{\rm p}}\, \eta _{\rm mask}({r_{\rm p}}) \end{aligned}$ (60)

$\begin{matrix} \propto W (r_{p}) {r_{p}}^{2} η_{mask} for a logarithmic binning, \end{matrix}$ $\begin{aligned}&\propto W({r_{\rm p}})\,{r_{\rm p}}^2\, \eta _{\rm mask} \text{ for} \text{ a} \text{ logarithmic} \text{ binning,}\end{aligned}$ (61)

$\begin{matrix} \propto {r_{p}}^{γ + 2} η_{mask} for a power law weighting . \end{matrix}$ $\begin{aligned}&\propto {r_{\rm p}}^{\gamma +2}\, \eta _{\rm mask} \text{ for} \text{ a} \text{ power} \text{ law} \text{ weighting} . \end{aligned}$ (62)

This effective weighting is still re-normalised to unity as required by the denominator of Eq. (59). The mask incompleteness is not rigorously corrected (cf. Appendix C), except if η_mask is scale-independent⁷. By default, we consider the first estimator (Eq. 57), since understanding the weighting is straightforward, and the mask is better corrected, but we will test and compare the two in the results section⁸.

3.4.2. Covariance matrix

We used the jackknife covariance matrix as our fiducial covariance. For a measured data vector w, the covariance matrix is

$\begin{matrix} C_{w} = \frac{N_{Jkk} - 1}{N_{Jkk}} \sum_{k = 1}^{N_{Jkk}} (w^{k} - \tilde{w}) {(w^{k} - \tilde{w})}^{⊤}, \end{matrix}$ $\begin{aligned} \mathcal{C} _{ w}=\frac{N_{\rm Jkk}-1}{N_{\rm Jkk}}\sum _{k = 1}^{N_{\rm Jkk}} ({ w}^k-\tilde{w})({ w}^k-\tilde{w})^\top \,, \end{aligned}$ (64)

where N_Jkk is the number of jackknife regions, w^k is the data vector excluding the data from the k^th region, and $\tilde{w}$ $\tilde{w}$ is its mean value. Here, w can be, for example, the vector w_sp(r_p), the ratio of vectors $w_{sp} (r_{p}) / \sqrt{w_{ss} (r_{p})}$ $\mathit{w}_{\mathrm{sp}}({r_{\mathrm{p}}})/\sqrt{\mathit{w}_{\mathrm{ss}}({r_{\mathrm{p}}})}$ , the scalars ${\bar{w}}_{sp}$ $\overline{w}_{\mathrm{sp}}$ , or the ratio of scalars ${\bar{w}}_{sp} / \sqrt{{\bar{w}}_{ss}}$ $\overline{w}_{\mathrm{sp}}/\sqrt{\overline{w}_{\mathrm{ss}}}$ . In this article, we use a sufficiently large number of regions (200), with a small enough number of points (around a dozen), so that Hartlap (Hartlap et al. 2007) or Percival (Percival et al. 2022) corrections to the inverse of the covariance are at the percent level and can be safely neglected.

3.5. Extracting the unknown distribution moments

3.5.1. clustering-redshifts requirements for cosmic shear

The success of the Euclid mission requires achieving a particular precision in the characterisation of n(z) of the tomographic bins i. The requirements usually focus on the mean redshift, expressed as

$\begin{matrix} {⟨ z ⟩}_{i} = \int d z z n_{p_{i}} (z), \end{matrix}$ $\begin{aligned} \langle z \rangle _{i} = \int {\mathrm{d} z} \; z\, n_{\rm p_i}(z)\,, \end{aligned}$ (65)

as illustrated in works such as Gatti et al. (2018), van den Busch et al. (2020), and Gatti et al. (2022). The desired precision for the Euclid mission requires knowledge of the redshift standard deviation, however, which is denoted as

$\begin{matrix} σ_{n_{i}} = \sqrt{\int d z {(z - ⟨ z ⟩)}^{2} n_{p_{i}} (z)}, \end{matrix}$ $\begin{aligned} \sigma _{n_{i}}= \sqrt{\int {\mathrm{d} z} \; (z-\langle z\rangle )^2\, n_{\rm p_i}(z)}\,, \end{aligned}$ (66)

as outlined in Reischke (2024). The criteria are as follows: the mean redshift bias (Laureijs et al. 2011; Reischke 2024) and the redshift standard deviation (Reischke 2024) for every bin should be measured with precision

$\begin{matrix} σ ({⟨ z ⟩}_{i}) \leq 0.002 (1 + {⟨ z ⟩}_{i}), \end{matrix}$ $\begin{aligned}&\sigma (\langle z\rangle _i)\le 0.002\, (1+\langle z\rangle _i),\end{aligned}$ (67)

$\begin{matrix} σ (σ_{n_{i}}) \leq 0.1 σ_{n_{i}} . \end{matrix}$ $\begin{aligned}&\sigma (\sigma _{n_i})\le 0.1\, \sigma _{n_i}\,. \end{aligned}$ (68)

3.5.2. Discretisation of a distribution

The last step of a clustering-redshifts pipeline is to derive the n_p(z) moments from the discrete measurements n_p(z_i). The mean redshift of a bin is evaluated with

$\begin{matrix} ⟨ z ⟩ & \approx \sum_{j} Δ z z_{j} n_{p}^{meas} (z_{j}) . \end{matrix}$ $\begin{aligned} \langle z\rangle&\approx \sum _j \Delta z\, z_j \,n^\mathrm{meas}_{\rm p}(z_j)\,. \end{aligned}$ (69)

We are not directly measuring n_p(z_j) but rather its averaged value over ranges z_j ± Δz/2,

$\begin{matrix} n_{p}^{meas} (z_{j}) \approx \frac{1}{Δ z} \int_{- Δ z / 2}^{Δ z / 2} d z n_{p} (z_{j} + z) . \end{matrix}$ $\begin{aligned} n_{\rm p}^\mathrm{meas}(z_j)\approx \frac{1}{\Delta z}\int _{-\Delta z/2}^{\Delta z/2} {\mathrm{d} z}\; n_{\rm p}(z_j+z)\,. \end{aligned}$ (70)

As a consequence, Eq. (69) is an accurate approximation of the exact integral form as long as Δz is small compared to the tomographic bin width. As an example, the numerical difference between Eq. (69) and Eq. (65) is 𝒪(10⁻⁴) for our Euclid tomographic bins (cf. Sect. 2.2), and Δz = 0.02–0.5. Nonetheless, the recovered shape of the distribution is affected by Eq. (70) and for large Δz the measured distribution would be flattened. We illustrate this in Fig. 3, plotting a galaxy distribution (which in practice corresponds to a very thin slicing), and the same distribution with a Δz averaging as Eq. (70). We see that the flattening associated with the bin increases. These effects will also be tested in Sect. 4.

Fig. 3.

Galaxy distribution observed through different Δz averaging as Eq. (70). The redshift convolution modifies the shape of the distribution for larger slicing compared to the real slicing as estimated with a very thin slicing (grey). For this case, all slicings Δz ≤ 0.06 produce a good estimate.

3.5.3. Overall strategy

The uncertainty on the mean-z directly inferred with Eq. (69) would not fulfil the Euclid requirement Eq. (67). In doing this, we would be very conservative, allowing for any tomographic bin shape, whereas in reality, we know their shape a priori. Thus, we can define some priors on the n_p(z) properties to decrease the uncertainty. We describe two possibilities:

we assume a prior on the shape of the distribution, through a n(z) model;
we use Gaussian process formalism, to generate coherent realisations, assuming a kernel, which is related to how “fast” the distribution is evolving in redshift.

Thus in one case, we put a prior on the shape of the tomographic bin, and in the other, on the redshift variation.

3.5.4. Shifted-stretched model

For this approach, we evaluated a given model n^model(z) with some parameters {p₁, p₂…}, by minimising the likelihood,

$\begin{matrix} ln L & ({p_{1}, p_{2} \dots} | n, C_{n}) \\ = - \frac{1}{2} {[n (z_{i}) - n^{model} (z_{i})]}^{⊤} C_{n}^{- 1} [n (z_{i}) - n^{model} (z_{i})], \end{matrix}$ $\begin{aligned} \ln \mathcal{L}&(\{p_1,p_2\ldots \}\vert n, \mathcal{C} _n)\nonumber \\&=-\frac{1}{2} \left[ n(z_i)-n^\mathrm{model}(z_i)\right]^\top \mathcal{C} _n^{-1}\left[ n(z_i)-n^\mathrm{model}(z_i)\right]\,, \end{aligned}$ (71)

where {n_p(z_i)}_i, and the associated covariance 𝒞_n are the measured quantities. Then, using a Markov chain Monte Carlo⁹, the mean redshift and standard deviation were extracted from one thousand realisations of {n^model}.

For the n^model, we use the shifted-stretched model (SSM), which is commonly used for clustering redshifts; given a particular distribution n(z) with a mean redshift ⟨z⟩, two free parameters are used describing a redshift shift δz and a stretching s,

$\begin{matrix} n^{model} (z | s, δ z) = \frac{1}{s} n (\frac{z - ⟨ z ⟩ - δ z}{s} + ⟨ z ⟩) . \end{matrix}$ $\begin{aligned} n^\mathrm{model}(z \vert s,\delta z) = \frac{1}{s}n\left(\frac{z-\langle z\rangle -\delta z}{s}+\langle z\rangle \right)\,. \end{aligned}$ (72)

This model is particularly convenient because for a given δz, s, the bias in mean redshift and standard deviation with respect to the model are δz and s. In our case, as the model is the true redshift distribution, we have

$\begin{matrix} δ z = & ⟨ z ⟩ - {⟨ z ⟩}_{true}, \end{matrix}$ $\begin{aligned} \delta z=&\langle z\rangle -\langle z\rangle _{\rm true},\,\end{aligned}$ (73)

$\begin{matrix} s = & σ_{n} / σ_{n}^{true} . \end{matrix}$ $\begin{aligned} s=&\sigma _n/\sigma _n^\mathrm{true}\,. \end{aligned}$ (74)

As we use the true redshift distribution of the bin as a model (simulation only), the performance may be overestimated compared to a realistic scenario, where the model is taken from a Self-Organised-Map estimate (e.g. Hildebrandt et al. 2021), or a candidate from a simulation (e.g. Gatti et al. 2018, 2022; Cawthon et al. 2022). That is why we also consider a more direct method.

3.5.5. Suppressed Gaussian process

We introduce the suppressed-Gaussian-process model (SGP) developed by Naidoo et al. (2023); GP reconstruction from clustering-redshifts was also performed in Johnson et al. (2017). The main benefit of this approach is that it is non-parametric. The procedure is as follows:

we generate realisations of Gaussian Process (GP) where the multivariate normal distributions are the measurements, and model the correlation between the points through a specific kernel¹⁰;
we apply to the realisations a suppression function that damps signals in regions where the measurements are consistent with zero;
the realisations are renormalised to unity;
the moments of the true distribution and their uncertainties are estimated from this large sample of normalised SGP realisations.

For the GP covariance, we chose a 3/2 Matern kernel function, and assume the length-scale l to be l = Δz/2 where Δz corresponds to the redshift slicing¹¹. To validate these choices, we tested different kernels and length-scale, by comparing generated GP from discrete n(z_i) and covariance to true n(z); see, for example, the right panel of Fig. 4.

Fig. 4.

Left: n(z) distribution measurement, with 50 associated GP realisations (orange), and the true distribution (red). Middle: SNR of the GP realisation (orange), and their convolution (blue). The SNR threshold ( = 3) is reported with the dashed line. Right: Associated SGP realisations, renormalised to unity (blue), compared to the true distribution (red).

For the second step, given a GP realisation i, the associated SPG is

$\begin{matrix} n_{SGP}^{i} (z) = n_{GP}^{i} (z) S (x_{i}, k), \end{matrix}$ $\begin{aligned} n_{\rm SGP}^i (z) = n_{\rm GP}^i(z)\; S(x_i,k)\,, \end{aligned}$ (75)

where S is a suppression factor that removes the noisy part of the distribution. We chose the parametrisation

$\begin{matrix} S (x, k) = {\begin{matrix} 0 & if x < 0 \\ 1 - {(1 - x)}^{k} & if 0 < x < 1 \\ 1 & if x > 1 . \end{matrix} \end{matrix}$ $\begin{aligned} S(x,k) = {\left\{ \begin{array}{ll} 0&\text{ if} x<0 \\ 1-(1-x)^k&\text{ if} 0 < x < 1\\ 1&\text{ if} x>1\,. \end{array}\right.} \end{aligned}$ (76)

In the above, the x variable is a smoothing of the continuous signal-to-noise ratio (SNR)

$\begin{matrix} x_{i} = \frac{1}{{SNR}_{thresh}} G * \frac{n_{GP}^{i} (z)}{σ_{i} (z)} . \end{matrix}$ $\begin{aligned} x_i=\frac{1}{\mathrm{SNR}_{\rm thresh}}\mathcal{G} *\frac{n_{\rm GP}^i(z)}{\sigma _i(z)}\,. \end{aligned}$ (77)

Here 𝒢 is a Gaussian with a standard deviation Δz, * represents the convolution operation, which smooths the SNR between data points (and avoids non-physical cuts), and SNR_thresh is the SNR threshold below which the suppressed factor is applied. Thus, if SNR < SNR_thresh over a redshift range larger than Δz, the amplitude of the realisation n_GPⁱ(z) would be reduced by a factor k/3 × SNR. We fix k = 0.3 in Eq. (76), so that the suppressed factor rapidly drops for x < 1, but we ensured that the impact of this parameter was small.

In Fig. 4 we illustrate the whole procedure. We show for a measurement {n_p(z_i)}_i, 𝒞_n (not normalised to unity), GP realisations, their SNR n_GP/σ(z), and the associated normalised SGP realisations compared with the true distribution. It is clear that going from the left to the right panel, the overall procedure greatly improves the realisations. Nonetheless, the procedure could be optimised further in the future, since we did not rigorously explore all of the SGP parametrisation.

4. Results

In this section, we present the clustering-redshifts pipeline optimisation. In Fig. 5 we summarise the different ingredients entering the modelling of clustering redshifts, and the corresponding validation tests presented in this section. In Sect. 4.1, we describe the simulated data used. In Sect. 4.2, we compare the performance of the clustering estimator. In Sect. 4.3, we evaluate the hypothesis that the galaxy biases of cross-correlations and auto-correlations are the same at small scales. In Sect. 4.4, we determine the optimal scale range, and in Sect. 4.5, the optimal scale weighting. In Sect. 4.6, we test different spectroscopic slicing Δz for measurements affected by systematics, with the effects of these systematics tested individually in Sect. 4.7. In Sect. 4.8, we compare different galaxy bias correction methods. Finally, in Sect. 4.9, we investigate the impact of the tomographic bin definition on the redshift constraints.

Fig. 5.

Modelling the cross-correlation between spectroscopic samples in redshift slices and photometric bins, and reference to the corresponding tests we performed to optimise the pipeline.

4.1. Investigation strategy

Our investigation strategy consists of varying a set of analysis parameters to check the impact on clustering redshifts. We chose to use three tomographic bins covering the full redshift range of interest and corresponding to different types of galaxies.

The three tomographic bins: “low-z”, “mid-z”, and “high-z”, were generated within the Flagship simulation with the photo-z cuts:

$\begin{matrix} 0.35 < z_{p} < 0.47, \end{matrix}$ $\begin{aligned}&0.35<z_{\rm p} < 0.47\,,\end{aligned}$ (78)

$\begin{matrix} 0.86 < z_{p} < 1, \end{matrix}$ $\begin{aligned}&0.86<z_{\rm p} < 1\,,\end{aligned}$ (79)

$\begin{matrix} 1.30 < z_{p} < 1.6 . \end{matrix}$ $\begin{aligned}&1.30<z_{\rm p} < 1.6\,. \end{aligned}$ (80)

They corresponds to bin 2, 7, and 10 in Fig. 1. For reference samples, we used BOSS LOWZ and CMASS, DESI ELG, and Euclid NISP-S like samples. We report the redshift distribution of the tomographic bins and spectroscopic samples in Fig. 1.

To characterise the uncertainty from sample variance in our analysis, we split each tomographic bin into five independent Flagship2 sky patches¹² of 1000 deg². As the simulation is limited to 5000 deg², this is the maximum number of large independent patches that we can use. When investigating the optimal spectroscopic slicing we do not want our results to be biased because the points luckily cover the optimal part of the distribution. Thus, for every patch j = 1, …5, we varied the redshift slicing of the spectroscopic samples z_i; Δz, by shifting the slice centres by j × Δz/5. To summarise, we tested over five independent 1000 deg² sky patches for three tomographic bins, each cross-correlated with spectroscopic samples distributed into different slices for each patch (but with the same Δz).

4.2. Choice of the estimator weighting scheme

In this section, we compare the two different estimators of the angular correlation function, described in Sect. 3.4.1. We measured the auto-correlations of spectroscopic samples for a 1000 deg² sky patch, a single redshift bin of width Δz = 0.05, and the two estimators:

$w_{ss}^{(1)}$ ${\it w}_{\rm ss}^{(1)}$ defined by Eq. (57), where weighting is applied to the ratio of pair counts;
$w_{ss}^{(2)}$ ${\it w}_{\rm ss}^{(2)}$ defined by Eq. (58), where weighting is applied directly the pair counts.

Correlations were evaluated for the scale range 0.5– 5 Mpc, and with different weighting functions $W \propto {r_{p}}^{γ}$ $W\propto {r_{\mathrm{p}}}^{\gamma}$ . The results are reported in Fig. 6. We used logarithmic binning, and expect the measurements to be weighted by an effective function W_eff ∝ r_p^γ + 2 (cf. Eq. (60)), which is confirmed by the matched amplitudes of blue-yellow and purple-red points. Importantly, we do not observe significant differences in the associated uncertainties between the two estimators with the corresponding weights. Given that the weighting variation does not offer any practical advantage, and leads to unnecessary complexity in the weighting process (cf. Appendix C), we adopted the first estimator, Eq. (57), as our fiducial choice for this analysis, and we will evaluate the optimal value of γ.

Fig. 6.

Auto-correlations of two spectroscopic samples for a single redshift bin (Δz = 0.05). The yellow and red points are obtained using the first estimator $w_{ss}^{(1)}$ $\mathit{w}_{\mathrm{ss}}^{(1)}$ with γ ∈ { − 1, + 1}. The blue and purple points are obtained using the second one $w_{ss}^{(2)}$ $\mathit{w}_{\mathrm{ss}}^{(2)}$ with γ ∈ { − 3, − 1}. The consistency of the points with different weights illustrates the effective weighting due to the estimator definition.

4.3. Testing the bias-cancellation hypothesis

The scales used for clustering-redshifts calibration are typically very small [e.g. (0.1–5) Mpc] in order to minimise correlations between the calibration and the cosmological analysis, and to maximise the SNR. The associated nonlinear effects are often overlooked, however. Most authors assume that for two galaxy samples located within the same redshift bin, the two-point angular correlation function follows the relation:

$\begin{matrix} w_{ab} (r_{p}) \approx \sqrt{w_{aa} (r_{p}) w_{bb} (r_{p})} . \end{matrix}$ $\begin{aligned} { w}_{ab}({r_{\rm p}})\approx \sqrt{{ w}_{aa}({r_{\rm p}})\; { w}_{bb}({r_{\rm p}})}\,. \end{aligned}$ (81)

If we were using scalar product, instead of scale correlation function, this would be equivalent to the Cauchy–Schwarz equality, which implies that the density contrasts Δ_a and Δ_b are linearly dependent, and the Pearson correlation coefficient r_ab is 1, with

$\begin{matrix} r_{ab} : = \frac{{⟨ Δ_{a} Δ_{b} ⟩}_{r_{p}}}{\sqrt{{⟨ Δ_{a} Δ_{a} ⟩}_{r_{p}} {⟨ Δ_{b} Δ_{b} ⟩}_{r_{p}}}} = 1 . \end{matrix}$ $\begin{aligned} r_{ab}:=\frac{\langle \Delta _a\,\Delta _b\rangle _{{r_{\rm p}}}}{\sqrt{ \langle \Delta _a\,\Delta _a\rangle _{{r_{\rm p}}}\; \langle \Delta _b\,\Delta _b\rangle _{{r_{\rm p}}}}} = 1 \, . \end{aligned}$ (82)

At large scales (> 10 Mpc) galaxy samples act as linear tracers of the underlying dark matter density field. In this case, δ_a = b_b/b_a × δ_b, which means that galaxy field are linearly dependant. At smaller scales, the linearity between the dark matter field and galaxy statistics begins to break down. For a more complex biasing between matter field and galaxy field, however, we would still expect |r_ab|≤1¹³.

At even smaller scales (within the one-halo regime), the modelling of galaxy fields as random variables of the dark matter field breaks down (e.g. in Swanson et al. 2008, clustering properties between galaxy subpopulations was investigated), potentially impacting cosmological inference (e.g. Chaves-Montero et al. 2023, in the context of weak lensing). We illustrate this issue with an extreme example: one sample consists only of central galaxies, whilst the other consists only of satellite galaxies. If we assume there is at most one satellite galaxy per halo, the auto-correlation of central and satellite galaxies would be very small on scales smaller than the halo size, but still non-zero as we are not evaluating 3d clustering. The positive cross-correlation between the two would be strong, however. This would result in r_ab ≫ 1, which is not possible for random variables. The reason for this discrepancy is that the statistics of the satellite distribution depend on those of the central galaxy distribution. In typical halo occupation distribution models, the probability of hosting a satellite galaxy is conditional on the presence of a central galaxy. Alternatively, if two galaxy samples populate different haloes, the cross-correlation would be very small on scales smaller than the halo size, while the auto-correlations would remain significant, leading to r_ab ≪ 1.

The scale range mentioned earlier (referred to as “small” and “smaller”), and the behaviour of the r_ab coefficients are not clearly known. We decided to improve this aspect for our Euclid calibration method.

The small-scale effects that we have described are not a hypothetical framework, as this type of behaviour has already been observed in spectroscopic surveys. For instance, the first DESI data provides clear evidence that LRGs and ELGs exhibit low cross-correlation signals at small scales (a phenomenon so-called “conformity”; Rocher et al. 2023; Yuan et al. 2025; Gao et al. 2024). This occurs because these galaxies reside in different haloes, with their properties influenced by the history of the halo.

Given that the under-correlation between red and blue galaxies is well established, we aimed to test whether the Flagship simulation could reproduce this small-scale effect. Our strategy was to split the two samples into redshift slices (Δz = 0.05) using true-redshift, and to measure the ratio with angular correlation,

$\begin{matrix} r_{ab} (r_{p}, z_{i}) & = \frac{w_{a_{i} b_{i}} (r_{p})}{\sqrt{w_{a_{i} a_{i}} (r_{p}) w_{b_{i} b_{i}} (r_{p})}}, \end{matrix}$ $\begin{aligned} r_{ab}({r_{\rm p}},\,z_i)&=\frac{{ w}_{a_ib_i}({r_{\rm p}})}{\sqrt{{ w}_{a_ia_i}({r_{\rm p}})\,{ w}_{b_ib_i}({r_{\rm p}})}}\,, \end{aligned}$ (83)

which is expected to be 1 under the linear approximation.

In Fig. 7 we show r_ab(r_p, z) for the Flagship LRG and ELG-like samples. We recover the under-correlation at small scales, through r_ab(r_p, z) < 1. This ratio not only decreases with scales, but also with redshift. Since the recovered n(z) is degenerate with r, an evolving r with redshift would introduce a bias in the reconstructed n(z). This confirms the presence of one-halo behaviour in the Flagship simulation. However, since these behaviours were only recently identified and are still being actively studied, it is likely that their implementation in the Flagship simulation is incomplete or not fully accurate. A thorough study of this aspect of the simulation and one-halo physics is beyond the scope of this article.

Fig. 7.

Variation in the Pearson coefficients r_ab for Flagship LRGs with ELGs. This coefficient informs on the small-scale under-correlation of blue and red galaxies.

In Fig. 8 we report r_ab(r_p, z) for the three simulated photometric × spectroscopic bins (defined in Sect. 4.1). We observe significant scale and redshift evolution of the ratio, at least for the BOSS and DESI correlations, at scales smaller than 1 Mpc. Notably, we find r > 1, which we attribute to the physics of galaxy-halo connection and galaxy formation rather than non-linearities in the matter field (cf. the beginning of this subsection). Proving whether this is due to limitations in the HODmodelling or is a reliable prediction of the simulation is beyond the scope of our work. There is also a weak under-correlation for the low-z bin (and the mid-z) at scales (1–8) Mpc, but it does not appear to evolve with redshift, and is therefore not expected to introduce a bias into the n(z). This may correspond to the correction for non-linearities described above (r ≲ 1). For the high-z bin, we do not measure deviation from unity at any scale.

Fig. 8.

Pearson coefficient for our three simulated bins: low-z, mid-z, and high-z. This term is assumed to be one for clustering-redshifts calibration. We see different behaviours, r = 1, r < 1, and r > 1 for different scales and redshifts, except for the high-z bin, for which r = 1 (almost) everywhere.

Given these results, we are confident that using projected scales larger than 1.5 Mpc would avoid this one-halo effect¹⁴. As simulations improve in their modelling of galaxy halo properties, this test will need to be repeated. The ratio amplitude, scale, and redshift evolution are different depending on the tracers: under-correlation for DESI LRG and ELG in Fig. 7, and over-correlation for Euclid photo and DESI ELG in Fig. 8. We recommend to use only one type of spectroscopic tracer for the sample s when evaluating cross-correlations between photometric and spectroscopic samples at a specific redshift: w_sp(z_i), avoiding mixing ELGs and LRGs for example. If different spectroscopic tracers {s_j}_j are available for the same redshift z_i, we can then predict different n_p(z_i |s_j) and combine them (e.g. Hildebrandt et al. 2021). Small-scale effect could lead to incompatibility as we show in this section, so that this procedure is more likely to detect systematics, and more robust.

4.4. Choice of the scale range

In this section, we investigate the best scale range that minimises the deviation

$\begin{matrix} Δ n (z_{i} | r_{p}) = n_{p} (z_{i} | r_{p}) - n_{true} (z_{i}), \end{matrix}$ $\begin{aligned} \Delta n\,(z_i\vert \, {r_{\rm p}}) = n_{\rm p}(z_i\vert \,{r_{\rm p}})-n_{\rm true}(z_i)\,, \end{aligned}$ (84)

while maximising the SNR. We explored a wide range of scales, including those previously discarded in Sect. 4.3. Unlike the estimator of w in Eq. (57), which averages over scales, our estimator was evaluated for a single scale¹⁵ as in Eq. (48). Consequently, we refer to the distribution measured as n_p (z | r_p). We always normalised the measured distributions to unity. We employed three methods for correcting the bias, covering all the bias correction cases: no correction (i.e. M0), spectroscopic correction (i.e. M1), and full correction (i.e. M5)¹⁶. This approach allows us to assess how bias correction influences the optimal scale range.

We used three statistical indicators, as a function of the projected scale

the root mean square

$\begin{matrix} RMS [Δ n] (r_{p}) = \sqrt{\frac{1}{N_{z}} \sum_{z_{i}} Δ n {(z_{i} | r_{p})}^{2}} ; \end{matrix}$ $\begin{aligned} \mathrm{RMS}[\Delta n]\,({r_{\rm p}}) = \sqrt{\frac{1}{N_z}\sum _{z_i} \Delta n\,(z_i\vert \, {r_{\rm p}}) ^2 }\,; \end{aligned}$ (85)
the (mean) signal-to-noise-ratio

$\begin{matrix} SNR [n] (r_{p}) = \frac{1}{N_{z}} \sum_{z_{i}} \frac{| n_{p} (z_{i}) |}{σ_{n} (z_{i})} ; \end{matrix}$ $\begin{aligned} \mathrm{SNR}[n] ({r_{\rm p}}) = \frac{1}{N_z} \sum _{z_i} \frac{\vert n_{\rm p}(z_i)\vert }{\sigma _n (z_i)}\,; \end{aligned}$ (86)
the reduced χ²

$\begin{matrix} χ_{z}^{2} (r_{p}) = \frac{1}{N_{z}} {[Δ n (z_{i} | r_{p}]}_{z_{i}}^{⊤} C_{n | r_{p})} {[Δ n (z_{i} | r_{p})]}_{z_{i}}, \end{matrix}$ $\begin{aligned} \chi _z^2({r_{\rm p}}) = \frac{1}{N_z} \left[\Delta n\,(z_i\vert \, {r_{\rm p}}\right]^\top _{z_i}\,\;\mathcal{C} _{n\vert {r_{\rm p}})}\;\left[\Delta n\,(z_i\vert \, {r_{\rm p}})\right]_{z_i}\,, \end{aligned}$ (87)

with N_z the number of points (equivalently the number of redshift slices of the spectroscopic sample). These indicators, evaluated at scales ranging from (0.05–10) Mpc for the different bins and bias corrections, are reported in Fig. 9. The thin lines represent individual realisations of 1000 deg² patches, while the thicker lines indicate their means.

Fig. 9.

RMS of Δn, SNR of n, and reduced χ² of Δn (top to bottom) as a function of the projected scale r_p for the three redshift bins (left to right) for three bias corrections (colours). The fine lines show five independent realisations of 1000 deg², and the thick lines show their means.

We first discuss the impact of the bias correction. Whilst the no-bias correction model M0 shows a higher SNR for scales lower than 1 Mpc, its corresponding χ_z² value is very high, indicating that it is not accurate. Therefore, we focus exclusively on M1 and M5 for the remainder of this section. Both methods perform similarly for the SNR, but for mid-z and high-z, M5 provides a χ_z² closer to unity.

For the three tomographic bins, the RMS is increasing for smaller scales. This increase is associated with a lower SNR, and the χ_z² remains relatively constant. This suggests an overall consistency, and we can choose the scale range based on the SNR only. For the low-z bin, it peaks at 1 Mpc and then decreases. For the other two bins, it appears to peak around 2 Mpc, with a small decrease for larger scales.

In conclusion, the scale range (1.5–5) Mpc appears to be well suited for clustering redshifts; it is nearly optimal regarding the SNR; the χ_z² is almost minimal for the whole range; it avoids the r_p < 1.5 Mpc one-halo regime (excluded in Sect. 4.3); these scales are not used for cosmological analysis.

This scale range is the same as in Gatti et al. (2022) and Giannini et al. (2024), but significantly larger than in other works such as van den Busch et al. (2020), Cawthon et al. (2022), and Naidoo et al. (2023), which used scales of 0.1–1 Mpc h⁻¹ or 0.5–1.5 Mpc h⁻¹. Early clustering-redshifts studies even advocated for using smaller scales, down to the kpc range (Ménard et al. 2013; Schmidt et al. 2013). We note that we have chosen scales that avoid the one-halo term and the range of scales used for cosmology. One way to be conservative on the small-scales bias effects would be to use only large scales, overlapping those used in the 3 × 2 point analysis. In previous calibrations such as HSC, KIDS, and DES, scale choice was largely driven by the lack of spectroscopic sample (Hildebrandt et al. 2021), and the range was chosen to be at the peak of the SNR (Gatti et al. 2018). For Euclid, this will no longer be the case, and from Fig. 9 it is clear that larger scales can be used for calibration. However, it requires the generation of a more complex covariance to accurately propagate the correlation. This was proved to be informative through a Fisher formalism in Johnston et al. (2025), but never implemented in practice with simulated or real data, and is beyond the scope of thisarticle.

4.5. Choice of the scale weighting

In this section, we varied the weighting W(r_p) in Eq. 53 and evaluated the RMS and SNR defined in Eqs. (85) and (86). The scale range was fixed to (1.5–5) Mpc following the last section’s conclusion. We tested three power laws

$\begin{matrix} W (r_{p}) = \frac{{r_{p}}^{γ}}{\int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} {r_{p}}^{γ}} with γ \in {- 1, 0, 1}, \end{matrix}$ $\begin{aligned} W({r_{\rm p}}) = \frac{{r_{\rm p}}^\gamma }{\int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}{\mathrm{d}r_{\rm p}}\; {r_{\rm p}} ^{\gamma }}\, \text{ with} \gamma \in \{-1,0,1\}\,, \end{aligned}$ (88)

and two different weighting schemes:

we scale-average the different correlation functions and compute their ratio, as in Eq. (53),

$\begin{matrix} {\tilde{w}}_{ab} (z_{i}) = \int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W_{w} (r_{p}) w_{ab} (r_{p}, z_{i}) ; \end{matrix}$ $\begin{aligned} \tilde{w}_{ab}(z_i) = \int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}{\mathrm{d}r_{\rm p}}\; W_{ w} ({r_{\rm p}}) \,{ w}_{ab}({r_{\rm p}},\,z_i)\,; \end{aligned}$ (89)
we scale-average the estimated n(z_i| r_p) as in Eq. (55),

$\begin{matrix} n (z_{i}) = \int_{r_{p}^{\min}}^{r_{p}^{\max}} d r_{p} W_{n} (r_{p}) n (z_{i} | r_{p}) . \end{matrix}$ $\begin{aligned} {n}(z_i) = \int _{{r_{\rm p}^\mathrm{min}}}^{{r_{\rm p}^\mathrm{max}}}{\mathrm{d}r_{\rm p}}\; W_n ({r_{\rm p}})\,n(z_i\vert \, {r_{\rm p}})\,. \end{aligned}$ (90)

In Fig. 10 we show the comparison between the SNR and the RMS for the three tomographic bins, with three power laws, two weighting schemes, and two bias correction methods (M1 and M5). The small markers are the realisations for the five individual 1000 deg² patches, while the larger markers indicate their means. The region of interest covers the lower right part of the plots, as it corresponds to minimum deviation and high SNR. Notably, the RMS/SNR plot displays a linear trend, indicating that points with a smaller deviation are associated with a better SNR, which is consistent with the expectations for an unbiased measurement. For all three redshift bins, the optimal weighting is the inverse scale weighting (darker red marker), a similar result to Schmidt et al. (2013) and Ménard et al. (2013). The reason is that SNR is higher, while RMS is lower, for scales around 1.5 Mpc than 5 Mpc (cf. Fig. 9). W_w and W_n are similar for the RMS, but the first performs slightly better for the SNR. These findings hold for M1 and M5. Additionally, we confirm that correcting the photometric bias for the first redshift bin is not strictly necessary in terms of RMS, but it does lead to abetter SNR.

Fig. 10.

SNR and RMS for the three tomographic bins (left to right), with three power laws as weighting functions (colours), two weighting schemes, and two bias-correction methods (M1 and M5).

Thus, the best weighting scheme appears to be the inverse scale weighting applied to correlation functions, combined with a full bias correction approach.

4.6. Finding the optimal spectroscopic slicing

In the previous sections, we did not include systematics (e.g. magnification) in the measurement because we were questioning the optimal statistical choices on the two-point clustering itself, hence the use of statistical quantities such as RMS and SNR. In the rest of the article, we consider realistic measurements, including systematics, and evaluate their impact on the moments of the redshift distribution we aim to calibrate.

In this section, we varied the slicing of the spectroscopic sample Δz, cf. Sect. 3.2.2. On the one hand, narrow slices are more sensitive to corrections associated with bin modelling (cf. Sect. 3.2.3), magnification (cf. Sect. 3.1.4), or RSD (cf. Sect. 3.1.5). On the other hand, if the slices are too wide, there may not be enough points covering the distribution to properly measure the n(z), the shape would be flattened (cf. Sect. 3.5.2 and Fig. 3), and the assumption that galaxy biases and redshift distributions are constant may break (cf. Sect. 3.2.2). Therefore, we aim to find a compromise between narrow slices affected by systematic errors and large slices that are not statistically optimal, and for which the modelling might not be precise enough. The impacts of the systematic errors are tested one by one in the next Sect. 4.7.

We introduced the systematics using magnified sky coordinates and fluxes for the photometric and spectroscopic samples, and spectroscopic redshift (including peculiar velocity) for the spectroscopic sample. We used M5 as the bias correction method. For extracting the distribution moments from the {n(z_i)} we employed the SGP n(z) reconstruction introduced in Sect. 3.5.5. We did so because, even with few points, fitting the true distribution (as with the SSM, cf. Sect. 3.5.4) is likely to yield good yet unrealistic results, whereas the SGP provides weaker yet realistic constraints.

We measured the mean (absolute) biases on the mean |⟨z⟩−⟨z⟩_true| and standard deviation |σ_z/σ_z^true − 1| of the n_p(z) for different slicing, where the mean is evaluated over the five independent patches. We reported the mean values and their associated mean uncertainties in Fig. 11. We also created a similar plot using the SSM and obtain comparable results except for the uncertainties, which remain constant across the different slicing. This confirms our previous statement and justifies the choice of the SGP.

Fig. 11.

Mean redshift and shape reconstruction with SGP, for the three tomographic bins. The different markers and colours are associated with different slicing Δz. The sandy region corresponds to the Euclid requirements, and the diagonal line shows a coherent deviation of 1σ.

First of all, the uncertainties (y axis) on mean redshift and shape increase with the slicing Δz, and we also observe larger deviation for wider slices. Regarding the shape reconstruction (related to σ_z), narrow slices are not associated with larger deviations, except for the mid-z bin. For the mean redshift, small slicing is associated with similar or larger deviations than Δz ∼ 0.05, but not larger uncertainties. The optimum slicing appears to correspond to 0.04 ≤ Δz ≤ 0.06.

Consequently, we decided to use a redshift slicing of Δz = 0.05 for every bin, for the rest of the study. This is larger than the slicing used for DES (Gatti et al. 2022; Giannini et al. 2024), but the same as the previous Euclid clustering-redshifts paper (Naidoo et al. 2023). As illustrated by the sandy region in Fig. 11, we predict that the systematic errors for this redshift slicing will not shift the data points outside the Euclid requirement. Since the Δz = 0.05 points are predominantly on the diagonal (except for the mid-z mean-z), this suggests that the corrections are fairly small for this slicing: the parameters are not biased more than 1σ away from the truth. The mid-z is the only case where we might fail to fulfil the requirement if we do not correct for the systematic errors. We investigated the origin of this deviation in detail in the next section.

4.7. Evaluating the (mis-)modelling of the clustering, magnification, and RSD

For clustering redshifts, we measure the cross-correlation of a wide photometric tomographic sample with several spectroscopic slices, and we assume that only the photometric galaxies within the redshift range of the spec-z cuts contribute to a non-zero correlation. There can be three corrections:

galaxies in the nearby redshift (nearby bins) can correlate as well (we refer to this as the one-bin approximation);
galaxies not covering the same redshift range can be correlated through magnification processes;
spectroscopic redshifts can be different than true redshifts, leading to correction on the n_s at the boundary of the bin.

Two of these effects can be understood as bin border effects, and for larger slices they are expected to decrease in proportion. For wide spectroscopic slices, the assumption that n(z) and b(z) are constant within the bin might break down. We also note that spectroscopic interlopers were not included in the Euclid spectroscopic samples.

In the previous Sect. 4.6, we demonstrated that the compromise between having enough points n_p(z_i) and limiting these effects, corresponds to slicings Δz = 0.04–0.06, regardless of the tomographic bin. To assess the influence of each effect, we used the idealised bias correction M5 to avoid the impact of galaxy bias, and proceed as follows:

To test the impact of the one-bin approximation, we compared the cross-correlation w_sp(z_i) with spectroscopic slices and the tomographic bin for two cases. In the first one, referred to as the “full bin”, the photometric sample is the fiducial one used in this paper. For the second case, referred to as the “independent bins”, we used idealised tomographic bins. To generate these idealised tomographic bins, for every z_i, we keep only the photometric galaxies within the same redshift range as the spectroscopic galaxies (using true-z), and then complete this sample with randoms to maintain the total number of galaxies as in the full tomographic bin. This ensures that only the galaxies within the redshift slice of the spectroscopic sample contribute to the correlation, for each spectroscopic slice.
To test the magnification effect, we compared the results using the magnified variables for position and fluxes (over which we apply the samples cuts), or the unmagnifiedones.
To test the RSD effect, for the spectroscopic samples we used either the Flagship true redshifts, or the observed ones, the latter corresponding to a spectroscopic-like redshift estimate (including velocities).
We also tested all these effects together, referred to as “Total”.

In Fig. 12 we show the deviations associated with the different systematic effects for different slicing, for the SGP. The same test for SSM methods is presented in Fig. 13. The corresponding values for the two procedures are listed in Table D.1 for Δz = 0.05, in Appendix D. For RSD and magnification, we have to consider the full tomographic bin, which by default comes with the one-bin approximation. Therefore, the full bin assumption has to be compared with “independent bins”, and the RSD or magnification has to be compared with the full bin.

Fig. 12.

Deviations on the mean-z (top) and standard deviation (bottom), with SGP fitting, for the three tomographic bins (left, middle, and right), with three Δz slicings (colours) and different implemented systematic effects (marker). The shadowed bands correspond to the Euclid requirement on both parameters. The dark and light error bars indicate 68.3% and 95.5% uncertainties

Fig. 13.

Same as Fig. 12 for the two parameters of the SSM model, which are equivalent to ⟨z⟩−⟨z⟩_true and σ_z/σ_ztrue − 1.

For the SSM and SGP, with Δz = 0.05, the constraints on the redshift standard deviation σ are within the requirements independently from the effects. Only a large slicing of Δz = 0.08 leads to some measurements out of the requirements because not enough points cover the distribution, and with larger slicing, the distribution appears flattened (σ_z/σ_z^true > 1) as shown in Fig. 3.

Regarding the mean redshift bias, all the “independent bin” measurements are consistent with no deviation at the 1 to 2σ level with the SGP and SSM. This suggests that our choice of scale and weighting is sufficient to limit the small-scale effects (cf. Sect. 4.4). In general, for the three tomographic bins, all the systematics lead to some deviations compared to the independent bins, but distinguishing these variations from statistical fluctuations is complicated. It is clear that the SGP procedure is more sensitive to these effects. The reason for this is that with SSM we compare the measurements with a model free from these effects. As this model is, in our case, the truth, its results may be optimistic. Nevertheless, with a SOM estimate as a model, we might expect a similar low sensitivity to these effects. For smaller bin sizes, the error bars are smaller, but the deviation due to systematic effects is at least as large as for other sizes. Wider slices are associated with greater uncertainty, and the balance between accuracy and precision is achieved for Δz = 0.05. For this bin size, the systematic effects do not produce deviations that exceed the requirements. However, for the mid-z bin, with SGP these systematic effects result in a deviation right at the limit of the budget on the mean-z, as the bias is 0.0038 ± 0.0017 (the requirements being 0.0038). For the optimistic case (SSM), the deviation remains within the requirements: the bias in mean redshift being 0.0022 ± 0.008. The effect causing this large deviation for SGP seems to be the one-bin approximation.

In Fig. 14, we report the true redshift distribution of three bins are shown alongside clustering-redshifts measurements for three slicing values: Δz = 0.02, 0.05, and 0.08, excluding the effects of RSD and magnification. We observe increasing deviations with finer slicing, whilst the shape reconstruction becomes less precise for coarser slicing (cf. Fig. 3). The small slicing deviations are higher for the mid-z and high-z bin, as one can expect since η_± increase with redshift. We also see that large slices lead to worst shape estimate for the low-z bin, as its width is narrower.

Fig. 14.

True-redshift distributions of tomographic bins in black, and the clustering-redshifts measurements with M5 for three spectroscopic slicings: Δz = 0.02, 0.05, 0.08 (orange, red, and purple).

In Appendix E we explore an alternative modelling including correlation with neighbours slices ( cf. Sect. 3.2.3). In short, despite our effort we did not manage to improve the accuracy of the modelling with extra-terms.

4.8. Comparison of the different bias-correction methods

In this section, we explore the different possibilities for breaking the degeneracy between n_p(z) and b_p b_s from w_sp (cf. Eq. (47)). In the first subsection, we focus on measuring the photometric bias alone, since this is the challenging aspect.

4.8.1. Measuring the photometric galaxy bias evolution

In Sect. 3.3.2, we proposed to improve existing photometric bias corrections (van den Busch et al. 2020; Cawthon et al. 2022) with two methods to measure b_p(z), applicable to real data, specifically for photometric galaxies with z_photo < 1.6 (i.e. fully covered by spectroscopic samples). The larger binning method M3 is applied in Fig. 15 and the smaller binning method M4 is applied in Fig. 16. They are represented by orange dashed lines. For both figures we also include the results using either the partial correction (blue dashed) or width correction (green dashed), for comparisons with previous analyses (van den Busch et al. 2020; Cawthon et al. 2022; Giannini et al. 2024). Additionally, we present the best fit degree-3 polynomial for the partial and the full corrections (solid lines) from a standard χ² minimisation. The grey line shows the measurement using true-z, which we want to recover after the corrections.

Fig. 15.

True w_pp for Δz = 0.05 in grey and measurement with corrections factors including M3 as dashed green, blue, and orange lines for photo-z binning Δz = 0.1. The best fits from degree-3 polynomials are reported with solid lines. The w_pp used for M3 is the best-fit model of the full correction (solid red line).

Fig. 16.

Same as Fig. 15, but for M4, with a smaller photo-z binning Δz = 0.02. The w_pp used for M4 is the best-fit model of the full correction (solid red line).

We find that assuming the photo-z as the true-z (width-only correction), neither the amplitude nor the redshift evolution is well measured. For the partial correction, whilst the redshift evolution appears well-recovered, differences in amplitude persist. Finally, our full correction method significantly outperforms the others: in both figures, we recover the amplitude and redshift evolution of the bias more accurately. Notably, the bias evolves weakly for z < 1, suggesting that correcting the photometric bias at low redshift may not be necessary.

Since the partial and full corrections both give a good trend for the redshift evolution of the galaxy bias, one could argue their performance are identical in the context of clustering redshifts, as the measured n_p(z) can always be latterly renormalised. With the correct amplitude for the galaxy bias, evaluating whether the n_p(z) measured is directly normalised to unity can provide valuable information. Deviations from unity could indicate

a contamination from systematics, such as the small scale effect illustrated in Sect. 4.3;
the photometric sample is not fully covered by spec (and what is the fraction);
a problem with the spectroscopic redshifts, such as interlopers.

In that sense, the full correction performs significantly better than the partial one. The performance of our bias measurement may depend on the specific photo-z code and the quality of the available photometry, and data implementation will have to address this question.

4.8.2. Comparison of the methods

In this section, we compare the different bias correction methods introduced in Sect. 3.3.1, including M3 and M4. We aim to evaluate the bias correction performance, thus we work in an idealised framework with independent bins, no RSD, and no magnification (i.e. the “independent bin” test of Sect. 4.7). We employ a redshift slicing Δz = 0.05. We report in colours the deviations on the mean redshift and standard deviation for the six methods M0–M5, in Figs. 17 (for SSM) and 18 (for SGP). For each method, we have five realisations by varying the sky patch. We also give the values for the biases on mean-z and standard deviation in Table D.1, in Appendix D.

Fig. 17.

Deviations with the SSM fitting on the mean redshift (upper panel) and standard deviation (lower panel) for the six methods M0–M5 in colour, for five sky patches (same colour). The requirements are represented by the shaded area. The dark and light error bars indicate 68.3% and 95.5% uncertainties.

Fig. 18.

Same as Fig. 17 with the SGP fitting.

We can see that at low-z it is not necessary to correct for the photometric bias, and spectroscopic correction M1 produces good estimates. At mid-z it is necessary to correct at least for the spectroscopic bias, and ideally for both with M3 or M4. At high-z, it is essential to correct both biases. We observe that M3 performs significantly better than M4 for SSM and SGP; indeed, the redshift evolution is better captured in Fig. 15 than in Fig. 16. This difference may be attributed to the fact that smaller slices are more sensitive to systematic effects, as discussed in Sect. 4.6. Overall, M3 performs similarly to the idealistic M5 and is consistently the best bias correction method, exhibiting performance akin to M1 for low-z (though with slightly larger errorbars).

In conclusion, using M3 for every bin yields unbiased estimates of the mean redshift and standard deviation across all bins, with uncertainties that fall within the requirements for both methods. The values presented in these two figures are reported in Table D.2.

4.9. Tomographic bin uncertainties

We defined the tomographic bins as equi-populated, based on the assumption that the uncertainty scales with the total number of galaxies in each bin. For the 10 spectroscopic samples, we used 1-DESI-BGs, 2-BOSS, 3-DESI-LRGs, 4-DESI-LRGs, 5-DESI-LRGs, 6-DESI-ELGs, 7-DESI-ELGs, 8-Euclid-NISP, 9-Euclid-NISP, 10-Euclid-NISP.

In Fig. 19 we plot in red the uncertainty on the mean redshift σ(⟨z⟩) for every tomographic sample, and its standard deviation σ(σ(⟨z⟩)), evaluated for the five 1000 deg² sky-patches with jackknife covariance, with SSM fitting. Interestingly, we find that the uncertainty is not constant. It remains the same for the first two samples, then decreases across the next six samples, before increasing again for the final ones. We also report for the first two samples, the case of 2500 deg² sky patches in dark red, and observe a mean-z uncertainty reduction of 40%.

Fig. 19.

Uncertainties on mean-z with SSM fitting for five sky realisations in red. We also plot the best-fit uncertainty model in blue.

McQuinn & White (2013) predicted that “the linear bias times number of objects in a redshift bin generally can be constrained with cross-correlations to fractional error $\approx \sqrt{10^{2} N_{Δ z} {(d N_{s} / d z)}^{- 1}}$ $\approx \sqrt{10^{2} N_{\Delta z}({\mathrm{d} N_{\mathrm{s}}/\mathrm{d} z})^{-1}}$ ”, where N_Δz is the number of spectroscopic redshift slices, and N_s the number of spectroscopic galaxies¹⁷. This expression describes how well the redshift distribution can be measured, whereas we are primarily interested in its moments. We assume that, at first order, the scaling should behave similarly, but with a different coefficient, which is complicated to predict because of the complexity of the mean-z fitting procedure. As a consequence, the uncertainty should depend on the number of spectroscopic galaxies, rather than on the volume or the number of photometric galaxies. However, for the low-z bins, the spectroscopic density is sufficiently high, especially with DESI-BG-like sample, that we might enter a cosmic variance (CV) regime. In this case, uncertainties are no longer sample-dominated (i.e. the shot noise 1/n_s is subdominant). Thus, we introduce a maximum spectroscopic density to reach the cosmic variance limit n_cv, which in principle depends on z, but that we assume to be constant. Our uncertainty model is

$\begin{matrix} U (z) = A \sqrt{\frac{N_{Δ z}}{V_{TB} min (n_{spec}, n_{cv})}}, \end{matrix}$ $\begin{aligned} U(z) = A \;\sqrt{\frac{N_{\Delta z}}{V_{\rm TB}\,\min \, (n_{\rm spec}, \, n_{\rm cv})}}\,, \end{aligned}$ (91)

where A is a constant independent of the tomographic bin, and V_TB is the volume of the tomographic bin.

The equation above also indicates that the uncertainty scales with the inverse of the square root of the tomographic volume. This explains why in Fig. 8 of Naidoo et al. (2023), the uncertainty scales as a power law of the survey area A^γ, and the best-fit power law is close to γ = −1/2.

We constrained the A and n_cv coefficients with the set of tomographic bin measurements using a χ² procedure, and report in Fig. 19 the best-fit model in blue for 1000 and 2500 deg². We observe a good agreement, particularly given that we neglected complications arising from galaxy biases and the mean-redshift fitting procedure. The uncertainty is largest for the low-z tomographic bins where the survey volume is much smaller and precision is limited by the CV regime. For the high-z samples, the uncertainty increases due to the reduced density of spectra, which is not sufficiently offset by the larger volume. The best-fit n_cv roughly corresponds to the DESI-BGS density divided by a factor of four (and the BOSS density by a factor of 1.8), and it only plays a role for the first two bins.

As a result, to achieve consistent uncertainty across bins with clustering-z, the tomographic bin definition would need to be adjusted according to an uncertainty model like Eq. (91). However, the tomographic bin definition is primarily influenced by cosmological constraining power (e.g. Euclid Collaboration: Pocino et al. 2021), and our results alone do not provide sufficient grounds to recommend a change in binning strategy. However, the precision of the mean redshift is one of the key limiting factors in cosmic shear analyses (see, e.g., Reischke 2024), and the impact of bin definitions on the accuracy of redshift calibration is often overlooked.

5. Application to the ten Euclid tomographic bins

In this section, we applied our pipeline to calibrate the ten tomographic bins described in Sect. 2.2. For the spectroscopic samples, following our recommendation (see Sect. 4.3) we only correlated a tomographic bin with one spectroscopic tracer at a time. We used the following tracers for the different bins: 1-DESI-BGs, 2-BOSS, 3-DESI-LRGs, 4-DESI-LRGs, 5-DESI-LRGs, 6-DESI-ELGs, 7-DESI-ELGs, 8-Euclid-NISP, 9-Euclid-NISP, 10-Euclid-NISP. For simplicity, we did not consider the joint constraint of several spectroscopic tracers covering the same bin, which would provide additional information. For each bin, we produced several independent realisations by selecting distinct patches in Flagship 2. For the first two tomographic bins, we generated two 2500 deg⁻² patches, while for the remaining eight bins, we used five 1000 deg⁻² patches. This approach ensures similar uncertainties across bins, as explained in Sect. 4.9.

The true tomographic bin distribution, and the clustering-redshifts measurements are reported in Fig. 20. The measurements are visually indistinguishable from the truth. To quantify how good they are, we show the (optimistic) SSM calibration of the mean-z and standard deviations in Fig. 21. The results with the conservative SGP calibration are shown in Fig. 22. For comparison, we also plot the previous results of Naidoo et al. (2023) for the mean redshift (with Flagship1 samples). Our results using the SSM method are fully successful; for every bin and patch realisation, we meet the Euclid requirements for the mean redshift and shape, unlike prior results. However, as expected, the "conservative" SGP method yields larger uncertainties and deviations. In particular, for some bins (especially bins 1, 3, and 7), we observe slightly larger deviations and uncertainties than allowed by the mean redshift requirement. We tested the effect of increasing the patch sizes by a factor of two and found that the mean-z bias was reduced on average by 40%, which was sufficient to meet the requirement for all bins.

Fig. 20.

True-redshift distribution of the simulated Flagship2 ten tomographic bins with z_p < 1.6 (solid lines) and their clustering-redshifts measurements with our pipeline, including all systematics and realistic bias correction M3. The first two bin measurements are realised with 2500 deg² sky patches, and the other eight are realised with 1000 deg² sky patches. The dark and light error bars indicate 68.3% and 95.5% uncertainties, but are barely visible by eye.

Fig. 21.

Deviations in the mean redshift (top) and the redshift standard deviation (bottom) for the ten Flagship2 tomographic bins relative to the true-redshift distributions using the SSM method. The deviations are shown for each bin and sky patch. We consider patches of 2500 deg² for the first two bins, and 1000 deg² for the others. The Euclid requirements on the uncertainty are highlighted by the shaded area centred around zero. The uncertainties are consistent across all the bins for the mean redshift and we achieve a clear improvement from previous work shown in red. For the redshift standard deviations, the uncertainties are slightly underestimated, but this is within an acceptable range given the requirements.

Fig. 22.

Deviations in the mean redshift (top) and the redshift standard deviation (bottom) using the SGP method for the same bin configuration as in Fig. 21. The Euclid requirements on the uncertainty are highlighted by the shaded area centred around zero. For some bins (particularly bins 1, 3, and 7), we observe slightly larger deviations and uncertainties than permitted on the mean redshift. However, a clear improvement compared to previous work (shown in red) is still evident. The redshift standard deviations are unbiased and fall within the required range (shaded sandy area).

Part of the improvement to Naidoo et al. (2023) is due to the use of a larger cosmic volume with Flagship2 instead of Flagship1. Nevertheless, with our pipeline we also got significantly lower mean redshift biases than Naidoo et al. (2023). The main differences between our pipeline and the previous one are our estimator (cf. Sect. 4.2), the scale range (cf. Sect. 4.4), the measurements being evaluated for each spectroscopic tracer (cf. Sect. 4.3), the bias correction method (cf. Sect. 4.8), and some improvement in the implementation of SSM and SGP (cf. Sect. 3.5). We note that we did include the effect of magnification contrary to Naidoo et al. (2023). In conclusion, we predict that clustering-redshifts will be able to satisfy the Euclid requirements.

6. Conclusion

We optimised and evaluated the performance of the clustering-redshifts method for calibrating the redshift distributions of Euclid tomographic bins.

Using the Flagship2 simulation, we generated realistic spectroscopic and photometric mock samples based on BOSS, DESI, and Euclid. The photometric sample was divided into ten equally populated tomographic bins, with a maximum photo-z of 1.6. We used up to five 1000 deg² patches to fully exploit the volume of the simulation.

We measured the angular auto- and cross-correlations of these samples under various methodological choices and investigated alternative approximations to model the clustering.

From these clustering measurements, we calibrated the redshift distributions via two distinct methods. One method involved fitting the data points using a model n(z| p₁, …) with free parameters. In practice, a complementary calibration, such as the self-organising map can be used to define the model (e.g., Hildebrandt et al. 2021; Gatti et al. 2022). The other method relies on Gaussian processes to directly fit the data without assuming a specific model. With the latter, the SOM and clustering-redshifts calibrations can both be run independently, or be combined as in the case of DES (Myles et al. 2021; Gatti et al. 2022).

In Sect. 4 we investigated and concluded the following about the clustering-redshifts technique and angular clustering.

In Sect. 4.2 we argued that the integrated pair-count estimator w⁽¹⁾ is a better choice than w⁽²⁾ for measuring the integrated clustering (i.e. integrating the ratio DD/RR instead of taking the ratio of the integrated DD and RR).
In Sect. 4.3 we explained a potential issue with very small scales that arises from the one-halo properties of the clustering. We concluded that scales should always be larger than 1.5 Mpc to limit this problem. We also recommended separating the clustering-redshifts measurements for the different spectroscopic tracers.
In Sects. 4.4 and 4.5, we ran an optimisation of the scale range choice and weighting. With the Flagship simulation, we achieved a good compromise with 1.5 < r_p < 5 Mpc with an inverse scale weighting.
In Sect. 4.6 we showed that the optimal spectroscopic slicing given our tomographic bins and the different systematics was Δz ≈ 0.05.
In Sect. 4.7 we showed that approximating ∫dz n²(z)⋯ (Limber-full-z) with some n²(z_i)∫dz⋯ expressions (Limber-1-bin) is the main systematic effect and that magnification has some limited impact on the n_p(z) inference.
In Sect. 4.8 we compared different bias corrections and introduced a way to measure the photometric galaxy bias by splitting the TB into smaller bins and interpolating these effective biases over redshift (i.e. M3). The results of this method were comparable to the ideal case M5, in which photometric bias is measured using true redshifts. It should be used as the fiducial correction scheme.
Finally, we investigated in Sect. 4.9 the dependence of the mean-z error bars on the clustering-redshifts parameters, and confirmed that they are mainly driven by the number of spectroscopic galaxies that is available in the footprint.

In Sect. 5 we applied the optimised pipeline to calibrate the Euclid simulated tomographic bins. The calibration performance using the SSM model met or exceeded the required accuracy, while the SGP method yielded comparable results, but they were slightly below the requirements in certain cases. It is important to note that these results were derived from limited sky patches with correction methods that can still be improved in future work. These results represent a significant improvement over the previous Euclid clustering-redshifts study (Naidoo et al. 2023) overall and demonstrated that the clustering-redshifts technique is robust.

Additionally, as discussed in Sect. 2, we assumed that the H α sample had a purity of 100%. In reality, however, when we bin the sample into small redshift intervals, we expect that some of these galaxies may be interlopers, that is, they might be located at different redshifts due to noise or a misidentification of a spectral lines. This situation might significantly impact clustering-redshifts measurements because photometric galaxies at a similar redshifts as the interlopers would contribute to the clustering signal, which would bias the inferred n(z). It is therefore crucial to determine the fraction of these interlopers and their redshift distribution to correct for this effect. The clustering-redshifts technique can indeed be used to constrain the properties of interlopers by cross-correlating the NISP sample with other spectroscopic data or with photometric data with priors on their n(z).

As already mentioned in Sect. 4.3, the galaxy-halo connection implemented in the Flagship simulation might not be accurate, and our statistical estimator might be optimistic. It might therefore still be beneficial to use larger scales even though they are not statistically optimal. These larger scales will be used for the 3 × 2 analysis, which necessitates the generation of a more complex covariance to accurately propagate the correlation with the calibration (e.g. the work of Johnston et al. 2025). Although this approach is more complicated, the advantage is that the distribution would be free of small-scale potential bias, which may prove worthwhile.

Finally, we restricted the calibration to galaxies with a maximum photo-z of 1.6 because we lack dense spectroscopic samples above redshift 1.8, which sets a strict upper limit. Clustering-z for HSC, KIDS, and DES was even more limited than Euclid, with a maximum redshift of z ∼ 1. In the far future numerous spectroscopic samples from Lyman-break galaxies are expected to be observed by spectroscopic experiments such as DESI-II, MUST, and WST (if funded). In a much shorter time frame, QSOs might be used one could utilise QSOs to cover the redshift range 1.5 < z < 3. These constraints alone would most likely not fulfil the Euclid requirements because QSOs have a low-density, however, but they would still provide valuable information.

¹

https://github.com/wdassignies/Clustering_z.git

²

In the linear regime, RSD contribute to the projected galaxy density contrast through the apparent large-scale flow, of galaxies across the redshift boundaries of the samples. The redshift z we use to parametrise the D and μ terms includes the peculiar velocity, and is referred to as observed redshift.

³

Our Universe and its observation is assumed to be isotropic. Although the real isotropy of our Universe seems a reasonable hypothesis, observation may not always support it. For example, closer to the Galactic plane, there is more star contamination, leading to angular anisotropy. The Euclid observation strategy might also induce some angular anisotropy. This could potentially introduce bias on the cosmological parameters, especially for experiments such as Euclid or LSST (Baleato Lizancos & White 2023). Indeed, clustering-redshifts may be a valuable tool to investigate this issue by splitting the samples into independent sky patches.

⁴

The magnification of the two samples by the lower-z matter is negligible as it is a smaller correction (μ × μ cf. Sect. 3.1.2).

⁵

Positive convergence is induced by galaxy over-density. Correlation with voids would probe the negative convergence regime.

⁶

https://github.com/LSSTDESC/CCL

⁷

For scales that are neither too small nor too large, it should be a valid hypothesis, at least to first order.

⁸

Of course, in practice, all the previous integrals are discrete, with

$\begin{matrix} \int d r_{p} ⟶ \sum_{i} Δ {r_{p}}^{i} . \end{matrix}$ $\begin{aligned} \int {\mathrm{d}r_{\rm p}}\longrightarrow \sum _i \Delta {r_{\rm p}}^i\,. \end{aligned}$ (63)

⁹

We use the python package emcee (Foreman-Mackey et al. 2013).

¹⁰

We use the python package Georges (Ambikasaran et al. 2015).

¹¹

The length-scale represents the characteristic distance over which the function values are correlated.

¹²

We applied cuts on the RA coordinates, with a width of ΔRA = 17.5 deg.

¹³

As shown in Appendix A for quadratic non-linearities (second-order effects), we expect r_ab ≈ 1, with the correction arising from third-order terms. Pushing further the perturbative development, we can show that indeed r_ab ≤ 1, analogously to the Cauchy–Schwarz inequality.

¹⁴

We note that we are using angular correlation, so most of the 3-D separation between galaxies is much larger than this 1.5 Mpc.

¹⁵

In practice, it is evaluated for a small scale bin.

¹⁶

We expect the results with M2-3-4 to be similar to that of M5.

¹⁷

We have slightly adjusted the convention to remain consistent with the rest of our article.

¹⁸

There are different and confusing conventions in the literature. For instance, sometimes α is given as s, 2.5s, or 5s − 2.

Acknowledgments

The authors thank Alex Alarcon, Carles Sánchez, Elisa Chisari, and Santi Ávila for enlightening discussions, as well as Krishna Naidoo for his help, code, and results sharing. The authors also acknowledge Mathias Urbano for his code support. This work was supported by the MICINN projects PID2019-111317GB-C32, PID2021-123012NB-C41, PID2022-141079NB-C32, CNS2023-144328 (Carteu) as well as predoctoral program AGAUR-FI ajuts (2024 FI-1 00692) Joan Oró. IFAE is partially funded by the CERCA program of the Generalitat de Catalunya. This work has made use of CosmoHub, developed by PIC (maintained by IFAE and CIEMAT) in collaboration with ICE-CSIC. It received funding from the Spanish government (grant EQC2021-007479-P funded by MCIN/AEI/10.13039/501100011033), the EU NextGeneration/PRTR (PRTR-C17.I1), and the Generalitat de Catalunya. The Euclid Consortium acknowledges the European Space Agency and a number of agencies and institutes that have supported the development of Euclid, in particular the Agenzia Spaziale Italiana, the Austrian Forschungsförderungsgesellschaft funded through BMIMI, the Belgian Science Policy, the Canadian Euclid Consortium, the Deutsches Zentrum für Luft- und Raumfahrt, the DTU Space and the Niels Bohr Institute in Denmark, the French Centre National d’Etudes Spatiales, the Fundação para a Ciência e a Tecnologia, the Hungarian Academy of Sciences, the Ministerio de Ciencia, Innovación y Universidades, the National Aeronautics and Space Administration, the National Astronomical Observatory of Japan, the Netherlandse Onderzoekschool Voor Astronomie, the Norwegian Space Agency, the Research Council of Finland, the Romanian Space Agency, the State Secretariat for Education, Research, and Innovation (SERI) at the Swiss Space Office (SSO), and the United Kingdom Space Agency. A complete and detailed list is available on the Euclid web site (https://www.euclid-ec.org/).

References

Adame, A. G., Aguilar, J., Ahlen, S., et al. 2024, AJ, 167, 62 [NASA ADS] [CrossRef] [Google Scholar]
Addison, G. E., Bennett, C. L., Jeong, D., Komatsu, E., & Weiland, J. L. 2019, ApJ, 879, 15 [NASA ADS] [CrossRef] [Google Scholar]
Ambikasaran, S., Foreman-Mackey, D., Greengard, L., Hogg, D. W., & O’Neil, M. 2015, IEEE Trans. Pattern Anal. Mach. Intell., 38, 252 [Google Scholar]
Baleato Lizancos, A., & White, M. 2023, JCAP, 07, 044 [Google Scholar]
Behroozi, P. S., Wechsler, R. H., & Wu, H.-Y. 2013, ApJ, 762, 109 [NASA ADS] [CrossRef] [Google Scholar]
Blas, D., Lesgourgues, J., & Tram, T. 2011, JCAP, 07, 034 [CrossRef] [Google Scholar]
Bordoloi, R., Lilly, S. J., & Amara, A. 2010, MNRAS, 406, 881 [NASA ADS] [Google Scholar]
Campos, A., Yin, B., Dodelson, S., et al. 2024, ArXiv e-prints [arXiv:2408.00922] [Google Scholar]
Carretero, J., Tallada, P., Casals, J., et al. 2017, in Proceedings of the EuropeanPhysical Society Conference on High Energy Physics. 5-12 July, 488 [Google Scholar]
Cawthon, R., Elvin-Poole, J., Porredon, A., et al. 2022, MNRAS, 513, 5517 [NASA ADS] [CrossRef] [Google Scholar]
Chaves-Montero, J., Angulo, R. E., & Contreras, S. 2023, MNRAS, 521, 937 [Google Scholar]
Chisari, N. E., Alonso, D., Krause, E., et al. 2019, ApJS, 242, 2 [Google Scholar]
Contarini, S., Verza, G., Pisani, A., et al. 2022, A&A, 667, A162 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Davis, M., & Peebles, P. J. E. 1983, ApJ, 267, 465 [Google Scholar]
Davis, C., Rozo, E., Roodman, A., et al. 2018, MNRAS, 477, 2196 [Google Scholar]
Dawson, K. S., Schlegel, D. J., Ahn, C. P., et al. 2013, AJ, 145, 10 [Google Scholar]
DESI Collaboration (Aghamousa, A., et al.) 2016, ArXiv e-prints [arXiv:1611.00037] [Google Scholar]
Elvin-Poole, J., MacCrann, N., Everett, S., et al. 2023, MNRAS, 523, 3649 [NASA ADS] [CrossRef] [Google Scholar]
Eriksen, M., Alarcon, A., Cabayol, L., et al. 2020, MNRAS, 497, 4565 [CrossRef] [Google Scholar]
Euclid Collaboration (Blanchard, A., et al.) 2020, A&A, 642, A191 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Euclid Collaboration (Desprez, G., et al.) 2020, A&A, 644, A31 [EDP Sciences] [Google Scholar]
Euclid Collaboration (Ilbert, O., et al.) 2021, A&A, 647, A117 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Euclid Collaboration (Pocino, A., et al.) 2021, A&A, 655, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Euclid Collaboration (Castander, F., et al.) 2025, A&A, 697, A5 [Google Scholar]
Euclid Collaboration (Cropper, M., et al.) 2025, A&A, 697, A2 [Google Scholar]
Euclid Collaboration (Jahnke, K., et al.) 2025, A&A, 697, A3 [Google Scholar]
Euclid Collaboration (Le Brun, V., et al.) 2025, A&A, submitted [Google Scholar]
Euclid Collaboration (Mellier, Y., et al.) 2025, A&A, 697, A1 [Google Scholar]
Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]
Gao, H., Jing, Y. P., Xu, K., et al. 2024, ApJ, 961, 74 [Google Scholar]
Gatti, M., Vielzeuf, P., Davis, C., et al. 2018, MNRAS, 477, 1664 [Google Scholar]
Gatti, M., Giannini, G., Bernstein, G. M., et al. 2022, MNRAS, 510, 1223 [Google Scholar]
Giannini, G., Alarcon, A., Gatti, M., et al. 2024, MNRAS, 527, 2010 [Google Scholar]
Gwyn, S., McConnachie, A. W., Cuillandre, J. C., et al. 2025, ArXiv e-prints [arXiv:2503.13783] [Google Scholar]
Hartlap, J., Simon, P., & Schneider, P. 2007, A&A, 464, 399 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Hartley, W. G., Chang, C., Samani, S., et al. 2020, MNRAS, 496, 4769 [Google Scholar]
Hildebrandt, H. 2016, MNRAS, 455, 3943 [NASA ADS] [CrossRef] [Google Scholar]
Hildebrandt, H., van den Busch, J. L., Wright, A. H., et al. 2021, A&A, 647, A124 [EDP Sciences] [Google Scholar]
Huterer, D., Takada, M., Bernstein, G., & Jain, B. 2006, MNRAS, 366, 101 [Google Scholar]
Jarvis, M. 2015, Astrophysics Source Code Library [record ascl:1508.007] [Google Scholar]
Johnson, A., Blake, C., Amon, A., et al. 2017, MNRAS, 465, 4118 [Google Scholar]
Johnston, H., Elisa Chisari, N., Joudaki, S., et al. 2025, A&A, 699, A127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Krause, E., Fang, X., Pandey, S., et al. 2021, ArXiv e-prints [arXiv:2105.13548] [Google Scholar]
Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 [Google Scholar]
Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]
Lepori, F., Schulz, S., Tutusaus, I., et al. 2025, A&A, 694, A321 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Pandey, S., Sánchez, C., Jain, B., & LSST Dark Energy Science Collaboration 2025, Phys. Rev. D, 111, 023527 [Google Scholar]
LSST Science Collaboration (Abell, P. A., et al.) 2009, ArXiv e-prints [arXiv:0912.0201] [Google Scholar]
McQuinn, M., & White, M. 2013, MNRAS, 433, 2857 [Google Scholar]
Ménard, B., Scranton, R., Schmidt, S., et al. 2013, ArXiv e-prints [arXiv:1303.4722] [Google Scholar]
Morrison, C. B., Hildebrandt, H., Schmidt, S. J., et al. 2017, MNRAS, 467, 3576 [Google Scholar]
Myles, J., Alarcon, A., Amon, A., et al. 2021, MNRAS, 505, 4249 [NASA ADS] [CrossRef] [Google Scholar]
Naidoo, K., Johnston, H., Joachimi, B., et al. 2023, A&A, 670, A149 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Newman, J. A. 2008, ApJ, 684, 88 [Google Scholar]
Nicola, A., Hadzhiyska, B., Findlay, N., et al. 2024, JCAP, 02, 015 [Google Scholar]
Palanque-Delabrouille, N., Magneville, C., Yèche, C., et al. 2016, A&A, 587, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Pandey, S., Krause, E., Jain, B., et al. 2020, Phys. Rev. D, 102, 123522 [CrossRef] [Google Scholar]
Percival, W. J., Friedrich, O., Sellentin, E., & Heavens, A. 2022, MNRAS, 510, 3207 [NASA ADS] [CrossRef] [Google Scholar]
Rau, M. M., Dalal, R., Zhang, T., et al. 2023, MNRAS, 524, 5109 [NASA ADS] [CrossRef] [Google Scholar]
Reischke, R. 2024, MNRAS, 530, 4412 [NASA ADS] [CrossRef] [Google Scholar]
Rocher, A., Ruhlmann-Kleider, V., Burtin, E., et al. 2023, JCAP, 10, 016 [Google Scholar]
Roster, W., Wright, A. H., Hildebrandt, H., et al. 2025, ApJ submitted [arXiv:2508.02779] [Google Scholar]
Salvato, M., Ilbert, O., & Hoyle, B. 2019, Nat. Astron., 3, 212 [NASA ADS] [CrossRef] [Google Scholar]
Schmidt, S. J., Ménard, B., Scranton, R., Morrison, C., & McBride, C. K. 2013, MNRAS, 431, 3307 [Google Scholar]
Scottez, V., Benoit-Lévy, A., Coupon, J., Ilbert, O., & Mellier, Y. 2018, MNRAS, 474, 3921 [Google Scholar]
Simon, P. 2007, A&A, 473, 711 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Stölzner, B., Joachimi, B., Korn, A., Hildebrandt, H., & Wright, A. H. 2021, A&A, 650, A148 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Swanson, M. E. C., Tegmark, M., Blanton, M., & Zehavi, I. 2008, MNRAS, 385, 1635 [Google Scholar]
Takahashi, R., Sato, M., Nishimichi, T., Taruya, A., & Oguri, M. 2012, ApJ, 761, 152 [Google Scholar]
Tallada, P., Carretero, J., Casals, J., et al. 2020, Astron. Comput., 32, 100391 [Google Scholar]
Tanaka, M., Coupon, J., Hsieh, B.-C., et al. 2018, PASJ, 70, S9 [Google Scholar]
van den Busch, J. L., Hildebrandt, H., Wright, A. H., et al. 2020, A&A, 642, A200 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Verdier, A., Rocher, A., Bandi, B., et al. 2025, ArXiv e-prints [arXiv:2508.07311] [Google Scholar]
Wright, A. H., Hildebrandt, H., van den Busch, J. L., & Heymans, C. 2020a, A&A, 637, A100 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Wright, A. H., Hildebrandt, H., van den Busch, J. L., et al. 2020b, A&A, 640, L14 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
Wright, A. H., Hildebrandt, H., van den Busch, J. L., et al. 2025, A&A, submitted [arXiv:2503.19440] [Google Scholar]
Yèche, C., Palanque-Delabrouille, N., Claveau, C.-A., et al. 2020, Res. Notes Am. Astron. Soc., 4, 179 [Google Scholar]
Yuan, S., Wechsler, R. H., Wang, Y., et al. 2025, MNRAS, 538, 1216 [Google Scholar]

Appendix A: Higher-order development of the galaxy bias

We introduced a first-order bias expansion in Sect. 3.1.3

$\begin{matrix} δ_{x} = b_{x}^{(1)} δ_{m} + O (δ_{m}) . \end{matrix}$ $\begin{aligned} \delta _x=b_{x}^{(1)}\,\delta _{\rm m} +{{\mathcal{O} }\left(\delta _{\rm m}\right)}\,. \end{aligned}$ (A.1)

The two-point correlation between two samples a and b localised at the same redshift is

$\begin{matrix} ξ_{ab} (θ) : = {⟨ δ_{a} δ_{b} ⟩}_{θ} = b_{a} b_{b} {⟨ δ_{m} δ_{m} ⟩}_{θ} + O (⟨ δ_{m} δ_{m} ⟩) . \end{matrix}$ $\begin{aligned} \xi _{ab}(\theta ):=\langle \delta _a\,\delta _b\rangle _{\theta }= b_a\,b_b\,\langle \delta _{\rm m}\delta _{\rm m}\rangle _{\theta }+{\mathcal{O} }\left(\langle \delta _{\rm m}\delta _{\rm m}\rangle \right)\,. \end{aligned}$ (A.2)

We are using the ratio of correlations to infer n(z), as it is equal to unity at main order (cf. Sect. 3.3),

$\begin{matrix} \frac{ξ_{ab} (θ)}{\sqrt{ξ_{aa} (θ) ξ_{bb} (θ)}} = 1 + O (1) . \end{matrix}$ $\begin{aligned} \frac{\xi _{ab}(\theta )}{\sqrt{\xi _{aa}(\theta )\,\xi _{bb}(\theta )}} = 1+{\mathcal{O} }\left(1\right)\,. \end{aligned}$ (A.3)

In this appendix we investigate the impact of second-order terms in the bias expansion on the ratio Eq. (A.3). We can express the galaxy overdensity δ_g perturbatively up to second-order with

$\begin{matrix} δ_{g} & = b_{g}^{(1)} δ_{m} + \sum_{j} b_{j, g}^{(2)} O_{j} + O (\sum_{j} O_{j}), \end{matrix}$ $\begin{aligned} \delta _{\rm g}&=b_{\rm g}^{(1)}\,\delta _{\rm m} +\sum _{j} b_{j,\,\mathrm{g}}^{(2)}\,\mathcal{O} _j + {\mathcal{O} }\left(\textstyle \sum _j \mathcal{O} _j\right)\,, \end{aligned}$ (A.4)

where 𝒪_j are a set of second-order fields, with ⟨𝒪_j⟩ = 0. We note that we do not consider the stochastic contribution, as we are working with real-space correlations. For example, in Pandey et al. (2020) and Nicola et al. (2024) they used

$\begin{matrix} \sum_{j} b_{j, g}^{(2)} O_{j} = \frac{b_{g}^{(2)}}{2} (δ_{m}^{2} - ⟨ δ_{m}^{2} ⟩) + \frac{s_{g}}{2} (s^{2} - ⟨ s^{2} ⟩) + d_{g} \nabla^{2} δ_{m}, \end{matrix}$ $\begin{aligned} \sum _{j} b_{j,\,\mathrm{g}}^{(2)}\,\mathcal{O} _j =\frac{b_{\rm g}^{(2)}}{2}(\delta _{\rm m}^2-\langle \delta _{\rm m}^2\rangle )+\frac{s_{\rm g}}{2}(s^2-\langle s^2\rangle )+d_{\rm g}\,\nabla ^2\delta _{\rm m}\,, \end{aligned}$ (A.5)

where $b_{g}^{(2)}$ $b_{\mathrm{g}}^{(2)}$ is the quadratic bias, s_g is the tidal bias, s² is the trace squared of the tidal tensor, and d_gal is the non-local bias. The correlation between a and b (with possibly a = b) at second order is:

$\begin{matrix} ξ_{ab} & = b_{a}^{(1)} b_{b}^{(1)} ⟨ δ_{m} δ_{m} ⟩ + \sum_{j} (b_{j, a}^{(2)} b_{b}^{(1)} + b_{a}^{(1)} b_{j, b}^{(2)}) ⟨ δ_{m} O_{j} ⟩ + O (\sum ⟨ δ_{m} O_{j} ⟩) \end{matrix}$ $\begin{aligned} \xi _{ab}&= b_{a}^{(1)}\,b_{b}^{(1)}\langle \delta _{\rm m} \delta _{\rm m} \rangle +\sum _j \left(b_{j,\,a}^{(2)}\,b_{b}^{(1)}+b_{a}^{(1)}\,b_{j,\,b}^{(2)}\right)\langle \delta _{\rm m}\, \mathcal{O} _j\rangle +{\mathcal{O} }\left(\textstyle \sum \langle \delta _{\rm m}\,\mathcal{O} _j\rangle \right)\end{aligned}$ (A.6)

$\begin{matrix} = b_{a}^{(1)} b_{b}^{(1)} ⟨ δ_{m} δ_{m} ⟩ (1 + \sum_{j} (\frac{b_{j, a}^{(2)}}{b_{a}^{(1)}} + \frac{b_{j, b}^{(2)}}{b_{b}^{(1)}}) \frac{⟨ δ_{m} O_{j} ⟩}{⟨ δ_{m} δ_{m} ⟩} + O (\sum \frac{⟨ δ_{m} O_{j} ⟩}{⟨ δ_{m} δ_{m} ⟩})) . \end{matrix}$ $\begin{aligned}&= b_{a}^{(1)}\,b_{b}^{(1)}\langle \delta _{\rm m} \delta _{\rm m} \rangle \left(1+\sum _j\left(\frac{b_{j,\,a}^{(2)}}{b_{a}^{(1)}}+\frac{b_{j,\,b}^{(2)}}{b_{b}^{(1)}}\right)\frac{\langle \delta _{\rm m}\,\mathcal{O} _j\rangle }{\langle \delta _{\rm m}\delta _{\rm m}\rangle }+{\mathcal{O} }\left(\textstyle \sum \frac{\langle \delta _{\rm m}\,\mathcal{O} _j\rangle }{\langle \delta _{\rm m}\delta _{\rm m}\rangle }\right)\right)\,. \end{aligned}$ (A.7)

We note we did drop the θ index for clarity. Using this relation for the product of the auto-correlation, we get at second order,

$\begin{matrix} {(ξ_{aa} ξ_{bb})}^{- 1 / 2} & = {(b_{a}^{(1)} b_{b}^{(1)} ⟨ δ_{m} δ_{m} ⟩)}^{- 1} {(1 + \sum_{j} (2 \frac{b_{j, a}^{(2)}}{b_{a}^{(1)}} + 2 \frac{b_{j, b}^{(2)}}{b_{b}^{(1)}}) \frac{⟨ δ_{m} O_{j} ⟩}{⟨ δ_{m} δ_{m} ⟩} + O (\sum \frac{⟨ δ_{m} O_{j} ⟩}{⟨ δ_{m} δ_{m} ⟩}))}^{- 1 / 2} \end{matrix}$ $\begin{aligned} \left(\xi _{aa}\,\xi _{bb}\right)^{-1/2}&=\left( b_{a}^{(1)}b_{b}^{(1)}\langle \delta _{\rm m} \delta _{\rm m}\rangle \right)^{-1} \left(1+\sum _j\left(2\frac{b_{j,\,a}^{(2)}}{b_{a}^{(1)}}+2\frac{b_{j,\,b}^{(2)}}{b_{b}^{(1)}}\right)\frac{\langle \delta _{\rm m}\,\mathcal{O} _j\rangle }{\langle \delta _{\rm m}\delta _{\rm m}\rangle }+{\mathcal{O} }\left(\textstyle \sum \frac{\langle \delta _{\rm m}\,\mathcal{O} _j\rangle }{\langle \delta _{\rm m}\delta _{\rm m}\rangle }\right)\right)^{-1/2}\end{aligned}$ (A.8)

$\begin{matrix} = {(b_{a}^{(1)} b_{b}^{(1)} ⟨ δ_{m} δ_{m} ⟩)}^{- 1} (1 - \sum_{j} (\frac{b_{j, a}^{(2)}}{b_{a}^{(1)}} + \frac{b_{j, b}^{(2)}}{b_{b}^{(1)}}) \frac{⟨ δ_{m} O_{j} ⟩}{⟨ δ_{m} δ_{m} ⟩} + O (\sum \frac{⟨ δ_{m} O_{j} ⟩}{⟨ δ_{m} δ_{m} ⟩})) . \end{matrix}$ $\begin{aligned}&=\left(b_{a}^{(1)}b_{b}^{(1)}\langle \delta _{\rm m} \delta _{\rm m}\rangle \right)^{-1} \left(1-\sum _j\left(\frac{b_{j,\,a}^{(2)}}{b_{a}^{(1)}}+\frac{b_{j,\,b}^{(2)}}{b_{b}^{(1)}}\right)\frac{\langle \delta _{\rm m}\,\mathcal{O} _j\rangle }{\langle \delta _{\rm m}\delta _{\rm m}\rangle }+{\mathcal{O} }\left(\textstyle \sum \frac{\langle \delta _{\rm m}\,\mathcal{O} _j\rangle }{\langle \delta _{\rm m}\delta _{\rm m}\rangle }\right)\right)\,.\end{aligned}$ (A.9)

$\ \ \$ (A.10)

Appendix B: Magnification modelling

Here we describe the modelling of the magnification terms cf. Eq. (34). The impact of the magnification change in flux depends on a parameter s_μ related to the selection function applied to the sample (Elvin-Poole et al. 2023). In the case of a magnitude threshold, it is the well-known derivative of the cumulative number count $N_{μ}^{c}$ $N_{\mu}^{\mathrm{c}}$ , evaluated at the maximum magnitude,

$\begin{matrix} s_{μ} = \frac{d}{d m} {[ln N_{μ}^{c}]}_{m_{\max}} . \end{matrix}$ $\begin{aligned} s_{\mu }= \frac{\mathrm{d}}{\mathrm{d} m}[\ln {N^\mathrm{c}_\mu }]_{m_{\rm max}}\,. \end{aligned}$ (B.1)

Usually, changes in flux and solid angle are concatenated through the coefficient¹⁸α = 2.5s_μ − 1 .

Evaluating the αs is in practice very challenging, since we are rarely considering only one magnitude cut as in Eq. (B.1), and it requires dedicated works (e.g. cf. the case of two magnitude cuts in Lepori et al. 2025), but is necessary for weak lensing (Hildebrandt 2016; Elvin-Poole et al. 2023).

The spectroscopic sample is magnified by the matter traced by low-z photometric galaxies, and the matter traced by spectroscopic galaxies is magnifying high-z photometric galaxies. Neglecting RSD, using the conventions from Krause et al. (2021) and the Limber approximation, we have

$\begin{matrix} w_{sp}^{full} (θ) & = \sum_{i j \in {DD, D μ, μ D}} c_{s}^{i} c_{p}^{j} \int d χ \frac{W_{i}^{s} W_{j}^{p}}{χ^{2}} \sum_{ℓ} \frac{2 ℓ + 1}{4 π} L_{ℓ} (cos θ) P_{m} (k (χ, ℓ), z (χ)), \end{matrix}$ $\begin{aligned} { w}_{\rm sp}^\mathrm{full}(\theta )&=\sum _{ij\in \{\mathrm{DD},\,\mathrm{D}\mu ,\, \mu \mathrm{D}\}}c^i_{\rm s}\,c^j_{\rm p}\int \mathrm{d} \chi \; \frac{\mathcal{W} _i^\mathrm{s}\mathcal{W} _j^\mathrm{p}}{\chi ^2}\,\sum _\ell \frac{2\ell +1}{4\pi }\mathcal{L} _\ell (\cos \theta )\,P_{\rm m}\left(k(\chi ,\ell ),\,z(\chi )\right)\,, \end{aligned}$ (B.2)

where $c_{a}^{D} = b_{a}$ $c_{a}^{\mathrm{D}}=b_{a}$ is the galaxy bias, c_a^μ = α_a is the magnification coefficient, ℒ_ℓ is the Legendre polynomials of order ℓ, P_m is the matter power spectrum, $k (χ, ℓ) = \frac{ℓ + 0.5}{χ}$ $k(\chi,\ell) = \frac{\ell+0.5}{\chi}$ , and

$\begin{matrix} W_{D}^{a} = n_{a} (z) \frac{d z}{d χ} and W_{μ}^{a} = \frac{3 Ω_{m} H_{0}^{2}}{2 c^{2}} \int_{χ}^{\infty} d χ^{'} n_{a} (χ^{'}) \frac{χ}{a (χ)} \frac{χ^{'} - χ}{χ^{'}} . \end{matrix}$ $\begin{aligned}&\mathcal{W} _{\rm D}^a=n_a(z)\frac{\mathrm{d} z}{\mathrm{d}\chi } \text{ and} \mathcal{W} _\mu ^a=\frac{3\, \Omega _{\rm m}\,H_0^2}{2\,c^2}\int _\chi ^{\infty }\mathrm{d} \chi ^\prime \;n_a(\chi ^{\prime })\frac{\chi }{a(\chi )}\frac{\chi ^{\prime }-\chi }{\chi ^{\prime }}\,. \end{aligned}$ (B.3)

These expressions are simplified if the spectroscopic sample is localised in a small redshift slice (Δz small enough to neglect redshift variations) centred in z_s. In that case, we can model the distribution of the photometric sample by a step function

$\begin{matrix} n_{p} (z) = \sum_{i} n_{p} (z_{i}) H (Δ z / 2 - | z - z_{i} |), \end{matrix}$ $\begin{aligned} n_{\rm p}(z) = \sum _i n_{\rm p}(z_i)\; \mathrm{H}( \Delta z/2-\vert z-z_i\vert )\,, \end{aligned}$ (B.4)

where H is the Heaviside function, equal to unity for Δz/2 − |z − z_i|> 0 and 0 otherwise. We then have

$\begin{matrix} w_{sp}^{D D} (θ) \approx b_{s} (z_{s}) b_{p} (z_{s}) n_{p} (z_{s}) ξ_{m} (χ_{s}), \end{matrix}$ $\begin{aligned}&{ w}_{\rm sp}^{\mathrm{D}\mathrm{D}}(\theta ) \approx b_{\rm s}(z_{\rm s})\,b_{\rm p}(z_{\rm s})\, n_{\rm p}(z_{\rm s})\,\xi _{\rm m}(\chi _{\rm s})\,,\end{aligned}$ (B.5)

$\begin{matrix} w_{sp}^{D μ} (θ) = \frac{3 Ω_{m} H_{0}}{c} b_{s} (z_{i}) n_{s} (z_{i}) \frac{H_{0} (1 + z_{i})}{H (z_{i})} ξ_{m} (z_{i}, θ) \sum_{j = i + 1, . . .} Δ z α_{p} (z_{j}) n_{p} (z_{j}) \frac{χ (z_{j}) - χ (z_{i})}{χ (z_{j})} χ (z_{i}), \end{matrix}$ $\begin{aligned}&{ w}_{\rm sp}^{\mathrm{D}\mu }(\theta ) = \frac{3\,\Omega _{\rm m}\,H_0 }{c} b_{\rm s}(z_i)\,n_{\rm s}(z_i)\,\frac{H_0\,(1+z_i)}{H(z_i)}\, \xi _{\rm m}(z_i,\,\theta )\sum _{j=i+1,...} \Delta z \,\alpha _{\rm p}(z_j)\, n_{\rm p}(z_j)\,\frac{\chi (z_j)-\chi (z_i)}{\chi (z_j)}\,\chi (z_i)\,,\end{aligned}$ (B.6)

$\begin{matrix} w_{sp}^{μ D} (θ) = \frac{3 Ω_{m} H_{0}}{c} \sum_{j = 0, . . i - 1} Δ z b_{p} (z_{j}) n_{p} (z_{j}) \frac{H_{0} (1 + z_{j})}{H (z_{j})} ξ_{m} (z_{j}, θ) α_{s} (z_{j}) \frac{χ (z_{i}) - χ (z_{j})}{χ (z_{i})} χ (z_{j}) . \end{matrix}$ $\begin{aligned}&{ w}_{\rm sp}^{\mu \mathrm{D}}(\theta ) = \frac{3\,\Omega _{\rm m}\,H_0 }{c}\sum _{j = 0,..i-1} \Delta z\, b_{\rm p}(z_j)\,n_{\rm p}(z_j)\,\frac{H_0\,(1+z_j)}{H(z_j)}\, \xi _{\rm m}(z_j,\,\theta )\,\alpha _{\rm s}(z_j)\, \frac{\chi (z_i)-\chi (z_j)}{\chi (z_i)}\,\chi (z_j)\,. \end{aligned}$ (B.7)

We can rewrite our magnification term as

$\begin{matrix} \begin{matrix} M (θ) = α_{s} (z_{s}) \sum_{j =, . . ., i - 1} & b_{p} (z_{j}) n_{p} (z_{j}) D (z_{j}, z_{i}, θ) + b_{s} (z_{i}) \sum_{j = i + 1, . . .} α_{p} (z_{j}) n_{p} (z_{j}) D (z_{i}, z_{j}, θ), \end{matrix} \end{matrix}$ $\begin{aligned} \begin{aligned} M(\theta ) = \alpha _{\rm s}(z_{\rm s})\sum _{j=,...,i-1} \,\,&{b_{\rm p}}(z_j)\,n_{\mathrm{p}}(z_{j})\,D(z_j,\,z_{ i},\,\theta )+{b_{\rm s}} (z_i)\sum _{j=i+1,...}\,\alpha _{\mathrm{p}}(z_j)\,n_{\mathrm{p}}(z_j)\,D(z_i,\,z_j,\,\theta )\,, \end{aligned} \end{aligned}$ (B.8)

with

$\begin{matrix} D (z_{k}, z_{l}, θ) = \frac{3 H_{0} Ω_{m}}{c} Δ z \frac{H_{0}}{H (z_{k})} (1 + z_{k}) ξ_{m} (z_{k}, θ) \frac{χ (z_{l}) - χ (z_{k})}{χ (z_{l})} χ (z_{k}) . \end{matrix}$ $\begin{aligned} D(z_k,\,z_l,\,\theta ) = \frac{3\,H_0\,\Omega _{\rm m}}{c}\,\Delta z\,\frac{H_0}{H(z_k)}\,(1+z_k)\;\xi _{\rm m}( z_k,\,\theta )\,\frac{\chi (z_l)-\chi (z_k)}{\chi (z_l)}\,\chi (z_k) \,. \end{aligned}$ (B.9)

Our final expression contains several differences with Gatti et al. (2022). We are taking into account the redshift variation of spectroscopic and photometric biases, and α. Our conventions and the sum indices are slightly different (we try to be more explicit).

We illustrate the impact of magnification on angular correlation in Fig. B.1 using toy models. Two tomographic bins are introduced, with Gaussian redshift distributions centred at z = 0.5 with widths σ = 0.08 and σ = 0.16, respectively. The clustering and the two magnification terms are evaluated using spectroscopic samples divided into 20 redshift slices with Δz = 0.05. We adopt b_s = b_p = 1, α_s = 1, and α_p = −1.

Fig. B.1.

Top panel: The redshift distributions of the photometric sample (blue) and five out of the 20 spectroscopic samples (red) are shown. The photometric distribution on the left follows a Gaussian with parameters (μ, σ) = (0.5, 0.08), while on the right, it follows (μ, σ) = (0.5, 0.16). Bottom panel: The angular correlation is displayed for clustering only (green), magnification only (red and purple), and their sum (orange). The values of the coefficients b and α are provided in the legend. Magnification impacts the shape of the angular correlation (sum vs. clustering only) and may influence the mean redshift, particularly for larger tomographic bins (right vs. left).

A noticeable difference is observed between clustering alone and the sum of the three effects, indicating that magnification is not entirely negligible. Since different α values are used for the spectroscopic and photometric samples, the combined effect appears to be shifted towards lower redshifts, potentially introducing a bias in the mean redshift. Finally, the impact is very limited for the narrower tomographic bin (left) but becomes more pronounced for the wider bin (right).

This highlights the importance of accounting for magnification effects when using broad tomographic bins. With UNION and LSST photometry, our bins are projected to be narrower and more localised than DES ones, and thus magnification may be negligible for Euclid clustering redshifts, even if it was not for DES.

Appendix C: Pair counts

The pair-count between two samples is theoretically given by

$\begin{matrix} D_{a} D_{b} (r_{p}) & = \int \int {d z}_{1} {d z}_{2} {⟨ N_{a} N_{b} ⟩}_{r_{p}} = \int \int {d z}_{1} {d z}_{2} N_{a} (z_{1}) N_{b} (z_{2}) {⟨ (1 + δ_{a}) (1 + δ_{b}) ⟩}_{r_{p}}^{data}, \end{matrix}$ $\begin{aligned} \mathrm{D}_a\mathrm{D}_b({r_{\rm p}})&=\int \int {\mathrm{d} z}_1{\mathrm{d} z}_2\, \langle N_a N_b \rangle _{{r_{\rm p}} }=\int \int {\mathrm{d} z}_1{\mathrm{d} z}_2\,N_a(z_1)\,N_b(z_2)\,\langle (1+\delta _a)(1+\delta _b)\rangle _{{r_{\rm p}} }^\mathrm{data}\,, \end{aligned}$ (C.1)

where N(z) = N_tot n(z) is the number galaxy distribution. We use galaxy weights to correct selection effects, anisotropies due to observation conditions or background etc.

C.1. From continuous to discrete

In practice, the pair count is evaluated for a certain distance range r_p ± Δr_p. It means that for a galaxy of sample a, we are counting all the sample b galaxies within an annulus with radius r_pⁱ and width Δr_pⁱ. For instance, we use logarithmic binning, for which Δr_pⁱ ∝ r_pⁱ. Thus, the correlation (including galaxy weights) of Eq. (C.1) is related to the theoretical one through,

$\begin{matrix} {⟨ (1 + δ_{a}) (1 + δ_{b}) ⟩}_{r_{p}^{i}}^{data} = \int_{0}^{2 π} d θ \int_{r_{p}^{i} \pm Δ r_{p}^{i} / 2} d r_{p} {⟨ (1 + δ_{a}) (1 + δ_{b}) ⟩}_{r_{p}, θ} \approx 2 π r_{p}^{i} Δ r_{p}^{i} {⟨ (1 + δ_{a}) (1 + δ_{b}) ⟩}_{r_{p}^{i}} . \end{matrix}$ $\begin{aligned} \langle (1+\delta _a)(1+\delta _b)\rangle _{r_{\rm p}^i}^\mathrm{data}= \int _0^{2\pi }\mathrm{d} \theta \, \int _{r_{\rm p}^i\pm \Delta r_{\rm p}^i/2}\mathrm{d} r_{\rm p}\langle (1+\delta _a)(1+\delta _b)\rangle _{r_{\rm p},\, \theta } \approx 2\pi \, r_{\rm p}^i\,\Delta r_{\rm p}^i\,\langle (1+\delta _a)(1+\delta _b)\rangle _{r_{\rm p}^i}\,. \end{aligned}$ (C.2)

C.2. Mask and estimator

There is an additional geometrical effect to Eq. (C.2) which is the mask: we do not observe the full sky. The annuli are not perfect and there is a geometrical correction

$\begin{matrix} {⟨ (1 + δ_{a}) (1 + δ_{b}) ⟩}_{r_{p}^{i}}^{data} \approx 2 π r_{p}^{i} Δ r_{p}^{i} η_{mask} (r_{p}) {⟨ (1 + δ_{a}) (1 + δ_{b}) ⟩}_{r_{p}^{i}} . \end{matrix}$ $\begin{aligned} \langle (1+\delta _a)(1+\delta _b)\rangle _{r_{\rm p}^i}^\mathrm{data}\approx 2\pi \, r_{\rm p}^i\,\Delta r_{\rm p}^i\,\eta _{\rm mask}({r_{\rm p}})\,\langle (1+\delta _a)(1+\delta _b)\rangle _{r_{\rm p}^i}\,. \end{aligned}$ (C.3)

To correct it, we compare the pair count of the correlated samples, with the ones of uncorrelated samples, submitted to the same mask, the so-called “randoms”,

$\begin{matrix} R_{a} R_{b} (r_{p}) = N_{a}^{rand} N_{b}^{rand} 2 π r_{p}^{i} Δ r_{p}^{i} η_{mask} (r_{p}) . \end{matrix}$ $\begin{aligned} \mathrm{R}_a\mathrm{R}_b({r_{\rm p}}) = N_{a}^\mathrm{rand}N_{b}^\mathrm{rand}\,2\pi \, r_{\rm p}^i\,\Delta r_{\rm p}^i\,\eta _{\rm mask}({r_{\rm p}})\,. \end{aligned}$ (C.4)

Comparing different pair counts, we always correct the amplitude, which scales as the number of objects. We usually use 10 times more randoms than objects, so we multiply DD/RR by 100. This step is done by default in Treecorr.

Computing random-galaxy pair counts, DR [which scales as 2π r_pⁱ Δr_pⁱ η_mask(r_p) as well] is optimal for variance reduction (Landy & Szalay 1993). That is why our fiducial analysis involves the full Landy–Szalay estimator,

$\begin{matrix} w_{ab}^{data} (r_{p}) & = \frac{D_{a} D_{b} - R_{a} D_{b} - D_{a} R_{b} + R_{a} R_{b}}{R_{a} R_{b}} (r_{p}) = {⟨ (1 + δ_{a}) (1 + δ_{b}) - (1 + δ_{a}) - (1 + δ_{b}) + 1 ⟩}_{r_{p}^{i}} = {⟨ δ_{a} δ_{b} ⟩}_{r_{p}^{i}}, \end{matrix}$ $\begin{aligned} { w}_{ab}^\mathrm{data}({r_{\rm p}} )&=\frac{\mathrm{D}_a\mathrm{D}_b-\mathrm{R}_a\mathrm{D}_b-\mathrm{D}_a\mathrm{R}_b+\mathrm{R}_a\mathrm{R}_b}{\mathrm{R}_a\mathrm{R}_b}({r_{\rm p}}) = \langle (1+\delta _a)(1+\delta _b)-(1+\delta _a)-(1+\delta _b)+1\rangle _{r_{\rm p}^i}=\langle \delta _a\,\delta _b\rangle _{r_{\rm p}^i}\,, \end{aligned}$ (C.5)

where we omitted the galaxy number normalisation factors introduced below. A common choice for clustering-redshifts is to use the Davis and Peebles estimator

$\begin{matrix} w_{ab} (r_{p}) \approx \frac{D_{a} D_{b} - R_{a} D_{b}}{R_{a} D_{b}} (r_{p}), \end{matrix}$ $\begin{aligned} { w}_{ab}({r_{\rm p}} )\approx \frac{\mathrm{D}_a\mathrm{D}_b-\mathrm{R}_a\mathrm{D}_b}{\mathrm{R}_a\mathrm{D}_b}({r_{\rm p}})\,, \end{aligned}$ (C.6)

mainly because there is no need for a random catalogue for the second sample (Davis & Peebles 1983; Gatti et al. 2018; van den Busch et al. 2020). For Euclid, a random catalogue would be created for the different analyses, so there is no motivation for using Eq. (C.6) rather than Eq. (C.5).

Appendix D: Systematic errors and bias correction

In this appendix, we provide additional tables to Sects. 4.7 and 4.8. In Table D.1, we report the values of the deviations on the mean redshift and shape due to the systematic effects. The values are the means over the five sky patches, and correspond to Δz = 0.05 and the M5 bias correction. We found similar results with other bias corrections. Finally in Table D.2, we report the values of the deviations on the mean redshift and standard deviation associated with the bias correction, in the absence of systematic effects.

Table D.1.

Mean redshift and standard deviation bias for the different systematic effects (including none, and all) are reported; Req. refers to the requirement that needs to be fulfilled.

Table D.2.

Mean redshift and standard deviation for the different bias corrections M0–M5, for SSM and SGP fitting, without systematics (independent case in Sect. 4.7); Req. refers to the requirement to fulfil.

Appendix E: Mis-modelling of clustering-redshifts with the one-bin approximation

In Sect. 3.2.3, we introduced an improvement to the modelling to expand beyond this one-bin approximation, which in the context of M5 bias correction is,

$\begin{matrix} \frac{w_{sp}}{Δ z \sqrt{w_{ss} w_{pp}}} & (z_{i}) \approx n_{b} (z_{i}) \times (1 + \frac{n_{b} (z_{i} + Δ z)}{n_{b} (z_{i})} η_{+} (z_{i}) + \frac{n_{b} (z_{i} - Δ z)}{n_{b} (z_{i})} η_{-} (z_{i})) . \end{matrix}$ $\begin{aligned} \frac{{ w}_{\rm sp}}{\Delta z\sqrt{{ w}_{\rm ss}\, { w}_{\rm pp}}}&(z_i)\approx n_b(z_i)\times \left( 1+\frac{n_b(z_i+\Delta z)}{n_b(z_i)}\,\eta _+(z_i)+\frac{n_b(z_i-\Delta z)}{n_b(z_i)}\,\eta _-(z_i) \right)\,.\nonumber \end{aligned}$

Since η_± > 0, we expect to measure n(z) with a norm higher than unity. The global offset should be approximately

$\begin{matrix} \sum_{z_{i}} \frac{w_{sp}}{Δ z \sqrt{w_{ss} w_{pp}}} (z_{i}) \approx 1 + η_{+} (⟨ z ⟩) + η_{-} (⟨ z ⟩) . \end{matrix}$ $\begin{aligned} \sum _{z_i}\frac{{ w}_{\rm sp}}{\Delta z\sqrt{{ w}_{\rm ss}\, { w}_{\rm pp}}}(z_i)\approx 1+\eta _+(\langle z \rangle )+\eta _-(\langle z \rangle )\,. \end{aligned}$ (E.1)

In Fig. E.1 we show the ratio of the measured n_p(z_i) to the true distribution, for three slicing Δz, and the five sky patches. We also report the offset predicted from Eq. (E.1) for each Δz. We observe consistency between the points and the rough prediction. Concatenating the measurements from the different redshift slices, we can express the correlation as

Fig. E.1.

Deviation of the measured n_p(z_i) to the true distribution, for different slicing Δz (points in colours), and the global offset predicted by our model, which includes correlation at the edge of the slices (solid lines in colours). The points correspond to the measurements for the five patches, hence the difference in redshift between them is not Δz.

$\begin{matrix} w_{sp} (θ) = B_{s} B_{p} Λ n, \end{matrix}$ $\begin{aligned} {w}_{\rm sp}(\theta ) = B_{\rm s}\,B_{\rm p}\,\Lambda \, {n}\,, \end{aligned}$ (E.2)

where ⃗n = {n(z_i)}_{i = 1, ..,n}, B_x are diagonal matrices representing the galaxy biases, and Λ is a tridiagonal matrix, easily invertible, with unity on its diagonal and smaller terms (corrections) on the first off-diagonals. We attempted to implement Eq. (E.2) for the inference of n_p(z) by replacing the one-bin modelling with three-bins. We did not observe a significant improvement in the bias on the mean redshift compared to the standard renormalisation procedure, however. In Fig. E.1, we note that points from different patches at similar redshifts (visible as groups of 5 points corresponding to the 5 patches in the high-z bin) are spread over a wide range, despite the true distributions being very similar across the patches. This behaviour could be attributed to substantial sample variance, which might explain why three-bins perform similarly to one-bin. Additionally, we observe that three-bins still exhibit significant deviations in Fig. 2, highlighting the challenges of improving one-bin modelling despite our efforts. We note that in Fig. 14, the measurements for 0.75 < z < 0.85 are systematically under the true n(z) for Δz = 0.02, 0.05, which might cause the positive shift in mean redshift. This under-correlation is also visible in Fig. E.1. We did not identify its cause.

All Tables

Table D.1.

Mean redshift and standard deviation bias for the different systematic effects (including none, and all) are reported; Req. refers to the requirement that needs to be fulfilled.

In the text

Table D.2.

Mean redshift and standard deviation for the different bias corrections M0–M5, for SSM and SGP fitting, without systematics (independent case in Sect. 4.7); Req. refers to the requirement to fulfil.

In the text

All Figures

	Fig. 1. Top: Simulated true-redshift distribution of each of the first ten tomographic bins (z_photo < 1.6). Bottom: Simulated redshift distribution of the spectroscopic samples from the BOSS, DESI, and Euclid surveys. The lack of spectroscopic samples for z_spec > 1.8 imposes the z_photo < 1.6 condition for clustering-redshifts calibration.
In the text

	Fig. 2. Top: Redshift distribution of sample b in blue, and four out of the ten samples a_i in red. Bottom: Deviation of approximated w_{a_ib} with respect to the exact one for the ten cross-correlations.
In the text

	Fig. 3. Galaxy distribution observed through different Δz averaging as Eq. (70). The redshift convolution modifies the shape of the distribution for larger slicing compared to the real slicing as estimated with a very thin slicing (grey). For this case, all slicings Δz ≤ 0.06 produce a good estimate.
In the text

	Fig. 4. Left: n(z) distribution measurement, with 50 associated GP realisations (orange), and the true distribution (red). Middle: SNR of the GP realisation (orange), and their convolution (blue). The SNR threshold ( = 3) is reported with the dashed line. Right: Associated SGP realisations, renormalised to unity (blue), compared to the true distribution (red).
In the text

	Fig. 5. Modelling the cross-correlation between spectroscopic samples in redshift slices and photometric bins, and reference to the corresponding tests we performed to optimise the pipeline.
In the text

Fig. 6.

Auto-correlations of two spectroscopic samples for a single redshift bin (Δz = 0.05). The yellow and red points are obtained using the first estimator $w_{ss}^{(1)}$ $\mathit{w}_{\mathrm{ss}}^{(1)}$ with γ ∈ { − 1, + 1}. The blue and purple points are obtained using the second one $w_{ss}^{(2)}$ $\mathit{w}_{\mathrm{ss}}^{(2)}$ with γ ∈ { − 3, − 1}. The consistency of the points with different weights illustrates the effective weighting due to the estimator definition.

In the text

	Fig. 7. Variation in the Pearson coefficients r_ab for Flagship LRGs with ELGs. This coefficient informs on the small-scale under-correlation of blue and red galaxies.
In the text

	Fig. 8. Pearson coefficient for our three simulated bins: low-z, mid-z, and high-z. This term is assumed to be one for clustering-redshifts calibration. We see different behaviours, r = 1, r < 1, and r > 1 for different scales and redshifts, except for the high-z bin, for which r = 1 (almost) everywhere.
In the text

	Fig. 9. RMS of Δn, SNR of n, and reduced χ² of Δn (top to bottom) as a function of the projected scale r_p for the three redshift bins (left to right) for three bias corrections (colours). The fine lines show five independent realisations of 1000 deg², and the thick lines show their means.
In the text

	Fig. 10. SNR and RMS for the three tomographic bins (left to right), with three power laws as weighting functions (colours), two weighting schemes, and two bias-correction methods (M1 and M5).
In the text

	Fig. 11. Mean redshift and shape reconstruction with SGP, for the three tomographic bins. The different markers and colours are associated with different slicing Δz. The sandy region corresponds to the Euclid requirements, and the diagonal line shows a coherent deviation of 1σ.
In the text

	Fig. 12. Deviations on the mean-z (top) and standard deviation (bottom), with SGP fitting, for the three tomographic bins (left, middle, and right), with three Δz slicings (colours) and different implemented systematic effects (marker). The shadowed bands correspond to the Euclid requirement on both parameters. The dark and light error bars indicate 68.3% and 95.5% uncertainties
In the text

	Fig. 13. Same as Fig. 12 for the two parameters of the SSM model, which are equivalent to ⟨z⟩−⟨z⟩_true and σ_z/σ_ztrue − 1.
In the text

	Fig. 14. True-redshift distributions of tomographic bins in black, and the clustering-redshifts measurements with M5 for three spectroscopic slicings: Δz = 0.02, 0.05, 0.08 (orange, red, and purple).
In the text

	Fig. 15. True w_pp for Δz = 0.05 in grey and measurement with corrections factors including M3 as dashed green, blue, and orange lines for photo-z binning Δz = 0.1. The best fits from degree-3 polynomials are reported with solid lines. The w_pp used for M3 is the best-fit model of the full correction (solid red line).
In the text

	Fig. 16. Same as Fig. 15, but for M4, with a smaller photo-z binning Δz = 0.02. The w_pp used for M4 is the best-fit model of the full correction (solid red line).
In the text

	Fig. 17. Deviations with the SSM fitting on the mean redshift (upper panel) and standard deviation (lower panel) for the six methods M0–M5 in colour, for five sky patches (same colour). The requirements are represented by the shaded area. The dark and light error bars indicate 68.3% and 95.5% uncertainties.
In the text

	Fig. 18. Same as Fig. 17 with the SGP fitting.
In the text

	Fig. 19. Uncertainties on mean-z with SSM fitting for five sky realisations in red. We also plot the best-fit uncertainty model in blue.
In the text

Fig. 20.

True-redshift distribution of the simulated Flagship2 ten tomographic bins with z_p < 1.6 (solid lines) and their clustering-redshifts measurements with our pipeline, including all systematics and realistic bias correction M3. The first two bin measurements are realised with 2500 deg² sky patches, and the other eight are realised with 1000 deg² sky patches. The dark and light error bars indicate 68.3% and 95.5% uncertainties, but are barely visible by eye.

In the text

Fig. 21.

Deviations in the mean redshift (top) and the redshift standard deviation (bottom) for the ten Flagship2 tomographic bins relative to the true-redshift distributions using the SSM method. The deviations are shown for each bin and sky patch. We consider patches of 2500 deg² for the first two bins, and 1000 deg² for the others. The Euclid requirements on the uncertainty are highlighted by the shaded area centred around zero. The uncertainties are consistent across all the bins for the mean redshift and we achieve a clear improvement from previous work shown in red. For the redshift standard deviations, the uncertainties are slightly underestimated, but this is within an acceptable range given the requirements.

In the text

Fig. 22.

Deviations in the mean redshift (top) and the redshift standard deviation (bottom) using the SGP method for the same bin configuration as in Fig. 21. The Euclid requirements on the uncertainty are highlighted by the shaded area centred around zero. For some bins (particularly bins 1, 3, and 7), we observe slightly larger deviations and uncertainties than permitted on the mean redshift. However, a clear improvement compared to previous work (shown in red) is still evident. The redshift standard deviations are unbiased and fall within the required range (shaded sandy area).

In the text

Fig. B.1.

Top panel: The redshift distributions of the photometric sample (blue) and five out of the 20 spectroscopic samples (red) are shown. The photometric distribution on the left follows a Gaussian with parameters (μ, σ) = (0.5, 0.08), while on the right, it follows (μ, σ) = (0.5, 0.16). Bottom panel: The angular correlation is displayed for clustering only (green), magnification only (red and purple), and their sum (orange). The values of the coefficients b and α are provided in the legend. Magnification impacts the shape of the angular correlation (sum vs. clustering only) and may influence the mean redshift, particularly for larger tomographic bins (right vs. left).

In the text

	Fig. E.1. Deviation of the measured n_p(z_i) to the true distribution, for different slicing Δz (points in colours), and the global offset predicted by our model, which includes correlation at the edge of the slices (solid lines in colours). The points correspond to the measurements for the five patches, hence the difference in redshift between them is not Δz.
In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] Adame, A. G., Aguilar, J., Ahlen, S., et al. 2024, AJ, 167, 62 [NASA ADS] [CrossRef] [Google Scholar]

[2] Addison, G. E., Bennett, C. L., Jeong, D., Komatsu, E., & Weiland, J. L. 2019, ApJ, 879, 15 [NASA ADS] [CrossRef] [Google Scholar]

[3] Ambikasaran, S., Foreman-Mackey, D., Greengard, L., Hogg, D. W., & O’Neil, M. 2015, IEEE Trans. Pattern Anal. Mach. Intell., 38, 252 [Google Scholar]

[4] Baleato Lizancos, A., & White, M. 2023, JCAP, 07, 044 [Google Scholar]

[5] Behroozi, P. S., Wechsler, R. H., & Wu, H.-Y. 2013, ApJ, 762, 109 [NASA ADS] [CrossRef] [Google Scholar]

[6] Blas, D., Lesgourgues, J., & Tram, T. 2011, JCAP, 07, 034 [CrossRef] [Google Scholar]

[7] Bordoloi, R., Lilly, S. J., & Amara, A. 2010, MNRAS, 406, 881 [NASA ADS] [Google Scholar]

[8] Campos, A., Yin, B., Dodelson, S., et al. 2024, ArXiv e-prints [arXiv:2408.00922] [Google Scholar]

[9] Carretero, J., Tallada, P., Casals, J., et al. 2017, in Proceedings of the EuropeanPhysical Society Conference on High Energy Physics. 5-12 July, 488 [Google Scholar]

[10] Cawthon, R., Elvin-Poole, J., Porredon, A., et al. 2022, MNRAS, 513, 5517 [NASA ADS] [CrossRef] [Google Scholar]

[11] Chaves-Montero, J., Angulo, R. E., & Contreras, S. 2023, MNRAS, 521, 937 [Google Scholar]

[12] Chisari, N. E., Alonso, D., Krause, E., et al. 2019, ApJS, 242, 2 [Google Scholar]

[13] Contarini, S., Verza, G., Pisani, A., et al. 2022, A&A, 667, A162 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[14] Davis, M., & Peebles, P. J. E. 1983, ApJ, 267, 465 [Google Scholar]

[15] Davis, C., Rozo, E., Roodman, A., et al. 2018, MNRAS, 477, 2196 [Google Scholar]

[16] Dawson, K. S., Schlegel, D. J., Ahn, C. P., et al. 2013, AJ, 145, 10 [Google Scholar]

[17] DESI Collaboration (Aghamousa, A., et al.) 2016, ArXiv e-prints [arXiv:1611.00037] [Google Scholar]

[18] Elvin-Poole, J., MacCrann, N., Everett, S., et al. 2023, MNRAS, 523, 3649 [NASA ADS] [CrossRef] [Google Scholar]

[19] Eriksen, M., Alarcon, A., Cabayol, L., et al. 2020, MNRAS, 497, 4565 [CrossRef] [Google Scholar]

[20] Euclid Collaboration (Blanchard, A., et al.) 2020, A&A, 642, A191 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[21] Euclid Collaboration (Desprez, G., et al.) 2020, A&A, 644, A31 [EDP Sciences] [Google Scholar]

[22] Euclid Collaboration (Ilbert, O., et al.) 2021, A&A, 647, A117 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[23] Euclid Collaboration (Pocino, A., et al.) 2021, A&A, 655, A44 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[24] Euclid Collaboration (Castander, F., et al.) 2025, A&A, 697, A5 [Google Scholar]

[25] Euclid Collaboration (Cropper, M., et al.) 2025, A&A, 697, A2 [Google Scholar]

[26] Euclid Collaboration (Jahnke, K., et al.) 2025, A&A, 697, A3 [Google Scholar]

[27] Euclid Collaboration (Le Brun, V., et al.) 2025, A&A, submitted [Google Scholar]

[28] Euclid Collaboration (Mellier, Y., et al.) 2025, A&A, 697, A1 [Google Scholar]

[29] Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306 [Google Scholar]

[30] Gao, H., Jing, Y. P., Xu, K., et al. 2024, ApJ, 961, 74 [Google Scholar]

[31] Gatti, M., Vielzeuf, P., Davis, C., et al. 2018, MNRAS, 477, 1664 [Google Scholar]

[32] Gatti, M., Giannini, G., Bernstein, G. M., et al. 2022, MNRAS, 510, 1223 [Google Scholar]

[33] Giannini, G., Alarcon, A., Gatti, M., et al. 2024, MNRAS, 527, 2010 [Google Scholar]

[34] Gwyn, S., McConnachie, A. W., Cuillandre, J. C., et al. 2025, ArXiv e-prints [arXiv:2503.13783] [Google Scholar]

[35] Hartlap, J., Simon, P., & Schneider, P. 2007, A&A, 464, 399 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[36] Hartley, W. G., Chang, C., Samani, S., et al. 2020, MNRAS, 496, 4769 [Google Scholar]

[37] Hildebrandt, H. 2016, MNRAS, 455, 3943 [NASA ADS] [CrossRef] [Google Scholar]

[38] Hildebrandt, H., van den Busch, J. L., Wright, A. H., et al. 2021, A&A, 647, A124 [EDP Sciences] [Google Scholar]

[39] Huterer, D., Takada, M., Bernstein, G., & Jain, B. 2006, MNRAS, 366, 101 [Google Scholar]

[40] Jarvis, M. 2015, Astrophysics Source Code Library [record ascl:1508.007] [Google Scholar]

[41] Johnson, A., Blake, C., Amon, A., et al. 2017, MNRAS, 465, 4118 [Google Scholar]

[42] Johnston, H., Elisa Chisari, N., Joudaki, S., et al. 2025, A&A, 699, A127 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[43] Krause, E., Fang, X., Pandey, S., et al. 2021, ArXiv e-prints [arXiv:2105.13548] [Google Scholar]

[44] Landy, S. D., & Szalay, A. S. 1993, ApJ, 412, 64 [Google Scholar]

[45] Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, ArXiv e-prints [arXiv:1110.3193] [Google Scholar]

[46] Lepori, F., Schulz, S., Tutusaus, I., et al. 2025, A&A, 694, A321 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[47] Pandey, S., Sánchez, C., Jain, B., & LSST Dark Energy Science Collaboration 2025, Phys. Rev. D, 111, 023527 [Google Scholar]

[48] LSST Science Collaboration (Abell, P. A., et al.) 2009, ArXiv e-prints [arXiv:0912.0201] [Google Scholar]

[49] McQuinn, M., & White, M. 2013, MNRAS, 433, 2857 [Google Scholar]

[50] Ménard, B., Scranton, R., Schmidt, S., et al. 2013, ArXiv e-prints [arXiv:1303.4722] [Google Scholar]

[51] Morrison, C. B., Hildebrandt, H., Schmidt, S. J., et al. 2017, MNRAS, 467, 3576 [Google Scholar]

[52] Myles, J., Alarcon, A., Amon, A., et al. 2021, MNRAS, 505, 4249 [NASA ADS] [CrossRef] [Google Scholar]

[53] Naidoo, K., Johnston, H., Joachimi, B., et al. 2023, A&A, 670, A149 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[54] Newman, J. A. 2008, ApJ, 684, 88 [Google Scholar]

[55] Nicola, A., Hadzhiyska, B., Findlay, N., et al. 2024, JCAP, 02, 015 [Google Scholar]

[56] Palanque-Delabrouille, N., Magneville, C., Yèche, C., et al. 2016, A&A, 587, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[57] Pandey, S., Krause, E., Jain, B., et al. 2020, Phys. Rev. D, 102, 123522 [CrossRef] [Google Scholar]

[58] Percival, W. J., Friedrich, O., Sellentin, E., & Heavens, A. 2022, MNRAS, 510, 3207 [NASA ADS] [CrossRef] [Google Scholar]

[59] Rau, M. M., Dalal, R., Zhang, T., et al. 2023, MNRAS, 524, 5109 [NASA ADS] [CrossRef] [Google Scholar]

[60] Reischke, R. 2024, MNRAS, 530, 4412 [NASA ADS] [CrossRef] [Google Scholar]

[61] Rocher, A., Ruhlmann-Kleider, V., Burtin, E., et al. 2023, JCAP, 10, 016 [Google Scholar]

[62] Roster, W., Wright, A. H., Hildebrandt, H., et al. 2025, ApJ submitted [arXiv:2508.02779] [Google Scholar]

[63] Salvato, M., Ilbert, O., & Hoyle, B. 2019, Nat. Astron., 3, 212 [NASA ADS] [CrossRef] [Google Scholar]

[64] Schmidt, S. J., Ménard, B., Scranton, R., Morrison, C., & McBride, C. K. 2013, MNRAS, 431, 3307 [Google Scholar]

[65] Scottez, V., Benoit-Lévy, A., Coupon, J., Ilbert, O., & Mellier, Y. 2018, MNRAS, 474, 3921 [Google Scholar]

[66] Simon, P. 2007, A&A, 473, 711 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[67] Stölzner, B., Joachimi, B., Korn, A., Hildebrandt, H., & Wright, A. H. 2021, A&A, 650, A148 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[68] Swanson, M. E. C., Tegmark, M., Blanton, M., & Zehavi, I. 2008, MNRAS, 385, 1635 [Google Scholar]

[69] Takahashi, R., Sato, M., Nishimichi, T., Taruya, A., & Oguri, M. 2012, ApJ, 761, 152 [Google Scholar]

[70] Tallada, P., Carretero, J., Casals, J., et al. 2020, Astron. Comput., 32, 100391 [Google Scholar]

[71] Tanaka, M., Coupon, J., Hsieh, B.-C., et al. 2018, PASJ, 70, S9 [Google Scholar]

[72] van den Busch, J. L., Hildebrandt, H., Wright, A. H., et al. 2020, A&A, 642, A200 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[73] Verdier, A., Rocher, A., Bandi, B., et al. 2025, ArXiv e-prints [arXiv:2508.07311] [Google Scholar]

[74] Wright, A. H., Hildebrandt, H., van den Busch, J. L., & Heymans, C. 2020a, A&A, 637, A100 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[75] Wright, A. H., Hildebrandt, H., van den Busch, J. L., et al. 2020b, A&A, 640, L14 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]

[76] Wright, A. H., Hildebrandt, H., van den Busch, J. L., et al. 2025, A&A, submitted [arXiv:2503.19440] [Google Scholar]

[77] Yèche, C., Palanque-Delabrouille, N., Claveau, C.-A., et al. 2020, Res. Notes Am. Astron. Soc., 4, 179 [Google Scholar]

[78] Yuan, S., Wechsler, R. H., Wang, Y., et al. 2025, MNRAS, 538, 1216 [Google Scholar]

Euclid: Photometric redshift calibration performance with the clustering-redshifts technique in the Flagship 2 simulation⋆

1. Introduction

2. Simulated data

2.1. The Flagship simulation and survey mocks

2.2. Euclid photometric sample

2.3. Euclid NISP sample

2.4. BOSS LOWZ and CMASS samples

2.5. DESI samples

3. Theory and method

3.1. Angular galaxy clustering

3.1.1. Galaxy density contrast

3.1.2. Angular two-point correlation function

3.1.3. Moving to matter statistics with galaxy bias

3.1.4. Magnification

3.1.5. RSD

3.2. Impact of the galaxy redshift distribution and its modelling

3.2.1. Ideal case of a Dirac galaxy distribution

3.2.2. Realistic case: Spectroscopic sample into small redshift slices

3.2.3. Comparison of the correlation modelling

3.3. Application to clustering redshifts

3.3.1. Correcting the photometric galaxy bias

3.3.2. More details about the M3 and M4 corrections

3.3.3. Data vectors reduced to scalars with some weighting

3.4. Estimating the correlations with pair counts

3.4.1. Estimator weighting scheme

3.4.2. Covariance matrix

3.5. Extracting the unknown distribution moments

3.5.1. clustering-redshifts requirements for cosmic shear

3.5.2. Discretisation of a distribution

3.5.3. Overall strategy

3.5.4. Shifted-stretched model

3.5.5. Suppressed Gaussian process

4. Results

4.1. Investigation strategy

4.2. Choice of the estimator weighting scheme

4.3. Testing the bias-cancellation hypothesis

4.4. Choice of the scale range

4.5. Choice of the scale weighting

4.6. Finding the optimal spectroscopic slicing

4.7. Evaluating the (mis-)modelling of the clustering, magnification, and RSD

4.8. Comparison of the different bias-correction methods

4.8.1. Measuring the photometric galaxy bias evolution

4.8.2. Comparison of the methods

4.9. Tomographic bin uncertainties

5. Application to the ten Euclid tomographic bins

6. Conclusion

Acknowledgments

References

Appendix A: Higher-order development of the galaxy bias

Appendix B: Magnification modelling

Appendix C: Pair counts

C.1. From continuous to discrete

C.2. Mask and estimator

Appendix D: Systematic errors and bias correction

Appendix E: Mis-modelling of clustering-redshifts with the one-bin approximation

All Tables

All Figures

Euclid: Photometric redshift calibration performance with the clustering-redshifts technique in the Flagship 2 simulation^⋆