Open Access
Issue
A&A
Volume 703, November 2025
Article Number A194
Number of page(s) 10
Section Numerical methods and codes
DOI https://doi.org/10.1051/0004-6361/202555224
Published online 14 November 2025

© The Authors 2025

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe to Open model. Subscribe to A&A to support open access publication.

1 Introduction

Due to the recent advances in the astronomical instrumentation installed on both ground- and space-based observatories, as well as the increased complexity of the data reduction and analysis techniques, it is evermore common for science to be based on observations acquired by large international consortia and exploited not only by the collaboration members but by the whole scientific community. There is growing awareness of the importance of open science, and public astronomical archives provide an unprecedented wealth of measurements across the entire electromagnetic spectrum. Their pivotal role in modern astronomy is emphasised in the most recent U.S. Decadal Survey for Astronomy and Astrophysics (National Academies of Sciences, Engineering, and Medicine 2023), as well as an analysis of the publication record of the European Space Agency (ESA) missions (De Marchi & Parmar 2024). In both cases, it is pointed out that the scientific publications derived from archival data surpass in number those produced by the researchers involved in the original programmes, achieving comparable impact in terms of citations.

However, the exploitation of these complex datasets often requires significant expertise in the specific characteristics of each instrument, its calibration procedures, and data reduction pipelines. Despite efforts to increase data accessibility, there remains a pressing need for tools that enable scientific analysis without requiring advanced technical skills (Goodman 2012). This is particularly relevant in the context of multi-wavelength (and multi-messenger) astronomy that involves the analysis and interpretation of extremely heterogeneous datasets.

One important first step in this direction is the definition of a universal standard for storing and distributing astronomical observations. The Flexible Image Transport System (FITS) format (Wells et al. 1981) and the representation of celestial co-ordinates (Calabretta & Greisen 2002) in the World Coordinate System (WCS; Greisen & Calabretta 2002) have been instrumental in this respect. For large data volumes, the Hierarchical Progressive Survey (HiPS) format (Fernique et al. 2015), developed by the Centre de Données astronomiques de Strasbourg (CDS) and the International Virtual Observatory Alliance (IVOA), offers a promising solution to visualise and distribute astronomical observations based on the HTTP protocol. HiPS organises full-sky data into a multi-resolution tiling scheme based on HEALPix tessellation (Górski et al. 2005). It has been adopted by major astronomical institutions such as ESA and ESO, and it is supported by the Aladin software suite (Bonnarel et al. 2000), as well as web applications such as ESASky (Giordano et al. 2018), derived from Aladin Lite (Boch & Fernique 2014).

Although the HiPS format was conceived with scientific analysis in mind, it has often been used primarily for visualisation based on compressed raster images (e.g. PNG, JPEG), which lack the fidelity and metadata necessary for scientific measurements. In this approach, researchers seeking quantitative analysis must still retrieve and interpret raw instrument-specific FITS files from archival repositories. Nevertheless, the number of datasets that are also available in the FITS format is steadily increasing, and here we advocate for the complementary approach of leveraging these skymaps for scientific analysis in addition to the interactive visualisation enabled by raster images.

Combining datasets with varying resolutions, sensitivities, and spectral coverages undoubtedly presents significant challenges, but their integration also enables new scientific insights that transcend individual datasets. Our goal is not to replace detailed studies requiring deep knowledge about a particular instrument (or set of instruments), but to complement them with accessible, reproducible, multi-wavelength analyses that may be carried out by a much broader segment of the astronomical community. To that end, we provide a set of tools to retrieve, display, and interact with HiPS datasets from local or remote sources, and to perform basic scientific manipulation (aperture photometry) on a list of targets specified by the user. In order to gauge the quality of the publicly available skymaps and test the feasibility of recovering accurate broadband photometry by means of a generic pipeline that is not optimised for any particular instrument, we used the Herschel Reference Survey (HRS; Boselli et al. 2010) of nearby galaxies as a realistic test case to benchmark our approach.

Section 2 summarises the main features of the HRS sample, the flux measurements quoted in the official catalogue, and the HiPS skymaps that are used in the present work. Section 3 describes the AstroBrowser software suite, its public interface, and its technical details, as well as the data analysis performed on the HiPS skymaps, including the retrieval of the required metadata and the automatic procedure to carry out aperture photometry, background subtraction and error estimates. A quantitative comparison between our results and the original HRS measurements is presented in Section 4, and the main implications in the broader context of the scientific exploitation of HiPS skymaps are briefly discussed in Section 5.

2 Astronomical observations

2.1 Herschel Reference Sample

The HRS is a volume-limited sample of galaxies with distances between 15 and 25 megaparsecs. This sample includes 261 late-type galaxies with a 2MASS K-band magnitude of KS, tot ≤ 12 mag, and 62 early-type galaxies with KS, tot ≤ 8.7 mag. Detailed information on the initial selection process is available in Boselli et al. (2010), with updated morphological classifications and distance estimates provided by Cortese et al. (2012). Aperture photometry in different bands across the electromagnetic spectrum has been published by Cortese et al. (2012, 2014); Ciesla et al. (2012). Apertures are generally re-scaled on the optical radius of each target to make sure that the total flux of each source is recovered. For more details on the procedure, we point the reader to the relevant papers.

The vast majority of galaxies in the HRS sample are clearly spatially resolved from the ultraviolet to the far-infrared regime, and the approach adopted by the HRS team was to treat each object individually, rather than resorting to automatic tools, in order to maximise the accuracy of the multi-wavelength photometric measurements. Specifically, photometry across the GALEX and SDSS bands was obtained by first fitting surface brightness profiles in elliptical annuli, defined primarily from the SDSS i-band images. The sky background was manually determined in each band by identifying sky-free regions and masking background and foreground sources. Asymptotic magnitudes were then estimated through a linear fit of the growth curve in each individual band. Details on the whole procedure are presented in Cortese et al. (2012). The approach followed for the Herschel bands was slightly different, primarily due to the additional complexity of the far-infrared observations. In the SPIRE bands, integrated flux densities for unresolved sources were determined via a timeline-based point-spread function fitting. For extended sources, aperture photometry was employed, with the size of the aperture rescaled to ensure it encapsulated the entire emission. This practically implies a morphology-dependent aperture size, with 1.4 times the optical radius (defined as the 25 mag/arcsec2 isophotal radius in the B-band) for late-type galaxies, 0.8 times for lenticulars, and 0.3 for ellipticals. More details are presented in Ciesla et al. (2012). For PACS data, photometry was carried out within the same apertures used for SPIRE, except for unresolved SPIRE sources that, being generally resolved in PACS, needed the definition of an elliptical aperture following the same criteria used for SPIRE. More details can be found in Cortese et al. (2014).

This sample is ideally suited to test the quality of the available data in HiPS format for several reasons. First, it makes use of a representative selection of widely used surveys across a broad wavelength range, from the ultraviolet to the far infrared regime. It is also a volume-limited sample, selected in terms of distance and luminosity, and therefore it includes galaxies of diverse physical properties, such as masses, morphologies, star formation rates, gas and dust content, and environments. It is composed of nearby, spatially resolved targets, with reliable measurements up to large galactocentric distances in all spectral bands, which facilitates the analysis of very heterogeneous instruments by a common, generic pipeline. The sample size is small enough to enable the traditional approach based on human expertise, but it perfectly exemplifies the advantages of a more automated procedure; the level of detail and human intervention used more than a decade ago to produce the HRS measurements makes this sample a fantastic test case with which to assess the performance of an automatic pipeline.

2.2 HiPS skymaps

The Galaxy Evolution Explorer (GALEX; Martin et al. 2005) was a NASA Small Explorer mission designed to study the evolution of star formation in galaxies from the early Universe to the present day. Using microchannel plate detectors, GALEX captured direct images and low-resolution spectra in the near-ultraviolet (NUV; λ ~ 1344–1786 Å) and far-ultraviolet (FUV; λ ~ 1771–2831 Å) wavelengths. Launched in 2003, the mission conducted several surveys, including the All-Sky Imaging Survey (AIS), Medium Imaging Survey (MIS), and Deep Imaging Survey (DIS). While the FUV camera ceased operations in 2009 due to hardware failure, the NUV instrument continued collecting data until 2012. The GALEX HiPS skymaps were constructed from the GR6/GR7 data releases (Bianchi et al. 2017), including all the FUV1 and NUV2 images produced by the pipeline described in Morrissey et al. (2007), which reads the lists of photon positions and arrival times recorded by the detectors, corrects instrumental effects, projects onto sky co-ordinates, and performs flux calibration, background subtraction, and object detection.

The Sloan Digital Sky Survey (SDSS; York et al. 2000) is a multi-filter imaging and spectroscopic survey conducted with a dedicated 2.5-meter telescope at Apache Point Observatory. The SDSS imaging camera (Gunn et al. 1998) took its first science-quality data the night of 19 September 1998, and it covered a footprint of 14 055 square degrees until its last night of science-quality data on 18 November 2009. Stoughton et al. (2002) described the main aspects of the data analysis pipeline, including basic reduction (e.g. correction for bias, flat field, and data defects such as bad columns or cosmic rays), astrometric and photometric calibration, and object detection. Starting from the seventh data release (DR7; Abazajian et al. 2009), the ubercal algorithm was used to simultaneously solve for the photometric calibration parameters and relative stellar fluxes using overlapping observations within the SDSS. The HiPS skymaps in the g3, r4, and i5 bands used in the present work were derived from the ninth data release (DR9; Ahn et al. 2012), whose main difference with respect to the DR7 imaging pipeline is an improvement in the astrometric calibration. In terms of photometry, the data should be almost identical to the SDSS observations used by Cortese et al. (2012).

The Herschel Space Observatory (Pilbratt et al. 2010) was an ESA mission that observed the far-infrared and submillimeter spectrum from 14 May 2009 to 29 April 2013 using three different instruments: PACS (Photodetector Array Camera and Spectrometer; Poglitsch et al. 2010), SPIRE (Spectral and Photometric Imaging Receiver; Griffin et al. 2010), and HIFI (Heterodyne Instrument for the Far-Infrared; de Graauw et al. 2010). The HRS catalogue reports photometric measurements in the ‘green’ and ‘red’ PACS bands at 1006 and 1607 μm, as well as the three SPIRE bands (2508, 3509, and 50010 μm). The raw data from both instruments underwent extensive processing through the Herschel Interactive Processing Environment (HIPE; Ott 2010), including astrometry, flux calibration, and projection of the scans associated to the different observing modes onto sky co-ordinates. The HiPS maps were constructed from all the publicly available Level 2.5 data (or Level 2, if no Level 2.5 was available) in the Herschel Science Archive11. For PACS, these images were produced by the Jscanam algorithm (Graciá-Carpio et al. 2015), and they are suitable for the scientific analysis of both point and extended sources. For SPIRE, the extended-source calibrated maps were used, whose zero-level is determined by comparison with specially produced Planck HFI maps at 545 and 857 GHz.

3 Data analysis

Nowadays, several platforms provide flexible frameworks for accessing and analysing astronomical data. The services they offer are often designed to abstract away mission-specific details, enabling users to focus on scientific goals such as multi-wavelength analysis without needing to address differences in data format or instrumentation. Key examples include metadata services like SIMBAD12 (Wenger et al. 2000) and NED13 (Helou & Madore 1991), which allow researchers to query source information by name or co-ordinates. The CDS Sesame service14 integrates both, along with additional resources. For catalogue access, the VizieR service15 (Ochsenbein et al. 2000) is an essential reference, hosting a vast collection of published astronomical catalogues widely used by the community. In terms of scientific visualisation, Aladin Desktop (Bonnarel et al. 2000) offers advanced exploration and analysis of HiPS maps, including cut-out services (see also the hips2fits16 web service and associated APIs, also provided by the CDS) and catalogue overlays. Its web-based counterpart, Aladin Lite (Boch & Fernique 2014), is extensively used in portals such as ESASky17, the ESO Science Archive18, and JAXA’s JUDO219. NASA/IPAC’s IRSA Viewer20, built upon the Firefly21 UI library, provides similar functionalities with an emphasis on infrared data.

Despite the availability of these established tools, our scientific goals required a platform with greater flexibility and extensibility, tailored to the specific requirements of our study. To this end, we developed AstroBrowser, a lightweight yet versatile framework that allows fine control over data handling, seamless integration with analysis workflows, and support for scientific interoperability, as well as a set of analysis scripts to carry out background subtraction, aperture photometry, and the estimation of measurement errors.

3.1 AstroBrowser

All the data used in this study were retrieved using AstroBrowser, a modular suite of tools for the acquisition, visualisation, and scientific analysis of astronomical datasets stored in the HiPS format. The system is designed to interface directly with public archives, streamlining access while maintaining consistency with IVOA standards. The version presented in this work represents the first implementation of a broader, long-term vision aimed at browser-based, supervised collaborative scientific analysis of intermediate-size samples, of the order of hundreds of targets to a few thousand. The current version implements the core functionality of the framework, and it consists of a web interface22, a backend, and three independent libraries, as is illustrated in Fig. 1 and detailed in the subsections below. AstroBrowser leverages IVOA standards such as the Table Access Protocol (TAP) (Dowler et al. 2011) for metadata access, the Astronomical Data Query Language (ADQL) (Ortiz et al. 2011) for querying external services, and HiPS for the multi-resolution representation of skymaps. At a lower level, it supports well-established formats and conventions, including FITS for data storage and the WCS (Greisen & Calabretta 2002) for spatial referencing. The entire source code has been released under the GNU General Public License v3 and is open to community contributions.

The main repository hosts both the web interface and the backend. For the WCSLight, jsFITSio, and HEALPixjs library, both the source code23,24,25 and nodejs packages26,27,28 are provided.

thumbnail Fig. 1

AstroBrowser components diagram.

3.1.1 Web interface

The web interface of AstroBrowser represents the primary user interaction point. It consists of two different modules:

HiPS Explorer, developed in Vanilla JavaScript and WebGL, is the core component that provides an intuitive and visual portal to access all the AstroBrowser functionality (see Fig. 2). It handles data visualisation (e.g. FITS and HiPS maps), and it overlays geometrical metadata representations (e.g. source catalogues and observation footprints) onto an active HiPS map. The list of HiPS maps available in the HiPS Explorer is derived from the Hipslist file29 provided by the CDS and ESASky HiPS nodes. The retrieval of catalogues and observation footprint metadata is performed using ADQL queries on ESASky and ESO TAP service providers.

The Data Explorer (Fig. 3) enables a more detailed exploration and manipulation of the data that are currently displayed in the HiPS Explorer, and it grants a user-friendly access to the cutout service exposed in the backend via the API. Its key features include:

  • Cut-out extraction: Users can input the central right ascension (RA), declination (Dec), pixel size, and radius of the requested field of view to extract a cut-out from the active HiPS map.

    thumbnail Fig. 2

    Herschel SPCS 500 catalogue overlay on M31 superimposed on SPIRE 500 HiPS. The shape size is mapped to the flux metadata column, and the shape hue is mapped to the snrnoise metadata column.

    thumbnail Fig. 3

    Interactive DataExplorer panel view on GALEX NUV HiPS targeting M31.

  • Visual inspection: By simply hovering the mouse, users can explore in real time the physical co-ordinates (i, j), astronomical co-ordinates (RA, Dec), and pixel values of the cut-out image.

  • Customisation: The colour map and scale function can be adjusted at will to improve visualisation and interactively adapt to the dynamical range of the displayed data.

  • Download: A postcard image (PNG) and the actual data in FITS format can be downloaded for further analysis or archival purposes.

3.1.2 Backend

The web interface described above relies on a backend to query external metadata providers, retrieve observational data, and perform various file and data manipulation tasks. The backend helps one to circumvent cross-origin resource sharing issues when accessing external providers (e.g. HiPS servers and TAP services). More importantly, it exposes the application programming interface (API) component that facilitates seamless programmatic access to the AstroBrowser functionality. The API consists of a single HTTP entry point that can be called from any program or web service. Here is an example of how to directly call the cut-out service from a web browser using the API:

http://astrobrowser.ft.uam.es/api/cutout?hipsbaseuri=https://alasky.cds.unistra.fr/GALEX/GALEXGR6_7_NUV/&radeg=10.7&decdeg=41.3&radiusasec=7000&pxsizeasec=15,

where

  • radiusasec is the desired radius, in arc-seconds;

  • pxsizeasec the desired pixel size, in arc-seconds;

  • radeg the central RA, in degrees;

  • decdeg the central Dec, in degrees;

  • hipsbaseuri the base URL of a HiPS map, where the properties file is stored.

Table 1

Scientific calibration parameters of each skymap.

3.1.3 Stand-alone libraries

The most common low-level, astronomy-specific operations that are carried out by the different AstroBrowser modules have been implemented as three independent libraries that are publicly distributed to help other developers of astronomical software. They can all be run embedded in a web page or imported as a dependency in any Node.js program.

jsFITSio is a TypeScript library that handles I/O operations and data manipulation on standard FITS files. This library represents the foundational system level in AstroBrowser, enabling the reading and writing of standard FITS files.

WCSLight is a TypeScript library designed to manipulate coordinate projections according to the FITS standard. It relies on jsFITSio to read, write, and download FITS files, and it implements the WCS functions pix2world and world2pix, as well as the low-level cut-out functionality on single FITS files and the entire HiPS.

HEALPixjs is a library that partially ports the HEALPix C++ library to Vanilla JavaScript. Since AstroBrowser uses NEST tile organisation for HiPS, only the NEST tessellation code has been ported to HEALPixjs. This library is used by WCSLight for operations on HiPS projections and in the AstroBrowser web interface for visualising HiPS maps and supporting the algorithms that allow the selection of footprints overlaid on a HiPS sphere.

3.2 Analysis pipeline

For every galaxy in the HRS, we used the AstroBrowser web API exposed by the backend to download cut-outs in each band, and then carried out aperture photometry on the retrieved images. Our analysis pipeline, publicly available as a Python notebook30, can be described in terms of two separate steps:

3.2.1 Calibration

To carry out a meaningful scientific analysis, it is necessary to supplement the information available in the properties file of the different skymaps with some additional data; most notably, the spatial resolution of the original images and the physical units of the intensity values. In this work, we adopt the parameters quoted on Table 1.

The maximum resolution of the HiPS maps can always be inferred from the compulsory hips_order keyword, and it is most often explicitly provided as hips_pixel_scale. However, it is common that they oversample the original observations (i.e. they feature smaller pixel size) in order to preserve all the information present in the original data. This comes, of course, at the cost of losing statistical independence, and one must take into account the original pixel size when computing the number of independent degrees of freedom to estimate the uncertainties, as is explained below.

In the maps considered here, the optional HiPS keyword s_pixel_scale is provided for most instruments and bands. For GALEX, though, we had to manually retrieve the information from Morrissey et al. (2007), while for the i band of the SDSS we decided to assume the same spatial resolution as for the original images in the g and r bands (0.39564 arcsec, consistent with the parameters of the SDSS photometric camera; Gunn et al. 1998). Note that in this case the pixel size of the HiPS maps is slightly larger, and therefore it can be safely assumed that all these measurements are statistically independent. For the Herschel space observatory, the original pixel sizes reported in the s_pixel_scale keyword of the HiPS properties files are fully consistent with the official description of the PACS (Poglitsch et al. 2010) and SPIRE (Griffin et al. 2010) instruments, as well as the online documentation31. Nevertheless, it is important to highlight that the actual pixel size of the original PACS products at 100 μm depends on the Astronomical Observation Template (AOT): 1.6 arcsec, as is quoted in Table 1, for the prime mode, and 3.2 arcsec for parallel mode.

In most cases, the original pixel size, s0, is significantly smaller than the true spatial resolution of the observations, usually characterised in terms of the full width at half maximum (FWHM) of the point spread function (PSF). Determining the precise shape of the PSF is a complex task, since it depends on both the instrument configuration and the observing conditions of each target and exposure, and there is no straightforward way to define an ‘average’ PSF for a HiPS skymap. No information in this regard is provided in the properties file, and therefore we simply estimated here the representative values of the typical FWHM of the original observations based on the GALEX (Morrissey et al. 2007), SDSS32, and Hershel (Poglitsch et al. 2010; Griffin et al. 2010) documentation. The GALEX PSF is dominated by instrumental effects (most notably, the source position within the detector), whereas in SDSS it is dominated by atmospheric seeing. The effective beam of the PACS instrument is rather asymmetric, and it depends on the scan speed; since the HiPS skymaps are derived from all the observations available in the archive, the numbers quoted on Table 1 are merely meant to be representative (i.e. an approximate median) of the precise values reported in Poglitsch et al. (2010) for each configuration. For SPIRE, we take the mean values of the FWHM provided in Griffin et al. (2010).

Finding out the precise nature and physical units of the measurements stored in each map also required some manual literature research. GALEX data contain a hips_bunit keyword in the properties files. However, such a parameter is not part of the standard recommendation for the HiPS format put forward by the IVOA33, and the reported values do not coincide with our estimates, quoted in Table 1, which assume zero points of 18.82 (FUV) and 20.08 (NUV) in the AB system (Oke & Gunn 1983), based on Morrissey et al. (2007) and the instrument documentation available online34. For SDSS and Herschel, flux calibration information was also retrieved from external sources. SDSS photometry (Padmanabhan et al. 2008), also intended to be on the AB system, is expressed in terms of ‘nanommaggies’, which we approximated as 3.631 μJy. According to the Herschel documentation, the standard product generation pipeline provides flux-calibrated PACS images in units of janskys, whereas SPIRE measurements are given in megajanskys per steradian.

Note that only the SPIRE maps contain intensity values. In all the other cases, the quantity stored in the HiPS maps corresponds to flux per original pixel (s_pixel_scale s0), regardless of the spatial resolution (hips_pixel_scale spix) of the HiPS maps. Therefore, precise knowledge of the original pixel area, A0=s02$\[A_{0}=s_{0}^{2}\]$, is required in order to convert the map values to intensities per unit solid angle and derive meaningful photometry. This poses a significant problem for the PACS 100 μm band, where original images with different resolutions (1.6 and 3.2 arcsec per pixel) have been combined without adequate recalibration.

3.2.2 Aperture photometry

For each galaxy in the HRS sample, we made use of the AstroBrowser backend to download the image cut-outs in every band and compute the average intensity, ⟨I⟩, within the elliptical aperture defined by the RA and Dec (α0, δ0) of the galaxy, its semi-major and semi-minor axes (a, b), and the position angle, θ, of the major axis, as provided in the HRS catalogues. All of our cut-outs were centred around the location of the target, and the field of view was set to eight times the reported semi-major axis, a.

To determine the flux received from the galaxy, one must subtract the potential contribution from any large-scale background, be it of astrophysical or instrumental origin. To do so, we masked an elliptical area three times larger than the region of interest and computed the median and 16–84 percentiles over the remaining pixels, Iijout$\[I_{i j}^{\text {out}}\]$. Then, we adopted the median intensity, I50out$\[I_{50}^{\text {out}}\]$, as indicative of the image background, and our flux estimate is simply F=πab(IijinI50out),$\[F=\pi a b\left(\left\langle I_{i j}^{\text {in}}\right\rangle-I_{50}^{\text {out}}\right),\]$(1)

where the average Iijin$\[\langle I_{i j}^{\mathrm{in}}\rangle\]$ was performed over the original region of interest (i.e. the ellipse with semi-major and semi-minor axes a and b).

No attempt was made to deconvolve the PSF of the different instruments, since most HRS galaxies are well resolved, and therefore the vast majority of the flux is expected to be contained within the reported elliptical apertures. The smallest semi-minor axis in the optical/UV (Cortese et al. 2012) is 9 arcsec, significantly larger than the PSF of SDSS and GALEX observations. In the FUV, all semi-minor axes (Ciesla et al. 2012; Cortese et al. 2014) are greater than 20 arcsec, which also suffices to fully enclose the PACS PSF35. SPIRE is the only instrument that is not able to spatially resolve all the galaxies in our sample. As a matter of fact, 10, 12, and 11 objects were treated as pointlike sources by Ciesla et al. (2012) in the 250, 350, and 500 μm bands, respectively. Thus, only a small fraction (about 5 percent) of our targets would require a detailed modelling of the PSF.

To estimate the measurement uncertainty, ΔF=N(ΔI)2+(ΔB)2,$\[\Delta F=N \sqrt{(\Delta I)^2+(\Delta B)^2},\]$(2)

we added in quadrature the error term (ΔI)2 associated with the mean intensity within the region of interest and the estimated uncertainty of the background (ΔB)2, including both systematic and statistical errors, and then multiplied by the total number, N, of pixels in the region of interest.

For the mean target intensity, we estimated (ΔI)2=(I50inI16in)2n1$\[(\Delta I)^2=\frac{\left(I_{50}^{\mathrm{in}}-I_{16}^{\mathrm{in}}\right)^2}{n-1}\]$(3)

from the median and the 16 percentile of the intensity within the target elliptical aperture, where n=Nmin(spix2s02,1)πabmax(spix2,s02)$\[n=N \min (\frac{s_{\mathrm{pix}}^{2}}{s_{0}^{2}}, 1) \simeq \frac{\pi a b}{\max (s_{\mathrm{pix}}^{2}, s_{0}^{2})}\]$ denotes the number of statistically independent degrees of freedom, based on the number, N, of pixels, their area, Apix=spix2$\[A_{\mathrm{pix}}=s_{\mathrm{pix}}^{2}\]$, and the resolution, A0=s02$\[A_{0}=s_{0}^{2}\]$, of the original images (see Table 1).

In order to account for possible systematic uncertainties associated with our background estimate, I50out$\[I_{50}^{\text {out}}\]$, we made an approximate model of large-scale diffuse emission in the following way. First, we excluded, in addition to the enlarged elliptical mask, all pixels Iijout>I50out+1.5(I84outI16out)$\[I_{i j}^{\text {out}}>I_{50}^{\text {out}}+1.5~(I_{84}^{\text {out}}-I_{16}^{\text {out}})\]$ above a three-sigma threshold in order to discard nearby bright sources. We thus obtained a binary mask, mij, and a masked image, bij = mijIij, over the whole field of view that isolates the backgrounddominated area. In order to interpolate the contribution of the diffuse background within the region of interest, we carried out a weighted Gaussian smoothing with radius R=a+b4$\[R=\frac{a\text{+}b}{4}\]$: Bij=GR(bij)GR(mij),$\[B_{i j}=\frac{G_R\left(b_{i j}\right)}{G_R\left(m_{i j}\right)},\]$(4)

where GR denotes the Gaussian smoothing operation with kernel radius R. Based on this spatially varying model, (ΔB)2=(I50outI16out)2n+(Bijin I50out )2$\[(\Delta B)^2=\frac{\left(I_{50}^{out}-I_{16}^{out}\right)^2}{n}+\left\langle\left(B_{i j}^{\text {in }}-I_{50}^{\text {out }}\right)^2\right\rangle\]$(5)

attempts to account for the statistical uncertainty in the estimate of the mean background intensity (first term) as well as the systematic errors arising from fluctuations of the background model throughout the region of interest (second term).

thumbnail Fig. 4

Comparison between the flux measured from the HiPS skymaps and the values reported in the HRS catalogues for GALEX (Cortese et al. 2012, top left), SDSS (Cortese et al. 2012, bottom left), and Herschel (Ciesla et al. 2012; Cortese et al. 2014, top and bottom right for the PACS and SPIRE instruments, respectively). Error bars in the vertical axis represent the uncertainties returned by our pipeline (they do not include HRS errors); measurements above the median S/N are highlighted in black. Horizontal dotted lines and grey bands denote the median and 16–84 percentiles of the flux ratio (see Figure 5 for the full probability distribution).

4 Results

As can be seen in Figure 4, there is overall good agreement between the values obtained by the procedure above, applied to the publicly available HiPS maps, and the official values provided by the HRS catalogue, carefully supervised by human experts. The most notable exception is the PACS 100 μm band, where a clearly visible systematic error is affecting a fraction of the data. As is noted in Section 3.2.1, we believe that the source of the observed offset may be related to the ingestion of original images with different pixel size, but further investigation is required in order to fully clarify this issue. In all other cases, the values recovered automatically from the HiPS maps are fairly close, for the vast majority of galaxies, to the fluxes derived from a thorough analysis by experts. We have verified by visual inspection that most catastrophic failures (discrepancies larger than a factor 1.5) are due to nearby sources (e.g. foreground stars), extended galactic emission, or complex backgrounds near the edge of the region of interest. Background estimation is particularly relevant at a low signal-to-noise ratio (S/N), usually associated with faint objects, but we also noticed a few instrumental artefacts in some cases, especially in the PACS160 band, which may affect bright targets as well. In the SPIRE maps at 500 μm, one can see that significant flux (sometimes more than 50%) may be missed due to the extended wings of the PSF for a few objects with b ≲ 30 arcsec. For the other two SPIRE bands at 250 and 350 μm, the effect is only seen for point-like sources (b = 20 arcsec in the catalogue), and it typically amounts to 15–20%.

A more quantitative assessment of the photometric accuracy of the public HiPS skymaps is provided by the probability distribution of the ratio between the fluxes obtained by our pipeline and the official ones. Cumulative fractions and differential probability densities for every band are shown in Figure 5.

The median and 16–84 percentiles, shown by the dotted lines and grey bands in Figure 4, are quite robust to the presence of outliers discussed above, which represent a small fraction of the sample and therefore only affect the extremes of the distribution. From our analysis, one may conclude that, barring the systematic effects associated to the PACS instrument, the automatic photometry based on the HiPS maps is consistent with the expert analysis within a relative accuracy of a few percent.

With the aforementioned exception, the probability distribution is fairly symmetric for all datasets, and there are very few outliers where the discrepancies are larger than 20 percent. The median ratio, p50, is always close to one, and we do not find evidence of strong systematic biases. The largest offsets are found for both Herschel PACS bands, but even in this case there is no obvious trend with respect to object brightness. The width of the distribution is consistent with the typical S/N of the observations (black dots in Figure 4). It is also quite stable as a function of observed flux, especially for high-S/N data. However, our measurements tend to be larger than the HRS values in the optical and the UV, and lower in the FUV. These trends are responsible for the median offsets and asymmetric tails in the probability distributions shown in Figure 5. They are much more evident at low S/N, but they seem to affect the whole sample, and we conjecture that they arise from the subtle differences between our analysis procedure and the prescriptions followed by the HRS team.

Finally, we evaluated the accuracy of the uncertainties returned by our algorithm by studying the probability distribution of the normalised residual χ=FHiPSFHRSσHiPS,$\[\chi=\frac{F_{\mathrm{HiPS}}-F_{\mathrm{HRS}}}{\sigma_{\mathrm{HiPS}}},\]$(6)

where FHRS denote the catalogued fluxes, whereas FHiPS and σHiPS are the fluxes and errors returned by our pipeline, respectively. Ideally, the normalised residual, χ, should follow a random normal distribution as closely as possible. The results shown in Figure 6 and the percentile values reported in Table 2 suggest that this is arguably not a very good approximation, but the uncertainties derived from the HiPS maps are representative of the true errors.

thumbnail Fig. 5

Probability distribution of the ratio between the flux measured from the HiPS skymaps and the values reported in the HRS catalogues. Cumulative fractions and differential probability densities are shown in the top and bottom panels, respectively.

thumbnail Fig. 6

Cumulative fraction (top) and probability density (bottom) of the normalised residual χ defined in Equation (6). The dash-dotted black line in the bottom panel corresponds to a standard normal distribution.

Table 2

Percentiles of the probability distribution of the normalised residual χ, defined in Equation (6).

5 Discussion and conclusions

The analysis presented in this work demonstrates the potential of publicly available skymaps in a HiPS format to serve not only as a visualisation tool to retrieve the original data but also as a valuable resource for scientific analysis. In order to quantify this statement, we compared the fluxes measured by a simple automatic pipeline from the GALEX, SDSS, and Herschel HiPS skymaps with those reported in the HRS catalogue, derived from a careful analysis of a relatively small sample of 323 galaxies carried out by human experts. Our results reveal an overall good agreement, with discrepancies generally within a few percent.

On the other hand, they also highlight the need for a few improvements in order to guarantee the reliability and optimise the scientific exploitation of these datasets:

To be fully leveraged in scientific applications, it is essential that the metadata of the HiPS skymaps include a better description of the measurements and their physical units. While there is no obvious prescription to do so in the properties file, we note that the HiPS standard defined in the official IVOA recommendation allows for the provision of a metadata file in FITS format where this information can be clearly stated in the header. We encourage map creators to actively make use of these mechanism, preferably through the universal content descriptors and the BUNIT keyword. Given the hierarchical (multi-resolution) nature of the HiPS format, the potential variety of use cases, and the fact the target users are the broad scientific community, we strongly recommend reporting the proper surface brightness (flux per unit solid angle) rather than the flux ‘per beam’, ‘per [original] pixel’, or any other unit that requires specific knowledge about the instrument.

There are several other aspects that are relevant in many scientific applications. Regarding the energy response of the instrument, we recommend storing the filter ID of the SVO filter service in the FILTER keyword. To provide some information about the PSF, one could follow the convention, widely adopted in radio astronomy, of using the keywords BMIN, BMAJ, and BPA to specify its shape (FWHM) and orientation. Although our results show that an approximate estimate of the measurement uncertainties can be derived from the data themselves, a more rigorous statistical analysis could be carried out if additional skymaps describing the uncertainties and their covariance were available.

All these aspects impose an additional burden on the map creators in terms of both work and responsibility, but we sincerely think that they would greatly increase the scientific and legacy value of the HiPS skymaps. In order to facilitate the calibration process, we suggest the definition of reference datasets to benchmark the quality of both the skymaps and their metadata, as we have done here based on the HRS. Such a system would provide users with an objective measure to help them evaluate whether the publicly available skymaps are suitable for a particular science case.

To that end, the details of map creation, its associated uncertainties, usage instructions, and recommendations, a discussion of potential caveats and limitations of the data, and the results of quality assessment against a recognised reference sample should all be well documented. In our opinion, formal validation based on peer review would be extremely helpful, and we strongly advocate for the publication of manuscripts dedicated entirely to discussing these issues, without any further scientific content.

In conclusion, we claim that the HiPS standard does not only provide an excellent means to optimise the visualisation of astronomical observations, but a robust, scalable framework to distribute science-grade data. With the exception of the PACS maps at 100 μm, all the other maps considered in the present work are perfectly suitable for photometric studies that do not critically depend on an accuracy better than ~10%. While no automated procedure will ever be able to achieve the same quality standards as a careful analysis by seasoned human experts, HiPS skymaps represent an invaluable resource for the broad astronomical community, especially in the context of multi-wavelength and multi-messenger studies that must combine measurements carried out by a wide variety of instruments.

Acknowledgements

We thank the anonymous referee for a constructive report that has helped us improve the presentation of our motivation and the discussion of the results. Y. Ascasibar received financial support from the Spanish State Research Agency (AEI/10.13039/501100011033) during the inital stages of this work through grant PID2019-107408GB-C42. A significant fraction of the project was carried out during a research visit to the International Centre for Radio Astronomy Research – University of Western Australia (ICRAR-UWA) funded from the mobility programme Ayudas de recualificación del profesorado universitario funionario o contratado (CA2/RSUE/2021-00817) of the Universidad Autónoma de Madrid (UAM), within the framework of the Plan de Recuperación, Transformación y Resiliencia of the Ministerio de Universidades (Spain). This work made use of Astropy36: a community-developed core Python package and an ecosystem of tools and resources for astronomy (Astropy Collaboration 2013, 2018, 2022), as well as the Numpy (Harris et al. 2020) and Scipy (Virtanen et al. 2020) Python libraries. Funding for SDSS-III has been provided by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, and the U.S. Department of Energy Office of Science. SDSS-III37 is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS-III Collaboration including the University of Arizona, the Brazilian Participation Group, Brookhaven National Laboratory, Carnegie Mellon University, University of Florida, the French Participation Group, the German Participation Group, Harvard University, the Instituto de Astrofisica de Canarias, the Michigan State/Notre Dame/JINA Participation Group, Johns Hopkins University, Lawrence Berkeley National Laboratory, Max Planck Institute for Astrophysics, Max Planck Institute for Extraterrestrial Physics, New Mexico State University, New York University, Ohio State University, Pennsylvania State University, University of Portsmouth, Princeton University, the Spanish Participation Group, University of Tokyo, University of Utah, Vanderbilt University, University of Virginia, University of Washington, and Yale University.

References

  1. Abazajian, K. N., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2009, ApJS, 182, 543 [Google Scholar]
  2. Ahn, C. P., Alexandroff, R., Allende Prieto, C., et al. 2012, ApJS, 203, 21 [Google Scholar]
  3. Astropy Collaboration (Robitaille, T. P., et al.) 2013, A&A, 558, A33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Astropy Collaboration (Price-Whelan, A. M., et al.) 2018, AJ, 156, 123 [Google Scholar]
  5. Astropy Collaboration (Price-Whelan, A. M., et al.) 2022, ApJ, 935, 167 [NASA ADS] [CrossRef] [Google Scholar]
  6. Bianchi, L., Shiao, B., & Thilker, D. 2017, ApJS, 230, 24 [Google Scholar]
  7. Boch, T., & Fernique, P. 2014, in Astronomical Society of the Pacific Conference Series, 485, Astronomical Data Analysis Software and Systems XXIII, eds. N. Manset & P. Forshay, 277 [Google Scholar]
  8. Bonnarel, F., Fernique, P., Bienaymé, O., et al. 2000, A&AS, 143, 33 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  9. Boselli, A., Eales, S., Cortese, L., et al. 2010, PASP, 122, 261 [Google Scholar]
  10. Calabretta, M. R., & Greisen, E. W. 2002, A&A, 395, 1077 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  11. Ciesla, L., Boselli, A., Smith, M. W. L., et al. 2012, A&A, 543, A161 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  12. Cortese, L., Boissier, S., Boselli, A., et al. 2012, A&A, 544, A101 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  13. Cortese, L., Fritz, J., Bianchi, S., et al. 2014, MNRAS, 440, 942 [Google Scholar]
  14. de Graauw, T., Helmich, F. P., Phillips, T. G., et al. 2010, A&A, 518, L6 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  15. De Marchi, G., & Parmar, A. N. 2024, arXiv e-prints [arXiv:2402.12818] [Google Scholar]
  16. Dowler, P., Rixon, G., & Tody, D. 2011, IVOA Recommendation [Google Scholar]
  17. Fernique, P., Allen, M. G., Boch, T., et al. 2015, ApJ, 578, A114 [Google Scholar]
  18. Giordano, F., Racero, E., Norman, H., et al. 2018, Astron. Comput., 24, 97 [NASA ADS] [CrossRef] [Google Scholar]
  19. Goodman, A. A. 2012, Astron. Nachr., 333, 505 [Google Scholar]
  20. Górski, K. M., Hivon, E., Banday, A. J., et al. 2005, ApJ, 622, 759 [Google Scholar]
  21. Graciá-Carpio, J., Wetzstein, M., & Roussel, H. 2015, arXiv e-prints [arXiv:1512.03252] [Google Scholar]
  22. Greisen, E. W., & Calabretta, M. R. 2002, A&A, 395, 1061 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  23. Griffin, M. J., Abergel, A., Abreu, A., et al. 2010, A&A, 518, L3 [EDP Sciences] [Google Scholar]
  24. Gunn, J. E., Carr, M., Rockosi, C., et al. 1998, AJ, 116, 3040 [NASA ADS] [CrossRef] [Google Scholar]
  25. Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357 [NASA ADS] [CrossRef] [Google Scholar]
  26. Helou, G., & Madore, B. F. 1991, Bull. Am. Astron. Soc., 23, 909 [Google Scholar]
  27. Martin, D. C., Fanson, J., Schiminovich, D., et al. 2005, ApJ, 619, L1 [Google Scholar]
  28. Morrissey, P., Conrow, T., Barlow, T. A., et al. 2007, ApJS, 173, 682 [Google Scholar]
  29. National Academies of Sciences, Engineering, and Medicine 2023, Pathways to Discovery in Astronomy and Astrophysics for the 2020s (Washington, DC: The National Academies Press) [Google Scholar]
  30. Ochsenbein, F., Bauer, P., & Marcout, J. 2000, A&AS, 143, 23 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  31. Oke, J. B., & Gunn, J. E. 1983, ApJ, 266, 713 [NASA ADS] [CrossRef] [Google Scholar]
  32. Ortiz, I., et al. 2011, IVOA Recommendation [Google Scholar]
  33. Ott, S. 2010, in Astronomical Society of the Pacific Conference Series, 434, Astronomical Data Analysis Software and Systems XIX, eds. Y. Mizumoto, K. I. Morita, & M. Ohishi, 139 [Google Scholar]
  34. Padmanabhan, N., Schlegel, D. J., Finkbeiner, D. P., et al. 2008, ApJ, 674, 1217 [NASA ADS] [CrossRef] [Google Scholar]
  35. Pilbratt, G. L., Riedinger, J. R., Passvogel, T., et al. 2010, A&A, 518, L1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  36. Poglitsch, A., Waelkens, C., Geis, N., et al. 2010, A&A, 518, L2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  37. Stoughton, C., Lupton, R. H., Bernardi, M., et al. 2002, AJ, 123, 485 [Google Scholar]
  38. Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nat. Methods, 17, 261 [Google Scholar]
  39. Wells, D. C., Greisen, E. W., & Harten, R. H. 1981, A&AS, 44, 363 [NASA ADS] [Google Scholar]
  40. Wenger, M., Ochsenbein, F., Egret, D., et al. 2000, A&AS, 143, 9 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. York, D. G., Adelman, J., Anderson, Jr., J. E., et al. 2000, AJ, 120, 1579 [NASA ADS] [CrossRef] [Google Scholar]

35

Let us note that the values reported in Table 1 correspond to the FWHM of the PSF, and therefore they should be compared to the full minor axis 2b of the region of interest.

All Tables

Table 1

Scientific calibration parameters of each skymap.

Table 2

Percentiles of the probability distribution of the normalised residual χ, defined in Equation (6).

All Figures

thumbnail Fig. 1

AstroBrowser components diagram.

In the text
thumbnail Fig. 2

Herschel SPCS 500 catalogue overlay on M31 superimposed on SPIRE 500 HiPS. The shape size is mapped to the flux metadata column, and the shape hue is mapped to the snrnoise metadata column.

In the text
thumbnail Fig. 3

Interactive DataExplorer panel view on GALEX NUV HiPS targeting M31.

In the text
thumbnail Fig. 4

Comparison between the flux measured from the HiPS skymaps and the values reported in the HRS catalogues for GALEX (Cortese et al. 2012, top left), SDSS (Cortese et al. 2012, bottom left), and Herschel (Ciesla et al. 2012; Cortese et al. 2014, top and bottom right for the PACS and SPIRE instruments, respectively). Error bars in the vertical axis represent the uncertainties returned by our pipeline (they do not include HRS errors); measurements above the median S/N are highlighted in black. Horizontal dotted lines and grey bands denote the median and 16–84 percentiles of the flux ratio (see Figure 5 for the full probability distribution).

In the text
thumbnail Fig. 5

Probability distribution of the ratio between the flux measured from the HiPS skymaps and the values reported in the HRS catalogues. Cumulative fractions and differential probability densities are shown in the top and bottom panels, respectively.

In the text
thumbnail Fig. 6

Cumulative fraction (top) and probability density (bottom) of the normalised residual χ defined in Equation (6). The dash-dotted black line in the bottom panel corresponds to a standard normal distribution.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.