-
Euclid preparation. Measuring detailed galaxy morphologies for Euclid with Machine Learning
Authors:
Euclid Collaboration,
B. Aussel,
S. Kruk,
M. Walmsley,
M. Huertas-Company,
M. Castellano,
C. J. Conselice,
M. Delli Veneri,
H. Domínguez Sánchez,
P. -A. Duc,
U. Kuchner,
A. La Marca,
B. Margalef-Bentabol,
F. R. Marleau,
G. Stevens,
Y. Toba,
C. Tortora,
L. Wang,
N. Aghanim,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
M. Baldi,
S. Bardelli
, et al. (233 additional authors not shown)
Abstract:
The Euclid mission is expected to image millions of galaxies with high resolution, providing an extensive dataset to study galaxy evolution. We investigate the application of deep learning to predict the detailed morphologies of galaxies in Euclid using Zoobot a convolutional neural network pretrained with 450000 galaxies from the Galaxy Zoo project. We adapted Zoobot for emulated Euclid images, g…
▽ More
The Euclid mission is expected to image millions of galaxies with high resolution, providing an extensive dataset to study galaxy evolution. We investigate the application of deep learning to predict the detailed morphologies of galaxies in Euclid using Zoobot a convolutional neural network pretrained with 450000 galaxies from the Galaxy Zoo project. We adapted Zoobot for emulated Euclid images, generated based on Hubble Space Telescope COSMOS images, and with labels provided by volunteers in the Galaxy Zoo: Hubble project. We demonstrate that the trained Zoobot model successfully measures detailed morphology for emulated Euclid images. It effectively predicts whether a galaxy has features and identifies and characterises various features such as spiral arms, clumps, bars, disks, and central bulges. When compared to volunteer classifications Zoobot achieves mean vote fraction deviations of less than 12% and an accuracy above 91% for the confident volunteer classifications across most morphology types. However, the performance varies depending on the specific morphological class. For the global classes such as disk or smooth galaxies, the mean deviations are less than 10%, with only 1000 training galaxies necessary to reach this performance. For more detailed structures and complex tasks like detecting and counting spiral arms or clumps, the deviations are slightly higher, around 12% with 60000 galaxies used for training. In order to enhance the performance on complex morphologies, we anticipate that a larger pool of labelled galaxies is needed, which could be obtained using crowdsourcing. Finally, our findings imply that the model can be effectively adapted to new morphological labels. We demonstrate this adaptability by applying Zoobot to peculiar galaxies. In summary, our trained Zoobot CNN can readily predict morphological catalogues for Euclid images.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Euclid preparation: TBD. The pre-launch Science Ground Segment simulation framework
Authors:
Euclid Collaboration,
S. Serrano,
P. Hudelot,
G. Seidel,
J. E. Pollack,
E. Jullo,
F. Torradeflot,
D. Benielli,
R. Fahed,
T. Auphan,
J. Carretero,
H. Aussel,
P. Casenove,
F. J. Castander,
J. E. Davies,
N. Fourmanoit,
S. Huot,
A. Kara,
E. Keihanen,
S. Kermiche,
K. Okumura,
J. Zoubian,
A. Ealet,
A. Boucaud,
H. Bretonniere
, et al. (251 additional authors not shown)
Abstract:
The European Space Agency's Euclid mission is one of the upcoming generation of large-scale cosmology surveys, which will map the large-scale structure in the Universe with unprecedented precision. The development and validation of the SGS pipeline requires state-of-the-art simulations with a high level of complexity and accuracy that include subtle instrumental features not accounted for previous…
▽ More
The European Space Agency's Euclid mission is one of the upcoming generation of large-scale cosmology surveys, which will map the large-scale structure in the Universe with unprecedented precision. The development and validation of the SGS pipeline requires state-of-the-art simulations with a high level of complexity and accuracy that include subtle instrumental features not accounted for previously as well as faster algorithms for the large-scale production of the expected Euclid data products. In this paper, we present the Euclid SGS simulation framework as applied in a large-scale end-to-end simulation exercise named Science Challenge 8. Our simulation pipeline enables the swift production of detailed image simulations for the construction and validation of the Euclid mission during its qualification phase and will serve as a reference throughout operations. Our end-to-end simulation framework starts with the production of a large cosmological N-body & mock galaxy catalogue simulation. We perform a selection of galaxies down to I_E=26 and 28 mag, respectively, for a Euclid Wide Survey spanning 165 deg^2 and a 1 deg^2 Euclid Deep Survey. We build realistic stellar density catalogues containing Milky Way-like stars down to H<26. Using the latest instrumental models for both the Euclid instruments and spacecraft as well as Euclid-like observing sequences, we emulate with high fidelity Euclid satellite imaging throughout the mission's lifetime. We present the SC8 data set consisting of overlapping visible and near-infrared Euclid Wide Survey and Euclid Deep Survey imaging and low-resolution spectroscopy along with ground-based. This extensive data set enables end-to-end testing of the entire ground segment data reduction and science analysis pipeline as well as the Euclid mission infrastructure, paving the way to future scientific and technical developments and enhancements.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Euclid Preparation XXXIII. Characterization of convolutional neural networks for the identification of galaxy-galaxy strong lensing events
Authors:
Euclid Collaboration,
L. Leuzzi,
M. Meneghetti,
G. Angora,
R. B. Metcalf,
L. Moscardini,
P. Rosati,
P. Bergamini,
F. Calura,
B. Clément,
R. Gavazzi,
F. Gentile,
M. Lochner,
C. Grillo,
G. Vernardos,
N. Aghanim,
A. Amara,
L. Amendola,
S. Andreon,
N. Auricchio,
S. Bardelli,
C. Bodendorf,
D. Bonino,
E. Branchini,
M. Brescia
, et al. (194 additional authors not shown)
Abstract:
Forthcoming imaging surveys will potentially increase the number of known galaxy-scale strong lenses by several orders of magnitude. For this to happen, images of tens of millions of galaxies will have to be inspected to identify potential candidates. In this context, deep learning techniques are particularly suitable for the finding patterns in large data sets, and convolutional neural networks (…
▽ More
Forthcoming imaging surveys will potentially increase the number of known galaxy-scale strong lenses by several orders of magnitude. For this to happen, images of tens of millions of galaxies will have to be inspected to identify potential candidates. In this context, deep learning techniques are particularly suitable for the finding patterns in large data sets, and convolutional neural networks (CNNs) in particular can efficiently process large volumes of images. We assess and compare the performance of three network architectures in the classification of strong lensing systems on the basis of their morphological characteristics. We train and test our models on different subsamples of a data set of forty thousand mock images, having characteristics similar to those expected in the wide survey planned with the ESA mission \Euclid, gradually including larger fractions of faint lenses. We also evaluate the importance of adding information about the colour difference between the lens and source galaxies by repeating the same training on single-band and multi-band images. Our models find samples of clear lenses with $\gtrsim 90\%$ precision and completeness, without significant differences in the performance of the three architectures. Nevertheless, when including lenses with fainter arcs in the training set, the three models' performance deteriorates with accuracy values of $\sim 0.87$ to $\sim 0.75$ depending on the model. Our analysis confirms the potential of the application of CNNs to the identification of galaxy-scale strong lenses. We suggest that specific training with separate classes of lenses might be needed for detecting the faint lenses since the addition of the colour information does not yield a significant improvement in the current analysis, with the accuracy ranging from $\sim 0.89$ to $\sim 0.78$ for the different models.
△ Less
Submitted 26 January, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Euclid preparation XXVI. The Euclid Morphology Challenge. Towards structural parameters for billions of galaxies
Authors:
Euclid Collaboration,
H. Bretonnière,
U. Kuchner,
M. Huertas-Company,
E. Merlin,
M. Castellano,
D. Tuccillo,
F. Buitrago,
C. J. Conselice,
A. Boucaud,
B. Häußler,
M. Kümmel,
W. G. Hartley,
A. Alvarez Ayllon,
E. Bertin,
F. Ferrari,
L. Ferreira,
R. Gavazzi,
D. Hernández-Lang,
G. Lucatelli,
A. S. G. Robotham,
M. Schefer,
L. Wang,
R. Cabanac,
H. Domínguez Sánchez
, et al. (193 additional authors not shown)
Abstract:
The various Euclid imaging surveys will become a reference for studies of galaxy morphology by delivering imaging over an unprecedented area of 15 000 square degrees with high spatial resolution. In order to understand the capabilities of measuring morphologies from Euclid-detected galaxies and to help implement measurements in the pipeline, we have conducted the Euclid Morphology Challenge, which…
▽ More
The various Euclid imaging surveys will become a reference for studies of galaxy morphology by delivering imaging over an unprecedented area of 15 000 square degrees with high spatial resolution. In order to understand the capabilities of measuring morphologies from Euclid-detected galaxies and to help implement measurements in the pipeline, we have conducted the Euclid Morphology Challenge, which we present in two papers. While the companion paper by Merlin et al. focuses on the analysis of photometry, this paper assesses the accuracy of the parametric galaxy morphology measurements in imaging predicted from within the Euclid Wide Survey. We evaluate the performance of five state-of-the-art surface-brightness-fitting codes DeepLeGATo, Galapagos-2, Morfometryka, Profit and SourceXtractor++ on a sample of about 1.5 million simulated galaxies resembling reduced observations with the Euclid VIS and NIR instruments. The simulations include analytic Sérsic profiles with one and two components, as well as more realistic galaxies generated with neural networks. We find that, despite some code-specific differences, all methods tend to achieve reliable structural measurements (10% scatter on ideal Sérsic simulations) down to an apparent magnitude of about 23 in one component and 21 in two components, which correspond to a signal-to-noise ratio of approximately 1 and 5 respectively. We also show that when tested on non-analytic profiles, the results are typically degraded by a factor of 3, driven by systematics. We conclude that the Euclid official Data Releases will deliver robust structural parameters for at least 400 million galaxies in the Euclid Wide Survey by the end of the mission. We find that a key factor for explaining the different behaviour of the codes at the faint end is the set of adopted priors for the various structural parameters.
△ Less
Submitted 28 November, 2022; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Euclid preparation. XXV. The Euclid Morphology Challenge -- Towards model-fitting photometry for billions of galaxies
Authors:
Euclid Collaboration,
E. Merlin,
M. Castellano,
H. Bretonnière,
M. Huertas-Company,
U. Kuchner,
D. Tuccillo,
F. Buitrago,
J. R. Peterson,
C. J. Conselice,
F. Caro,
P. Dimauro,
L. Nemani,
A. Fontana,
M. Kümmel,
B. Häußler,
W. G. Hartley,
A. Alvarez Ayllon,
E. Bertin,
P. Dubath,
F. Ferrari,
L. Ferreira,
R. Gavazzi,
D. Hernández-Lang,
G. Lucatelli
, et al. (196 additional authors not shown)
Abstract:
The ESA Euclid mission will provide high-quality imaging for about 1.5 billion galaxies. A software pipeline to automatically process and analyse such a huge amount of data in real time is being developed by the Science Ground Segment of the Euclid Consortium; this pipeline will include a model-fitting algorithm, which will provide photometric and morphological estimates of paramount importance fo…
▽ More
The ESA Euclid mission will provide high-quality imaging for about 1.5 billion galaxies. A software pipeline to automatically process and analyse such a huge amount of data in real time is being developed by the Science Ground Segment of the Euclid Consortium; this pipeline will include a model-fitting algorithm, which will provide photometric and morphological estimates of paramount importance for the core science goals of the mission and for legacy science. The Euclid Morphology Challenge is a comparative investigation of the performance of five model-fitting software packages on simulated Euclid data, aimed at providing the baseline to identify the best suited algorithm to be implemented in the pipeline. In this paper we describe the simulated data set, and we discuss the photometry results. A companion paper (Euclid Collaboration: Bretonnière et al. 2022) is focused on the structural and morphological estimates. We created mock Euclid images simulating five fields of view of 0.48 deg2 each in the $I_E$ band of the VIS instrument, each with three realisations of galaxy profiles (single and double Sérsic, and 'realistic' profiles obtained with a neural network); for one of the fields in the double Sérsic realisation, we also simulated images for the three near-infrared $Y_E$, $J_E$ and $H_E$ bands of the NISP-P instrument, and five Rubin/LSST optical complementary bands ($u$, $g$, $r$, $i$, and $z$). To analyse the results we created diagnostic plots and defined ad-hoc metrics. Five model-fitting software packages (DeepLeGATo, Galapagos-2, Morfometryka, ProFit, and SourceXtractor++) were compared, all typically providing good results. (cut)
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Probabilistic segmentation of overlapping galaxies for large cosmological surveys
Authors:
Hubert Bretonnière,
Alexandre Boucaud,
Marc Huertas-Company
Abstract:
Encoder-Decoder networks such as U-Nets have been applied successfully in a wide range of computer vision tasks, especially for image segmentation of different flavours across different fields. Nevertheless, most applications lack of a satisfying quantification of the uncertainty of the prediction. Yet, a well calibrated segmentation uncertainty can be a key element for scientific applications suc…
▽ More
Encoder-Decoder networks such as U-Nets have been applied successfully in a wide range of computer vision tasks, especially for image segmentation of different flavours across different fields. Nevertheless, most applications lack of a satisfying quantification of the uncertainty of the prediction. Yet, a well calibrated segmentation uncertainty can be a key element for scientific applications such as precision cosmology. In this on-going work, we explore the use of the probabilistic version of the U-Net, recently proposed by Kohl et al (2018), and adapt it to automate the segmentation of galaxies for large photometric surveys. We focus especially on the probabilistic segmentation of overlapping galaxies, also known as blending. We show that, even when training with a single ground truth per input sample, the model manages to properly capture a pixel-wise uncertainty on the segmentation map. Such uncertainty can then be propagated further down the analysis of the galaxy properties. To our knowledge, this is the first time such an experiment is applied for galaxy deblending in astrophysics.
△ Less
Submitted 6 December, 2021; v1 submitted 30 November, 2021;
originally announced November 2021.
-
Euclid preparation: XIII. Forecasts for galaxy morphology with the Euclid Survey using Deep Generative Models
Authors:
Euclid Collaboration,
H. Bretonnière,
M. Huertas-Company,
A. Boucaud,
F. Lanusse,
E. Jullo,
E. Merlin,
D. Tuccillo,
M. Castellano,
J. Brinchmann,
C. J. Conselice,
H. Dole,
R. Cabanac,
H. M. Courtois,
F. J. Castander,
P. A. Duc,
P. Fosalba,
D. Guinet,
S. Kruk,
U. Kuchner,
S. Serrano,
E. Soubrie,
A. Tramacere,
L. Wang,
A. Amara
, et al. (171 additional authors not shown)
Abstract:
We present a machine learning framework to simulate realistic galaxies for the Euclid Survey. The proposed method combines a control on galaxy shape parameters offered by analytic models with realistic surface brightness distributions learned from real Hubble Space Telescope observations by deep generative models. We simulate a galaxy field of $0.4\,\rm{deg}^2$ as it will be seen by the Euclid vis…
▽ More
We present a machine learning framework to simulate realistic galaxies for the Euclid Survey. The proposed method combines a control on galaxy shape parameters offered by analytic models with realistic surface brightness distributions learned from real Hubble Space Telescope observations by deep generative models. We simulate a galaxy field of $0.4\,\rm{deg}^2$ as it will be seen by the Euclid visible imager VIS and show that galaxy structural parameters are recovered with similar accuracy as for pure analytic Sérsic profiles. Based on these simulations, we estimate that the Euclid Wide Survey will be able to resolve the internal morphological structure of galaxies down to a surface brightness of $22.5\,\rm{mag}\,\rm{arcsec}^{-2}$, and $24.9\,\rm{mag}\,\rm{arcsec}^{-2}$ for the Euclid Deep Survey. This corresponds to approximately $250$ million galaxies at the end of the mission and a $50\,\%$ complete sample for stellar masses above $10^{10.6}\,\rm{M}_\odot$ (resp. $10^{9.6}\,\rm{M}_\odot$) at a redshift $z\sim0.5$ for the wide (resp. deep) survey. The approach presented in this work can contribute to improving the preparation of future high-precision cosmological imaging surveys by allowing simulations to incorporate more realistic galaxies.
△ Less
Submitted 10 January, 2022; v1 submitted 25 May, 2021;
originally announced May 2021.