The Large-Scale Structure (LSS) of the Universe describes the distribution of galaxies and matter on scales much larger than individual galaxies or clusters. This cosmic web forms an intricate network of voids, sheets, filaments, and knots. Its morphology encodes key information about structure formation and cosmological parameters. Classifying these structures with high precision and accuracy is therefore essential for modern cosmological analysis. In this work, we present a reproducible pipeline based on 3D Deep Learning to classify LSS into these four classes. We used N-body simulations from the Quijote Simulations suite. In particular, we analyzed a cubic volume with a side length of 1 h−1Gpc at redshift z = 0. By applying the Piece- wise Cubic Spline (PCS) mass assignment method, we distributed the particles from the raw snapshots into a 1283 voxel grid. From this grid, we computed the velocity, density and overdensity fields needed to develop the following methods. To train our supervised model, we generated ground-truth labels using the Hoffman method, which uses the eigenvalues of the shear tensor to produce the classification. We implemented a 3D U-Net architecture, which is widely used for image segmentation, to capture the multi-scale spatial correlations of the cosmic web. Before training the model, we applied data augmentation with random 90-degree rotations and axis flips. The U-Net begins with 64 initial filters, descending through four downsampling blocks to a bottleneck of 1024 channels, followed by four corresponding upsampling blocks. Due to the heavy class imbalance present in the data, in fact, voids occupy the majority of the volume, we optimized the model using a hybrid loss function composed of the classic cross-entropy and the Dice loss. Moreover, we applied a weighting scheme based on the effective number of samples per class to avoid biasing toward the majority classes. During inference on the test set, the U-Net achieved a mean Dice coefficient of 0.788, with per-class scores ranging from 0.931 for voids to 0.633 for knots. Despite the strong class imbalance, it is clear that the class balance strategy is effective in maintaining high segmentation fidelity across all scales. The strong point of this work is the physical consistency of the network’s predictions, which is underlined by the robust agreement between the Volume and Mass Filling Fractions (VFF, MFF) computed from the U-Net predictions with the ground-truth Hoffman classification, with differences below 10−3, highlighting that the model successfully captured the underlying density-morphology relation. This confirms the 3D U-Net as a robust, physically-consistent tool for high-resolution cosmic web segmentation.
The Large-Scale Structure (LSS) of the Universe describes the distribution of galaxies and matter on scales much larger than individual galaxies or clusters. This cosmic web forms an intricate network of voids, sheets, filaments, and knots. Its morphology encodes key information about structure formation and cosmological parameters. Classifying these structures with high precision and accuracy is therefore essential for modern cosmological analysis. In this work, we present a reproducible pipeline based on 3D Deep Learning to classify LSS into these four classes. We used N-body simulations from the Quijote Simulations suite. In particular, we analyzed a cubic volume with a side length of 1 h−1Gpc at redshift z = 0. By applying the Piece- wise Cubic Spline (PCS) mass assignment method, we distributed the particles from the raw snapshots into a 1283 voxel grid. From this grid, we computed the velocity, density and overdensity fields needed to develop the following methods. To train our supervised model, we generated ground-truth labels using the Hoffman method, which uses the eigenvalues of the shear tensor to produce the classification. We implemented a 3D U-Net architecture, which is widely used for image segmentation, to capture the multi-scale spatial correlations of the cosmic web. Before training the model, we applied data augmentation with random 90-degree rotations and axis flips. The U-Net begins with 64 initial filters, descending through four downsampling blocks to a bottleneck of 1024 channels, followed by four corresponding upsampling blocks. Due to the heavy class imbalance present in the data, in fact, voids occupy the majority of the volume, we optimized the model using a hybrid loss function composed of the classic cross-entropy and the Dice loss. Moreover, we applied a weighting scheme based on the effective number of samples per class to avoid biasing toward the majority classes. During inference on the test set, the U-Net achieved a mean Dice coefficient of 0.788, with per-class scores ranging from 0.931 for voids to 0.633 for knots. Despite the strong class imbalance, it is clear that the class balance strategy is effective in maintaining high segmentation fidelity across all scales. The strong point of this work is the physical consistency of the network’s predictions, which is underlined by the robust agreement between the Volume and Mass Filling Fractions (VFF, MFF) computed from the U-Net predictions with the ground-truth Hoffman classification, with differences below 10−3, highlighting that the model successfully captured the underlying density-morphology relation. This confirms the 3D U-Net as a robust, physically-consistent tool for high-resolution cosmic web segmentation.
Extracting Cosmological Information from Large-Scale Structure Morphology using Deep Learning
BOCCANERA, EUGENIA
2025/2026
Abstract
The Large-Scale Structure (LSS) of the Universe describes the distribution of galaxies and matter on scales much larger than individual galaxies or clusters. This cosmic web forms an intricate network of voids, sheets, filaments, and knots. Its morphology encodes key information about structure formation and cosmological parameters. Classifying these structures with high precision and accuracy is therefore essential for modern cosmological analysis. In this work, we present a reproducible pipeline based on 3D Deep Learning to classify LSS into these four classes. We used N-body simulations from the Quijote Simulations suite. In particular, we analyzed a cubic volume with a side length of 1 h−1Gpc at redshift z = 0. By applying the Piece- wise Cubic Spline (PCS) mass assignment method, we distributed the particles from the raw snapshots into a 1283 voxel grid. From this grid, we computed the velocity, density and overdensity fields needed to develop the following methods. To train our supervised model, we generated ground-truth labels using the Hoffman method, which uses the eigenvalues of the shear tensor to produce the classification. We implemented a 3D U-Net architecture, which is widely used for image segmentation, to capture the multi-scale spatial correlations of the cosmic web. Before training the model, we applied data augmentation with random 90-degree rotations and axis flips. The U-Net begins with 64 initial filters, descending through four downsampling blocks to a bottleneck of 1024 channels, followed by four corresponding upsampling blocks. Due to the heavy class imbalance present in the data, in fact, voids occupy the majority of the volume, we optimized the model using a hybrid loss function composed of the classic cross-entropy and the Dice loss. Moreover, we applied a weighting scheme based on the effective number of samples per class to avoid biasing toward the majority classes. During inference on the test set, the U-Net achieved a mean Dice coefficient of 0.788, with per-class scores ranging from 0.931 for voids to 0.633 for knots. Despite the strong class imbalance, it is clear that the class balance strategy is effective in maintaining high segmentation fidelity across all scales. The strong point of this work is the physical consistency of the network’s predictions, which is underlined by the robust agreement between the Volume and Mass Filling Fractions (VFF, MFF) computed from the U-Net predictions with the ground-truth Hoffman classification, with differences below 10−3, highlighting that the model successfully captured the underlying density-morphology relation. This confirms the 3D U-Net as a robust, physically-consistent tool for high-resolution cosmic web segmentation.| File | Dimensione | Formato | |
|---|---|---|---|
|
Boccanera_Eugenia.pdf
accesso aperto
Dimensione
4.6 MB
Formato
Adobe PDF
|
4.6 MB | Adobe PDF | Visualizza/Apri |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
https://hdl.handle.net/20.500.12608/107349