Tensor Networks (TN) methods have successfully been applied to many different machine learning (ML) tasks. Notably, Tree-Tensor Networks (TTN) have proven efficient and rapid predictors in image classification and object detection, showcasing prowess in devising high-performance algorithms.  However, these linear models exhibit limitations in encoding long-distance correlations prevalent in 2D datasets. On the other hand, Multi-scale Entanglement Renormalization Ansatz (MERA) excels in capturing correlations at different scales but is algorithmically complex to optimize within an ML framework. This study aims to harness the strength of the two architectures for building a hybrid image classification algorithm by studying the application of augmented-TTN classifier and optimizing those with state-of-the-art techniques (i.e. auto-differentiation, Riemannian optimization..). The hybrid model will be tested on standard image classification problems: Fashion-MNIST; CIFAR10; ImageNet; and benchmarked against linear TN models and CNN. Finally, we will explore how the explainability features present in TTN can be extended or reinterpreted to hybrid MERA-TTN approaches. This comprehensive analysis seeks to provide insights into the synergies between hierarchical and linear tensor network architectures, paving the way for enhanced multi-scale image classification methodologies.

Tensor Networks (TN) methods have successfully been applied to many different machine learning (ML) tasks. Notably, Tree-Tensor Networks (TTN) have proven efficient and rapid predictors in image classification and object detection, showcasing prowess in devising high-performance algorithms.  However, these linear models exhibit limitations in encoding long-distance correlations prevalent in 2D datasets. On the other hand, Multi-scale Entanglement Renormalization Ansatz (MERA) excels in capturing correlations at different scales but is algorithmically complex to optimize within an ML framework. This study aims to harness the strength of the two architectures for building a hybrid image classification algorithm by studying the application of augmented-TTN classifier and optimizing those with state-of-the-art techniques (i.e. auto-differentiation, Riemannian optimization..). The hybrid model will be tested on standard image classification problems: Fashion-MNIST; CIFAR10; ImageNet; and benchmarked against linear TN models and CNN. Finally, we will explore how the explainability features present in TTN can be extended or reinterpreted to hybrid MERA-TTN approaches. This comprehensive analysis seeks to provide insights into the synergies between hierarchical and linear tensor network architectures, paving the way for enhanced multi-scale image classification methodologies.

Augmented Tensor Network approaches for multi-scale image classification

COLOMBO, MASSIMO
2023/2024

Abstract

Tensor Networks (TN) methods have successfully been applied to many different machine learning (ML) tasks. Notably, Tree-Tensor Networks (TTN) have proven efficient and rapid predictors in image classification and object detection, showcasing prowess in devising high-performance algorithms.  However, these linear models exhibit limitations in encoding long-distance correlations prevalent in 2D datasets. On the other hand, Multi-scale Entanglement Renormalization Ansatz (MERA) excels in capturing correlations at different scales but is algorithmically complex to optimize within an ML framework. This study aims to harness the strength of the two architectures for building a hybrid image classification algorithm by studying the application of augmented-TTN classifier and optimizing those with state-of-the-art techniques (i.e. auto-differentiation, Riemannian optimization..). The hybrid model will be tested on standard image classification problems: Fashion-MNIST; CIFAR10; ImageNet; and benchmarked against linear TN models and CNN. Finally, we will explore how the explainability features present in TTN can be extended or reinterpreted to hybrid MERA-TTN approaches. This comprehensive analysis seeks to provide insights into the synergies between hierarchical and linear tensor network architectures, paving the way for enhanced multi-scale image classification methodologies.
2023
Augmented Tensor Network approaches for multi-scale image classification
Tensor Networks (TN) methods have successfully been applied to many different machine learning (ML) tasks. Notably, Tree-Tensor Networks (TTN) have proven efficient and rapid predictors in image classification and object detection, showcasing prowess in devising high-performance algorithms.  However, these linear models exhibit limitations in encoding long-distance correlations prevalent in 2D datasets. On the other hand, Multi-scale Entanglement Renormalization Ansatz (MERA) excels in capturing correlations at different scales but is algorithmically complex to optimize within an ML framework. This study aims to harness the strength of the two architectures for building a hybrid image classification algorithm by studying the application of augmented-TTN classifier and optimizing those with state-of-the-art techniques (i.e. auto-differentiation, Riemannian optimization..). The hybrid model will be tested on standard image classification problems: Fashion-MNIST; CIFAR10; ImageNet; and benchmarked against linear TN models and CNN. Finally, we will explore how the explainability features present in TTN can be extended or reinterpreted to hybrid MERA-TTN approaches. This comprehensive analysis seeks to provide insights into the synergies between hierarchical and linear tensor network architectures, paving the way for enhanced multi-scale image classification methodologies.
Tensor Network
Machine Learning
MPS
MERA
Image Classification
File in questo prodotto:
File Dimensione Formato  
Colombo_Massimo.pdf

accesso riservato

Dimensione 3.37 MB
Formato Adobe PDF
3.37 MB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/75518