This thesis explores the application of archetypal analysis (AA) as a tool to analyze single-cell RNA sequencing (scRNAseq) dataset, with the objective of assessing its validity and effec- tiveness in capturing significant biological patterns. The study tests archetypal analysis on two scRNAseq datasets to evaluate its performance. The introductory section sets the biological context, discussing the fundamental principles of cellular biology, scRNAseq sequencing and analysis. Subsequently, a comprehensive introduction to archetypal analysis is provided, detail- ing its mathematical formulation and various computational approaches, including convex least squares, penalized nonnegative least squares, and projected gradient methods. The FurthestSum initialization method is also discussed, a technique used to improve the accuracy and computa- tion times of AA. Then the available software implementations for AA are introduced, with a particular focus on the archetypal package in the R programming language, which is used in this thesis. After having described the dataset used for testing, in the result section are presented the conclusions obtained from the AA alongside an overview on the computational burden of the method. The result obtained demonstrates that AA can capture significant biological trends and patterns in both tested datasets, suggesting its potential as a valuable tool in the analysis of scRNAseq data.

This thesis explores the application of archetypal analysis (AA) as a tool to analyze single-cell RNA sequencing (scRNAseq) dataset, with the objective of assessing its validity and effec- tiveness in capturing significant biological patterns. The study tests archetypal analysis on two scRNAseq datasets to evaluate its performance. The introductory section sets the biological context, discussing the fundamental principles of cellular biology, scRNAseq sequencing and analysis. Subsequently, a comprehensive introduction to archetypal analysis is provided, detail- ing its mathematical formulation and various computational approaches, including convex least squares, penalized nonnegative least squares, and projected gradient methods. The FurthestSum initialization method is also discussed, a technique used to improve the accuracy and computa- tion times of AA. Then the available software implementations for AA are introduced, with a particular focus on the archetypal package in the R programming language, which is used in this thesis. After having described the dataset used for testing, in the result section are presented the conclusions obtained from the AA alongside an overview on the computational burden of the method. The result obtained demonstrates that AA can capture significant biological trends and patterns in both tested datasets, suggesting its potential as a valuable tool in the analysis of scRNAseq data.

Archetypal analysis for single-cell RNA sequencing data

ANDRIOLO, MATTEO
2023/2024

Abstract

This thesis explores the application of archetypal analysis (AA) as a tool to analyze single-cell RNA sequencing (scRNAseq) dataset, with the objective of assessing its validity and effec- tiveness in capturing significant biological patterns. The study tests archetypal analysis on two scRNAseq datasets to evaluate its performance. The introductory section sets the biological context, discussing the fundamental principles of cellular biology, scRNAseq sequencing and analysis. Subsequently, a comprehensive introduction to archetypal analysis is provided, detail- ing its mathematical formulation and various computational approaches, including convex least squares, penalized nonnegative least squares, and projected gradient methods. The FurthestSum initialization method is also discussed, a technique used to improve the accuracy and computa- tion times of AA. Then the available software implementations for AA are introduced, with a particular focus on the archetypal package in the R programming language, which is used in this thesis. After having described the dataset used for testing, in the result section are presented the conclusions obtained from the AA alongside an overview on the computational burden of the method. The result obtained demonstrates that AA can capture significant biological trends and patterns in both tested datasets, suggesting its potential as a valuable tool in the analysis of scRNAseq data.
2023
Archetypal analysis for single-cell RNA sequencing data
This thesis explores the application of archetypal analysis (AA) as a tool to analyze single-cell RNA sequencing (scRNAseq) dataset, with the objective of assessing its validity and effec- tiveness in capturing significant biological patterns. The study tests archetypal analysis on two scRNAseq datasets to evaluate its performance. The introductory section sets the biological context, discussing the fundamental principles of cellular biology, scRNAseq sequencing and analysis. Subsequently, a comprehensive introduction to archetypal analysis is provided, detail- ing its mathematical formulation and various computational approaches, including convex least squares, penalized nonnegative least squares, and projected gradient methods. The FurthestSum initialization method is also discussed, a technique used to improve the accuracy and computa- tion times of AA. Then the available software implementations for AA are introduced, with a particular focus on the archetypal package in the R programming language, which is used in this thesis. After having described the dataset used for testing, in the result section are presented the conclusions obtained from the AA alongside an overview on the computational burden of the method. The result obtained demonstrates that AA can capture significant biological trends and patterns in both tested datasets, suggesting its potential as a valuable tool in the analysis of scRNAseq data.
archetypal analysis
single-cell RNA-seq
data analysis
bioinformatics
File in questo prodotto:
File Dimensione Formato  
Andriolo_Matteo.pdf

accesso riservato

Dimensione 6.61 MB
Formato Adobe PDF
6.61 MB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/69261