In recent years, the advent of single-cell RNA sequencing has revolutionized the transcriptomics research field providing a deeper understanding of cells’ diversity and dynamics in complex tissues and organisms. One of the most critical steps in scRNAseq data analysis workflow is the annotation of cell types, which involves the categorization of cells within datasets into distinct classes, based on some known phenotypes or molecular signatures. The accurate definition of cells’ identities is a complex task: if conducted manually, it can be extremely laborious, time-consuming and prone to errors since it relies on partially subjective decision-making. Moreover, the growing high-dimensionality of data with thousands of cells per experiment, poses new challenges in developing specialized bioinformatics tools. For these reasons, many automated supervised or semi-supervised classification approaches have been adapted to ensure reproducible and consistent results. However, despite the diversity in their basic functioning algorithms, there is a lack of extensive and comprehensive comparisons of such methods, which further complicates the selection of the most appropriate tool. The ultimate goal of this project is to provide a benchmark of some of the existing approaches for cell labeling, in order to systematically compare and evaluate their overall performance across publicly available scRNAseq datasets.

In recent years, the advent of single-cell RNA sequencing has revolutionized the transcriptomics research field providing a deeper understanding of cells’ diversity and dynamics in complex tissues and organisms. One of the most critical steps in scRNAseq data analysis workflow is the annotation of cell types, which involves the categorization of cells within datasets into distinct classes, based on some known phenotypes or molecular signatures. The accurate definition of cells’ identities is a complex task: if conducted manually, it can be extremely laborious, time-consuming and prone to errors since it relies on partially subjective decision-making. Moreover, the growing high-dimensionality of data with thousands of cells per experiment, poses new challenges in developing specialized bioinformatics tools. For these reasons, many automated supervised or semi-supervised classification approaches have been adapted to ensure reproducible and consistent results. However, despite the diversity in their basic functioning algorithms, there is a lack of extensive and comprehensive comparisons of such methods, which further complicates the selection of the most appropriate tool. The ultimate goal of this project is to provide a benchmark of some of the existing approaches for cell labeling, in order to systematically compare and evaluate their overall performance across publicly available scRNAseq datasets.

Benchmark of bioinformatics tools for cell type classification in single-cell RNA sequencing data

CARLONE, ANNAMARIA
2023/2024

Abstract

In recent years, the advent of single-cell RNA sequencing has revolutionized the transcriptomics research field providing a deeper understanding of cells’ diversity and dynamics in complex tissues and organisms. One of the most critical steps in scRNAseq data analysis workflow is the annotation of cell types, which involves the categorization of cells within datasets into distinct classes, based on some known phenotypes or molecular signatures. The accurate definition of cells’ identities is a complex task: if conducted manually, it can be extremely laborious, time-consuming and prone to errors since it relies on partially subjective decision-making. Moreover, the growing high-dimensionality of data with thousands of cells per experiment, poses new challenges in developing specialized bioinformatics tools. For these reasons, many automated supervised or semi-supervised classification approaches have been adapted to ensure reproducible and consistent results. However, despite the diversity in their basic functioning algorithms, there is a lack of extensive and comprehensive comparisons of such methods, which further complicates the selection of the most appropriate tool. The ultimate goal of this project is to provide a benchmark of some of the existing approaches for cell labeling, in order to systematically compare and evaluate their overall performance across publicly available scRNAseq datasets.
2023
Benchmark of bioinformatics tools for cell type classification in single-cell RNA sequencing data
In recent years, the advent of single-cell RNA sequencing has revolutionized the transcriptomics research field providing a deeper understanding of cells’ diversity and dynamics in complex tissues and organisms. One of the most critical steps in scRNAseq data analysis workflow is the annotation of cell types, which involves the categorization of cells within datasets into distinct classes, based on some known phenotypes or molecular signatures. The accurate definition of cells’ identities is a complex task: if conducted manually, it can be extremely laborious, time-consuming and prone to errors since it relies on partially subjective decision-making. Moreover, the growing high-dimensionality of data with thousands of cells per experiment, poses new challenges in developing specialized bioinformatics tools. For these reasons, many automated supervised or semi-supervised classification approaches have been adapted to ensure reproducible and consistent results. However, despite the diversity in their basic functioning algorithms, there is a lack of extensive and comprehensive comparisons of such methods, which further complicates the selection of the most appropriate tool. The ultimate goal of this project is to provide a benchmark of some of the existing approaches for cell labeling, in order to systematically compare and evaluate their overall performance across publicly available scRNAseq datasets.
Transcriptomics
Cell type
Machine Learning
scRNA-seq
File in questo prodotto:
File Dimensione Formato  
Carlone_Annamaria.pdf

accesso riservato

Dimensione 12.83 MB
Formato Adobe PDF
12.83 MB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/64086