Taxonomic text classification is a branch of Natural Language Processing that focuses on organizing textual data into a structured schema. Its applications span various domains, including document management, content recommendation systems, and market research. By integrating custom sub-taxonomies into the existing taxonomies, the research enables a more granular categorization tailored to specific use cases. Leveraging diverse topic modeling techniques and a semi-supervised approach, the project enhances flexibility and performance, facilitating a nuanced understanding of textual information and feature extraction. Performance assessment covers both English and Italian languages, broadening the scope of the experiments. Through rigorous experimentation and comparative analysis, this study identifies the strengths and weaknesses of the current model, providing valuable insights for future investigations.

Custom Taxonomy Text Classification for Enriched Granularity

ORTIZ BENITEZ, CARMEN ROCIO
2022/2023

Abstract

Taxonomic text classification is a branch of Natural Language Processing that focuses on organizing textual data into a structured schema. Its applications span various domains, including document management, content recommendation systems, and market research. By integrating custom sub-taxonomies into the existing taxonomies, the research enables a more granular categorization tailored to specific use cases. Leveraging diverse topic modeling techniques and a semi-supervised approach, the project enhances flexibility and performance, facilitating a nuanced understanding of textual information and feature extraction. Performance assessment covers both English and Italian languages, broadening the scope of the experiments. Through rigorous experimentation and comparative analysis, this study identifies the strengths and weaknesses of the current model, providing valuable insights for future investigations.
2022
Custom Taxonomy Text Classification for Enriched Granularity
Text classification
NLP
Automated pipeline
File in questo prodotto:
File Dimensione Formato  
dissertation.pdf

accesso riservato

Dimensione 4.34 MB
Formato Adobe PDF
4.34 MB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/52275