Taxonomic text classification is a branch of Natural Language Processing that focuses on organizing textual data into a structured schema. Its applications span various domains, including document management, content recommendation systems, and market research. By integrating custom sub-taxonomies into the existing taxonomies, the research enables a more granular categorization tailored to specific use cases. Leveraging diverse topic modeling techniques and a semi-supervised approach, the project enhances flexibility and performance, facilitating a nuanced understanding of textual information and feature extraction. Performance assessment covers both English and Italian languages, broadening the scope of the experiments. Through rigorous experimentation and comparative analysis, this study identifies the strengths and weaknesses of the current model, providing valuable insights for future investigations.
Custom Taxonomy Text Classification for Enriched Granularity
ORTIZ BENITEZ, CARMEN ROCIO
2022/2023
Abstract
Taxonomic text classification is a branch of Natural Language Processing that focuses on organizing textual data into a structured schema. Its applications span various domains, including document management, content recommendation systems, and market research. By integrating custom sub-taxonomies into the existing taxonomies, the research enables a more granular categorization tailored to specific use cases. Leveraging diverse topic modeling techniques and a semi-supervised approach, the project enhances flexibility and performance, facilitating a nuanced understanding of textual information and feature extraction. Performance assessment covers both English and Italian languages, broadening the scope of the experiments. Through rigorous experimentation and comparative analysis, this study identifies the strengths and weaknesses of the current model, providing valuable insights for future investigations.File | Dimensione | Formato | |
---|---|---|---|
dissertation.pdf
accesso riservato
Dimensione
4.34 MB
Formato
Adobe PDF
|
4.34 MB | Adobe PDF |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
https://hdl.handle.net/20.500.12608/52275