This thesis focuses on the collection, preparation, and structuring of a curated code dataset centered around Csound projects, aimed at enabling future tasks such as code translation and analysis. We detail the process of selecting clean, well-documented repositories, preprocessing codebases to reduce noise, and organizing data for downstream tasks. We designed a system architecture and pipeline to support querying, response handling, and evaluation.
LLMs for Understanding and Preserving Historical Musical Codes : Evaluation and Benchmarking
TAHVILDARI, ALI
2024/2025
Abstract
This thesis focuses on the collection, preparation, and structuring of a curated code dataset centered around Csound projects, aimed at enabling future tasks such as code translation and analysis. We detail the process of selecting clean, well-documented repositories, preprocessing codebases to reduce noise, and organizing data for downstream tasks. We designed a system architecture and pipeline to support querying, response handling, and evaluation.File in questo prodotto:
| File | Dimensione | Formato | |
|---|---|---|---|
|
Tahvildari_Ali.pdf
Accesso riservato
Dimensione
1.17 MB
Formato
Adobe PDF
|
1.17 MB | Adobe PDF |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
Utilizza questo identificativo per citare o creare un link a questo documento:
https://hdl.handle.net/20.500.12608/99281