This thesis focuses on the collection, preparation, and structuring of a curated code dataset centered around Csound projects, aimed at enabling future tasks such as code translation and analysis. We detail the process of selecting clean, well-documented repositories, preprocessing codebases to reduce noise, and organizing data for downstream tasks. We designed a system architecture and pipeline to support querying, response handling, and evaluation.

LLMs for Understanding and Preserving Historical Musical Codes : Evaluation and Benchmarking

TAHVILDARI, ALI
2024/2025

Abstract

This thesis focuses on the collection, preparation, and structuring of a curated code dataset centered around Csound projects, aimed at enabling future tasks such as code translation and analysis. We detail the process of selecting clean, well-documented repositories, preprocessing codebases to reduce noise, and organizing data for downstream tasks. We designed a system architecture and pipeline to support querying, response handling, and evaluation.
2024
LLMs for Understanding and Preserving Historical Musical Codes : Evaluation and Benchmarking
Large Language Model
Prompt Engineering
Model Evaluation
File in questo prodotto:
File Dimensione Formato  
Tahvildari_Ali.pdf

Accesso riservato

Dimensione 1.17 MB
Formato Adobe PDF
1.17 MB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/99281