This thesis focuses on the development of a chatbot designed to facilitate interaction with a BI platform using natural language commands. The primary objective was to create a system capable of interpreting user requests and translating them into database operations, thereby simplifying data access and management. The project utilized the LangChain library in Python to build an agent capable of executing queries and commands on a database. An incremental development approach was adopted, starting with a foundational model and progressively enhancing it with advanced functionalities to align with the structure of Statwolf's status queries. The chatbot was developed during an internship at Statwolf, specifically tailored to interact with their datasources. It was designed to transform natural language queries into a structured dictionary-like format compatible with Statwolf's data query system. Additional functionalities were incorporated to support the creation of new metrics and dimensions through natural language commands, which were then translated into ClickHouse query definitions.
This thesis focuses on the development of a chatbot designed to facilitate interaction with a BI platform using natural language commands. The primary objective was to create a system capable of interpreting user requests and translating them into database operations, thereby simplifying data access and management. The project utilized the LangChain library in Python to build an agent capable of executing queries and commands on a database. An incremental development approach was adopted, starting with a foundational model and progressively enhancing it with advanced functionalities to align with the structure of Statwolf's status queries. The chatbot was developed during an internship at Statwolf, specifically tailored to interact with their datasources. It was designed to transform natural language queries into a structured dictionary-like format compatible with Statwolf's data query system. Additional functionalities were incorporated to support the creation of new metrics and dimensions through natural language commands, which were then translated into ClickHouse query definitions.
Natural Language Interaction for a BI Platform and Data Integration:Task-Oriented, Retrieval-Augmented Agent Using LLMs
FRANCARIO, FELICE
2024/2025
Abstract
This thesis focuses on the development of a chatbot designed to facilitate interaction with a BI platform using natural language commands. The primary objective was to create a system capable of interpreting user requests and translating them into database operations, thereby simplifying data access and management. The project utilized the LangChain library in Python to build an agent capable of executing queries and commands on a database. An incremental development approach was adopted, starting with a foundational model and progressively enhancing it with advanced functionalities to align with the structure of Statwolf's status queries. The chatbot was developed during an internship at Statwolf, specifically tailored to interact with their datasources. It was designed to transform natural language queries into a structured dictionary-like format compatible with Statwolf's data query system. Additional functionalities were incorporated to support the creation of new metrics and dimensions through natural language commands, which were then translated into ClickHouse query definitions.File | Dimensione | Formato | |
---|---|---|---|
FINAL_Data_Science_MsC_Thesis____Felice.pdf
accesso riservato
Dimensione
868.46 kB
Formato
Adobe PDF
|
868.46 kB | Adobe PDF |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
https://hdl.handle.net/20.500.12608/81803