Retrieval-Augmented Generation: Strengthening Answer Confidence through Source Referencing

This thesis presents the design of a Retrieval-Augmented Generation (RAG) system and proposes an evaluation strategy to assess its performance in open-domain Question Answering (QA). The study explores how diﬀerent settings, including prompt design, deduplication strategies, generator model temperature, and context document order, impact the quality of generated responses. Performance is measured through metrics that assess answer correctness, fluency, and citation quality. The experiments, conducted as part of the TREC 2024 RAG track, reveal significant trade-oﬀs between these factors. Longer prompts improved fluency and correctness but negatively aﬀected citation quality, while deduplication strategies often led to the loss of useful context, diminishing the overall answer quality. Changes in generator temperature and context document order had minimal impact on the results. The proposed evaluation strategy provides a structured approach for assessing the system’s performance and enables eﬀective comparison across the diﬀerent settings. This work also discusses limitations, such as the evaluation method’s inability to penalize unnecessary citations, and the computational ineﬃciencies of deduplication. Future research should focus on alternative evaluation metrics, eﬃcient retrieval systems, and improved citation strategies.

Retrieval-Augmented Generation: Strengthening Answer Confidence through Source Referencing

CECCATO, ANDREA

2023/2024

Abstract

This thesis presents the design of a Retrieval-Augmented Generation (RAG) system and proposes an evaluation strategy to assess its performance in open-domain Question Answering (QA). The study explores how diﬀerent settings, including prompt design, deduplication strategies, generator model temperature, and context document order, impact the quality of generated responses. Performance is measured through metrics that assess answer correctness, fluency, and citation quality. The experiments, conducted as part of the TREC 2024 RAG track, reveal significant trade-oﬀs between these factors. Longer prompts improved fluency and correctness but negatively aﬀected citation quality, while deduplication strategies often led to the loss of useful context, diminishing the overall answer quality. Changes in generator temperature and context document order had minimal impact on the results. The proposed evaluation strategy provides a structured approach for assessing the system’s performance and enables eﬀective comparison across the diﬀerent settings. This work also discusses limitations, such as the evaluation method’s inability to penalize unnecessary citations, and the computational ineﬃciencies of deduplication. Future research should focus on alternative evaluation metrics, eﬃcient retrieval systems, and improved citation strategies.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Ingegneria dell'Informazione - DEI
			
	Corso di studio
	
				COMPUTER ENGINEERING Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2023
			
	Titolo inglese
	
				Retrieval-Augmented Generation: Strengthening Answer Confidence through Source Referencing
			
	Abstract in italiano
	
				This thesis presents the design of a Retrieval-Augmented Generation (RAG) system and proposes an evaluation strategy to assess its performance in open-domain Question Answering (QA). The study explores how diﬀerent settings, including prompt design, deduplication strategies, generator model temperature, and context document order, impact the quality of generated responses. Performance is measured through metrics that assess answer correctness, fluency, and citation quality. The experiments, conducted as part of the TREC 2024 RAG track, reveal significant trade-oﬀs between these factors.

Longer prompts improved fluency and correctness but negatively aﬀected citation quality, while deduplication strategies often led to the loss of useful context, diminishing the overall answer quality. Changes in generator temperature and context document order had minimal impact on the results. The proposed evaluation strategy provides a structured approach for assessing the system’s performance and enables eﬀective comparison across the diﬀerent settings.

This work also discusses limitations, such as the evaluation method’s inability to penalize unnecessary citations, and the computational ineﬃciencies of deduplication. Future research should focus on alternative evaluation metrics, eﬃcient retrieval systems, and improved citation strategies.
			
	Parola chiave
	
				RAG
Retrieval
Generation
Citations
			
	Relatore
	
				FERRO, NICOLA
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Ceccato_Andrea.pdf accesso riservato Dimensione 2.28 MB Formato Adobe PDF	2.28 MB	Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/75155