Nowadays most people use keyword search to retrieve information and surf the web wherever they are. Ease of use given, this technology has been introduced, in the last decade, also for searching structured data such as RDF graphs, which non-expert users accustomed to web search cannot easily get access to. Indeed, users would need to use tools like SPARQL that require a knowledge of the structure of the data it is being searched. In order to apply keyword search over structured data, virtual documents approaches and text retrieval technologies have been proposed. In particular, in this work, we refer to a previous study on the topological semantical aggregator and virtual document pruning over RDF datasets: these two technologies produce state-of-the-art results in terms of efficiency and effectiveness and ensure scalability over millions of triples. This work expands the previous study on RDF keyword search systems modifying and developing new systems for aggregating documents during the offline phase. The additional rankings produced by these systems are then merged following a round-robin rule.

Nowadays most people use keyword search to retrieve information and surf the web wherever they are. Ease of use given, this technology has been introduced, in the last decade, also for searching structured data such as RDF graphs, which non-expert users accustomed to web search cannot easily get access to. Indeed, users would need to use tools like SPARQL that require a knowledge of the structure of the data it is being searched. In order to apply keyword search over structured data, virtual documents approaches and text retrieval technologies have been proposed. In particular, in this work, we refer to a previous study on the topological semantical aggregator and virtual document pruning over RDF datasets: these two technologies produce state-of-the-art results in terms of efficiency and effectiveness and ensure scalability over millions of triples. This work expands the previous study on RDF keyword search systems modifying and developing new systems for aggregating documents during the offline phase. The additional rankings produced by these systems are then merged following a round-robin rule.

A ranking fusion approach for RDF keyword-based search systems

MAINO, NICOLA
2022/2023

Abstract

Nowadays most people use keyword search to retrieve information and surf the web wherever they are. Ease of use given, this technology has been introduced, in the last decade, also for searching structured data such as RDF graphs, which non-expert users accustomed to web search cannot easily get access to. Indeed, users would need to use tools like SPARQL that require a knowledge of the structure of the data it is being searched. In order to apply keyword search over structured data, virtual documents approaches and text retrieval technologies have been proposed. In particular, in this work, we refer to a previous study on the topological semantical aggregator and virtual document pruning over RDF datasets: these two technologies produce state-of-the-art results in terms of efficiency and effectiveness and ensure scalability over millions of triples. This work expands the previous study on RDF keyword search systems modifying and developing new systems for aggregating documents during the offline phase. The additional rankings produced by these systems are then merged following a round-robin rule.
2022
A ranking fusion approach for RDF keyword-based search systems
Nowadays most people use keyword search to retrieve information and surf the web wherever they are. Ease of use given, this technology has been introduced, in the last decade, also for searching structured data such as RDF graphs, which non-expert users accustomed to web search cannot easily get access to. Indeed, users would need to use tools like SPARQL that require a knowledge of the structure of the data it is being searched. In order to apply keyword search over structured data, virtual documents approaches and text retrieval technologies have been proposed. In particular, in this work, we refer to a previous study on the topological semantical aggregator and virtual document pruning over RDF datasets: these two technologies produce state-of-the-art results in terms of efficiency and effectiveness and ensure scalability over millions of triples. This work expands the previous study on RDF keyword search systems modifying and developing new systems for aggregating documents during the offline phase. The additional rankings produced by these systems are then merged following a round-robin rule.
RDF
keyword search
ranking fusion
File in questo prodotto:
File Dimensione Formato  
Maino_Nicola.pdf

accesso riservato

Dimensione 458.71 kB
Formato Adobe PDF
458.71 kB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/43343