The Sheffield and Basque Country Universities Entry to CHiC: Using Random Walks and Similarity to Access Cultural Heritage

Agirre, Eneko, Clough, Paul, Fernando, Samuel, Hall, Mark, Otegi, Arantxa and Stevenson, Mark (2012) The Sheffield and Basque Country Universities Entry to CHiC: Using Random Walks and Similarity to Access Cultural Heritage. CLEF 2012 Evaluation Labs and Workshop, 17-20 September 2012, Rome.

[img]
Preview
PDF
agirreetal2012.pdf

Download (133kB)

Abstract

The Cultural Heritage in CLEF 2012 (CHiC) pilot evaluation included these tasks: ad-hoc retrieval, semantic enrichment and variability tasks. At CHiC 2012, the University of She�eld and the University of the Basque Country submitted a joint entry, attempting the three English monolingual tasks. For the ad-hoc task, the baseline approach used the Indri Search engine. Query expansion approaches used random walks using Personalised Page Rank over graphs constructed from Wikipedia and WordNet, and also by �nding similar articles within Wikipedia. For the semantic enrichment task, random walks using Personalised Page Rank were again used. Additionally links to Wikipedia were added and further approaches used this information to �nd enrichment terms. Finally for the variability task, TF-IDF scores were calculated from text and meta-data �elds. The �final results were selected using MMR (Maximal Marginal Relevance) and cosine similarity.

Item Type: Conference or Workshop Item (Other)
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Divisions: Computing and Information Systems
Related URLs:
Date Deposited: 06 Nov 2013 14:11
URI: http://repository.edgehill.ac.uk/id/eprint/5713

Archive staff only

Item control page Item control page