Selecting query terms to build a specialised corpus from a restricted-access database

Gabrielatos, Costas (2007) Selecting query terms to build a specialised corpus from a restricted-access database. ICAME Journal, 31. pp. 5-44.

Item not available from this archive. (Request a copy)

Abstract

This paper proposes an accessible measure of the relevance of additional terms to a given query, describes and comments on the steps leading to its develop-ment, and discusses its utility. The measure, termed relative query term rele-vance (RQTR), draws on techniques used in information retrieval, and can becombined with a technique used in creating corpora from the world wide web,namely keyword analysis. It is independent of reference corpora, and does notrequire knowledge of the number of (relevant) documents in the database. Although it does not make use of user/expert judgements of document relevance,it does allow for subjective decisions. However, subjective decisions are triangu-lated against two objective indicators: keyness and, mainly, RQTR.

Item Type: Article
Uncontrolled Keywords: corpora ; corpus building ; text database ; query expansion ; query term relevance ; keywords
Subjects: P Language and Literature > P Philology. Linguistics
Divisions: English Language & Literature
Date Deposited: 30 Aug 2012 11:44
URI: http://repository.edgehill.ac.uk/id/eprint/4125

Archive staff only

Item control page Item control page