Developing Asian language corpora: standards and practice

Xiao, R., McEnery, T., Baker, P. and Hardie, A. (2004) Developing Asian language corpora: standards and practice. 4th Workshop on Asian Language Resources, 25 March, Sanya, Hainan.

Item not available from this archive.


This paper first discusses standards for developing Asian language corpora so as to facilitate international data exchange. Following this, we present two corpora of Asian languages developed at Lancaster University - the EMILLE Corpus, which contains 14 South Asian languages, and the Lancaster Corpus of Mandarin Chinese. Finally, we will demonstrate how to explore these corpora using Xara and other corpus tools.

Item Type: Conference or Workshop Item (Paper)
Subjects: P Language and Literature > P Philology. Linguistics
P Language and Literature > PE English
P Language and Literature > PI Oriental languages and literatures
Divisions: English Language & Literature
Date Deposited: 23 Dec 2010 12:37

Archive staff only

Item control page Item control page