Adding typology to lexicostatistics: A combined approach to language classification

Bakker, Dik, Muller, Andre, Velupillai, Viveka, Wichmann, Soren, Brown, Cecil H, Brown, Pamela, Egorov, Dmitry, Mailhammer, Robert, Grant, Anthony and Holman, Eric W (2009) Adding typology to lexicostatistics: A combined approach to language classification. Linguistic Typology, 13 (1). pp. 169-181. ISSN 1430-0532 DOI

Item not available from this archive. (Request a copy)


The ASJP project aims at establishing relationships between languages on the basis of the Swadesh word list. For this purpose, lists have been collected and phonologically transcribed for almost 3,500 languages. Using a method based on the algorithm proposed by Levenshtein (Cybernetics and Control Theory 10: 707–710, 1966), a custom-made computer program calculates the distances between all pairs of languages in the database. Standard software is used to express the relationships between languages graphically. The current article compares the results of our lexicon-based approach with the results of a similar exercise that takes the typological variables contained in the WALS database as a point of departure. We establish that the latter approach leads to even better results than the lexicon-based one. The best result in terms of correspondence with some well-established genetic and areal classifications, however, is attained when the lexical and typological methods are combined, especially if we select both the most stable Swadesh items and the most stable WALS variables.

Item Type: Article
Subjects: P Language and Literature > P Philology. Linguistics
Divisions: English Language & Literature
Date Deposited: 05 Sep 2012 10:37

Archive staff only

Item control page Item control page