Corpus-driven Bantu Lexicography Part 2: Lemmatisation and Rulers for Lusoga

  • Gilles-Maurice de Schryver BantUGent, Department of Languages and Cultures, Ghent University, Ghent, Belgium; and Department of African Languages, University of Pretoria, Pretoria, South Africa
  • Minah Nabirye BantUGent, Department of Languages and Cultures, Ghent University, Ghent, Belgium; and Department of Teacher Education and Development Studies, Kyambogo University, Kampala, Uganda
Keywords: Bantu, Lusoga, corpus lexicography, lemmatisation, lemmatised frequency list, part-of-speech ruler, alphabetical ruler, multidimensional lexicographic ruler, dictionary planning, dictionary-writing system, TLex, TshwaneLex

Abstract

This article is the second in a trilogy that deals with corpus-driven Bantu lexicography, which is illustrated for Lusoga. The focus here is on the macrostructure and in particular on the building of a lemmatised frequency list directly within a dictionary-writing system. The programming code for the parts of the lemmatisation that may be automated is included as addenda. A second focus is on the embedded part-of-speech and alphabetical rulers, for which it is shown how these may be used to plan the actual compilation of the dictionary entries.
Published
2018-12-17
How to Cite
de Schryver, G.-M., & Nabirye, M. (2018). Corpus-driven Bantu Lexicography Part 2: Lemmatisation and Rulers for Lusoga. Lexikos, 28(1). https://doi.org/10.5788/28-1-1458
Section
Artikels/Articles