Corpus-driven Bantu Lexicography Part 1: Organic Corpus Building for Lusoga

Gilles-Maurice de Schryver, Minah Nabirye


This article is the first in a trilogy that deals with corpus-driven Bantu lexicography, which is illustrated for Lusoga. The focus here is on the building of a so-called 'organic corpus' from scratch, while the next two instalments will deal with the use of that corpus on the macrostructural and microstructural levels, respectively. Not many detailed descriptions of corpus-building efforts exist for Bantu languages, so each and every step is discussed in detail, paying particular attention to the parameters that have to be taken into account, while not losing sight of the need to log the metadata either.


Bantu; Lusoga; corpus building; organic corpus; oral; written; source; period; genre; topic; metadata

Full Text:




  • There are currently no refbacks.

ISSN 2224-0039 (online); ISSN 1684-4904 (print)

Creative Commons License CC BY 4.0

Powered by OJS and hosted by Stellenbosch University Library and Information Service since 2011.


This journal is hosted by the SU LIS on request of the journal owner/editor. The SU LIS takes no responsibility for the content published within this journal, and disclaim all liability arising out of the use of or inability to use the information contained herein. We assume no responsibility, and shall not be liable for any breaches of agreement with other publishers/hosts.

SUNJournals Help