Abstract
In this document we describe the scheme used for labelling the open-source Ukwabelana Zulu corpus as well as the rules employed for the Part-of-speech (POS) tagger used to assign POS to morphologically analysed words. A detailed description of the Zulu morphology, the corpus itself and its generation is given in Spiegler et al. (2010). All resources can be downloaded from http://www.cs.bris.ac.uk/Research/MachineLearning/Morphology/Resources/.
Translated title of the contribution | Additional material for the Ukwabelana Zulu corpus |
---|---|
Original language | English |
Publisher | University of Bristol |
Publication status | Published - 2010 |