Additional material for the Ukwabelana Zulu corpus

Sebastian Spiegler, Andrew van der Spuy, Peter A. Flach

Research output: Working paper

Abstract

In this document we describe the scheme used for labelling the open-source Ukwabelana Zulu corpus as well as the rules employed for the Part-of-speech (POS) tagger used to assign POS to morphologically analysed words. A detailed description of the Zulu morphology, the corpus itself and its generation is given in Spiegler et al. (2010). All resources can be downloaded from http://www.cs.bris.ac.uk/Research/MachineLearning/Morphology/Resources/.
Translated title of the contributionAdditional material for the Ukwabelana Zulu corpus
Original languageEnglish
PublisherUniversity of Bristol
Publication statusPublished - 2010

Bibliographical note

Other identifier: 2001225

Fingerprint

Dive into the research topics of 'Additional material for the Ukwabelana Zulu corpus'. Together they form a unique fingerprint.

Cite this