Morphology-based investigation of differences between spoken and written isiZulu

Laurette Marais,Ilana Wilken,,
DOI: https://doi.org/10.55492/dhasa.v3i01.3860
2021-01-01
Journal of the Digital Humanities Association of Southern Africa (DHASA)
Abstract:Research attempting to describe and quantify the differences between spoken and written language has been done for languages such as English, but not for isiZulu. In this paper, we present a quantitative investigation into such differences by considering the morphology of tokens in a transcribed spoken isiZulu corpus and a written isiZulu corpus. We use morpheme tags as a proxy for features that typically differ between spoken and written language, and calculate relative differences of the occurrence of specific morpheme tags from analyses produced by ZulMorph, a finite-state morphological analyser for isiZulu. This analysis presents information that could inform the development of voice-enabled computer applications for isiZulu.
What problem does this paper attempt to address?