Knowledge-Based Approaches to the Segmentation of Oral History Interviews

Pengyi Zhang
2006-01-01
Abstract:This paper applies discourse knowledge to the segmentation of speech transcripts. The paper reviews literature on discourse structure, as well as approaches used in text segmentation and speech segmentation, identifies what features are used and how the features are combined in these approaches. After reviewing the literature, a three-part study is conducted to answer the following three research questions: • Are discourse-markers indicators of segment boundaries in oral history interviews? • Are questions good indicators of segment boundaries? Could questions be used as segment boundary or segment continuation indicators? • Do the discourse structures proposed by Labov and Waletzky (1967, 1997) and Stein and Glenn (1979) hold for oral history interviews? How could this knowledge be used in automatic segmentation? Methodology, results and analysis of each part of the study are described. Major findings include trends in segmentation and answers to these questions. Limitation of the study is discussed. The paper also suggests future research topic relates to segmentation and discourse analysis. Pengyi Zhang Segmentation of Oral History Interviews 3
What problem does this paper attempt to address?