Extracting Arguments from Korean Question and Command: An Annotated Corpus for Structured Paraphrasing

Won Ik Cho,Young Ki Moon,Woo Hyun Kang,Nam Soo Kim
DOI: https://doi.org/10.48550/arXiv.1810.04631
2019-07-09
Abstract:Intention identification is a core issue in dialog management. However, due to the non-canonicality of the spoken language, it is difficult to extract the content automatically from the conversation-style utterances. This is much more challenging for languages like Korean and Japanese since the agglutination between morphemes make it difficult for the machines to parse the sentence and understand the intention. To suggest a guideline for this problem, and to merge the issue flexibly with the neural paraphrasing systems introduced recently, we propose a structured annotation scheme for Korean question/commands and the resulting corpus which are widely applicable to the field of argument mining. The scheme and dataset are expected to help machines understand the intention of natural language and grasp the core meaning of conversation-style instructions.
Computation and Language
What problem does this paper attempt to address?