A large synchronous corpus as monitoring corpus: Some comparative content analysis of Chinese and Japanese language developments

Benjamin K. Tsou,Andy C. Chin
DOI: https://doi.org/10.1109/IUCS.2010.5666763
2010-01-01
Abstract:Appropriate and large corpora are uncommon but they can provide important resources for wide ranging efforts in natural language processing, ranging from contextualized or localized speech and text input to automatic patent translation. They also provide lesser known rich resources for human and automatic content analysis such as sentiment analysis of texts and product reviews. Furthermore they can function as a monitoring corpus and enhance the human centered communication environment by allowing more substantive introspection and comparison of content rather than the linguistic form in communication.
What problem does this paper attempt to address?