The Comprehensive Language Knowledge Base and Its Prospect

YU Shiwen,SUI Zhifang,ZHU Xuefeng
DOI: https://doi.org/10.3969/j.issn.1003-0077.2011.06.002
2011-01-01
Abstract:Since 1986,the Institute of Computational Linguistics at Peking University has been working on the Comprehensive Language Knowledge Base(CLKB),which consists of 6 language knowledge bases,10 specifications and standards,4 application systems and a software tool kit.These components provide support for each other and integrate into CLKB to describe linguistic knowledge on morphological,syntactic and semantic levels.The language data that have been collected include words,phrases,sentences and discourse in Chinese and many other languages,which occur in specific fields as well as the general domain.After 25 years of development,significant progress in CLKB has been made,and it is still growing.This paper gives an introduction to CLKB and explores its potential in the future.
What problem does this paper attempt to address?