Abstract:The scale and quality of the knowledge-base decides the success or failure of the natural language processing system. Institute of computational linguistics of Peking university has accumulated a series of languages-data resources that have good quality with considerable scale after 18 years of diligent work: the grammatical knowledge-base of contemporary Chinese, the large-scale POS-Tagged corpus of contemporary Chinese, Semantics Knowledge-base of Contemporary Chinese (SKCC), Chinese Concept Dictionary (CCD), a bilingual parallel corpus with different aligned units, special term bank of different disciplines, the phrase structure knowledge-base of contemporary Chinese, a corpus of ancient Chinese poems. The present research will integrate these language data resources into one unified and comprehensive language knowledge-base. While incorporating all these different resources, the gaps between them must be filled up. The comprehensive language knowledge-base being planned will provide not only friendly using interface and convenient application program interface but also various software toolssupporting knowledge mining. Therefore, the research promotes the present language data resources to develop constantly from primary products into deep processed products. It will set up diversified forms of knowledge spreading mechanism and information service mechanism to offer omni-directional and multi-level support to language information processing, traditional linguistics research and language teaching.

The Coonstruction and Utilization of A Comprehensive Language Knowledge-base

The Comprehensive Language Knowledge Base and Its Prospect

Comprehensive Language Knowledge Base and Its Applications in Language Teaching

Building a situation-based language knowledge base

The Construction of Language Resource and Knowledge Base for Chinese Language Computing

The Rationale of Building the Comprehensive Language Knowledge-base and the Significance of Its Achievements

Development and Evaluation of Task-Specific NLP Framework in China.

Research on Chinese Lexical Semantic and the construction of Lexical Knowledge Base

The Chinese-English Contrastive Language Knowledge Base And Its Applications

Chinese idiom knowledge base for chinese information processing

New Progress of the Grammatical Knowledge-base of Contemporary Chinese

Building a Large-scale Chinese Event Knowledge Base

Construction of Multilingual Terminology Bank of Computational Linguistics.

Towards automatic construction of knowledge bases from Chinese online resources

On Theoretical Issues in Building A Knowledge Database of Chinese Constructions

Semantic Computing and Language Knowledge Bases

Bridge Knowledge and Languages: the Application of Computational Linguistics

A Brief Introduction to Computational Linguistics

Construction of Chinese Idiom Knowledge-base and Its Applications.

Study on the Construction of CCD

Development of Translation Database based on Chinese-English parallel corpora