Abstract:Purpose: Terminology is the set of technical words or expressions used in specific contexts, which denotes the core concept in a formal discipline and is usually applied in the fields of machine translation, information retrieval, information extraction and text categorization, etc. Bilingual terminology extraction plays an important role in the application of bilingual dictionary compilation, bilingual Ontology construction, machine translation and cross-language information retrieval etc. This paper addresses the issues of monolingual terminology extraction and bilingual term alignment based on multi-level termhood. Design/methodology/approach: A method based on multi-level termhood is proposed. The new method computes the termhood of the terminology candidate as well as the sentence that includes the terminology by the comparison of the corpus. Since terminologies and general words usually have differently distribution in the corpus, termhood can also be used to constrain and enhance the performance of term alignment when aligning bilingual terms on the parallel corpus. In this paper, bilingual term alignment based on termhood constraints is presented. Findings: Experiment results show multi-level termhood can get better performance than existing method for terminology extraction. If termhood is used as constrain factor, the performance of bilingual term alignment can be improved.

Measuring Termhood in Automatic Terminology Extraction

A Novel Topic Model for Automatic Term Extraction

Research on Automatic Chinese Multi-word Term Extraction Based on Term Component

Parsing-based Automatic Chinese Term Extraction

Bilingual Terminology Extraction Using Multi-level Termhood

Research on Automatic Chinese Multi-word Term Extraction Based on Integration of Web Information and Term Component

Automatic Extraction of Domain-Specific Terms

Automatic Recognition of Chinese Scientific and Technological Terms Using Integrated Linguistic Knowledge

A Survey of Term Recognition and Extraction for Domainspecific Chinese Text Information Processing

Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach

Can cross-domain term extraction benefit from cross-lingual transfer and nested term labeling?

Termhood-based Comparability Metrics of Comparable Corpus in Special Domain

An interactive approach to term relation extraction and term extraction

[Automatic labeling and extraction of terms in natural language processing in acupuncture clinical literature]

DRTE:A Term Extraction Method for K12 Education

Bilingual Terminology Extraction from Comparable E-Commerce Corpora

Unsupervised Technical Domain Terms Extraction using Term Extractor

Two-Character Chinese Word Extraction Based on Hybrid of Internal and Contextual Measures

Hybrid extraction of multi-word terms: an application on vibration-based condition monitoring technique

Chinese technical terminology extraction based on DC-value and information entropy

Web-Based Terminology Translation Mining