Abstract:The models of linguistic networks and their analytical tools constitute a potential methodology for investigating the formation of structural patterns in actual language use. Research with this methodology has just started, which can hopefully shed light on the emergent nature of linguistic structure. This study attempts to employ linguistic networks to investigate the formation of modern Chinese two-character words (as structural units based on the chunking of their component characters) in the actual use of modern Chinese, which manifests itself as continuous streams of Chinese characters. Network models were constructed based on authentic Chinese language data, with Chinese characters as nodes, their co-occurrence relations as directed links, and the co-occurrence frequencies as link weights. Quantitative analysis of the network models has shown that a Chinese two-character word can highlight itself as a two-node island, i.e., a cohesive sub-network with its two component characters co-occurring more frequently than they co-occur with the other characters. This highlighting mechanism may play a vital role in the formation and acquisition of two-character words in actual language use. Moreover, this mechanism may also throw some light on the emergence of other structural phenomena (with the chunking of specific linguistic units as their basis).

Internet-Oriented New Words Identification

New Word Identification in Social Network Text Based on Time Series Information

Internet-oriented Chinese New Words Detection

Research on algorithm for networks new words identification

An Effective Method for Distinguishing Photograph and Graphics

New Words Recognition Algorithm and Application Based on Micro-Blog Hot

Effective Approach to Deep Web Entries Identification

Research on Intelligent Construction of China English Network New Words Database Based on Adjacent Entropy Recognition Algorithm

A study on the classification of stylistic and formal features in English based on corpus data testing

New Cyber Word Discovery Using Chinese Word Segmentation

Domain-Specific New Words Detection in Chinese.

Related Words Acquisition and Analysis

Linguistic emergence from a networks approach: The case of modern Chinese two-character words.

Detecting new Chinese words from massive domain texts with word embedding

New Word Extraction from Chinese Financial Documents.

New words discovery in microblog content

New Word Detection For Sentiment Analysis

The sources of new words and expressions in the Chinese Internet language and the ways by which they enter the Internet language

New Word Detection Using BiLSTM+CRF Model with Features

Incorporating user behaviors in new word detection

SVM-based Hybrid Pattern for New Word Discovery