Spectral methods cluster words of the same class in a syntactic dependency network

Ramon Ferrer i Cancho,Andrea Capocci,Guido Caldarelli
DOI: https://doi.org/10.48550/arXiv.cond-mat/0504165
2005-04-07
Abstract:We analyze here a particular kind of linguistic network where vertices representwords and edges stand for syntactic relationships between words. The statisticalproperties of these networks have been recently studied and various features such as the small-world phenomenon and a scale-free distribution of degrees have been found. Our work focuses on four classes of words: verbs, nouns, adverbs and adjectives. Here, we use spectral methods sorting vertices. We show that the ordering clusters words of the same class. For nouns and verbs, the cluster size distribution clearly follows a power-law distribution that cannot be explained by a null hypothesis. Long-range correlations are found between vertices in theordering provided by the spectral method. The findings support the use of spectral methods for detecting community structure.
Statistical Mechanics
What problem does this paper attempt to address?