Abstract:Several complex systems are characterized by presenting intricate characteristics taking place at several scales of time and space. These multiscale characterizations are used in various applications, including better understanding diseases, characterizing transportation systems, and comparison between cities, among others. In particular, texts are also characterized by a hierarchical structure that can be approached by using multi-scale concepts and methods. The multiscale properties of texts constitute a subject worth further investigation. In addition, more effective approaches to text characterization and analysis can be obtained by emphasizing words with potentially more informational content. The present work aims at developing these possibilities while focusing on mesoscopic representations of networks. More specifically, we adopt an extension to the mesoscopic approach to represent text narratives, in which only the recurrent relationships among tagged parts of speech (subject, verb and direct object) are considered to establish connections among sequential pieces of text (e.g., paragraphs). The characterization of the texts was then achieved by considering scale-dependent complementary methods: accessibility, symmetry and recurrence signatures. In order to evaluate the potential of these concepts and methods, we approached the problem of distinguishing between literary genres (fiction and non-fiction). A set of 300 books organized into the two genres was considered and were compared by using the aforementioned approaches. All the methods were capable of differentiating to some extent between the two genres. The accessibility and symmetry reflected the narrative asymmetries, while the recurrence signature provided a more direct indication about the non-sequential semantic connections taking place along the narrative.

Distinguishing Fact from Fiction: Pattern Recognition in Texts Using Complex Networks

Probing the topological properties of complex networks modeling short written texts

Identification of Literary Movements Using Complex Networks to Represent Texts

Text characterization based on recurrence networks

Using Complex Networks to Quantify Consistency in the Use of Words

Social Networks Analysis in Discovering the Narrative Structure of Literary Fiction

Comparison study of using semantic and syntactic network characteristics to do text clustering

Fingerprint Matrices: Uncovering the dynamics of social networks in prose literature

A Novel Discrimination Structure for Assessing Text Semantic Similarity

Representation of texts as complex networks: a mesoscopic approach

Finding Common Features in Multilingual Fake News: a Quantitative Clustering Approach

A complex network approach to stylometry

Language Clustering with Word Co-Occurrence Networks Based on Parallel Texts

Measuring Information Propagation in Literary Social Networks

Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach

Complex network analysis of literary and scientific texts

Complicating the Social Networks for Better Storytelling: An Empirical Study of Chinese Historical Text and Novel

Modeling texts with networks: comparing five approaches to sentence representation

Characterizing the community structure of complex networks

Topological properties and organizing principles of semantic networks

Exploiting Textual Information for Fake News Detection