Text feature-based copyright recognition method for comics

De Li,Hong Xin,Xun Jin
DOI: https://doi.org/10.1016/j.engappai.2024.107925
IF: 8
2024-01-27
Engineering Applications of Artificial Intelligence
Abstract:With the rise of the digital comic industry, pirated comic works have also emerged. Textual plagiarism, mainly in terms of character language, is gradually increasing, mainly in terms of plot and character description. In this paper, we propose a novel copyright recognition framework based on the language features of comic characters to identify and certify the textual infringement of digital comic works. The framework consists of internal and external copyright recognition methods. The internal method analyzes the writing styles of a comic and extracts text style features to detect abnormal chapters and determines whether it is a suspicious comic. The external method matches the suspicious comic text and other original comic texts with semantic and document features. Comic copyright recognition is achieved based on the similarities between comics. We also collect comic works from different domains and construct an original comic corpus and a pirated comic corpus in terms of chapters. The experimental results show that the proposed framework can detect abnormal chapters and plagiarized documents in comic text. After integrating several types of plagiarisms, the recognition accuracy of the proposed method is about 98%, higher than those of the state-of-the-art models.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary
What problem does this paper attempt to address?