English Vocabulary Learning Aid System Using Digital Twin Wasserstein Generative Adversarial Network Optimized With Jelly Fish Optimization Algorithm

Fei Wu
DOI: https://doi.org/10.1080/08839514.2024.2327908
IF: 2.777
2024-04-13
Applied Artificial Intelligence
Abstract:Natural Language Processing (NLP) is a technology that permits computers to recognize human languages. Words are the fundamental unit of analysis in deep-level grammatical and semantic analysis. The main goal of NLP is typically word segmentation. Since the machine learning techniques cannot be directly applied to the practical issue of significant structural disparities between various data modalities in a multi-modal context. In this paper, English Vocabulary Learning Aid System Using Digital Twin Wasserstein Generative Adversarial Network Optimized using Jelly Fish Optimization Algorithm is proposed. The problematic of multiple modal data heterogeneity is handled by the feature extraction of Parameterized Local Maximum Synchro squeezing Transform and extract the features such as Phonetic features, sentence length, word embedding's, part of speech tags, word frequencies, N-grams. Then, the Digital twin Wasserstein generative adversarial network classifies the English vocabulary to easy words, intermediate words, and difficult words. The performance of the proposed EVLS-DtwinWGAN-NLP approach attains 3.101%, 7.12%, 7.73% higher accuracy, 24.13%, 13.04%, 29.51% lower computation Time and 2.292%, 5.365%, 1.551% higher AUC compared with existing methods like Feature extraction and analysis of natural language processing for deep learning English language (EVLS-BiLSTM-NLP), State of art for semantic analysis of natural language processing (EVLS-SA-NLP) respectively.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?
This paper presents a solution to the problem of English vocabulary learning by using a digital twin Wasserstein Generative Adversarial Network (DtwinWGAN) combined with the Jellyfish Optimization Algorithm. In environments where multimodal data exhibits significant structural differences, traditional machine learning techniques have poor word segmentation performance and require long training times in Natural Language Processing (NLP). The proposed system in this paper extracts features such as phonetic features, sentence length, word embeddings, part-of-speech tagging, word frequency, and N-gram using Parameterized Local Maximum Synchronization Squeeze Transform (PLMSST). Then, the DtwinWGAN classifies English vocabulary into easy words, medium words, and difficult words. Compared to existing methods such as the EVLS-BiLSTM-NLP and the latest method for semantic analysis in Natural Language Processing (EVLS-SA-NLP), the proposed method exhibits higher performance in terms of accuracy, computation time, and AUC (Area Under the Curve). The main contributions of this paper include: 1. Using PLMSST to address the heterogeneity problem of multimodal data, dynamically allocating weights, selecting informative modalities, and filtering noise. 2. Introducing DtwinWGAN for vocabulary difficulty classification, solving the long-range dependency issue, and improving training efficiency and accuracy. 3. The EVLS-DtwinWGAN-NLP method demonstrates superiority in learning English vocabulary, providing a personalized and accurate classification system. The paper also reviews related studies, presents the workflow of the proposed method, and compares its performance with other methods, demonstrating improvements in accuracy, speed, and AUC. Finally, the paper describes in detail the steps of data acquisition, preprocessing, feature extraction, classification, and optimization, as well as how JFOA optimizes the weight parameters of DtwinWGAN.