Abstract:Identifying the language spoken in an audio source is the difficult task of automatic language identification (LID) in speech processing. Short audio segments pose a significant challenge in language identification because they contain limited contextual information and fewer distinguishing features compared to longer audio samples. This lack of context makes it difficult to accurately identify the language, as the model has less data to analyse. By addressing the challenge of short-duration audio, the research aims to develop more robust and versatile language identification systems that can operate effectively even with minimal input. Another objective of the research is to address the specific challenge of identifying Indian languages accurately and efficiently from short-duration audio segments using CNNs and spectrogram representations in Python. The methodology involves several key steps: initially, audio data undergoes pre-processing to normalize the signals and reduce noise, ensuring consistency across the dataset. Subsequently, the audio signals are converted into spectrograms, which offer a visual depiction of the frequency spectrum, capturing both temporal and frequency characteristics essential for language discrimination. A CNN model is then built and trained using these spectrograms, with a specific architecture designed to extract significant features from the spectrograms. The system's performance is evaluated on a custom dataset consisting of three Indian languages: Hindi, Tamil, and Malayalam. The experimental findings show that a 98.9% accuracy rate is attained by the CNN-based model, surpassing the performance of existing models. The proposed method has potential applications in areas such as automatic speech recognition and speaker identification, where accurate and efficient language identification is crucial.

Factorized Recurrent Neural Network with Attention for Language Identification and Content Detection

Recurrent Neural Unit with Frequency Attention for Specific Emitter Identification

Is Attention always needed? A Case Study on Language Identification from Speech

Speaker-based language identification for Ethio-Semitic languages using CRNN and hybrid features

Joint Language Identification of Code-Switching Speech using Attention based E2E Network

An Attention Based Neural Network for Code Switching Detection: English & Roman Urdu

Deep Neural Network with Attention Model for Scene Text Recognition.

Convolutional neural network based language identification system: A spectrogram based approach

Multi-Lingual Attention based Multi-Intent Detection in Dialogue System

Phonetic Temporal Neural Model for Language Identification

Phone-aware Neural Language Identification

Language Identification with a Reciprocal Rank Classifier

Hate Speech Detection and Classification in Amharic Text with Deep Learning

LIDE: Language Identification from Text Documents

A Fast, Compact, Accurate Model for Language Identification of Codemixed Text

Deep Learning Detection Method for Large Language Models-Generated Scientific Content

Evaluating Input Representation for Language Identification in Hindi-English Code Mixed Text

BharatBhasaNet-A Unified Framework to Identify Indian Code Mix Languages

Recurrent Neural Network based Part-of-Speech Tagger for Code-Mixed Social Media Text

LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages

Transformer-based Model for Word Level Language Identification in Code-mixed Kannada-English Texts