Abstract:Call Centers have huge amount of audio data which can be used for achieving valuable business insights and transcription of phone calls is manually tedious task. An effective Automated Speech Recognition system can accurately transcribe these calls for easy search through call history for specific context and content allowing automatic call monitoring, improving QoS through keyword search and sentiment analysis. ASR for Call Center requires more robustness as telephonic environment are generally noisy. Moreover, there are many low-resourced languages that are on verge of extinction which can be preserved with help of Automatic Speech Recognition Technology. Urdu is the $10^{th}$ most widely spoken language in the world, with 231,295,440 worldwide still remains a resource constrained language in ASR. Regional call-center conversations operate in local language, with a mix of English numbers and technical terms generally causing a "code-switching" problem. Hence, this paper describes an implementation framework of a resource efficient Automatic Speech Recognition/ Speech to Text System in a noisy call-center environment using Chain Hybrid HMM and CNN-TDNN for Code-Switched Urdu Language. Using Hybrid HMM-DNN approach allowed us to utilize the advantages of Neural Network with less labelled data. Adding CNN with TDNN has shown to work better in noisy environment due to CNN's additional frequency dimension which captures extra information from noisy speech, thus improving accuracy. We collected data from various open sources and labelled some of the unlabelled data after analysing its general context and content from Urdu language as well as from commonly used words from other languages, primarily English and were able to achieve WER of 5.2% with noisy as well as clean environment in isolated words or numbers as well as in continuous spontaneous speech.

Classification techniques for automatic speech recognition (ASR) algorithms used with real time speech translation

Arabic Speech Recognition: Advancement and Challenges

Study of Various Machine Learning Algorithms for use with Automatic Speech Recognition

Bilingual Automatic Speech Recognition: A Review, Taxonomy and Open Challenges

Automatic Speech Recognition for Hindi

An investigation and analysis on automatic speech recognition systems

Using Voice Technologies to Support Disabled People

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey

The Impact of ASR on Speech-to-Speech Translation Performance.

Real-Time Statistical Speech Translation

Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech

Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN

ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs

An intelligent system to help deaf students learn Arabic Sign Language

Arabic Language Learning Assisted by Computer, based on Automatic Speech Recognition

Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

An Evaluation on Speech Recognition Technology based on Machine Learning

Automatic Dialect Detection in Arabic Broadcast Speech

Dialectal Coverage And Generalization in Arabic Speech Recognition

Automated Dysarthria Severity Classification: A Study on Acoustic Features and Deep Learning Techniques

SignsWorld; Deeping Into the Silence World and Hearing Its Signs (State of the Art)