A Proposal to Use GAN For Speech Recognition in Natural Language Processing

Vijetha Ringu,Aileni Eenaja
Abstract:: Usage of advanced technology in our daily routine has been increased. NLP has been more needful to humans. Automatic speech recognition, translating of spoken words into text, is still a challenging task due to the high viability in speech signals. So here we are discussing the Deep Learning usage in the future with TTS. NLP, and speech applications in many areas (including Finance, Healthcare, and Government) there is a growing need for one comprehensive resource. Deep Learning for NLP and Speech Recognition explains recent deep learning methods applicable to NLP and speech, provides state-of-the-art approaches, and offers real-world case studies. NLP has various tools that are used in speech recognition process such as speech tagging, sentiment analysis, semantics and general understanding of the speech. NLP enables the classification and location of entities into various categories, while processing the language, using the named-entity technology Even IT industries have transformed the way they perform speech recognition process with the use of natural language processing. NLP has allowed the industries to process more accurate and automated understanding of speech and text. Natural Language Processing plays a critical role in supporting machine-human interactions. We propose to use Generative Adversarial Network (GAN) along with the idea of ”Professor Forcing” in training. A discriminator in GAN is jointly trained to equalize the difference between real and the predicted data. The idea with GAN is to have a Generator and Discriminator that are adversaries in a learning game. The concept applies to any type of problem for which a generator and a discriminator can be constructed. In NLP there are many types of Generators that could be considered, depending on the problem to be solved . As more research is being carried in this field, we expect to see more breakthroughs
Computer Science
What problem does this paper attempt to address?