Aikyam: A Video Conferencing Utility for Deaf and Dumb

Kshitij Deshpande,Varad Mashalkar,Kaustubh Mhaisekar,Amaan Naikwadi,Archana Ghotkar
DOI: https://doi.org/10.48550/arXiv.2312.05962
2023-12-11
Abstract:With the advent of the pandemic, the use of video conferencing platforms as a means of communication has greatly increased and with it, so have the remote opportunities. The deaf and dumb have traditionally faced several issues in communication, but now the effect is felt more severely. This paper proposes an all-encompassing video conferencing utility that can be used with existing video conferencing platforms to address these issues. Appropriate semantically correct sentences are generated from the signer's gestures which would be interpreted by the system. Along with an audio to emit this sentence, the user's feed is also used to annotate the sentence. This can be viewed by all participants, thus aiding smooth communication with all parties involved. This utility utilizes a simple LSTM model for classification of gestures. The sentences are constructed by a t5 based model. In order to achieve the required data flow, a virtual camera is used.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to solve the communication barriers faced by deaf - mutes in video conferences. Specifically, the paper proposes a video - conference - assistance tool named "Aikyam", which can be integrated with existing video - conference platforms (such as Google Meet, Teams, etc.) to help deaf - mutes communicate effectively through sign language. Aikyam recognizes users' body movements and converts them into text and voice, enabling deaf - mutes to participate in video conferences more smoothly, and other participants can also understand the expressions of deaf - mutes. This tool uses a lightweight LSTM model to classify gestures and a T5 - based model to generate semantically correct sentences. In addition, in order to process data streams, this tool also uses virtual camera technology.