Challenges in real-time-embedded IoT Command Recognition

Hazem Younis,John H.L. Hansen
DOI: https://doi.org/10.1109/wf-iot51360.2021.9595903
2021-06-14
Abstract:Automatic speech recognition (ASR) is an effective means of communicating information for command and control of IoT based electronic devices. Voice enabled electronic devices offer the prospect of greater user control and increased system support for deployed platforms. The internet of things (IoT) is an internet-based architecture that allows for data transmission over wireless networks for control and monitoring. By combining previously stated architectures, we can perform many daily tasks remotely, eliminating the need for physical presence. Some challenges associated with command recognition include command word perplexity as well as noise levels/distortion in the environmental setting. In this study, system formulation consisted of (i) a Raspberry-pi low resource microprocessor with fixed computing capacity, and (ii) a ReSpeaker microphone system, which provided an integrated hardware setup for algorithm development to combine aspects of ASR and IoT, allowing users to dictate a variety of commands remotely. The platform eliminates the need for physical interaction with the device by simply listening to commands and performing actions to fulfill the command. The platform is suitable for several settings including residential and commercial spaces due to its low computational resource model. Trade-offs in machine learning based CNN network design and training are explored. Also, challenges and factors that affect speech-command recognition for IoT related applications are considered. A summary of phonetic and computational limiting factors for success in command recognition devices also discussed. The study highlights several ASR based neural net architectures which are integrated within a low-resource computing platform, along with benchmark performance evaluations.
What problem does this paper attempt to address?