Age Estimation from Speech Using Tuned CNN Model on Edge Devices

Laxmi Kantham Durgam,Ravi Kumar Jatoth
DOI: https://doi.org/10.1007/s11265-024-01929-4
2024-11-03
Journal of Signal Processing Systems
Abstract:The speaker's emotions, age, and gender have all been ascertained through imaginative investigation. This information can be applied to communications, common applications like biometric identification and human-machine interactions. The Edge Impulse framework employs a tiny model that has been trained to identify the speaker's age based on speech attributes. As a result, a speaker's age can be inferred from their voice. With the help of an external microphone connected to the Jetson Nano and the MP34DT05 digital microphone on the Arduino Nano BLE 33 device. It is possible to record and determine a person's age from their speech in real-time applications. Making an effective human-machine interface for practical applications is speech recognition's fundamental goal. The Arduino Nano BLE 33 has an integrated RGB LED that enables it to determine a speaker's age and determine if they are a child or an adult. A red led will be used to signify a child speaker, while a blue led will be used to identify an adult speaker. The proposed tuned deep convolution neural networks outperform the more commonly used convolutional neural networks in tests compared to training data.The proposed tuned 1D CNN with MFCC speech features are outperforming compared to existing traditional methods. The Nvidia Jetson Nano and Nano BLE 33 Microcontrollers are ideal for applications needing speaker age detection because of their low power consumption, ease of use, small size, and excellent computational performance.
computer science, information systems,engineering, electrical & electronic
What problem does this paper attempt to address?