Responding to Challenge Call of Machine Learning Model Development in Diagnosing Respiratory Disease Sounds

negin melek
DOI: https://doi.org/10.2139/ssrn.4127047
2022-01-01
SSRN Electronic Journal
Abstract:The normal and abnormal sounds arising from the respiratory system shed great light on the medical science world by revealing the quality, diseases and changes in the lungs of people. In medicine, this invasive and easy old method, which is realized by the stethoscope, facilitates the diagnosis of diseases by specialists. This manual method can sometimes lead to wrong decisions in terms of sound detection due to different audibility. In lung diseases with a high mortality rate, detailed sound analysis is very important for obtaining accurate detection. As technology advances, the development of automated approaches based on machine learning is of great interest as they provide modern and highly accurate analysis. In today’s most popular topic, i.e., the COVID-19 disaster, the conflict of early detection of respiratory disease and machine learning for sound signal processing is of great interest. In this study, a machine learning model was developed for automatically detecting respiratory system sounds such as sneezing and coughing in disease diagnosis. The automatic model and approach development of breath sounds, which carry valuable information, results in early diagnosis and treatment. A successful machine learning model was developed in this study, which was a strong response to the challenge called the 'Pfizer digital medicine challenge' on the 'OSFHOME' open access platform. 'Environmental sound classification' called ESC-50 and AudioSet sound files were used to prepare the dataset. In this dataset, which consisted of three parts, features that effectively showed coughing and sneezing sound analysis were extracted from training, testing and validating samples. Based on the Mel frequency cepstral coefficients (MFCC) feature extraction method, mathematical and statistical features were prepared. Three different classification techniques were considered to perform successful respiratory sound classification in the dataset containing more than 3800 different sounds.
What problem does this paper attempt to address?