A Study on the Effect of Preprocessing and Normalization on Classification of Plant Samples in Machine Learning Assisted Laser-Induced Breakdown Spectroscopy

Muhammad Haider Zaman,Fahad Rehman,Muhammad Shoaib Tahir,Muhammad Faheem,Yasir Jamil
DOI: https://doi.org/10.1007/s13369-024-08716-0
IF: 2.807
2024-02-09
Arabian Journal for Science and Engineering
Abstract:Classification of herbal medicinal plants is a major challenge these days. Almost one-fourth of the globe uses the medicines extracted directly from the plants. Parts of plants like, seed, root, barks and leaf have been used in traditional ayurvedic medicine, but still there is no rapid multi-elemental identification technique to test the medicinal plant samples. This research was focused on the rapid elemental analysis and classification of Azadirachta indica grown in clean and polluted atmosphere using machine learning assisted Laser-induced Breakdown Spectroscopy. The experiment was performed using a Q-Switched Nd: YAG laser operating at second harmonic (532 nm) as an excitation source and an Avantes spectrometer having 0.06 nm resolution with spectral range of 250–880 nm was used to record the emission spectra. The following elements: Fe, Si, Mg, Ca, Ba, Na, Li, N, K, O, Al, Sr, Ti, C, and H were found in leaves and barks of Azadirachta indica . However, LIBS's conventional technique could not clearly distinguish the spectral differences between leaves and barks. So, unsupervised machine learning algorithm Principal Component Analysis and various supervised machine learning algorithms like decision trees, Naïve Bayes, Ensemble, ANN, SVM, and KNN were used to classify the samples. Furthermore, to minimize spectral discreetness and improve the classification accuracy of machine learning assisted LIBS in pharmaceutical industry as well as other areas data preprocessing like baseline correction, normalization was studied, and the results were compared.
multidisciplinary sciences
What problem does this paper attempt to address?