Abstract:Chinese Herbal Medicine (CHM) classification is a promising research issue in Intelligent Medicine. However, the small available Chinese Herbal datasets and the traditional CHM classification model lead to huge challenge for obtaining the promising classification results. To tackle the above challenges, a novel large CHM classification (CHMC) dataset has been firstly established, which includes 100 classes with about 10,000 samples. This dataset contains a wide range of medicinal materials and natural background. Further, the promising EfficientNetB4 model is proposed to perform the CHM classification. EfficientNet can uniformly scales up the depth, width and resolution of the model, which will obtain better accuracy as it balance all dimensions of the network, including depth, width, and resolution, respectively. To validate the superiority of the EfficientNet and the effectiveness of CHMC dataset, extensive experiments have been conducted, verifying that the EfficientNetB4 is optimal for CHM classification, with 5% improvement of the existing model. In addition, this model has achieved state-of-the art CHM classification performance, with TOP-1 accuracy of 83.1%, and TOP-5 accuracy of 92.50%.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges of Chinese Herbal Medicine (CHM) classification in intelligent healthcare. Specifically, the existing Chinese herbal medicine datasets are small in scale, and the performance of traditional classification models is poor, making it difficult to obtain ideal classification results. To solve these problems, the authors made the following efforts:
1. **Establish a new large - scale Chinese herbal medicine classification dataset**:
- Constructed a large - scale Chinese herbal medicine classification dataset (CHMC) containing 100 categories and approximately 10,000 samples, covering a variety of medicinal materials and natural backgrounds.
- The dataset includes plant - based medicines (such as rice sprouts, cockscomb flowers, cassia twigs, sappan wood), animal - based medicines (such as cuttlefish bone, sea dragon, ground beetle, centipede) and mineral - based medicines (such as red halloysite, alum), as well as medicinal materials from different parts such as roots, bark and seeds.
2. **Propose a classification method based on EfficientNet**:
- Adopted the EfficientNetB4 model for Chinese herbal medicine classification. EfficientNet can be evenly expanded in three dimensions: depth, width and resolution, thereby improving classification accuracy while balancing all dimensions of the network.
- The experimental results show that EfficientNetB4 performs excellently in the Chinese herbal medicine classification task, improving the classification accuracy by 5% compared with the existing models, and achieving a TOP - 1 accuracy rate of 83.1% and a TOP - 5 accuracy rate of 92.50%.
### Formula Representation
EfficientNet consistently expands the width, depth and resolution of the network through the compound coefficient \(\phi\):
\[
\text{width\_scale} : w=\beta^\phi
\]
\[
\text{depth\_scale} : d = \alpha^\phi
\]
\[
\text{resolution\_scale} : r=\gamma^\phi
\]
Among them, \(\alpha, \beta, \gamma\) are constants, and \(\phi\) is a user - specified coefficient used to determine the increase in available resources. These parameters satisfy the following conditions:
\[
\beta^2\cdot\alpha^2\cdot\gamma\approx 2
\]
\[
\beta\geq 1,\quad\alpha\geq 1,\quad\gamma\geq 1
\]
In this way, EfficientNet can achieve better classification performance under different resource conditions.
### Summary
The main contribution of this paper lies in constructing a large - scale Chinese herbal medicine classification dataset and applying the EfficientNet model for classification for the first time, significantly improving the classification effect. This provides new ideas and technical support for the intelligent identification and processing of Chinese herbal medicine.