Data Splitting Method of Distance Metric Learning Based on Gaussian Mixed Model

ZHENG Dezhong,YANG Yuanyuan,XIE Zhe,NI Yangfan,LI Wentao
DOI: https://doi.org/10.16183/j.cnki.jsjtu.2020.082
2021-01-01
Abstract:Aimed at the problem of instability and deviation of multiple training model in limited samples, this paper proposes a method of distance metric learning based on the Gaussian mixture model, which can solve this problem more reasonably by dividing the dataset. Distance metric learning relies on the excellent feature extraction capabilities of deep neural networks to embed the original data into the new metric space. Then, based on the deep features, the Gaussian mixture model is used to cluster the analyzer and estimate the sample distribution in this new metric space. Finally, according to the characteristics of sample distribution, stratified sampling is used to reasonably divide the data. The research shows that the method proposed can better understand the characteristics of data distribution and obtain a more reasonable data division, thereby improving the accuracy and generalization of the model.
What problem does this paper attempt to address?