Multi-kernel multi-instance learning based diabetic retinopathy diagnosis
Fulong Ren,Peng Cao,Jinzhu Yang,Chao Wan,Dazhe Zhao
DOI: https://doi.org/10.11834/jig.170470
2018-01-01
Journal of Image and Graphics
Abstract:Objective Diabetic retinopathy (DR) is one of the complications of diabetes and causes severe vision loss and blindness in severe cases if left untreated.A regular eye examination is important for initial diagnosis and early treatment.The change in the blood vessels of the retina is the leading cause of DR.The form of red lesions,such as hemorrhage/microaneurysm (HMA),is the first explicit sign and an important symptom of DR.Hence,in the traditional DR diagnosis system,the accuracy of HMA lesion detection determines the final diagnosis performance.The diagnosis method produces a large number of false positive samples for high sensitivity,and the supervised classification model is ineffective in removing false positives because the dataset does not label the lesion area.A new algorithm based on multi-kernel and multi-instance learning is proposed to solve the problem of supervised learning in DR diagnosis.Method First,a multi-scale morphological top-hat transformation is employed to enhance blood vessels and red lesions on the green channel image,then the main vessels of the retina are segmented by thresholding technique on the mask image obtained by binarizing the enhanced image.All regions of interest are generated by subtracting the main vessels from the mask image,and a connected-component labeling technique based on region growing is conducted to detect the suspicious HMA.The detected HMA areas are considered instances,and the entire image is considered a bag.Thus,the problem of DR diagnosis is considered a multi-instance learning problem.Second,a 37D feature based on color,texture,and shape is extracted for each candidate HMA to describe the instance in multi-instance learning.Numerous suspected HMAs are generally obtained to ensure high sensitivity in the initial detection of the lesions,but many HMAs would produce a negative effect on the performance of the multi-instance learning.An extreme learning machine (ELM)-based classifier is accordingly constructed to filter irrelevant instances for improving the multi-instance learning performance.Nevertheless,no such database that contains both the ground truth of DR diagnosis at an image label and HMA segmentation at a lesion label is publicly available.For example,the MESSIDOR dataset contains diagnosis information for DR but not the ground-truth location of HMAs,whereas the E-ophtha dataset contains the location information of HMAs without the diagnosis label.Consequently,the ELM-based classifier trained on the E-ophtha dataset cannot be applied directly to the MESSIDOR dataset due to the difference between datasets.A threshold on the output probability value of the ELM-based classifier is designed to filter the irrelevant instances,and the best threshold can be obtained by cross validation on the training set.Finally,a multi-instance learning method,mi-Graph,which assumes that the instances in a bag are not independently and identically distributed,combined with a multi-kernel learning framework,is adopted for DR diagnosis.The method implicitly constructs graphs by deriving affinity matrices and defines an efficient graph kernel considering clique information.The kernel in the multi-kernel learning is defined as a linear combination of multiple kernels,including Gaussian,polynomial,and linear kernels.As a result,a multi-instance learning model based on multi-kernel graph is constructed to classify the input retinal image into DR or no-DR status.Result The evaluation is implemented on 1 200 images from the publicly available MESSIDOR dataset,which provides the DR diagnosis results.We verify the effectiveness of the proposed method and the irrelevant instance filtration method.The contributions of different features to DR diagnosis in a multi-kernel learning framework are analyzed.We compare our method with other multi-instance learning methods,such as iterative axis parallel rectangle,expectation-maximization diverse density,citation-k-nearest neighbor,and multiple-instance support vector machine.Our method and other DR diagnosis methods on the MESSIDOR dataset are also compared.Our proposed method achieved an accuracy of 90.1%,sensitivity of 92.4%,specificity of 91.4%,and area under the receiver operating characteristic curve of 0.932.Results show that the proposed method performs better than the other methods and is comparable to previous methods.Conclusion A multi-instance learning algorithm is introduced into DR diagnosis in this study.The detected HMAs and the entire image are considered instances and a bag of multi-instance learning,respectively.The relationship among the instances in a bag is established by using a kernel graph.A multi-kernel learning framework is adopted to enhance the generalization classification performance.Consequently,a multi-instance learning model based on multi-kernel graph is constructed for DR diagnosis.The experimental results indicate that the proposed approach can be used to diagnose DR efficiently without label information of suspicious lesions to avoid the time-consuming effort of labeling the lesions by specialists and false positive reduction.