Analysing Large Biological Data Sets with an Improved Algorithm for MIC.

Shuliang Wang,Yiping Zhao
DOI: https://doi.org/10.1504/ijdmb.2015.071548
2015-01-01
International Journal of Data Mining and Bioinformatics
Abstract:The computational framework used the traditional similarity measures to find out the significant relationships in biological annotations. But its prerequisites that the biological annotations do not cooccur with each other is particular. To overcome it, in this paper a new method Improved Algorithm for Maximal Information Coefficient (IAMIC) is suggested to discover the hidden regularities between biological annotations. IAMIC approximates a novel similarity coefficient on maximal information coefficient with generality and equitability, by bettering axis partition through quadratic optimisation instead of violence search. The experimental results show that IAMIC is more appropriate for identifying the associations between biological annotations, and further extracting the novel associations hidden in collected data sets than other similarity measures.
What problem does this paper attempt to address?