DGDFS: Dependence Guided Discriminative Feature Selection for Predicting Adverse Drug-Drug Interaction

Jiajing Zhu,Yongguo Liu,Chuanbiao Wen,Xindong Wu
DOI: https://doi.org/10.1109/tkde.2020.2978055
IF: 9.235
2020-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Adverse drug-drug interaction (ADDI) is referred to as a situation where the unpleasant or adverse effects caused by the co-administration of two drugs, which becomes a significant problem for public health. With the increasing availability of healthcare data, many methods are proposed for ADDI prediction. However, these methods usually work in a "nondiscriminatory" manner, i.e., they treat each feature without discrimination and equally incorporate all features into the predictive models. In practice, only a few features are essentially discriminative and relevant to ADDIs. In this paper, we propose a Dependence Guided Discriminative Feature Selection (DGDFS) model for ADDI prediction. In DGDFS, two drug attributes, molecular structure and side effect are adopted to model the adverse interaction among drugs and <span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="3.027ex" height="2.843ex" style="vertical-align: -1.005ex;" viewBox="0 -791.3 1303.2 1223.9" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"> <use xlink:href="#MJMATHI-6C" x="0" y="0"></use><g transform="translate(298,-150)"> <use transform="scale(0.707)" xlink:href="#MJMAIN-32" x="0" y="0"></use> <use transform="scale(0.707)" xlink:href="#MJMAIN-2C" x="500" y="0"></use> <use transform="scale(0.707)" xlink:href="#MJMAIN-30" x="779" y="0"></use></g></g></svg></span>l2,0-norm equality constraints are introduced to select discriminative molecular substructures and side effects for ADDI prediction. Besides, three dependence guided terms, i.e., the dependence between molecular structure and ADDI, the dependence between side effect and ADDI, and the dependence between molecular structure and side effect, are designed to guide feature selection. An iterative algorithm based on the alternating direction method of multipliers is developed for optimization. Experimental results indicate the effectiveness of DGDFS compared with fourteen baselines and its three variants.<svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><defs id="MathJax_SVG_glyphs"><path stroke-width="1" id="MJMATHI-6C" d="M117 59Q117 26 142 26Q179 26 205 131Q211 151 215 152Q217 153 225 153H229Q238 153 241 153T246 151T248 144Q247 138 245 128T234 90T214 43T183 6T137 -11Q101 -11 70 11T38 85Q38 97 39 102L104 360Q167 615 167 623Q167 626 166 628T162 632T157 634T149 635T141 636T132 637T122 637Q112 637 109 637T101 638T95 641T94 647Q94 649 96 661Q101 680 107 682T179 688Q194 689 213 690T243 693T254 694Q266 694 266 686Q266 675 193 386T118 83Q118 81 118 75T117 65V59Z"></path><path stroke-width="1" id="MJMAIN-32" d="M109 429Q82 429 66 447T50 491Q50 562 103 614T235 666Q326 666 387 610T449 465Q449 422 429 383T381 315T301 241Q265 210 201 149L142 93L218 92Q375 92 385 97Q392 99 409 186V189H449V186Q448 183 436 95T421 3V0H50V19V31Q50 38 56 46T86 81Q115 113 136 137Q145 147 170 174T204 211T233 244T261 278T284 308T305 340T320 369T333 401T340 431T343 464Q343 527 309 573T212 619Q179 619 154 602T119 569T109 550Q109 549 114 549Q132 549 151 535T170 489Q170 464 154 447T109 429Z"></path><path stroke-width="1" id="MJMAIN-2C" d="M78 35T78 60T94 103T137 121Q165 121 187 96T210 8Q210 -27 201 -60T180 -117T154 -158T130 -185T117 -194Q113 -194 104 -185T95 -172Q95 -168 106 -156T131 -126T157 -76T173 -3V9L172 8Q170 7 167 6T161 3T152 1T140 0Q113 0 96 17Z"></path><path stroke-width="1" id="MJMAIN-30" d="M96 585Q152 666 249 666Q297 666 345 640T423 548Q460 465 460 320Q460 165 417 83Q397 41 362 16T301 -15T250 -22Q224 -22 198 -16T137 16T82 83Q39 165 39 320Q39 494 96 585ZM321 597Q291 629 250 629Q208 629 178 597Q153 571 145 525T137 333Q137 175 145 125T181 46Q209 16 250 16Q290 16 318 46Q347 76 354 130T362 333Q362 478 354 524T321 597Z"></path></defs></svg>
computer science, information systems, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?