Analysis of Protein and Fat in Milk Using Multiwavelength Gradient-Boosted Regression Tree

Tao Sheng,Shengzhe Shi,Yuanyang Zhu,Debao Chen,Sheng Liu
DOI: https://doi.org/10.1109/tim.2022.3165298
IF: 5.6
2022-01-01
IEEE Transactions on Instrumentation and Measurement
Abstract:Traditional chemical measurement methods for the milk composition are not only time-consuming and laborious but also highly polluting. This has necessitated the development of a new method to facilitate fast, easy, and real-time determination of milk composition. This article presents the use of a multichannel infrared spectral sensor and broadband infrared (IR) light source to obtain multiwavelength feature data simultaneously. Furthermore, the gradient-boosted regression tree (GBRT) algorithm was used to develop a method for accurate milk content determination under different conditions. To this end, we developed a near-infrared (NIR) light-strength-acquisition device and accompanying software, compared the effectiveness of different machine learning algorithms, and established an optimal prediction model. Subsequently, the optimal prediction network was selected depending on the milk composition, thereby realizing the highest prediction accuracy. The results obtained in this study revealed that the milk protein and fat contents could be determined from the NIR absorption multispectra based on machine learning of the corresponding samples with coefficients of determination ( <span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="2.818ex" height="2.509ex" style="vertical-align: -0.338ex;" viewBox="0 -934.9 1213.4 1080.4" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"> <use xlink:href="#MJMATHI-52" x="0" y="0"></use> <use transform="scale(0.707)" xlink:href="#MJMAIN-32" x="1074" y="513"></use></g></svg></span> ) values of 0.949 and 0.996, respectively. The corresponding root-mean-squared estimation errors of the prediction were 0.058 and 0.085, respectively. These experimental results indicate that the proposed milk quality evaluation system can be used to obtain real-time results. Moreover, it is simple, fast, affordable, and environmentally friendly.<svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><defs id="MathJax_SVG_glyphs"><path stroke-width="1" id="MJMATHI-52" d="M230 637Q203 637 198 638T193 649Q193 676 204 682Q206 683 378 683Q550 682 564 680Q620 672 658 652T712 606T733 563T739 529Q739 484 710 445T643 385T576 351T538 338L545 333Q612 295 612 223Q612 212 607 162T602 80V71Q602 53 603 43T614 25T640 16Q668 16 686 38T712 85Q717 99 720 102T735 105Q755 105 755 93Q755 75 731 36Q693 -21 641 -21H632Q571 -21 531 4T487 82Q487 109 502 166T517 239Q517 290 474 313Q459 320 449 321T378 323H309L277 193Q244 61 244 59Q244 55 245 54T252 50T269 48T302 46H333Q339 38 339 37T336 19Q332 6 326 0H311Q275 2 180 2Q146 2 117 2T71 2T50 1Q33 1 33 10Q33 12 36 24Q41 43 46 45Q50 46 61 46H67Q94 46 127 49Q141 52 146 61Q149 65 218 339T287 628Q287 635 230 637ZM630 554Q630 586 609 608T523 636Q521 636 500 636T462 637H440Q393 637 386 627Q385 624 352 494T319 361Q319 360 388 360Q466 361 492 367Q556 377 592 426Q608 449 619 486T630 554Z"></path><path stroke-width="1" id="MJMAIN-32" d="M109 429Q82 429 66 447T50 491Q50 562 103 614T235 666Q326 666 387 610T449 465Q449 422 429 383T381 315T301 241Q265 210 201 149L142 93L218 92Q375 92 385 97Q392 99 409 186V189H449V186Q448 183 436 95T421 3V0H50V19V31Q50 38 56 46T86 81Q115 113 136 137Q145 147 170 174T204 211T233 244T261 278T284 308T305 340T320 369T333 401T340 431T343 464Q343 527 309 573T212 619Q179 619 154 602T119 569T109 550Q109 549 114 549Q132 549 151 535T170 489Q170 464 154 447T109 429Z"></path></defs></svg>
What problem does this paper attempt to address?