Evaluation of the reliability of measuring lower back muscles cross-sectional area based on manual segmentation within multi-level MRI images

A A Alhulail,M S Alshuhri,D F Al-Jolifiy,M N Al-Nuwaybit,A S Al-Tamimi,N M Al-Qahtani,R A Al-Hakami,M A Almanaa,A M Jabour
DOI: https://doi.org/10.1016/j.radi.2024.10.002
Abstract:Introduction: Manual segmentation of paraspinal muscle cross-sectional area (CSA) is widely used to assess related health disorders. This study aimed to evaluate the reliability of this segmentation process for each paraspinal region of interest across the three intervertebral levels commonly used for segmentation (L3/4, L4/5, and L5/S1). Methods: Axial-T2-weighted MRI images for 238 patients were divided among five raters (47 ± 1 cases each). To conduct the intra-rater reliability study, the CSA of each paraspinal lumber muscle (psoas major (PM), multifidus (MF), and erector spinae (ES)) and the intervertebral disc (ID) were manually segmented twice on all targeted levels before being assessed for each region per rater. The Inter-rater reliability was determined by comparing the results of different readers who segmented the same dataset. The Intraclass Correlation (ICC) and Coefficient-of-Variation percentage (CV%) were reported for each analysis. Results: Low intra- and inter-rater variability (CV%<11) was found for each reader in each region and intervertebral level. The inter-rater reliability was excellent (ICC>0.9) for the PM, ID, and ES at L3/4. However, it was very good for MF at all levels and ES at L4/5, L5/S1 (ICC range: 0.82-0.88) affected by the fat-infiltration nature of the ES and MF muscles, their proximity to each other, and their smaller size (correlation between muscle size and ICC = 0.6, P < 0.01). The ID segmentation has the lowest CV (<3 %) and excellent ICC (>0.93). Conclusion: Manual paraspinal muscle segmentation using axial-T2-weighted MRI is reliable at all commonly segmented intervertebral levels. However, the reliability level can be degraded by the presence of high-fat infiltrate, unclear muscle boundaries, and muscle size. Implications for practice: Following consistent guidelines can help improve segmentation results. The IDs can be used as reliable internal references.
What problem does this paper attempt to address?