Willers, Corin; Bauman, Grzegorz; Andermatt, Simon; Santini, Francesco; Sandkühler, Robin; Ramsey, Kathryn A.; Cattin, Philippe C; Bieri, Oliver; Pusterla, Orso; Latzin, Philipp (2021). The impact of segmentation on whole-lung functional MRI quantification: Repeatability and reproducibility from multiple human observers and an artificial neural network. Magnetic resonance in medicine, 85(2), pp. 1079-1092. Wiley-Liss 10.1002/mrm.28476
Text
The impact of segmentation on whole-lung functional MRI quantification_Repeatability and reproducibility from multiple human observers and an artificial neural network.pdf - Published Version Restricted to registered users only Available under License Publisher holds Copyright. Download (2MB) |
||
|
Text
Impact_of_segmentation_on_functional_lung_MRI_WillersC_20200713_MRMbiblio_revised_clean_with_Fig.pdf - Accepted Version Available under License Publisher holds Copyright. Download (1MB) | Preview |
PURPOSE
To investigate the repeatability and reproducibility of lung segmentation and their impact on the quantitative outcomes from functional pulmonary MRI. Additionally, to validate an artificial neural network (ANN) to accelerate whole-lung quantification.
METHOD
Ten healthy children and 25 children with cystic fibrosis underwent matrix pencil decomposition MRI (MP-MRI). Impaired relative fractional ventilation (RFV ) and relative perfusion (RQ ) from MP-MRI were compared using whole-lung segmentation performed by a physician at two time-points (At1 and At2 ), by an MRI technician (B), and by an ANN (C). Repeatability and reproducibility were assess with Dice similarity coefficient (DSC), paired t-test and Intraclass-correlation coefficient (ICC).
RESULTS
The repeatability within an observer (At1 vs At2 ) resulted in a DSC of 0.94 ± 0.01 (mean ± SD) and an unsystematic difference of -0.01% for RFV (P = .92) and +0.1% for RQ (P = .21). The reproducibility between human observers (At1 vs B) resulted in a DSC of 0.88 ± 0.02, and a systematic absolute difference of -0.81% (P < .001) for RFV and -0.38% (P = .037) for RQ . The reproducibility between human and the ANN (At1 vs C) resulted in a DSC of 0.89 ± 0.03 and a systematic absolute difference of -0.36% for RFV (P = .017) and -0.35% for RQ (P = .002). The ICC was >0.98 for all variables and comparisons.
CONCLUSIONS
Despite high overall agreement, there were systematic differences in lung segmentation between observers. This needs to be considered for longitudinal studies and could be overcome by using an ANN, which performs as good as human observers and fully automatizes MP-MRI post-processing.