Rizzo, Rudy; Dziadosz, Martyna; Kyathanahally, Sreenath P; Shamaei, Amirmohammad; Kreis, Roland (2023). Quantification of MR spectra by deep learning in an idealized setting: Investigation of forms of input, network architectures, optimization by ensembles of networks, and training bias. Magnetic resonance in medicine, 89(5), pp. 1707-1727. Wiley-Liss 10.1002/mrm.29561
|
Text
Magnetic_Resonance_in_Med_-_2022_-_Rizzo_-_Quantification_of_MR_spectra_by_deep_learning_in_an_idealized_setting_.pdf - Published Version Available under License Creative Commons: Attribution-Noncommercial (CC-BY-NC). Download (5MB) | Preview |
PURPOSE
The aims of this work are (1) to explore deep learning (DL) architectures, spectroscopic input types, and learning designs toward optimal quantification in MR spectroscopy of simulated pathological spectra; and (2) to demonstrate accuracy and precision of DL predictions in view of inherent bias toward the training distribution.
METHODS
Simulated 1D spectra and 2D spectrograms that mimic an extensive range of pathological in vivo conditions are used to train and test 24 different DL architectures. Active learning through altered training and testing data distributions is probed to optimize quantification performance. Ensembles of networks are explored to improve DL robustness and reduce the variance of estimates. A set of scores compares performances of DL predictions and traditional model fitting (MF).
RESULTS
Ensembles of heterogeneous networks that combine 1D frequency-domain and 2D time-frequency domain spectrograms as input perform best. Dataset augmentation with active learning can improve performance, but gains are limited. MF is more accurate, although DL appears to be more precise at low SNR. However, this overall improved precision originates from a strong bias for cases with high uncertainty toward the dataset the network has been trained with, tending toward its average value.
CONCLUSION
MF mostly performs better compared to the faster DL approach. Potential intrinsic biases on training sets are dangerous in a clinical context that requires the algorithm to be unbiased to outliers (i.e., pathological data). Active learning and ensemble of networks are good strategies to improve prediction performances. However, data quality (sufficient SNR) has proven as a bottleneck for adequate unbiased performance-like in the case of MF.