Divide and Conquer: Stratifying Training Data by Tumor Grade Improves Deep Learning-Based Brain Tumor Segmentation

Rebsamen, Michael; Knecht, Urspeter; Reyes, Mauricio; Wiest, Roland; Meier, Raphael; McKinley, Richard (2019). Divide and Conquer: Stratifying Training Data by Tumor Grade Improves Deep Learning-Based Brain Tumor Segmentation. Frontiers in neuroscience, 13, p. 1182. Frontiers Research Foundation 10.3389/fnins.2019.01182

fnins-13-01182.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (1MB) | Preview

It is a general assumption in deep learning that more training data leads to better performance, and that models will learn to generalize well across heterogeneous input data as long as that variety is represented in the training set. Segmentation of brain tumors is a well-investigated topic in medical image computing, owing primarily to the availability of a large publicly-available dataset arising from the long-running yearly Multimodal Brain Tumor Segmentation (BraTS) challenge. Research efforts and publications addressing this dataset focus predominantly on technical improvements of model architectures and less on properties of the underlying data. Using the dataset and the method ranked third in the BraTS 2018 challenge, we performed experiments to examine the impact of tumor type on segmentation performance. We propose to stratify the training dataset into high-grade glioma (HGG) and low-grade glioma (LGG) subjects and train two separate models. Although we observed only minor gains in overall mean dice scores by this stratification, examining case-wise rankings of individual subjects revealed statistically significant improvements. Compared to a baseline model trained on both HGG and LGG cases, two separately trained models led to better performance in 64.9% of cases (p < 0.0001) for the tumor core. An analysis of subjects which did not profit from stratified training revealed that cases were missegmented which had poor image quality, or which presented clinically particularly challenging cases (e.g., underrepresented subtypes such as IDH1-mutant tumors), underlining the importance of such latent variables in the context of tumor segmentation. In summary, we found that segmentation models trained on the BraTS 2018 dataset, stratified according to tumor type, lead to a significant increase in segmentation performance. Furthermore, we demonstrated that this gain in segmentation performance is evident in the case-wise ranking of individual subjects but not in summary statistics. We conclude that it may be useful to consider the segmentation of brain tumors of different types or grades as separate tasks, rather than developing one tool to segment them all. Consequently, making this information available for the test data should be considered, potentially leading to a more clinically relevant BraTS competition.

Item Type:

Journal Article (Original Article)


10 Strategic Research Centers > ARTORG Center for Biomedical Engineering Research
04 Faculty of Medicine > Department of Radiology, Neuroradiology and Nuclear Medicine (DRNN) > Institute of Diagnostic and Interventional Neuroradiology
04 Faculty of Medicine > Pre-clinic Human Medicine > Institute for Surgical Technology & Biomechanics ISTB [discontinued]

Graduate School:

Graduate School for Cellular and Biomedical Sciences (GCB)

UniBE Contributor:

Rebsamen, Michael Andreas, Knecht, Urspeter, Reyes Aguirre, Mauricio Antonio, Wiest, Roland Gerhard Rudi, Meier, Raphael, McKinley, Richard


500 Science > 570 Life sciences; biology
600 Technology > 610 Medicine & health




Frontiers Research Foundation




Martin Zbinden

Date Deposited:

11 Dec 2019 15:54

Last Modified:

02 Mar 2023 23:32

Publisher DOI:


PubMed ID:






Actions (login required)

Edit item Edit item
Provide Feedback