Benchmarking Deep Learning Models for Tooth Structure Segmentation.

Schneider, L; Arsiwala-Scheppach, L; Krois, J; Meyer-Lueckel, H; Bressem, K K; Niehues, S M; Schwendicke, F (2022). Benchmarking Deep Learning Models for Tooth Structure Segmentation. Journal of dental research, 101(11), pp. 1343-1349. Sage 10.1177/00220345221100169

[img]
Preview
Text
00220345221100169.pdf - Published Version
Available under License Creative Commons: Attribution-Noncommercial (CC-BY-NC).

Download (2MB) | Preview

A wide range of deep learning (DL) architectures with varying depths are available, with developers usually choosing one or a few of them for their specific task in a nonsystematic way. Benchmarking (i.e., the systematic comparison of state-of-the art architectures on a specific task) may provide guidance in the model development process and may allow developers to make better decisions. However, comprehensive benchmarking has not been performed in dentistry yet. We aimed to benchmark a range of architecture designs for 1 specific, exemplary case: tooth structure segmentation on dental bitewing radiographs. We built 72 models for tooth structure (enamel, dentin, pulp, fillings, crowns) segmentation by combining 6 different DL network architectures (U-Net, U-Net++, Feature Pyramid Networks, LinkNet, Pyramid Scene Parsing Network, Mask Attention Network) with 12 encoders from 3 different encoder families (ResNet, VGG, DenseNet) of varying depth (e.g., VGG13, VGG16, VGG19). On each model design, 3 initialization strategies (ImageNet, CheXpert, random initialization) were applied, resulting overall into 216 trained models, which were trained up to 200 epochs with the Adam optimizer (learning rate = 0.0001) and a batch size of 32. Our data set consisted of 1,625 human-annotated dental bitewing radiographs. We used a 5-fold cross-validation scheme and quantified model performances primarily by the F1-score. Initialization with ImageNet or CheXpert weights significantly outperformed random initialization (P < 0.05). Deeper and more complex models did not necessarily perform better than less complex alternatives. VGG-based models were more robust across model configurations, while more complex models (e.g., from the ResNet family) achieved peak performances. In conclusion, initializing models with pretrained weights may be recommended when training models for dental radiographic analysis. Less complex model architectures may be competitive alternatives if computational resources and training time are restricting factors. Models developed and found superior on nondental data sets may not show this behavior for dental domain-specific tasks.

Item Type:

Journal Article (Original Article)

Division/Institute:

04 Faculty of Medicine > School of Dental Medicine > Department of Preventive, Restorative and Pediatric Dentistry

UniBE Contributor:

Meyer-Lückel, Hendrik

Subjects:

600 Technology > 610 Medicine & health

ISSN:

0022-0345

Publisher:

Sage

Language:

English

Submitter:

Pubmed Import

Date Deposited:

13 Jun 2022 08:17

Last Modified:

05 Dec 2022 16:20

Publisher DOI:

10.1177/00220345221100169

PubMed ID:

35686357

Uncontrolled Keywords:

artificial intelligence computer vision neural networks segmentation tooth structures transfer learning

BORIS DOI:

10.48350/170584

URI:

https://boris.unibe.ch/id/eprint/170584

Actions (login required)

Edit item Edit item
Provide Feedback