DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

Ghamsarian, Negin; Taschwer, Mario; Sznitman, Raphael; Schoeffmann, Klaus (2022). DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos. In: Medical Image Computing and Computer Assisted Intervention – MICCAI 2022. Lecture Notes in Computer Science: Vol. 13435 (pp. 276-286). Springer 10.1007/978-3-031-16443-9_27

[img] Text
MICCAI2022_1080.pdf - Accepted Version
Restricted to registered users only
Available under License Publisher holds Copyright.

Download (8MB) | Request a copy
[img] Text
978-3-031-16443-9_27.pdf - Published Version
Restricted to registered users only
Available under License Publisher holds Copyright.

Download (2MB) | Request a copy

Semantic segmentation in cataract surgery has a wide range of applications contributing to surgical outcome enhancement and clinical risk reduction. However, the varying issues in segmenting the different relevant structures in these surgeries make the designation of a unique network quite challenging. This paper proposes a semantic segmentation network, termed DeepPyramid, that can deal with these challenges using three novelties: (1) a Pyramid View Fusion module which provides a varying-angle global view of the surrounding region centering at each pixel position in the input convolutional feature map; (2) a Deformable Pyramid Reception module which enables a wide deformable receptive field that can adapt to geometric transformations in the object of interest; and (3) a dedicated Pyramid Loss that adaptively supervises multi-scale semantic feature maps. Combined, we show that these modules can effectively boost semantic segmentation performance, especially in the case of transparency, deformability, scalability, and blunt edges in objects. We demonstrate that our approach performs at a state-of-the-art level and outperforms a number of existing methods with a large margin (3.66% overall improvement in intersection over union compared to the best rival approach).

Item Type:

Conference or Workshop Item (Paper)

Division/Institute:

10 Strategic Research Centers > ARTORG Center for Biomedical Engineering Research
10 Strategic Research Centers > ARTORG Center for Biomedical Engineering Research > ARTORG Center - AI in Medical Imaging Laboratory

UniBE Contributor:

Ghamsarian, Negin, Sznitman, Raphael

Subjects:

500 Science > 570 Life sciences; biology
600 Technology > 610 Medicine & health

ISBN:

978-3-031-16443-9

Series:

Lecture Notes in Computer Science

Publisher:

Springer

Funders:

[UNSPECIFIED] Haag Streit Foundation, Switzerland ; Organisations 0 not found.

Language:

English

Submitter:

Negin Ghamsarian

Date Deposited:

17 Nov 2023 09:05

Last Modified:

26 Nov 2023 02:26

Publisher DOI:

10.1007/978-3-031-16443-9_27

BORIS DOI:

10.48350/189044

URI:

https://boris.unibe.ch/id/eprint/189044

Actions (login required)

Edit item Edit item
Provide Feedback