Surgical Phase Recognition: From Public Datasets to Real-World Data

Kirtac, Kadir; Nizamettin, Aydin; Lavanchy, Joël L.; Beldi, Guido; Marco, Smit; Woods, Michael S.; Aspart, Florian (2022). Surgical Phase Recognition: From Public Datasets to Real-World Data. Applied Sciences, 12(17), p. 8746. MDPI 10.3390/app12178746

[img]
Preview
Text
Publication.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (964kB) | Preview

Automated recognition of surgical phases is a prerequisite for computer-assisted analysis of surgeries. The research on phase recognition has been mostly driven by publicly available datasets of laparoscopic cholecystectomy (Lap Chole) videos. Yet, videos observed in real-world settings might contain challenges, such as additional phases and longer videos, which may be missing in curated public datasets. In this work, we study (i) the possible data distribution discrepancy between videos observed in a given medical center and videos from existing public datasets, and (ii) the potential impact of this distribution difference on model development. To this end, we gathered a large, private dataset of 384 Lap Chole videos. Our dataset contained all videos, including emergency surgeries and teaching cases, recorded in a continuous time frame of five years. We observed strong differences between our dataset and the most commonly used public dataset for surgical phase recognition, Cholec80. For instance, our videos were much longer, included additional phases, and had more complex transitions between phases. We further trained and compared several state-of-the-art phase recognition models on our dataset. The models’ performances greatly varied across surgical phases and videos. In particular, our results highlighted the challenge of recognizing extremely under- represented phases (usually missing in public datasets); the major phases were recognized with at least 76 percent recall. Overall, our results highlighted the need to better understand the distribution of the video data phase that recognition models are trained on.

Item Type:

Journal Article (Original Article)

Division/Institute:

04 Faculty of Medicine > Department of Gastro-intestinal, Liver and Lung Disorders (DMLL) > Clinic of Visceral Surgery and Medicine > Visceral Surgery
04 Faculty of Medicine > Department of Gastro-intestinal, Liver and Lung Disorders (DMLL) > Clinic of Visceral Surgery and Medicine

UniBE Contributor:

Lavanchy, Joël Lukas, Beldi, Guido Jakob Friedrich

Subjects:

600 Technology > 610 Medicine & health
000 Computer science, knowledge & systems

ISSN:

2076-3417

Publisher:

MDPI

Language:

English

Submitter:

Joël Lukas Lavanchy

Date Deposited:

07 Dec 2022 11:34

Last Modified:

07 May 2024 07:59

Publisher DOI:

10.3390/app12178746

BORIS DOI:

10.48350/172586

URI:

https://boris.unibe.ch/id/eprint/172586

Actions (login required)

Edit item Edit item
Provide Feedback