Kurmann, Thomas; Márquez-Neila, Pablo; Allan, Max; Wolf, Sebastian; Sznitman, Raphael (2021). Mask then classify: multi-instance segmentation for surgical instruments. International journal of computer assisted radiology and surgery, 16(7), pp. 1227-1236. Springer 10.1007/s11548-021-02404-2
|
Text
Kurmann2021_Article_MaskThenClassifyMulti-instance.pdf - Published Version Available under License Creative Commons: Attribution (CC-BY). Download (8MB) | Preview |
PURPOSE
The detection and segmentation of surgical instruments has been a vital step for many applications in minimally invasive surgical robotics. Previously, the problem was tackled from a semantic segmentation perspective, yet these methods fail to provide good segmentation maps of instrument types and do not contain any information on the instance affiliation of each pixel. We propose to overcome this limitation by using a novel instance segmentation method which first masks instruments and then classifies them into their respective type.
METHODS
We introduce a novel method for instance segmentation where a pixel-wise mask of each instance is found prior to classification. An encoder-decoder network is used to extract instrument instances, which are then separately classified using the features of the previous stages. Furthermore, we present a method to incorporate instrument priors from surgical robots.
RESULTS
Experiments are performed on the robotic instrument segmentation dataset of the 2017 endoscopic vision challenge. We perform a fourfold cross-validation and show an improvement of over 18% to the previous state-of-the-art. Furthermore, we perform an ablation study which highlights the importance of certain design choices and observe an increase of 10% over semantic segmentation methods.
CONCLUSIONS
We have presented a novel instance segmentation method for surgical instruments which outperforms previous semantic segmentation-based methods. Our method further provides a more informative output of instance level information, while retaining a precise segmentation mask. Finally, we have shown that robotic instrument priors can be used to further increase the performance.