Why Deep Surgical Models Fail?:
Revisiting Surgical Action Triplet Recognition through the Lens of Robustness
Yanqi Cheng1
Lihao Liu1
Shujun Wang1
Yueming Jin2
Carola-Bibiane Schönlieb1
Angelica I. Aviles-Rivero1
1DAMTP, University of Cambridge and 2WEISS, UCL

affiliations

[Paper]
[Presentation]
The image provides a quick visualisation of the results we obtained.
This is a website made for Why Deep Surgical Models Fail?: Revisiting Surgical Action Triplet Recognition through the Lens of Robustness.

Abstract

Surgical action triplet recognition provides a better understanding of the surgical scene. This task is of high relevance as it provides the surgeon with context-aware support and safety. The current go-to strategy for improving performance is the development of new network mechanisms. However, the performance of current state-of-the-art techniques is substantially lower than other surgical tasks. Why is this happening? This is the question that we address in this work. We present the first study to understand the failure of existing deep learning models through the lens of robustness and explainability. Firstly, we study current existing models under weak and strong δ−perturbations via an adversarial optimisation scheme. We then analyse the failure modes via feature based explanations. Our study reveals that the key to improving performance and increasing reliability is in the core and spurious attributes. Our work opens the door to more trustworthy and reliable deep learning models in surgical data science.


Demo


Method

We consider the current three SOTA techniques for our study: Tripnet, Attention Tripnet and Rendezvous; and extensively investigate the repercussion of deep features using four widely used backbones ResNet-18, ResNet-50, DenseNet-121 and Swin Transformer. Based on the networks output, we then evaluate robustness via adversarial optimisation to analyse the feature based explanations.


Paper and Supplementary Material


Yanqi Cheng, Lihao Liu, Shujun Wang, Yueming Jin, Carola-Bibiane Schönlieb and Angelica I. Aviles-Rivero.
Why Deep Surgical Models Fail?: Revisiting Surgical Action Triplet Recognition through the Lens of Robustness.

International Conference in Learning Representation (ICLR23) Workshop


[Paper]   [Bibtex]



Acknowledgements

YC and AIAR greatly acknowledge support from a C2D3 Early Career Research Seed Fund and CMIH EP/T017961/1, University of Cambridge. CBS acknowledges support from the Philip Leverhulme Prize, the Royal Society Wolfson Fellowship, the EPSRC advanced career fellowship EP/V029428/1, EPSRC grants EP/S026045/1 and EP/T003553/1, EP/N014588/1, EP/T017961/1, the Wellcome Innovator Awards 215733/Z/19/Z and 221633/Z/20/Z, the European Union Horizon 2020 research and innovation programme under the Marie Skodowska-Curie grant agreement No. 777826 NoMADS, the Cantab Capital Institute for the Mathematics of Information and the Alan Turing Institute.