Vu, Minh H.; Löfstedt, Tommy; Nyholm, Tufve; Sznitman, Raphael (2020). A Question-Centric Model for Visual Question Answering in Medical Imaging. IEEE transactions on medical imaging, 39(9), pp. 2856-2868. Institute of Electrical and Electronics Engineers IEEE 10.1109/TMI.2020.2978284
Text
09024133.pdf - Published Version Restricted to registered users only Available under License Publisher holds Copyright. Download (2MB) |
Deep learning methods have proven extremely effective at performing a variety of medical image analysis tasks. With their potential use in clinical routine, their lack of transparency has however been one of their few weak points, raising concerns regarding their behavior and failure modes. While most research to infer model behavior has focused on indirect strategies that estimate prediction uncertainties and visualize model support in the input image space, the ability to explicitly query a prediction model regarding its image content offers a more direct way to determine the behavior of trained models. To this end, we present a novel Visual Question Answering approach that allows an image to be queried by means of a written question. Experiments on a variety of medical and natural image datasets show that by fusing image and question features in a novel way, the proposed approach achieves an equal or higher accuracy compared to current methods.
Item Type: |
Journal Article (Original Article) |
---|---|
Division/Institute: |
10 Strategic Research Centers > ARTORG Center for Biomedical Engineering Research > ARTORG Center - AI in Medical Imaging Laboratory |
UniBE Contributor: |
Sznitman, Raphael |
Subjects: |
000 Computer science, knowledge & systems 600 Technology 600 Technology > 610 Medicine & health |
ISSN: |
0278-0062 |
Publisher: |
Institute of Electrical and Electronics Engineers IEEE |
Language: |
English |
Submitter: |
Raphael Sznitman |
Date Deposited: |
13 May 2020 11:47 |
Last Modified: |
05 Dec 2022 15:37 |
Publisher DOI: |
10.1109/TMI.2020.2978284 |
BORIS DOI: |
10.7892/boris.141690 |
URI: |
https://boris.unibe.ch/id/eprint/141690 |