Bridging AI and Clinical Practice: Integrating Automated Sleep Scoring Algorithm with Uncertainty-Guided Physician Review.

Bechny, Michal; Monachino, Giuliana; Fiorillo, Luigi; van der Meer, Julia; Schmidt, Markus H; Bassetti, Claudio L. A.; Tzovara, Athina; Faraci, Francesca D (2024). Bridging AI and Clinical Practice: Integrating Automated Sleep Scoring Algorithm with Uncertainty-Guided Physician Review. Nature and science of sleep, 16, pp. 555-572. Dove Medical Press 10.2147/NSS.S455649

Preview

Text
NSS-455649-bridging-ai-and-clinical-practice--integrating-automated-sle.pdf - Published Version
Available under License Creative Commons: Attribution-Noncommercial (CC-BY-NC).
Download (3MB) | Preview

PURPOSE

This study aims to enhance the clinical use of automated sleep-scoring algorithms by incorporating an uncertainty estimation approach to efficiently assist clinicians in the manual review of predicted hypnograms, a necessity due to the notable inter-scorer variability inherent in polysomnography (PSG) databases. Our efforts target the extent of review required to achieve predefined agreement levels, examining both in-domain (ID) and out-of-domain (OOD) data, and considering subjects' diagnoses.

PATIENTS AND METHODS

A total of 19,578 PSGs from 13 open-access databases were used to train U-Sleep, a state-of-the-art sleep-scoring algorithm. We leveraged a comprehensive clinical database of an additional 8832 PSGs, covering a full spectrum of ages (0-91 years) and sleep-disorders, to refine the U-Sleep, and to evaluate different uncertainty-quantification approaches, including our novel confidence network. The ID data consisted of PSGs scored by over 50 physicians, and the two OOD sets comprised recordings each scored by a unique senior physician.

RESULTS

U-Sleep demonstrated robust performance, with Cohen's kappa (K) at 76.2% on ID and 73.8-78.8% on OOD data. The confidence network excelled at identifying uncertain predictions, achieving AUROC scores of 85.7% on ID and 82.5-85.6% on OOD data. Independently of sleep-disorder status, statistical evaluations revealed significant differences in confidence scores between aligning vs discording predictions, and significant correlations of confidence scores with classification performance metrics. To achieve κ ≥ 90% with physician intervention, examining less than 29.0% of uncertain epochs was required, substantially reducing physicians' workload, and facilitating near-perfect agreement.

CONCLUSION

Inter-scorer variability limits the accuracy of the scoring algorithms to ~80%. By integrating an uncertainty estimation with U-Sleep, we enhance the review of predicted hypnograms, to align with the scoring taste of a responsible physician. Validated across ID and OOD data and various sleep-disorders, our approach offers a strategy to boost automated scoring tools' usability in clinical settings.

Item Type:	Journal Article (Original Article)
Division/Institute:	08 Faculty of Science > Institute of Computer Science (INF) 04 Faculty of Medicine > Department of Head Organs and Neurology (DKNS) > Clinic of Neurology
UniBE Contributor:	Bechny, Michal, Monachino, Giuliana, van der Meer, Julia, Schmidt, Markus Helmut, Bassetti, Claudio L.A., Tzovara, Athina
Subjects:	000 Computer science, knowledge & systems 600 Technology > 610 Medicine & health 500 Science > 510 Mathematics
ISSN:	1179-1608
Publisher:	Dove Medical Press
Language:	English
Submitter:	Pubmed Import
Date Deposited:	03 Jun 2024 15:43
Last Modified:	03 Jun 2024 15:52
Publisher DOI:	10.2147/NSS.S455649
PubMed ID:	38827394
Uncontrolled Keywords:	automated sleep scoring explainable AI polysomnography sleep medicine uncertainty quantification
BORIS DOI:	10.48350/197517
URI:	https://boris.unibe.ch/id/eprint/197517

Actions (login required)

Edit item

Bridging AI and Clinical Practice: Integrating Automated Sleep Scoring Algorithm with Uncertainty-Guided Physician Review.

Interest & Impact

Downloads

Citations

Search

Services

Actions (login required)

Item Type:

Division/Institute:

UniBE Contributor:

Subjects:

ISSN:

Publisher:

Language:

Submitter:

Date Deposited:

Last Modified:

Publisher DOI:

PubMed ID:

Uncontrolled Keywords:

BORIS DOI:

URI:

Actions (login required)