Federated learning for predicting clinical outcomes in patients with COVID-19
Journal
Nature Medicine
Journal Volume
27
Journal Issue
10
Pages
1735-1743
Date Issued
2021
Author(s)
Dayan I
Roth H.R
Zhong A
Harouni A
Gentili A
Abidin A.Z
Liu A
Costa A.B
Wood B.J
Tsai C.-S
Wang C.-H
Hsu C.-N
Lee C.K
Ruan P
Xu D
Wu D
Huang E
Kitamura F.C
Lacey G
de Antônio Corradi G.C
Nino G
Shin H.-H
Obinata H
Ren H
Crane J.C
Tetreault J
Guan J
Garrett J.W
Kaggie J.D
Park J.G
Dreyer K
Juluru K
Kersten K
Rockenbach M.A.B.C
Linguraru M.G
Haider M.A
AbdelMaseeh M
Rieke N
Damasceno P.F
e Silva P.M.C
Wang P
Xu S
Kawano S
Sriswasdi S
Park S.Y
Grist T.M
Buch V
Jantarabenjakul W
Tak W.Y
Li X
Lin X
Kwon Y.J
Quraini A
Feng A
Priest A.N
Turkbey B
Glicksberg B
Bizzo B
Kim B.S
Tor-Díez C
Lee C.-C
Hsu C.-J
Lin C
Lai C.-L
Hess C.P
Compas C
Bhatia D
Oermann E.K
Leibovitz E
Sasaki H
Mori H
Yang I
Sohn J.H
Murthy K.N.K
de Mendonça M.R.F
Fralick M
Kang M.K
Adil M
Gangai N
Vateekul P
Elnajjar P
Hickman S
Majumdar S
McLeod S.L
Reed S
Gräf S
Harmon S
Kodama T
Puthanakit T
Mazzulli T
de Lavor V.L
Rakvongthai Y
Lee Y.R
Wen Y
Gilbert F.J
Flores M.G
Li Q.
Abstract
Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC) >0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site’s data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare. ? 2021, The Author(s), under exclusive licence to Springer Nature America, Inc.
Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC) >0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site’s data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare. © 2021, The Author(s), under exclusive licence to Springer Nature America, Inc.
Subjects
C reactive protein
oxygen
adult
Article
artificial ventilation
clinical outcome
cohort analysis
comparative study
controlled study
coronavirus disease 2019
electronic medical record
emergency ward
female
health care system
human
male
oxygen consumption
oxygen therapy
prediction
sensitivity and specificity
thorax radiography
electronic health record
isolation and purification
machine learning
pathophysiology
prognosis
therapy
virology
COVID-19
Electronic Health Records
Humans
Machine Learning
Outcome Assessment, Health Care
Prognosis
SARS-CoV-2
SDGs
Other Subjects
C reactive protein; oxygen; adult; Article; artificial ventilation; clinical outcome; cohort analysis; comparative study; controlled study; coronavirus disease 2019; electronic medical record; emergency ward; female; health care system; human; male; oxygen consumption; oxygen therapy; prediction; sensitivity and specificity; thorax radiography; electronic health record; isolation and purification; machine learning; pathophysiology; prognosis; therapy; virology; COVID-19; Electronic Health Records; Humans; Machine Learning; Outcome Assessment, Health Care; Prognosis; SARS-CoV-2
Type
journal article