A Prediction Model to Prioritize Individuals for a SARS-CoV-2 Test Built from National Symptom Surveys.
Artificial Intelligence
COVID-19
Diagnosis
Health Policies
Machine Learning
SARS-CoV-2
Journal
Med (New York, N.Y.)
ISSN: 2666-6340
Titre abrégé: Med
Pays: United States
ID NLM: 101769215
Informations de publication
Date de publication:
12 02 2021
12 02 2021
Historique:
received:
09
08
2020
revised:
07
09
2020
accepted:
06
10
2020
pubmed:
20
10
2020
medline:
20
10
2020
entrez:
19
10
2020
Statut:
ppublish
Résumé
The gold standard for COVID-19 diagnosis is detection of viral RNA through PCR. Due to global limitations in testing capacity, effective prioritization of individuals for testing is essential. We devised a model estimating the probability of an individual to test positive for COVID-19 based on answers to 9 simple questions that have been associated with SARS-CoV-2 infection. Our model was devised from a subsample of a national symptom survey that was answered over 2 million times in Israel in its first 2 months and a targeted survey distributed to all residents of several cities in Israel. Overall, 43,752 adults were included, from which 498 self-reported as being COVID-19 positive. Our model was validated on a held-out set of individuals from Israel where it achieved an auROC of 0.737 (CI: 0.712-0.759) and auPR of 0.144 (CI: 0.119-0.177) and demonstrated its applicability outside of Israel in an independently collected symptom survey dataset from the US, UK, and Sweden. Our analyses revealed interactions between several symptoms and age, suggesting variation in the clinical manifestation of the disease in different age groups. Our tool can be used online and without exposure to suspected patients, thus suggesting worldwide utility in combating COVID-19 by better directing the limited testing resources through prioritization of individuals for testing, thereby increasing the rate at which positive individuals can be identified. Moreover, individuals at high risk for a positive test result can be isolated prior to testing. E.S. is supported by the Crown Human Genome Center, Larson Charitable Foundation New Scientist Fund, Else Kroener Fresenius Foundation, White Rose International Foundation, Ben B. and Joyce E. Eisenberg Foundation, Nissenbaum Family, Marcos Pinheiro de Andrade and Vanessa Buchheim, Lady Michelle Michels, and Aliza Moussaieff and grants funded by the Minerva foundation with funding from the Federal German Ministry for Education and Research and by the European Research Council and the Israel Science Foundation. H.R. is supported by the Israeli Council for Higher Education (CHE) via the Weizmann Data Science Research Center and by a research grant from Madame Olga Klein - Astrachan.
Sections du résumé
BACKGROUND
The gold standard for COVID-19 diagnosis is detection of viral RNA through PCR. Due to global limitations in testing capacity, effective prioritization of individuals for testing is essential.
METHODS
We devised a model estimating the probability of an individual to test positive for COVID-19 based on answers to 9 simple questions that have been associated with SARS-CoV-2 infection. Our model was devised from a subsample of a national symptom survey that was answered over 2 million times in Israel in its first 2 months and a targeted survey distributed to all residents of several cities in Israel. Overall, 43,752 adults were included, from which 498 self-reported as being COVID-19 positive.
FINDINGS
Our model was validated on a held-out set of individuals from Israel where it achieved an auROC of 0.737 (CI: 0.712-0.759) and auPR of 0.144 (CI: 0.119-0.177) and demonstrated its applicability outside of Israel in an independently collected symptom survey dataset from the US, UK, and Sweden. Our analyses revealed interactions between several symptoms and age, suggesting variation in the clinical manifestation of the disease in different age groups.
CONCLUSIONS
Our tool can be used online and without exposure to suspected patients, thus suggesting worldwide utility in combating COVID-19 by better directing the limited testing resources through prioritization of individuals for testing, thereby increasing the rate at which positive individuals can be identified. Moreover, individuals at high risk for a positive test result can be isolated prior to testing.
FUNDING
E.S. is supported by the Crown Human Genome Center, Larson Charitable Foundation New Scientist Fund, Else Kroener Fresenius Foundation, White Rose International Foundation, Ben B. and Joyce E. Eisenberg Foundation, Nissenbaum Family, Marcos Pinheiro de Andrade and Vanessa Buchheim, Lady Michelle Michels, and Aliza Moussaieff and grants funded by the Minerva foundation with funding from the Federal German Ministry for Education and Research and by the European Research Council and the Israel Science Foundation. H.R. is supported by the Israeli Council for Higher Education (CHE) via the Weizmann Data Science Research Center and by a research grant from Madame Olga Klein - Astrachan.
Identifiants
pubmed: 33073258
doi: 10.1016/j.medj.2020.10.002
pii: S2666-6340(20)30019-2
pmc: PMC7547576
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Pagination
196-208.e4Informations de copyright
© 2020 Elsevier Inc.
Déclaration de conflit d'intérêts
The study protocol was approved by the Weizmann Institute of Science review board (IRB). Informed consent was waived by the IRB, as all identifying details of the participants were removed before the computational analysis. Participants were made fully aware of the way in which the data will be stored, handled, and shared, which was provided to them and is in accordance with the privacy and data-protection policy of the Weizmann Institute of Science (https://www.weizmann.ac.il/pages/privacy-policy).
Références
Nat Mach Intell. 2020 Jan;2(1):56-67
pubmed: 32607472
Intensive Care Med. 2020 May;46(5):837-840
pubmed: 32123994
J Infect Public Health. 2020 May;13(5):667-673
pubmed: 32340833
BMJ. 2015 Jan 07;350:g7594
pubmed: 25569120
Science. 2020 Jun 19;368(6497):1362-1367
pubmed: 32371477
Int Forum Allergy Rhinol. 2020 Jul;10(7):806-813
pubmed: 32279441
Pediatr Pulmonol. 2020 Aug;55(8):1892-1899
pubmed: 32492251
Nat Med. 2020 Aug;26(8):1224-1228
pubmed: 32427924
J Natl Cancer Inst. 1971 Jun;46(6):1291-7
pubmed: 5088049
Lancet Child Adolesc Health. 2020 Sep;4(9):653-661
pubmed: 32593339
Nat Biotechnol. 2020 Jul;38(7):769-772
pubmed: 32641849
Nat Med. 2020 May;26(5):634-638
pubmed: 32273611
N Engl J Med. 2020 Jun 11;382(24):2302-2315
pubmed: 32289214
BMJ. 2020 Apr 7;369:m1328
pubmed: 32265220
Nat Med. 2020 Aug;26(8):1161-1165
pubmed: 32488218
Ann Intern Med. 2020 Aug 18;173(4):262-267
pubmed: 32422057
JAMA. 2020 Apr 28;323(16):1545-1546
pubmed: 32167538