Introduction
In December 2019, a cluster of severe pneumonia occurred in the city of Wuhan, China. The causative pathogen was identified as a new betacoronavirus [
1]. It was later named the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) and the infectious disease was termed coronavirus disease 2019 (COVID-19) [
2]. As of September 2020, more than 32 million infections were reported worldwide and over 970,000 people had died [
3]. Course and outcome of patients with COVID-19 are heterogeneous. While most SARS-CoV-2 infected patients are asymptomatic or exhibit mild symptoms, some deteriorate to the complicated stage and require medical treatment and hospitalization. COVID-19 symptoms can deteriorate within hours of hospital admission prompting need for oxygen supply or transfer to the intensive care unit [
4,
5]. Hence, identifying patients at this early stage of the disease is of paramount importance in medical decision-making regarding follow-up, hospitalization, and decision for medical treatment.
Many studies investigated predictors for progression to critical COVID-19, which was defined as admission to an intensive care unit (ICU) or need for mechanical ventilation [
6‐
10]. However, predictors for a COVID-19 deterioration causing oxygen therapy, have been rarely studied so far [
11‐
13]. Depending on the clinical perspective, this stage of the disease is denoted in the literature as severe, but not critical [
14‐
16] or moderate, but not severe [
11,
13]. To avoid misinterpretations of our analysis, in the following, we use the term advanced COVID-19 disease stage for this stage of the disease and this was used as our endpoint to be predicted. Patients presenting with asymptomatic SARS-CoV-2 infection or mild COVID-19 who are at risk for clinical deterioration benefit from close monitoring, swift medication and supportive measurements [
17]. Further, patients at risk may benefit from early therapeutic agents for COVID-19 [
14,
16]. In addition, due to the high prevalence of long-term COVID-19 symptoms and the association of severity of COVID-19 and severity of long-term COVID-19 symptoms [
18‐
20], the need for medical interventions avoiding COVID-19 disease progression in patients at risk is further emphasized.
Here, we present a predictor and score (SACOV-19, Score for the prediction of an Advanced disease stage of COVID-19) resulting from a robust risk-stratification algorithm to assess if a patient is at risk of developing the advanced COVID-19 disease stage, based on data available at the day of the first positive SARS-CoV-2 test. By identifying patients at risk with a high probability for advanced COVID-19, our score aims at supporting clinical decision making for these patients presenting with asymptomatic SARS-CoV-2 infection or mild COVID-19. A low predicted risk could support out-patient management. A high predicted risk could promote close follow-up, hospitalization or enter risk–benefit assessments regarding medical treatment.
The algorithm and SACOV-19 were developed using state-of-the-art machine learning methods and based on patient variables from the study cohort of the Lean European Open Survey on SARS-CoV-2 Infected Patients (LEOSS). LEOSS is a large multicenter cohort of medically supervised patients with predominant hospital contact [
21]. The algorithm and SACOV-19 were assessed by a temporal validation using the LEOSS data. The algorithm is implemented in a browser-based web application enabling straightforward usage of our predictor in future clinical studies and to make it accessible to the research community.
Discussion
We computed and validated a predictor and associated predictive score (SACOV-19) to predict a complicated or more severe COVID-19 stage in patients, who were tested positive for SARS-CoV-2 and presented at mainly inpatient settings asymptomatic or with mild COVID-19 symptoms. SACOV-19 is based on standard parameters, which can be acquired in most hospital and out-patient settings. In addition, we implemented a browser-based interactive graphical user interface making the data-driven model accessible to the research community.
Though most patients presenting asymptomatic or with mild COVID-19 symptoms do not require medical treatment, some patients rapidly deteriorate and need medical intervention [
17,
26]. By focusing on complicated or more severe COVID-19 as the endpoint, our score (SACOV-19) identifies patients requiring medical intervention and hospitalization. For asymptomatic/mild COVID-19 patients with increased risk predicted by our score, the attending physician might consider hospitalization or close follow-up. A high-risk result might also enter risk–benefit considerations when evaluating medical treatments with possible side effects. In turn, supporting the decision to discharge an asymptomatic/mild COVID-19 patient according to our score, enables physicians to prioritize patients in need for hospitalization and close monitoring.
As of now, management decisions for asymptomatic/mild COVID-19 patient are mainly based on the presence of risk factors, the clinical judgment of the attending physicians and the available resources [
17]. Unfortunately, course and outcome of COVID-19 are heterogeneous complicating this situation. Risk factors such as higher age, high BMI, male sex or arterial hypertension have been associated with poorer prognosis. However, they are also highly prevalent in patients with mild or asymptomatic courses [
5]. Earlier studies evaluated general disease severity scores such as CRB65, NEWS2, or qSOFA in COVID-19. Mostly, these scores were validated for risk of progression to severe COVID-19 or death, to guide IMC/ICU admission in hospitalized patients [
27‐
30]. Notably, patients of our cohort showed a very indistinctive qSOFA score at baseline, indicating its unsuitability for identifying asymptomatic patients or with mild COVID-19 who are at risk of developing an advanced stage (58% accuracy for a threshold of one, and Glasgow Coma Scale ≤ 12 instead of 14). Scores specifically developed for risk of progression in COVID-19 like the COVID-GRAM, Brescia-COVID Respiratory Severity Scale (BCRSS) or 4C Mortality Score most entirely focus on the progression to severe respiratory impairment and death not taking the early risk of progression into a complicated stage into consideration [
6,
8,
12,
31]. Exceptions are the CALL and EWAS score and the score published by Huang
et al. [
32], which were designed to predict risk for progression to advanced COVID-19. However, these scores were based on a relatively small patient cohort [
32,
33]. Though in validation studies, their performance in predicting the progression to complicated or more severe COVID-19 was poor (AUC < 0.67) [
13,
34]. To note, we could not evaluate these scores and most of the published scores for the critical endpoint as the needed thresholds for calculating the according variables are more complex and were not collected in LEOSS. LEOSS data were collected using predefined categories to preserve the anonymous data collection protocol. In the 4C Mortality score [
8], for example, which was rated as high quality [
12], categories for age, respiratory rate, oxygen saturation, urea and C reactive protein were not mappable to LEOSS. In future research the 4C mortality score, for example, could be adapted to the LEOSS data and could be evaluated on advanced COVID-19.
SACOV-19 is based on eleven patient characteristics (14 binary variables) which are often documented at first presentation. In line with previous studies, SACOV-19 shows that patients of higher age, higher BMI, and smokers or former smokers have a higher risk for advanced COVID-19 courses [
5,
12,
13,
26]. The respiratory parameters oxygen saturation, respiratory rate and feeling of dyspnea are included in SACOV-19 emphasizing the importance of examining pulmonary parameters at initial presentation.
A strength of the study is that it is based on data of a well-documented and curated multinational COVID-19 registry supported by the German Center for Infection Research and German Infectious Disease Society, and a well set up machine learning procedure. We trained the SACOV-19 on a discovery cohort including only patients from the first wave of the COVID-19 pandemic. SACOV-19 was tested on an independent validation cohort comprising patients from the first to the third wave, which have been collected after the development of the score. COVID-19 is a newly emerging infectious disease, for which the knowledge and standard of care evolved. Hence one may argue that our score which was developed based on data from March to July 2020 may not be useful anymore. But, most treatment options to date are administered after a COVID-19 disease deterioration [
35] which is our endpoint and hence would not affect the predictiveness of our score. Indeed, when we tested SACOV-19 on an independent validation cohort comprising patients from the first to the third wave (in which potential changes of care may have occurred), we didn’t recognize a drop in performance. The SACOV-19 stands out because it has been evaluated across regions and sectors. At the time of manuscript preparation, it contained, to our knowledge, the largest German data collection of comprehensive clinical data on high-risk patients. [
36]. Nevertheless, until now, the investigated patients may limit its general applicability. Most of the patients received care in an inpatient setting. When testing our score on outpatients we observed a similar performance result, however, we had only
n = 28 outpatients for this analysis and could hence not get a significant result. Furthermore, the majority of patients exhibited a mild disease and did not advance to the complicated phase. Therefore, patients with co-morbidities could have been overrepresented in our cohort, as these patients were mainly admitted without severe symptoms [
21]. To show the general applicability of our score, a further, clinical trial is necessary. We actually plan a trial testing in a primary care setting if SACOV-19 acceptably predicts COVID-19 deterioration.
While we included a large cohort of patients, a limitation is that the majority of patients were included at German health care facilities. Our results may not be fully applicable to countries or regions with different demographics or resource settings. Most of the patients received care in an inpatient setting. The majority exhibited a mild disease and did not advance to the complicated phase. Therefore, patients with co-morbidities could be overrepresented in our cohort, as these patients were mainly admitted without severe symptoms [
21]. Another caveat may be the high number of missing values for specific variables and, in particular, some laboratory values, as not all parameters were collected at the day of the first positive SARS-CoV-2 test. For example interleukin 6 has been shown to have predictive power for a severe COVID-19 course [
37] but was not selected by our algorithms, possibly due to its high number of missing values. Furthermore, thresholds for parameters were predefined in the study protocol. Metric available data could improve prediction models. The web application was designed for research use making our predictor accessible to the research community.
Acknowledgements
We express our deep gratitude to all study teams supporting the LEOSS study. The LEOSS study group contributed at least 5 per mille to the analyses of this study: Klinikum Ernst von Bergmann (Lukas Tometten), University Hospital Freiburg (Siegbert Rieg), University Hospital Heidelberg (Uta Merle), Johannes Wesling Hospital Minden (Kai Wille), Hospital Ingolstadt (Stefan Borgmann), University Hospital rechts der Isar (Christoph Spinner), University Hospital Essen (Sebastian Dolff), University Hospital Jena (Maria Madeleine Rüthrich), University Hospital Regensburg (Frank Hanses), Klinikum Dortmund (Martin Hower), University Hospital Erlangen (Richard Strauß), Hacettepe University Faculty of Medicine (Murat Akova), University Hospital of Cologne (Norma Jung), Ludwig Maximilians University Hospital Munich (Michael von Bergwelt-Baildon), University Hospital Frankfurt (Maria Vehreschild), University Hospital Ulm (Beate Grüner), Hospital Passau (Martina Haselberger), University Hospital Würzburg (Nora Isberner), Hospital Bremen-Mitte (Christiane Piepel), St. Josef-Hospital Bochum (Kerstin Hellwig), Bundeswehr Central Hospital Koblenz (Dominic Rauschning), Hospital Leverkusen (Lukas Eberwein), University Hospital Düsseldorf (Björn Jensen), Tropenklinik Paul-Lechler Hospital Tuebingen (Claudia Raichle), Medical practice for general medicine Dres. Elisabeth Schrödter and Gabriele Müller-Jörger (Gabriele Müller-Jörger), Petrus Hospital Wuppertal (Sven Stieglitz), Robert Koch Institute (Thomas Kratz), Municipal Hospital Karlsruhe (Christian Degenhardt), University Hospital Schleswig-Holstein site Kiel (Anette Friedrichs), University Hospital of Saarland (Robert Bals), Munich Clinic gGmbH (Susanne Rüger), University Hospital Carl Gustav Carus Dresden (Katja de With), Robert-Bosch-Hospital (Katja Rothfuss), University Hospital Tuebingen (Siri Goepel), University Hospital Bonn (Jacob Nattermann), University Hospital Hamburg-Eppendorf (Sabine Jordan), Sophien- und Hufeland Hospital Weimar (Jessica Rüddel), University Hospital Giessen und Marburg (Janina Trauth), Hannover Medical School (Gernot Beutel), Bakirkoy Dr Sadi Konuk Training and Research Hospital Istanbul (Ozlem Altuntas Aydin), St. Franziskus Hospital Flensburg (Milena Milovanovic), and St. Josefs-Hospital Wiesbaden (Michael Doll). LEOSS study infrastructure group: Jörg Janne Vehreschild (Goethe University Frankfurt), Lisa Pilgram (Goethe University Frankfurt), Melanie Stecher (University Hospital of Cologne), Carolin E. M. Jakob (University Hospital of Cologne), Maximilian Schons (University Hospital of Cologne), Annika Claßen (University Hospital of Cologne), Sandra Fuhrmann (University Hospital of Cologne), Susana Nunes de Miranda (University Hospital of Cologne), Bernd Franke (University Hospital of Cologne), Nick Schulze (University Hospital of Cologne), Fabian Prasser (Charité, Universitätsmedizin Berlin) und Martin Lablans (University Medical Center Mannheim).