| Literature DB >> 34807925 |
Leander Melms1, Evelyn Falk1, Bernhard Schieffer2, Andreas Jerrentrup3,4, Uwe Wagner5, Sami Matrood6, Jürgen R Schaefer4, Tobias Müller4, Martin Hirsch1.
Abstract
Pandemic scenarios like SARS-Cov-2 require rapid information aggregation. In the age of eHealth and data-driven medicine, publicly available symptom tracking tools offer efficient and scalable means of collecting and analyzing large amounts of data. As a result, information gains can be communicated to front-line providers. We have developed such an application in less than a month and reached more than 500 thousand users within 48 hours. The dataset contains information on basic epidemiological parameters, symptoms, risk factors and details on previous exposure to a COVID-19 patient. Exploratory Data Analysis revealed different symptoms reported by users with confirmed contacts vs. no confirmed contacts. The symptom combination of anosmia, cough and fatigue was the most important feature to differentiate the groups, while single symptoms such as anosmia, cough or fatigue alone were not sufficient. A linear regression model from the literature using the same symptom combination as features was applied on all data. Predictions matched the regional distribution of confirmed cases closely across Germany, while also indicating that the number of cases in northern federal states might be higher than officially reported. In conclusion, we report that symptom combinations anosmia, fatigue and cough are most likely to indicate an acute SARS-CoV-2 infection.Entities:
Mesh:
Year: 2021 PMID: 34807925 PMCID: PMC8608328 DOI: 10.1371/journal.pone.0258649
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Base characteristics of the study population.
| Variable | n | % |
|---|---|---|
| Age | ||
| • < 20 | 56676 | 08·42 |
| • 20–30 | 144652 | 21·49 |
| • 31–40 | 210658 | 31·29 |
| • 41–50 | 126731 | 18·82 |
| • 51–60 | 80993 | 12·03 |
| • 61–70 | 36801 | 05·47 |
| • > 70 | 16647 | 02·47 |
| Gender | ||
| • Male | 392326 | 58·28 |
| • Female | 279642 | 41·54 |
| • Other | 1190 | 00·17 |
| Weight | ||
| • < 60kg | 70690 | 10·50 |
| • 60–70 kg | 115590 | 17·17 |
| • 71–80 kg | 134079 | 19·91 |
| • 81–90 kg | 135229 | 20·09 |
| • 91–100 kg | 96334 | 14·31 |
| • 101–110 kg | 57436 | 08·53 |
| • 111–120 kg | 32141 | 04·77 |
| • > 120 kg | 31659 | 04·70 |
| Height | ||
| • < 1.50 m | 6036 | 00·90 |
| • 1.50–1.60 m | 52106 | 07·74 |
| • 1.61–1.70 m | 178958 | 26·58 |
| • 1.71–1.80 m | 235466 | 34·98 |
| • 1.81–1.90 m | 165267 | 24·55 |
| • 1.91–2.00 m | 33193 | 04·93 |
| • > 2.00 m | 2132 | 00·32 |
Results of the chi-square test of independence.
| Variable | X2 | p-value | Degrees of freedom |
|---|---|---|---|
| sniff | 551·734708 | < .001 | 1 |
| cough | 1671·296687 | < .001 | 1 |
| fatigue | 1780·535304 | < .001 | 1 |
| body aches | 2190·363285 | < .001 | 1 |
| headache | 2232·331668 | < .001 | 1 |
| diarrhea | 2389·594631 | < .001 | 1 |
| sore throat | 2630·390694 | < .001 | 1 |
| nausea | 4716·380914 | < .001 | 1 |
| dyspnea at rest | 5297·727920 | < .001 | 1 |
| anosmia | 7484·169351 | < .001 | 1 |
| fever | 8779·939422 | < .001 | 1 |
Fig 1Comparison of symptom distribution between patients with and without confirmed contact.
A) Single symptoms: Color coded Cramer’s V correlation of symptoms with the confirmed contact variable (dark red tones) & symptom frequency count in percent of positive statements broken down by groups with and without confirmed contact. Anosmia seems to be the strongest predictor followed by fever and dyspnea at rest. On the contrary, the least single important symptoms are sniff, fatigue and cough by itself. B) Complex symptoms: Symptom frequency count (total) with combinations. The symptom combination fatigue, anosmia and cough has been highlighted in red to illustrate the shift of importance between the two groups. The age distribution of the two groups is depicted in each case above graph B. A random sample of 19128 was taken from the population without confirmed contact for comparison. The percentage of positive cases in the total number of participants without contact was 6·24% whereas the percentage of positive cases in the group with confirmed contact was 23·21%.
Fig 2Maps illustrating the total number of users of COVID-Online (A), the number of predicted infections (B), the number of confirmed infections (C), the frequency of fatigue (D), the frequency of anosmia (E) and the frequency of fever (F). Data of maps A–F is based on the time period 03.04–10.04.2020. A, B, D, E, F: The district “Marburg-Biedenkopf” has been excluded from these charts as it contained too many records from internal tests carried out by associated personnel of COVID-Online and was also influenced by regional media reports. Unfortunately, due to the great time pressure in times of crisis, no test or staging instance could be installed for such purposes.
Number of tests submitted by 177 laboratories voluntarily in calendar week 14 (06.04–12.04.2020, source: RKI).
| Federal state | N tests / calendar week 15 2020 | Tests / 100k residents |
|---|---|---|
| BW | ~ 5 000 | ~ 45·17–67·75 |
| BY | ~ 15 000 | ~ 114·29 |
| NI | ~ 5 000 | ~ 62·55 |
| SH | ~ 1 500–2 500 | ~ 51·78–86·30 |
| NRW | ~ 25 000 | ~ 139·30 |
Data is from a sample of laboratories, not a complete survey of all tests in Germany. The coverage and representativeness of the data can greatly vary between the federal states. Unfortunately, the proportion of voluntary participation of the total is not known. Up to and including week 14/2020, 177 laboratories have registered for this RKI test laboratory interrogation or in one of the other transmitting networks. The data was derived from the Management Report on Coronavirus Disease (as of April 8, 2020) and the graphs contained therein.
Number of SARS-CoV-2-PCR tests (cumulative) broken down by federal state.
Data status until 23.04.2020 (source RKI).
| Federal state | N | N positive | % positive | Population | % tested |
|---|---|---|---|---|---|
|
| 148,968 | 12,128 | 8·1 | 17,947,221 | 0·8 |
|
| 91,463 | 9,480 | 10·4 | 13,124,737 | 0·7 |
|
| 46,265 | 5,514 | 11·9 | 11,069,533 | 0·4 |
|
| 39,563 | 1,893 | 4·8 | 7,993,608 | 0·5 |
|
| 35,784 | 2,626 | 7·3 | 4.093.903 | 0·9 |
|
| 7,860 | 313 | 4 | 2,896,712 | 0·3 |
|
| 4,389 | 100 | 2·3 | 1,608,138 | 0·3 |
The percentage of the total tests to the number of inhabitants of a federal state was displayed in green from a value of 0·5 and otherwise in red. Number of conducted tests greatly varied between federal states. Populous federal states in the south and west of Germany have performed significantly more tests than the northern and eastern regions.