| Literature DB >> 34623953 |
Aikaterini Galani1, Reza Aalizadeh1, Marios Kostakis1, Athina Markou1, Nikiforos Alygizakis1, Theodore Lytras2, Panagiotis G Adamopoulos3, Jordan Peccia4, David C Thompson5, Aikaterini Kontou1, Apostolos Karagiannidis1, Evi S Lianidou1, Margaritis Avgeris6, Dimitrios Paraskevis7, Sotirios Tsiodras8, Andreas Scorilas3, Vasilis Vasiliou9, Meletios-Athanasios Dimopoulos10, Nikolaos S Thomaidis11.
Abstract
We measured SARS-CoV-2 RNA load in raw wastewater in Attica, Greece, by RT-qPCR for the environmental surveillance of COVID-19 for 6 months. The lag between RNA load and pandemic indicators (COVID-19 hospital and intensive care unit (ICU) admissions) was calculated using a grid search. Our results showed that RNA load in raw wastewater is a leading indicator of positive COVID-19 cases, new hospitalization and admission into ICUs by 5, 8 and 9 days, respectively. Modelling techniques based on distributed/fixed lag modelling, linear regression and artificial neural networks were utilized to build relationships between SARS-CoV-2 RNA load in wastewater and pandemic health indicators. SARS-CoV-2 mutation analysis in wastewater during the third pandemic wave revealed that the alpha-variant was dominant. Our results demonstrate that clinical and environmental surveillance data can be combined to create robust models to study the on-going COVID-19 infection dynamics and provide an early warning for increased hospital admissions.Entities:
Keywords: COVID-19; Hospital admission rates; ICU admission rates; Method validation; RT-qPCR; Wastewater-based epidemiology
Mesh:
Substances:
Year: 2021 PMID: 34623953 PMCID: PMC8421077 DOI: 10.1016/j.scitotenv.2021.150151
Source DB: PubMed Journal: Sci Total Environ ISSN: 0048-9697 Impact factor: 7.963
Fig. 1Viral load in wastewater and surveillance data; the SARS-CoV-2 load (RNA copies SARS-CoV-2/100K inhabitants) in the wastewater from wastewater treatment plants in Athens (blue bars) and NPHO-reported COVID-19 confirmed cases (orange line) are shown for the period August 31, 2020 through March 21, 2021. SARS-CoV-2 data are presented as mean ± SE from 191 wastewater samples. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
SARS-CoV-2 variant analysis in the wastewater treatment plant of Athens; targeted DNA-seq analysis of genetic markers for detection and quantification of SARS-CoV-2 variants of concern.
| Position | Reference base | Alternative base | Alternative frequency (%) | Total depth | Reference amino acid | Alternative amino acid |
|---|---|---|---|---|---|---|
| del69-70HV | TACATG | −N(6) | 98.63 | 252,808 | HV | – |
| del144Y | TTA | −N(3) | 98.31 | 225,804 | Y | – |
| N501Y (A23063T) | A | T | 99.88 | 244,198 | N | Y |
| D614G (A23403G) | A | G | 84.26 | 261,583 | D | G |
| P681H (C23604A) | C | A | 93.29 | 344,390 | P | H |
| T716I (C23709T) | C | T | 99.85 | 338,016 | T | I |
| S982A (T24506G) | T | G | 99.90 | 257,801 | S | A |
SARS-CoV-2 variants present in the wastewater treatment plant of Athens; frequencies of analyzed SARS-CoV-2 variants of concern.
| Variant of concern | % frequency | Genetic markers analyzed |
|---|---|---|
| Β.1.1.7/UK lineage (variant VOC_202012/01) | 96.3 ± 2.2 | N501Y |
| B.1.351/South Africa lineage (variant 20H/501.V2) | ND | D80A, K417N |
| P.1/Japan-Brazil lineage (variant 20J/501Y.V3) | ND | L18F, T20N, P26S, K417N |
ND: not detected.
% proportion of SARS-CoV-2 variants of concern in wastewater samples. Data are presented as the mean ± SE.
Common genetic marker for Β.1.1.7/UK, B.1.351/South Africa & P.1/Japan-Brazil variants.
Common genetic marker for B.1.351/South Africa & P.1/Japan-Brazil variants.
Fig. 2Identifying the time lag between RNA load in wastewater and SARS-CoV-2 pandemic clinical cases; turning points and difference between changes in the scaled SARS-Cov-2 RNA copies in wastewater/100k inhabitants and number of positive cases are presented. The grey dashed line is the CVln (%) threshold (i.e., 2.6e+11 normalized SARS-CoV-2 RNA copies/100K inhabitants (scaled value is 14%)). The black dashed lines are the top five detected turning points in the scaled SARS-Cov-2 RNA copies in wastewater/100k inhabitants. The blue dashed lines are the turning points detected in the trend profile of COVID-19 positive cases. The temporal separation between the blue dashed lines and the black dashed lines were used to derive lag time (delay) between turning points and peaks. Kendall information theory is calculated from −log2 (probability value |t) at given time (P is the probability to observe a turning point at time (t)). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Predicting pandemic clinical cases in Athens using the averaged and lagged linear regression models; prediction of the number of SARS-CoV-2 positive clinical cases, new admissions to hospital or new admissions to ICU cases in Athens using normalized SARS-CoV-2 RNA copies/100K inhabitants identified in wastewater from February 19 to 22 (2021) by 4 days lag and 4 days average data.
| Variable | Experimental data | Predicted data |
|---|---|---|
| (Averaged by 4 days) | (Averaged by 4 days and +4 day lag) | |
| Positive clinical cases | 564 | 930 (874–987) |
| New hospital admissions | 156 | 180 (165–194) |
| New ICU admissions | 13 | 16 (14–17) |
Averaged data between February 15 and 18, 2021.
Numbers in parentheses represent the lower and upper 95% confidence intervals.
The number of people being tested for SARS-CoV-2 decreased due to adverse weather conditions occurring between February 15 and 21, 2021.
Fig. 3Grid search for optimal averaged lagged linear regression model; optimization of lag and averaging term using RMSE of leave-one-out cross validation in the estimation of positive cases (A), and new admissions to hospitals (B) or ICUs (C). The lower RMSECV results in better prediction performance. Each surface plot (A–C) shows the changes in RMSECV value around lag and averaging values. In plots D–F (positive cases, new admissions to hospitals, and new admissions to ICUs, respectively), data were subjected to linear regression analysis, with the line of best fit being shown as a dotted line. The test date with bad weather is shown with red marker. The grey dashed line is the CVln (%) threshold (i.e., 2.6e+11 normalized SARS-CoV-2 RNA copies/100K inhabitants). In the plots G–I (positive cases, new admissions to hospitals and new admissions to ICUs, respectively), the RNA load data were shuffled randomly and then subjected to linear regression analysis. R2 and Q2 values were calculated for the randomized data and compared with the main model (shown as red and green dotted lines, respectively). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Artificial Intelligence for modelling of SARS-CoV-2 pandemic data; the grid search results for selection of hidden layers in ANN in new admissions to hospitals (A) and new admissions to ICUs (B). The best combination of the hidden layer is the region where it showed the lowest RMSE value. Plots C and D show the ANN structure for new admissions to hospital and ICUs, respectively. The blue lines are bias in each node and the black line is the combination of layers and the weights used for each input data. The bar charts in the plots C and D show the importance of variables in ANN structure. Plots E and F show the predicted versus experimental data for new admissions to hospitals and ICUs, respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)