Literature DB >> 35779714

Five-week warning of COVID-19 peaks prior to the Omicron surge in Detroit, Michigan using wastewater surveillance.

Liang Zhao1, Yangyang Zou1, Yabing Li1, Brijen Miyani1, Maddie Spooner1, Zachary Gentry1, Sydney Jacobi1, Randy E David2, Scott Withington2, Stacey McFarlane3, Russell Faust4, Johnathon Sheets5, Andrew Kaye5, James Broz5, Anil Gosine6, Palencia Mobley6, Andrea W U Busch7, John Norton7, Irene Xagoraraki8.   

Abstract

Wastewater-based epidemiology (WBE) is useful in predicting temporal fluctuations of COVID-19 incidence in communities and providing early warnings of pending outbreaks. To investigate the relationship between SARS-CoV-2 concentrations in wastewater and COVID-19 incidence in communities, a 12-month study between September 1, 2020, and August 31, 2021, prior to the Omicron surge, was conducted. 407 untreated wastewater samples were collected from the Great Lakes Water Authority (GLWA) in southeastern Michigan. N1 and N2 genes of SARS-CoV-2 were quantified using RT-ddPCR. Daily confirmed COVID-19 cases for the City of Detroit, and Wayne, Macomb, Oakland counties between September 1, 2020, and October 4, 2021, were collected from a public data source. The total concentrations of N1 and N2 genes ranged from 714.85 to 7145.98 gc/L and 820.47 to 6219.05 gc/L, respectively, which were strongly correlated with the 7-day moving average of total daily COVID-19 cases in the associated areas, after 5 weeks of the viral measurement. The results indicate a potential 5-week lag time of wastewater surveillance preceding COVID-19 incidence for the Detroit metropolitan area. Four statistical models were established to analyze the relationship between SARS-CoV-2 concentrations in wastewater and COVID-19 incidence in the study areas. Under a 5-week lag time scenario with both N1 and N2 genes, the autoregression model with seasonal patterns and vector autoregression model were more effective in predicting COVID-19 cases during the study period. To investigate the impact of flow parameters on the correlation, the original N1 and N2 gene concentrations were normalized by wastewater flow parameters. The statistical results indicated the optimum models were consistent for both normalized and non-normalized data. In addition, we discussed parameters that explain the observed lag time. Furthermore, we evaluated the impact of the omicron surge that followed, and the impact of different sampling methods on the estimation of lag time.
Copyright © 2022 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  COVID-19; Detroit; Early warning; Prediction; SARS-CoV-2; Wastewater-based-epidemiology (WBE)

Mesh:

Substances:

Year:  2022        PMID: 35779714      PMCID: PMC9239917          DOI: 10.1016/j.scitotenv.2022.157040

Source DB:  PubMed          Journal:  Sci Total Environ        ISSN: 0048-9697            Impact factor:   10.753


Introduction

Wastewater-based epidemiology (WBE) for the prediction of viral outbreaks was proposed in 2019 and 2020 (Xagoraraki and Brien, 2019; O’Brien and Xagoraraki, 2019; Xagoraraki, 2020), and has been applied for the early detection of COVID-19 (Ahmed et al., 2020a; Ahmed et al., 2021; Brijen et al., 2020). One of the critical utilities of WBE is the possibility to forecast upcoming fluctuations of disease with a lag time. The lag time in our study is defined as the lag between peaks in measured concentrations of SARS-CoV-2 in wastewater and peaks in reported COVID-19 cases based on clinical testing. Lag times observed in published studies since the inception of COVID-19 pandemic (summarized in Table 1 ) vary widely between 2 days and 28 days. The observed lag times may depend on multiple parameters. These parameters include disease characteristics of SARS-CoV-2 such as incubation time and shedding duration summarized in Tables S1 and S2, respectively, that may change with novel variants (Yaniv et al., 2021). The parameters that affect observed lag times may also involve contributing populations and their demographic characteristics, including age (Omori et al., 2021), sex (Syangtan et al., 2021), racial ancestry (Allan-Blitz et al., 2021; Feehan et al., 2021), and traveling history of infected populations (Xiao et al., 2020). Furthermore, the hydraulic influence in the sewage network, including dilution events (Foladori et al., 2020), and sorption and desorption of the virus in wastewater (Ziqiang et al., 2018) as well as the methods of wastewater sampling (that may be include viruses sorbed on particles or supernatant viruses) are critically affecting the observed lag times. Moreover, manners of reporting the clinical data, including the accessibility to the testing (Wiens et al., 2021), and traveling time to the testing sites (Rader et al., 2020) are important.
Table 1

Lag time in published studies prior to Omicron surge.

LocationSample typeLag timeTest methodReferences
Milan & Rome, Italywastewaterwithin a few daysRT-qPCR(La Rosa et al., 2020)
Ottawa, Canadawastewater2 daysRT-qPCR(D’Aoust et al., 2021)
Montana, USAwastewater2–4 daysRT-qPCR(Nemudryi et al., 2020)
Wisconsin, USAwastewater0–6 daysRT-qPCR(Feng et al., 2021)
New York, USAwastewater3 daysRT-qPCR(Larsen et al., 2021)
New Haven, Connecticut, USAsludge0–2, 1–4, 6–8 days under given scenariosRT-qPCR(Peccia et al., 2020)
Charlotte, North Carolinawastewater5–12 daysRT-qPCR RT-ddPCR(Barua et al., 2022)
Gandhinagar, Gujarat, Indiawastewater7–14 daysRT-PCR(Kumar et al., 2021)
Paris, Francewastewater8 daysRT-qPCR(Wurtzer et al., 2020)
Minnesota, USAwastewaterstatewide: 15–17 days, regional level: 4–20 daysRT-qPCR(Melvin et al., 2021)
Massachusetts, USAwastewater4–10 daysRT-qPCR(Wu et al., 2022)
Netherlandswastewater4 daysRT-qPCR(Lodder & de Roda Husman, 2020)
Netherlandssewage samples6 daysRT-qPCR(Medema et al., 2020)
Utah, USAwastewater7 daysRT-qPCR(Weidhaas et al., 2021)
Milan Metropolitan Area, Italy,raw and treated wastewater8 daysRT-qPCR(Rimoldi et al., 2020)
Spainwastewater12–16 daysRT-qPCR(Randazzo et al., 2020)
Australiawastewater21 daysRT-qPCR(Ahmed et al., 2021)
Gothenburg, Swedenwastewater19–21 daysRT-qPCR(Saguti et al., 2021)
Australiawastewater28 daysRT-qPCR(Ahmed et al., 2020, Ahmed et al., 2020, Ahmed et al., 2020)
Lag time in published studies prior to Omicron surge. Some of the factors contributing to the lag time between wastewater-based data peaks and clinical data peaks are visualized in the timeline shown in Fig. 1 . The majority of published studies show empirical evidence that the range of the incubation time of SARS-CoV-2 prior to the Omicron surge was 0 to 14 days (shown in Table S1 and summarized in Fig. 1). Clinical testing is often delayed from the manifestation of clinical symptoms by several days, due to limited availability of testing supplies, limited ability to reach testing sites, or resistance of people to seek testing (Rader, 2020; Torres et al., 2021; Wiens et al., 2021). In some cases, the mean delay in the reporting of confirmed COVID-19 cases is 5 days with 15 % of cases reported after day 10 (Harris, 2020). Some clinical studies have demonstrated a delay of approximately 7 days from illness onset to clinical testing (Huang et al., 2020). Henceforth, the delay in gathering clinical data could vary significantly, particularly in demographically and socioeconomically varied populations, like that of Detroit, Michigan. Generally, between day 19 and 25, clinical data will become publicly available, after an estimated delay of 3 to 9 days of clinical data collection and processing time (Garg et al., 2020; Harris, 2020; Rader, 2020). Additionally, Fig. 1 demonstrates the temporal progress of data collection of viral loadings in wastewater. The detention time of wastewater in the collection network is estimated as 12 to 24 h (Table S4). In Fig. 1 it is assumed that in most cases, wastewater laboratory tests are completed within a day upon sample collection. A compilation of all the above-mentioned timelines indicates that the lag time may be estimated to be between 3 and 4 weeks (Fig. 1). This is expected to vary with different variants and different sampling methods.
Fig. 1

Time scale of clinical data collection and wastewater surveillance (incubation time and shedding duration are summarized in Tables S1 and S2, respectively).

Time scale of clinical data collection and wastewater surveillance (incubation time and shedding duration are summarized in Tables S1 and S2, respectively). Here we present a twelve-month consecutive study, prior to the omicron surge, using N1 and N2 gene RT-ddPCR to monitor SARS-CoV-2 concentrations in untreated wastewater samples collected from the WRRF of GLWA that serves the city of Detroit, as well as Wayne, Macomb, and Oakland counties in Michigan. We applied a sampling method that captured suspended viruses in the supernatant of wastewater to circumvent the potential input of “old” viruses via desorption of settled viruses during high flows. We investigated lag time through statistical analyses and established four models to predict COVID-19 clinical cases using normalized and non-normalized SARS-CoV-2 concentrations in wastewater. The performance of each model is evaluated using the Root Mean Square Error (RSME) and Pearson correlation between the predicted case number and actual clinical case number. Future incidences of COVID-19 were predicted based on SARS-CoV-2 concentrations in wastewater during the study period, using statistical models. In addition, we examined the influence of the omicron surge to the early waring lag time. We also performed another widely applied sampling method for comparison purposes to demonstrate the benefits of our current method in terms of providing early warning for COVID-19 incidences in Detroit.

Materials and methods

Wastewater treatment plant and sample collection

Untreated wastewater samples were collected from the WRRF of GLWA located in southeastern Michigan between September 1, 2020, and August 31, 2021, prior to the Omicron surge. The WRRF in Detroit is the largest single-site wastewater treatment plant in the U.S. with a primary treatment capacity of 1700 million gallons per day (MGD) and a secondary treatment capacity of 930 MGD. GLWA's WRRF has a semi-combined sewer-shed system, which collects and treats stormwater along with residential, industrial, and commercial waste, depending on service areas. The WRRF serves the three most populous Michigan counties: Wayne, Oakland, and Macomb. Fig. 2.a. shows all ZIP codes captured by the three main interceptors that discharge into the WRRF. The WRRF receives wastewater via three main interceptors including the Detroit River Interceptor (DRI), the North Interceptor-East Arm (NIEA), and Oakwood-Northwest-Wayne County Interceptor (ONWI) from its service areas which are shown in Fig. 2.b. The samples were collected from all three interceptors at the point of discharge into the WRRF.
Fig. 2

a. GLWA WRRF tributary areas; b. Three interceptors and GLWA WRRF locations.

a. GLWA WRRF tributary areas; b. Three interceptors and GLWA WRRF locations. Estimated populations served by each interceptor, daily flows, and other characteristics of the three interceptors, between September 2020 and August 2021, are shown in Table S27. Sampling occurred weekly between September 1, 2020, and August 30, 2021. A total of 407 untreated wastewater samples were collected at the influent of the WRRF, including 146, 117, and 144 samples from ONWI, NIEA, and DRI, respectively. In addition, to evaluate the effect of omicron surge in the estimated lag time, we performed testing between August 1, 2021, and February 28, 2022, using the same methods, with a total of 249 untreated wastewater samples from the same sites.

Sampling methods

Viruses were collected and isolated from wastewater using electropositive NanoCeram column filters (Argonide, Sanford, FL) based on the EPA Virus Adsorption-Elution (VIRADEL) method (Xagoraraki et al., 2014; Miyani, Zhao, et al., 2021b). Depending on the suspended solids of wastewater, approximately 10 to 50 L of raw wastewater passed through NanoCeram electropositive cartridge filters at a rate not more than 11 L/min using a previously described method (USEPA, 2001; USEPA, 2014; Miyani et al., 2021b). Flow meter readings were recorded at the inception and termination of each sampling event. After sampling, all NanoCeram column filters were placed in sealed plastic bags, on ice, and transported to the laboratory within 24 h for downstream analysis. In addition to the VIRADEL method, for method comparison purposes, 1 L of 24-h composite samples were collected weekly between August 1, 2021, and February 28, 2022, to conduct polyethylene glycol precipitation (PEG) for the virus concentration (Ahmed et al., 2020b; Ahmed et al., 2020c; D’Aoust et al., 2021; Kaya et al., 2022).

Virus elution, RNA extraction, RT-ddPCR, and variants testing

Viruses were eluted within 24 h after sampling based on a previously described method (Miyani et al., 2021b). Bacteriophage Phi6 was used as a proxy virus to estimate losses during virus elution and concentration (Kantor et al., 2021; Ye et al., 2016). The recoveries obtained were from 10.37 % to 58.96 %, with a mean recovery of 24.91 % (±22.89 %). Viral RNA was extracted using Viral RNA QIAGEN kit (QIAGEN, Germantown, Maryland), following the manufacturer's protocol with the modification described previously (Miyani et al., 2021b). RT-ddPCR was performed on a QX200 AutoDG Droplet Digital PCR system (Bio-Rad, Hercules, CA, USA), using the One-step RT-ddPCR Advanced Kit for Probes (Bio-Rad, Hercules, CA, USA). Per the CDC 2019-Novel Coronavirus (2019-nCoV) Real-Time RT-PCR Diagnostic Panel for SARS-CoV-2 detection (www.cdc.gov), the primers and probe targeting N1 and N2 of SARS-CoV-2 were shown in Table S5, which were proven to be the most sensitive assays to identify SARS-CoV-2 (Ahmed et al., 2022; Bivins et al., 2021) and were chosen in the current study. N1 N2 gene Duplex Assay Reaction Mixture is shown in Table S6. Samples were then run on a C1000 Touch Thermal Cycler (Bio-Rad, Hercules, CA, USA) using the conditions shown in Table S7, following a measurement of fluorescence on a QX200 Droplet Reader (Bio-Rad, Hercules, CA, USA). For each RT-ddPCR run, three positive controls (PTCs) and three negative controls (NTCs), and process negative controls (including virus elution and RNA extraction process controls) were included. 102 gc/μL Twist Bioscience Twist Synthetic SARS-CoV-2 RNA Control 2 (MN908947.3) was used for PTCs. Nuclease-free water was used for NTCs. Nanopure water was used as a substitute for 1.5 % beef extract in virus elution, as process negative controls. Sterile nuclease-free water was used as a substitute for 140 μL of sample for RNA extraction, as process negative controls. All samples were run in triplicate. Determination of Limit of Blank (LOB) and Limit of Detection (LOD) were based on the methods described in the manufacturer's (Bio-Rad) guidelines for evaluating analytical sensitivity and validation of RT-ddPCR (Bio-Rad, Hercules, CA, USA). The Limit of Blank (LOB) was determined by testing three types of samples using RT-ddPCR, across four consecutive days including the prior-to COVID-19 pandemic samples collected from the same interceptors, nuclease-free water, and negative process control samples from elution and extraction processes. The purpose of testing the LOB across four separate days was to include the unnoticeable impacts when the tests are performed on different days. The prior-to COVID-19 pandemic interceptor samples were collected on February 18, 2018, from the ONWI, NIEA, and DRI interceptors. LOB for N1 gene ddPCR was determined to be 0.09 gc/μL, and the LOB for N2 gene ddPCR was determined to be 0.08 gc/μL. The Limit of Detection (LOD) was determined for each sample using 10−4 and 102 gc/μL of positive SARS-CoV-2 RNA across nine consecutive days using a nonparametric method, as-described in the manufacturer's (Bio-rad) guidelines aforementioned. An LOD of 0.1 gc/μL with 72.92 % confidence for the N1 gene ddPCR and 0.1 gc/μL with 81.25 % confidence for the N2 gene ddPCR were determined. To elucidate the impact of SARS-CoV-2 variants on the lag time, the mutations of dominant SARS-CoV-2 variants including Alpha, Beta, Gamma, Delta, and Omicron variants were tested in the wastewater samples using the GT Molecular kits (Pulicharla et al., 2021), during the time when N1 and N2 measurements reached three peaks, which were defined in the current study as Peak I (10/6/20–10/28/20) and Peak II (2/17/21–3/8/21). For comparison purposes data were collected during the omicron surge Peak III (12/15/21–1/12/22) and are presented in the supplementary information. Table S8 shows the time of report of the dominant SARS-CoV-2 variants which are corresponded to the peaks' periods that were chosen for variant tests.

Clinical data of COVID-19

Publicly available data of confirmed COVID-19 incidence in the city of Detroit, as well as Wayne, Macomb, and Oakland counties were used for this study (michigan.gov/coronavirus/). The supplying database was accessed on January 10, 2022 (Fig. 3a). The range of clinical data was between September 1, 2020, and October 4, 2021. The database was accessed again on April 1, 2022, to supply data for the comparative analysis during the omicron surge. Data were reported as follows: “county” is based on the county of residence (or city in the case of Detroit); “cases” are aggregated by the date of onset of COVID-19 symptoms, if known, otherwise by laboratory specimen date, if known, otherwise by case referral date; “confirmed cases” only include individuals who have had a positive diagnostic laboratory test for COVID-19. Clinical data considering a 7-day moving average was used for further statistical analysis (Fig. 3b). COVID-19 data were only available per city/country for the Detroit metropolitan area. Moreover, each interceptor received wastewater from portions of each city/county, thus, only the total SARS-CoV-2 concentrations could be correlated to the total COVID-19 cases of city/counties.
Fig. 3

a. Total confirmed COVID-19 cases between September 1, 2020, and October 4, 2021, in the city of Detroit, and Wayne, Macomb, and Oakland counties; b. Total confirmed COVID-19 cases between September 1, 2020, and October 4, 2021, with 7-day moving averages in the city of Detroit, and Wayne, Macomb, and Oakland counties.

a. Total confirmed COVID-19 cases between September 1, 2020, and October 4, 2021, in the city of Detroit, and Wayne, Macomb, and Oakland counties; b. Total confirmed COVID-19 cases between September 1, 2020, and October 4, 2021, with 7-day moving averages in the city of Detroit, and Wayne, Macomb, and Oakland counties.

Contributing populations, flow rates, and normalization of SARS-CoV-2 concentrations

The contributing population of each interceptor was estimated from 2020 calculations, provided by the Southeast Michigan Council of Governments by traffic analysis zone (TAZ). Geographic information systems (GIS) analysis was adopted to intersect the TAZ boundaries with ZIP Code boundaries and proportionally allocate population from each TAZ to the intersected areas. ZIP Code boundaries were also intersected with the interceptor service areas to allow for a calculation of population by interceptor area. Daily flow rates for the three interceptors were estimated from the daily influent flow to the WRRF, calculated from GLWA-reported primary influent flow minus WRRF recycle flows, and a calibrated hydrologic and hydraulic model developed for the GLWA collection system. The collection system model was developed with the U.S. Environmental Protection Agency's (EPA's) Stormwater Management Model (SWMM) 5 as part of the GLWA Wastewater Master Plan. The SWMM model represents sanitary wastewater and infiltration/inflow, hydraulics in all physical assets of the collection system and at the WRRF entrances, and stages and flows in the Rouge and Detroit rivers. To account for the changing flow and dilution of the wastewater where those parameters are highly variable day to day, two approaches of normalizing the N1 and N2 gene concentrations of SARS-CoV-2 in gc/L were adopted using Eqs. (1), (2). The normalized and non-normalized N1 and N2 gene concentrations were used for statistical analysis and modeling. ( Normalized concentration of SARS-CoV-2 (gc/d) N1 or N2 gene concentrations of SARS-CoV-2 (gc/L) Volume of wastewater flowing into WWTP interceptors during sampling events 3.8 × 106, conversion factor between liter and million gallons (Normalized concentration of SARS-CoV-2 (gc/L of sanitary flow) Sanitary flow percentage of wastewater flowing into WWTP interceptors during sampling events (%)

Data analyses and visualization

Data were tracked and organized with Microsoft Excel. MATLAB of a 2019b edition was applied to perform the model regression analyses. All figures were generated using R Statistical Computing Software and MATLAB (2019b), depending essentially on ggplot2 package for visualization, and Forecast and VAR package for autoregression model. The inputs of the models are 7-day moving average of the clinical cases (Barua et al., 2022) and total N1 or N2 gene concentrations for the three interceptors. Model 1: Linear regressionwhere Y is reported clinical cases, x is SARS-CoV-2 concentration from the wastewater samples, b is the slope of the linear regression line, and a is the intercept from the line. The method uses least squares regression. Model 2: Linear regression model with autoregressive model (ARIMA model) Autoregressive Integrated Moving Average (ARIMA) is one of the most widely used forecasting methods for time series data. It applies to time series data which have a trend. In this study, ARIMA model is used to fit the residuals from the linear regression between clinical cases and SARS-CoV-2 concentration. The ARIMA model is characterized by autoregressive (AR) and moving average (MA), and number of differencing required to make the time series stationary. For example, for AR model, Yt depends only on its own lags Y , Y …Y . In this study, if Y is clinical case of week t, Y is the previous one-week (t-1) value of clinical case. β 1 is the coefficient of lag 1-week that the model estimates and α is the intercept term. The criterion to choose the best ARIMA model in this study are: 1) lower Akaike information criterion (AIC) value, a lower AIC score indicates a more predictive model 2) white noise after adjusting residuals 3) low standard error (SE) value. Model 3: Regression model with Autoregressive model has a seasonal pattern (SARIMA model) Seasonal ARIMA or SARIMA model is an extension of ARIMA model that applies to time series data with a seasonal component. In this study, ARIMA has a limitation fitting clinical cases and SARS-CoV-2 concentration time series data because it does not apply to these two time series data which have a seasonal pattern. Model 4: Vector autoregressive model (VAR) VAR model is a forecasting model that can be used when two or more time series influence each other. In this study, clinical cases and SARS-CoV-2 concentration time series data are considered as two time series data, and there is a relationship between these two time series data. where, Y 1, is the clinical cases at week t and Y 2, is the SARS-CoV-2 concentration at week t. Y 1, is the lag of one-week values of clinical cases, Y 2, is the lag of one-week values of SARS-CoV-2 concentration.

Results and discussion

Observed SARS-CoV-2 RNA in wastewater samples and observed COVID-19 cases

During the 12-month study, N1 and N2 genes of SARS-CoV-2 were detected and quantified in all 407 wastewater samples using RT-ddPCR shown in Table 2 . The overall observed trends of the total N1 and N2 gene concentrations increased steeply from mid-September 2020 and reached a peak in mid-October 2020 (Fig. 4 ), which heralded the first peak of COVID-19 infections, in mid-November 2020. Both N1 and N2 gene concentrations stayed comparatively steady between November 2020 and the end of January 2021, following a sharp increase in February 2021 and reached a peak by the end of the month. This brought about the second peak of COVID-19 infections, from approximately late March to April 2021. Subsequently, the total SARS-CoV-2 concentrations measured by N1 and N2 gene RT-ddPCR in wastewater samples reached comparatively lower peaks in June 2021, which preceded the increase of COVID-19 incidences towards the end of July and August 2021 (Fig. 4). The following decrease of N1 and N2 gene concentrations after each peak was largely due to the termination of shedding events which last a few weeks as demonstrated in Fig. 1 and Table S2.
Table 2

N1 and N2 gene concentrations measured by RT-ddPCR in wastewater samples collected from GLWA WRRF.

UnitGeneInterceptor
ONWINIEADRI
gc/lN1Maximum5773.421517.101405.36
Minimum256.39204.78185.41
Mean1069.11659.16550.48
N2Maximum4826.542583.641927.34
Minimum281.47202.71210.27
Mean1048.03729.26600.62
gc/dN1Maximum5.23E+121.29E+121.87E+12
Minimum1.66E+111.57E+111.61E+11
Mean7.68E+114.36E+114.41E+11
N2Maximum4.38E+121.75E+121.87E+12
Minimum1.56E+111.61E+111.82E+11
Mean7.58E+114.81E+114.79E+11
gc/l of SFN1Maximum2.54E+043.87E+039.99E+03
Minimum8.05E+024.71E+028.57E+02
Mean3.73E+031.31E+032.35E+03
N2Maximum2.13E+045.23E+039.99E+03
Minimum7.58E+024.82E+029.71E+02
Mean3.68E+031.44E+032.55E+03

Note: SF stands for “Sanitary Flow”.

Fig. 4

Total N1 and N2 gene concentrations in gc/L for the three interceptors and total confirmed COVID-19 cases in the city of Detroit, as well as Wayne, Macomb, and Oakland counties.

N1 and N2 gene concentrations measured by RT-ddPCR in wastewater samples collected from GLWA WRRF. Note: SF stands for “Sanitary Flow”. Total N1 and N2 gene concentrations in gc/L for the three interceptors and total confirmed COVID-19 cases in the city of Detroit, as well as Wayne, Macomb, and Oakland counties. Best prediction models based on (a) N1 gene concentrations (gc/L) and (b) N2 gene concentrations (gc/L) with a 5-week lag time.

Correlation between SARS-CoV-2 and confirmed COVID-19 cases

A series of statistical analysis with normalizations using different parameters, including BOD, TSS, wastewater flow volume and sanitary percentage of wastewater were conducted. Wastewater flow volume and sanitary percentage of wastewater were chosen for subsequent analysis in terms of stronger Pearson's correlation. Table 2 presents the N1 and N2 gene concentrations normalized by total flow volume and sanitary percentage, respectively. The fluctuations of the normalized results in gc/d and gc/L of sanitary flow stay relatively comparable to the original results in gc/L of the N1 and N2 genes, which predicted the peaks of COVID-19 incidence by 5 weeks. SARS-CoV-2 gene concentrations in wastewater were influenced by absolute clinical case numbers of COVID-19 on single consecutive days. Therefore, 7-day moving averages of clinical cases were performed to smooth the data and reduce outliers. Confirmed COVID-19 clinical data were then aligned with SARS-CoV-2 concentrations (N1/N2) on the exact sampling dates. Missing data from samples were filled using linear interpolation (Lepot et al., 2017). After aligning these two datasets, each SARS-CoV-2 concentrations (N1/N2) had a corresponding COVID-19 case data to be analyzed for pairwise correlation and statistical modeling in the next sections. We proposed the hypothesis that the fluctuations of SARS-CoV-2 concentrations in wastewater correlate to confirmed COVID-19 cases with a prescribed range of lag times. Hence, Pearson's correlations were first conducted to investigate the strength of a linear correlation between the total N1 and N2 gene concentrations in gc/L, gc/d, and gc/L of sanitary flow and total confirmed COVID-19 cases in the study areas with various lag times (Table S28). To estimate the approximate lag time, week-shift of the clinical cases was adopted. Notably, SARS-CoV-2 concentrations in wastewater strongly correlated with the 5-week shifting forward of a 7-day moving average of COVID-19 incidences (Pearson's r = 0.62 for N1 gene in gc/L, and Pearson's r = 0.64 for N2 gene in gc/L). Our suggested lag time of 5 weeks between peaks in SARS-CoV-2 genes in wastewater and peaks in reported COVID-19 clinical tests is relatively similar to the longest lag times reported in the literature (Table 1). For example, a lag of 21 days was reported in Australia (Ahmed et al., 2021) and Sweden (Saguti et al., 2021), and 28 days in Australia (Ahmed et al., 2020a). The increased lag time may be in partly due to our sampling method (VIRADEL) that focuses on viruses in the supernatant of untreated wastewater rather than in the precipitates and solids. These samples represent the near real-time increase in clinical cases in the community. This is important for samples collected in large interceptors where precipitation and resuspension of solids occurs to a large extent. Factors affecting our observed lag time are discussed in Section 3.4. Our analysis also demonstrated that the normalization of clinical data using wastewater flow rate and percentage of sanitary flow did not significantly improve the correlation between the SARS-CoV-2 concentrations in wastewater and COVID-19 clinical cases in communities during the study period. Similar observations were also demonstrated in another recent study (Ai et al., 2021).

Statistical models on the relationship between wastewater surveillance of SARS-CoV-2 and COVID-19 incidence

To better estimate the relationship between the measured SARS-CoV-2 concentrations in wastewater and reported COVID-19 incidence in the communities, four statistical models were established naming: linear model (L), autoregression model (A), autoregression with time effect model (AT), and vector autoregression model (VA). Confirmed COVID-19 cases with lag times of 3, 4, and 5 weeks were chosen to correlate with N1 and N2 gene concentrations in gc/L, gc/d, and gc/L of sanitary flow using the aforementioned models. The Root Mean Square Error (RMSE) and Pearson's coefficient between actual cases and predicted cases were calculated to estimate the performance of each model shown in Supplementary Tables S10 – S15. From the previous result in Section 3.2, a lag time of 5 weeks exhibits a stronger correlation, in agreement with the models in Table 3 and Tables S29 and S30, based on Pearson's r. For 5 weeks, in Table S29, linear regression only considers SARS-CoV-2 concentration as the predictor in the model, the correlation between actual case number and predicted case number is range from 0.4 to 0.62. Seasonal patterns from the residual of linear regression were also observed. Thus, we consider that there is an autocorrelation effect from the case number. The models were therefore improved with autoregressive errors using ARIMA. The correlation then consequently increased to 0.4–0.67. It is important to note that there is a limitation of ARIMA, it does not apply to seasonal data. After a seasonal component was included in the ARIMA model, the correlation further improved to 0.94–0.95. Thus, the analyses suggest that there is a seasonal pattern present in clinical COVID-19 cases. VA is also a better model (r ranges from 0.95 to 0.96) because it considers both SARS-CoV-2 concentration and case number as the predictors in the model and use their past values to predict current case number. Similarly, the modeling results of N2 gene concentrations show agreement with modeling results of N1 gene concentrations. Comparing the non-normalized data with normalized data in Table S29 and Table S30, for the L and A models, using normalized data does not improve the correlation. For AT and VA models, using normalized data shows similar results comparing to using non-normalized data. It may indicate that these two models reduce the effect of normalizing data. It also indicates that normalizations of the original N1 and N2 gene concentrations did not significantly improve the performance of the modeling and vice versa. All other detailed results are shown in Tables S10 – S15. Same models and analysis were also applied to both the measurements by VIRADEL method (results were shown in Tables S16 – S21) and PEG sampling method results (shown in Tables S22 and S23, Fig. S2) of the study period between August 2021 and February 2022 for comparison purposes as explained in Section 3.4.
Table 3

Statistical modeling results between N1 and N2 gene concentrations and total COVID-19 cases during the 5-week lag time study period in city of Detroit, as well as Wayne, Macomb, and Oakland counties (* is shown in Fig. 5.)

Lag time
Model
N1-based results
N2-based results
RMSE
RMSE
Unit of N1/N2 genegc/Lgc/dgc/L of sanitary flowgc/Lgc/dgc/L of sanitary flow
3 weekLinear7.22135.7611.7812.40926.305.62
Autoregression135.65780.00250.52341.27901.23700.34
Autoregression+ time effect10.1810.1810.9711.3412.3415.33
Vector Autoregression8.327.858.978.899.909.90
4 weekLinear7.26123.569.1816.37104.458.33
Autoregression182.92234.90635.69132.35730.74500.62
Autoregression+ time effect7.507.477.209.757.397.33
Vector Autoregression8.007.998.626.888.317.62
5 weekLinear1.8348.972.6213.9536.192.36
Autoregression105.81417.57642.83548.14570.56100.95
Autoregression+ time effect*1.47*1.601.603.21*1.601.42
Vector Autoregression*0.35*0.534.447.57*4.371.03
Statistical modeling results between N1 and N2 gene concentrations and total COVID-19 cases during the 5-week lag time study period in city of Detroit, as well as Wayne, Macomb, and Oakland counties (* is shown in Fig. 5.)
Fig. 5

Best prediction models based on (a) N1 gene concentrations (gc/L) and (b) N2 gene concentrations (gc/L) with a 5-week lag time.

Factors affecting lag time

Variants

All the discussions above are based on the study period between September 2020 and August 2021 prior to the Omicron surge. Variations of lag times could not be explicitly elucidated without addressing the changing epidemiological characteristics of emerging SARS-CoV-2 variants, involving incubation time, shedding durations, shedding dynamics, and so forth. Given the shortened incubation time (median of 3 days, (Baker et al., 2022; Brandal et al., 2021; Jansen et al., 2021)) and shedding duration (less than 10 days, (Lamers et al., 2022)) during the Omicron surge, we identified a 2-week lag time, between August 2021 and February 2022, with the same VIRADEL method and same modeling methods (Fig. S2). The incubation time (Baker et al., 2022) and shedding duration (Lamers et al., 2022) were apparently shorter comparing to the parental and previous variants, inevitably leading to a shorter lag time. Besides, the shedding dynamics changed amid Omicron surge which inevitably affected the lag time (Table S3). The variant test results shown in Table S9 demonstrate the different mutations of dominant variants identified in the samples which in addition correspond to the reported emerging variants shown in Table S8. Changing epidemiological characteristics of SARS-CoV-2 variants play a crucial role in affecting the lag time.

Sampling method

An additional factor that may influence lag time is the sampling method. The VIRADEL electropositive filtration method focuses on supernatant virus in wastewater and avoids the inclusion of large wastewater organic solids where viruses may adsorb onto the surfaces. This method has been recommended by the EPA (USEPA, 2001; USEPA, 2014) and has been extensively applied in the field (Brijen et al., 2020; McCall et al., 2020, McCall et al., 2021; Miyani et al., 2021a). On the other hand, grab or composite sampling followed by PEG precipitation incorporates viruses attached onto larger solid particles, which tend to settle in large interceptors and generally re-suspend during periods of high flow. If these larger particles are included in a grab or composite sample, they may include a portion of the viruses that have been settled for a while, having been excreted earlier into that sewer-shed, thus interfering with the desired prediction which is the objective of this work. This becomes a critical factor in interceptors of large urban centers. For small catchment areas with short hydraulic detention times in neighborhood sewer lines the importance of this factor is expected to decrease. To compare the two methods, we simultaneously collected samples from the same locations. This comparison took place during the omicron surge and the data are shown in the supplementary information. Between August 2021 and February 2022, 24-h composite samples followed by PEG were collected at the day as the VIRADEL electropositive filtration was performed. This allowed us to investigate on the impact of sampling methods on lag time and demonstrate the potential for providing early warning signals of the VIRADEL method through comparison. The PEG measurements (Fig. S1) and its Pearson's correlation (Table S26) demonstrated that the N1 and N2 concentrations did not correlate with COVID-19 cases with a lag time. Results from the same models presented above are shown in Table S22 and S23 for PEG measurements. Results demonstrate that N1 and N2 concentrations based on PEG method did not provide an early warning of COVID-19 cases or significant correlations with the COVID-19 cases in the study area in terms of Pearson's r and RMSE.

Clinical data uncertainties

The report time for associated clinical cases tends to be uncertain due to potential reluctance to be tested (Feng et al., 2021), disparities in reporting clinical data and availability of different testing methods (Ai et al., 2021), limited testing supplies and limited testing sites under rapid testing demand and so forth (Hasan and Nasution, 2021). Since the inception of the COVID-19 pandemic, communities in the U.S. as well as many countries across the world had troubles and limited access to COVID-19 testing and real-time reporting, resulting in delay of real-time tracking and monitoring the clinical cases (Bibby et al., 2021; Larsen et al., 2021). The lag time could not be explicitly elucidated without addressing all the challenges facing clinical testing, especially amid the early stages of the unprecedented pandemic. In the early stage of the pandemic, the health departments were struggling to provide prompt testing across the country, such as in Detroit and its surrounding counties, rendering a notable delay in clinical data collection and reporting, inevitably leading to a longer lag time (Rader, 2020; Torres et al., 2021; Wiens et al., 2021). Throughout the past two years' efforts amid COVID-19 pandemic, governmental agencies and LHDs have been adapting to increasing COVID-19 cases as well as testing demands by gradually building the testing capabilities, improving clinical data processing and organizing, pushing clinical data releasing and so forth (Alexander et al., 2022; Powell et al., 2021). The significant improvement and adaptation of LHD to the rapid changing transmissions and clinical incidences of COVID-19 enable a quicker and prompt clinical data reporting and releasing, inevitably leading to a shorter lag time. Despite these caveats, WBE is widely accepted as an effective tool for forewarning community fluctuations in COVID-19 infections (Bibby et al., 2021). Notwithstanding the uncertainties discussed above, we put forth that WBE is an important tool in predicting future fluctuations of COVID-19 infections. Overall, this study demonstrates the effectiveness of applying wastewater-based epidemiology (WBE) as an early warning tool for the prediction of fluctuations of COVID-19 cases in communities in the Detroit metropolitan area. To our knowledge, this is one of the first studies to systematically evaluate the lag time between peaks in measured concentrations of SARS-CoV-2 in wastewater and peaks in reported COVID-19 cases based on clinical testing. Also, this is, to our knowledge, one of the first studies to propose the use of an autoregression with seasonal pattern model and a vector autoregression model in predicting clinical COVID-19 incidences based on the N1 and N2 gene measurements in wastewater. Though WBE demonstrates great promise, potential limitations and challenges remain. More research is warranted to establish a standard framework for modeling the latency between early detection of COVID-19 and presentation of clinical cases. Future studies should include establishing predictive models to optimize wastewater surveillance for early warning of clinical manifestation.

Conclusions

During the 12-month study, prior to the omicron surge, 407 wastewater samples were collected and analyzed for SARS-CoV-2 genes using RT-ddPCR. Measured concentrations of SARS-CoV-2 ranged from 714.85 to 7145.98 gc/L by total N1 gene RT-ddPCR and 820.47 to 6219.05 gc/L by total N2 gene RT-ddPCR. Lag time, the latency from surge in viral concentration in wastewater and peak in clinical cases was estimated as 5 weeks prior to the Omicron surge. As compared to linear regression and autoregression (ARIMA) models, the autoregression model with seasonal patterns and vector autoregression model were more effective in predicting COVID-19 cases during the study period for the 5-week lag scenario. Original N1 and N2 gene concentrations were normalized by total flow volumes and sanitary percentage. The statistical results indicated the optimum 5-week prediction models were consistent for both normalized and non-normalized data. Surveillance through wastewater sampling and analysis can be employed for predicting infections and monitoring health conditions in large metropolitan areas such as Detroit.

Funding

This study was funded by the , Michigan Department of Environment, Great Lakes, and Energy (EGLE), and the Great Lakes Water Authority (GLWA).

CRediT authorship contribution statement

Liang Zhao: Methodology, Investigation, Data curation, Formal analysis, Visualization, Validation, Writing – original draft. Yangyang Zou: Investigation, Software, Formal analysis, Writing – review & editing. Yabing Li: Investigation, Writing – review & editing. Brijen Miyani: Investigation, Writing – review & editing. Maddie Spooner: Investigation, Writing – review & editing. Zachary Gentry: Investigation, Writing – review & editing. Sydney Jacobi: Investigation, Writing – review & editing. Randy E. David: Writing – review & editing. Scott Withington: Writing – review & editing. Stacey McFarlane: Writing – review & editing. Russell Faust: Writing – review & editing. Johnathon Sheets: Investigation, Data curation, Writing – review & editing. Andrew Kaye: Investigation, Data curation, Writing – review & editing. James Broz: Investigation, Data curation, Writing – review & editing. Anil Gosine: Writing – review & editing. Palencia Mobley: Writing – review & editing. Andrea W.U. Busch: Writing – review & editing. John Norton: Funding acquisition, Writing – review & editing. Irene Xagoraraki: Conceptualization, Funding acquisition, Methodology, Investigation, Project administration, Resources, Supervision, Writing – review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  4 in total

1.  Surveillance of SARS-CoV-2 in nine neighborhood sewersheds in Detroit Tri-County area, United States: Assessing per capita SARS-CoV-2 estimations and COVID-19 incidence.

Authors:  Yabing Li; Brijen Miyani; Liang Zhao; Maddie Spooner; Zach Gentry; Yangyang Zou; Geoff Rhodes; Hui Li; Andrew Kaye; John Norton; Irene Xagoraraki
Journal:  Sci Total Environ       Date:  2022-08-27       Impact factor: 10.753

2.  Citywide wastewater SARS-CoV-2 levels strongly correlated with multiple disease surveillance indicators and outcomes over three COVID-19 waves.

Authors:  Loren Hopkins; David Persse; Kelsey Caton; Katherine Ensor; Rebecca Schneider; Camille McCall; Lauren B Stadler
Journal:  Sci Total Environ       Date:  2022-09-23       Impact factor: 10.753

3.  Anthropogenic Sewage Water Circuit as Vector for SARS-CoV-2 Viral ARN Transport and Public Health Assessment, Monitoring and Forecasting-Sibiu Metropolitan Area (Transylvania/Romania) Study Case.

Authors:  Ioana Boeraș; Angela Curtean-Bănăduc; Doru Bănăduc; Gabriela Cioca
Journal:  Int J Environ Res Public Health       Date:  2022-09-17       Impact factor: 4.614

4.  SARS-CoV-2 Monitoring in Wastewater Reveals Novel Variants and Biomarkers of Infection.

Authors:  Jenna McGowan; Monica Borucki; Hicham Omairi; Merina Varghese; Shahnaz Vellani; Sukanya Chakravarty; Shumin Fan; Srestha Chattopadhyay; Mashuk Siddiquee; James B Thissen; Nisha Mulakken; Joseph Moon; Jeffrey Kimbrel; Amit K Tiwari; Roger Travis Taylor; Dae-Wook Kang; Crystal Jaing; Ritu Chakravarti; Saurabh Chattopadhyay
Journal:  Viruses       Date:  2022-09-13       Impact factor: 5.818

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.