| Literature DB >> 35444162 |
María Escobar1, Guillaume Jeanneret1, Laura Bravo-Sánchez1, Angela Castillo1, Catalina Gómez1,2, Diego Valderrama1, Mafe Roa1, Julián Martínez1, Jorge Madrid-Wolff3, Martha Cepeda4, Marcela Guevara-Suarez5, Olga L Sarmiento6, Andrés L Medaglia1,7, Manu Forero-Shelton8, Mauricio Velasco9, Juan M Pedraza8, Rachid Laajaj10, Silvia Restrepo5, Pablo Arbelaez11,12.
Abstract
Massive molecular testing for COVID-19 has been pointed out as fundamental to moderate the spread of the pandemic. Pooling methods can enhance testing efficiency, but they are viable only at low incidences of the disease. We propose Smart Pooling, a machine learning method that uses clinical and sociodemographic data from patients to increase the efficiency of informed Dorfman testing for COVID-19 by arranging samples into all-negative pools. To do this, we ran an automated method to train numerous machine learning models on a retrospective dataset from more than 8000 patients tested for SARS-CoV-2 from April to July 2020 in Bogotá, Colombia. We estimated the efficiency gains of using the predictor to support Dorfman testing by simulating the outcome of tests. We also computed the attainable efficiency gains of non-adaptive pooling schemes mathematically. Moreover, we measured the false-negative error rates in detecting the ORF1ab and N genes of the virus in RT-qPCR dilutions. Finally, we presented the efficiency gains of using our proposed pooling scheme on proof-of-concept pooled tests. We believe Smart Pooling will be efficient for optimizing massive testing of SARS-CoV-2.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35444162 PMCID: PMC9020431 DOI: 10.1038/s41598-022-10128-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Smart Pooling makes pooling methods efficient even at high disease prevalences. (a) In standard Dorfman’s testing methods, samples are pooled randomly. As prevalence increases, the efficiency of pooling without a priori information drops rapidly, making the strategy unviable. (b) Smart Pooling tackles this problem by thresholding samples according to the automated predicted risk probability, based on the sociodemographic characteristics. If the risk probability of a sample surpasses the defined threshold, this sample goes directly to individual testing. We arrange samples into pools with similar risk probability. (c) The Smart Pooling pipeline directly integrates machine learning with laboratory procedures.
Figure 2Smart Pooling decreases the expected number of tests per specimen. (a) Smart Pooling’s expected number of tests per specimen compared to standard testing methods on Patient Dataset. Efficiency improves by reducing the expected number of tests per specimen. Smart Pooling achieves higher efficiencies than Dorfman’s and individual testing for prevalences of the disease of up to 50%, with a fixed pool size of 10. (b) Smart Pooling’s expected number of tests per specimen trained with the coarse metadata from the Test Center Dataset. Efficiency improves by reducing the expected number of tests per specimen. Despite not having detailed patient metadata, Smart Pooling can produce higher efficiencies than Dorfman’s pooling for the simulated prevalence of the disease of up to 25%, with a fixed pool size of 10. (c) Expected number of tests per specimen for the proof-of-concept Smart Pooling Implementation. Each point on the graph represents the performance of both methods for a day in the proof-of-concept stage. The x-axis depicts the incidence of the corresponding day. Efficiency improves by reducing the expected number of tests per specimen. Smart Pooling has a similar efficiency compared to Dorfman’s testing since the daily incidence is lower than 10%. However, for every day of the implementation, Smart Pooling has a higher efficiency than individual testing.
Figure 3Comparison between Smart Pooling and Random ordering. Left: samples arranged by Smart Pooling. Right: random ordering of samples. Note that Smart Pooling groups positive samples generating long groups of negative samples, reducing the number of times it is necessary to perform the second stage of Dorfman’s testing, increasing the efficiency.
Figure 4Efficiency of two-stage Dorfman’s pooling as a function of prevalence p and pool size c.
Figure 5Optimal pooling strategies given a maximum pool size of (a) 10 and (b) 5.
Optimal pooling strategies restricted by a maximum pool size c.
| Pooling protocol | Prevalence Interval (%) | |
|---|---|---|
| Lower bound | Upper Bound | |
| 0.00 | 1.25 | |
| 1.25 | 1.375 | |
| 1.375 | 4.875 | |
| 4.875 | 5.25 | |
| 5.25 | 5.875 | |
| 5.875 | 6.375 | |
| 6.375 | 7.375 | |
| 7.375 | 8.125 | |
| 8.125 | 9.625 | |
| 9.625 | 10.875 | |
| 10.875 | 12.125 | |
| 12.125 | 12.375 | |
| 12.375 | 25.00 | |
| 0.00 | 6.625 | |
| 6.625 | 12.375 | |
| 12.375 | 25.00 | |
Prevalence intervals and their respective optimal pooling strategy. are single pooling protocols and are matrix pooling protocols.
Expected Ct, average , and the fraction of wells with no reading for markers N and ORF1ab for successive dilutions of the same sample.
| Dilution | ORF1ab | N | ||||
|---|---|---|---|---|---|---|
| Expected | Average | Fail fraction | Expected | Average | Fail fraction | |
| 0X | 38.4 | 39.0 | 0.00 | 38.0 | 37.5 | 0.00 |
| 1.4X | 38.9 | 39.0 | 0.13 | 38.5 | 38.0 | 0.00 |
| 2X | 39.4 | 40.1 | 0.00 | 39.0 | 38.5 | 0.00 |
| 2.7X | 39.9 | 40.5 | 0.00 | 39.5 | 38.9 | 0.00 |
| 3.8X | 40.3 | 40.9 | 0.63 | 39.9 | 39.3 | 0.13 |
| 5.4X | 40.8 | 42.1 | 0.67 | 40.4 | 39.9 | 0.33 |
| 7.5X | 41.3 | 41.6 | 0.50 | 40.9 | 399.0 | 0.50 |
| 10.5X | 41.8 | – | 1.00 | 41.4 | 40.2 | 0.57 |
| 14.8X | 42.3 | 42.0 | 0.88 | 41.9 | 40.6 | 0.75 |
| 20.7X | 42.8 | 41.3 | 0.88 | 42.4 | 39.4 | 0.75 |
| 28.9X | 43.3 | 41.7 | 0.88 | 42.9 | 40.3 | 0.88 |
Estimated upper bound for the error (false negative) rate for single pools of up to 16 samples for markers N and ORF1ab.
| Dilution | Error rate ORF1ab (%) | Error rate N (%) |
|---|---|---|
| 0.2 | 0.2 | |
| 0.2 | 0.2 | |
| 0.6 | 0.5 | |
| 2.6 | 1.5 | |
| 2.7 | 2.6 | |
| 3.5 | 3.2 | |
| 4.3 | 3.9 | |
| 5.0 | 4.4 | |
| 5.4 | 4.8 | |
| 6.4 | 5.8 | |
| 7.0 | 6.4 | |
| 7.6 | 7.0 | |
| 8.3 | 7.6 | |
| 9.0 | 8.2 | |
| 9.7 | 8.7 |