| Literature DB >> 35814295 |
Nathan C K Wong1, Sepehr Meshkinfamfard2, Valérian Turbé2, Matthew Whitaker3, Maya Moshe4, Alessia Bardanzellu1, Tianhong Dai1, Eduardo Pignatelli1, Wendy Barclay5,4,6, Ara Darzi5,7,6, Paul Elliott3,5,6,8, Helen Ward3,5,6, Reiko J Tanaka1, Graham S Cooke4,6, Rachel A McKendry2,9, Christina J Atchison3,5, Anil A Bharath1.
Abstract
Background: Lateral flow immunoassays (LFIAs) are being used worldwide for COVID-19 mass testing and antibody prevalence studies. Relatively simple to use and low cost, these tests can be self-administered at home, but rely on subjective interpretation of a test line by eye, risking false positives and false negatives. Here, we report on the development of ALFA (Automated Lateral Flow Analysis) to improve reported sensitivity and specificity.Entities:
Keywords: Databases; Public health
Year: 2022 PMID: 35814295 PMCID: PMC9259560 DOI: 10.1038/s43856-022-00146-z
Source DB: PubMed Journal: Commun Med (Lond) ISSN: 2730-664X
Fig. 1Key visual features of the Fortress SARS-CoV-2 Lateral Flow Immunoassay (LFIA) device.
a Diagram illustrating the test result window and blood sample well. This diagram shows a negative Immunoglobin G (IgG) test. b Example of a participant-submitted photographic image of a negative IgG test. c The result window has an initially blue control line, which will remain if the test is unsuccessful (Invalid). d In a successful test, the control line turns red, and if IgG antibodies are present in the blood sample, a secondary line will appear below the control. The tertiary line indicates the presence of Immunoglobin M (IgM) antibodies, which were not used as part of the analysis in the REACT-2 study.
A breakdown of the subsets used to develop ALFA.
| Dataset | 1 | 2 | 3 | 4 | |||
|---|---|---|---|---|---|---|---|
| Experiment | Segmentation Network | Validity (Control-line status) | IgG Status Classification 1 | IgG Status Classification 2 | |||
| Purpose | Develop | Test | Test | Develop | Test | Develop | Test |
| No. of samples | 415 | 83 | 187 | 864 | 294 | 1699 | 237 |
| No. of IgG positives | – | – | – | 351 | 143 | 641 | 79 |
| No. of invalids | – | – | 12 | – | – | – | – |
| The subsets from REACT-2 Study-5 Rounds which were combined to form the datasets (“Included” indicates inclusion) | |||||||
| Round-1, Sample A | Included | – | Included | Included | |||
| Round-1, Sample B | Included | Included | Included | Included | |||
| Round-1, Sample C | – | – | Included | Included | |||
| Round-2 sample | – | – | Included | Included | |||
| Round-5 sample | – | – | – | Included | |||
“Experiment” refers to which component of ALFA was being developed/tested. Most datasets were split into development and test subsets and indicated in the “purpose” row. The origins of the images are also shown in the bottom half of the table. “Round” refers to the round of the REACT-2 Study-5 from which the data were collected. “Included” indicate whether the sample was included in the dataset.
Results of classification experiment 1 (CE1) (N = 294, dataset 3’s test dataset with fewer weak positive results).
| Model/heuristic/participants | Specificity | Sensitivity | Accuracy | Cohen’s kappa |
|---|---|---|---|---|
| 2D CNN | 0.994 | 0.971 | 0.983 | 0.966 |
| 1D CNN | 0.999 | 0.879 | 0.941 | 0.882 |
| Heuristic-C | 0.994 | 0.757 | 0.881 | 0.759 |
| Study participants | 0.961 | 1.000 | 0.980 | 0.959 |
CNN Convolutional Neural Network.
Results of classification experiment 2 (CE2), including the performance of the 2D CNN trained in experiment 1 (CE1), on dataset 4’s test set with more weak positive results (N = 237).
| Model/heuristic/participants | Specificity | Sensitivity | Accuracy | Cohen’s kappa |
|---|---|---|---|---|
| 2D CNN (CE1) | 1.000 | 0.852 | 0.949 | 0.883 |
| 2D CNN (CE2, retrained) | 0.987 | 0.901 | 0.958 | 0.905 |
| 1D CNN | 0.968 | 0.844 | 0.926 | 0.733 |
| Heuristic-C | 0.976 | 0.487 | 0.815 | 0.525 |
| ViT Net | 0.968 | 0.952 | 0.962 | 0.917 |
| Study Participants | 0.974 | 0.679 | 0.873 | 0.699 |
CNN Convolutional Neural Network.
Fig. 2Examples of issues with the Lateral Flow Immunassay (LFIA) device.
a Examples of weak Immunoglobin G (IgG) positive LFIA results. The control line is blue if the test has not been completed, turning red if the sample and buffer have been correctly added. ‘Weak’ IgG positives were highlighted as being a challenging scenario for the algorithms to classify correctly. As seen in the examples, the line is faint, and with additional issues of variable lighting. The solution was to introduce more of these cases into the training data. b An invalid LFIA test with a partially converted control line. The method for determining validity looked at the normalised red and blue pixel intensity at the detected control line. A potential source of misclassification is partial conversion of the control line from blue to red. c Examples of blood leakage on the LFIA. The presence of blood leakage at the sample and buffer end of the read-out window was found to be a common source of misreading by participants. This led many participants to report the test result as being Immunoglobin M (IgM) positive even though the test was both IgG and IgM negative.
Fig. 3Comparison between automated lateral flow analysis (ALFA) pipeline and REACT-2 Study-5 participants.
a Cohen’s kappa between the two 2D Convolutional Neural Network (CNN) algorithms developed as part of the ALFA pipeline and study participants for REACT-2 rounds 1 to 5 (Respectively for each round, N = 93252, N = 95508, N = 123614, N = 133225, N = 140240). 2D CNN (Classification Experiment 2 (CE2), Blue) and 2D CNN (Classification Experiment 1 (CE1), Orange) indicate strong Cohen’s kappa, suggesting that participants are very good at reading their own results throughout the series of surveys. 2D CNN (CE1) is likely to have higher agreement than 2D CNN (CE2) because CE2 is picking up a greater proportion of weak positives, some of which are being missed by participants. The ALFA pipeline suggests substantial agreement between (0.70) participants’ readings and the ALFA pipeline, providing confidence in the self-reading of fingerprick blood home self-testing LFIA kits. Error bars represent a 95% confidence interval. b Estimated prevalence of SARS-CoV-2 Immunoglobin G (IgG) antibodies in adults in England in the REACT-2 study by i) participant-reported results (Blue) and ii) ALFA (2D CNN CE2, Orange) automated read-out. Error bars represent 95% confidence interval, respectively for each round, N = 88557, N = 94291, N = 122211, N = 131327, N = 138200. c ALFA (2D CNN CE2, Orange bars) estimated antibody prevalence (Respectively for each round, N = 88557, N = 94291, N = 122211, N = 131327, N = 138200) overlaying daily new test positive COVID-19 cases (Blue line graph, Data acquired from GOV.UK here: https://coronavirus.data.gov.uk/).
A comparison to show the advances of this study (Wong et al.) over other recent research.
| Sample/LFIA Target | Tablet device (single device type) | Study Participants | Technology | Train-Test Set | Deployment | |
|---|---|---|---|---|---|---|
| Turbe et al.[ | Blood Serum/HIV antibody | Tablet device (single device type) | 60 Field Workers | 11,374 | 40 tests in the field | |
| LFD AI Consortium (Beggs, corr author)[ | Nasopharyngeal Swab/ SARS-CoV-2 antigen | Expert Operator + participant at-home testing | Asymptomatic, provider-led community testing and NHS workers at home. | Multiple networks | 115,316 | Unknown |
| Mendels et al.[ | Blood Serum/SARS-CoV-2 antibody | Custom Stand with fixed distance/pose to camera; 3 image capture types, multiple LFIAs. | Not specified | Multiple networks | 4344 | N/A |
| Wong (this paper) | Blood Serum/ SARS-CoV-2 antibody | Hand-held camera, multiple capture types, untrained users, multiple LFIA types. | Lay population self-testing | Multiple networks | 1936 | 500,000+ in the field |