| Literature DB >> 33120404 |
Abstract
Entities:
Mesh:
Year: 2020 PMID: 33120404 PMCID: PMC7595277 DOI: 10.1371/journal.pntd.0008751
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Fig 1The biologically hierarchical problem of detecting a pathogen in its hosts.
A subject drawn from a population is infected with the pathogen with probability Ψ (which reflects the prevalence of infection in that population). A sample drawn from a subject contains the pathogen with probability θ, which is conditional on Ψ. Therefore, θ represents sample-level pathogen (or, more broadly, target) availability, given subject-level infection. Finally, pathogen detection tests run on samples drawn from a subject detect the target with a probability p that is conditional on θ; therefore, p represents test-level probability of detection, given sample-level availability (and, hence, subject-level infection). This shows that the biological problem of detecting a pathogen in its hosts involves a hierarchy of nested levels: subjects (within populations), samples within subjects, and tests within samples. To understand how the system works, we therefore need to estimate 3 probabilities: test-level detection (p), sample-level availability (θ), and subject-level infection (Ψ). Replicate tests inform on the value of p (which is, strictly speaking, the sensitivity of the test) and replicate samples (possibly encompassing different tissues or bodily fluids, as suggested by black/white drops) drawn from the same subject inform on the value of θ; this opens the possibility of estimating Ψ and, therefore, prevalence in the population. Note that all the parameters above (Ψ, θ, and p) are probabilities and hence can take on any value between 0 and 1.
Fig 2Detecting Trypanosoma cruzi in human subjects: A hierarchical approach to a hierarchical problem.
Two tests (T1 and T2) are run in duplicate on each of 2 samples (S1 and S2) drawn from subjects A and B; the observed data come in the form of a “history” of detection (coded “1”) or non-detection (“0”) of the parasite (or a surrogate biomarker). In A, both T1 and the first T2 (but not the second) replicates run on S1, and none of the 4 run on S2, yielded a “1.” This means (assuming no false positives) that T. cruzi (represented by a small curly object) (i) was infecting subject A (Ψ); (ii) was available for detection in S1 (red drop; θS1) and indeed detected twice by T1 (pT1-S1 × pT1-S1) and once by T2 (pT2-S1), yet the second T2 failed to detect it (1−pT2-S1) (this is, hence, a true false negative; highlighted in red); and (iii) was not detected by any test in S2—which may be because T. cruzi was unavailable for detection in S2 (1−θS2) or because it was available (θS2), yet all test failed to detect it (1−pT1-S2)2 × (1−pT2-S2)2. I use gray font to highlight ambiguities. Was the parasite available in S2, but went undetected? Or was it unavailable, so that the tests gave the correct (sample-level) answer? Are the “0”s from S2 true negatives, or are they false negatives? In B, no detections are recorded in the subject’s history, and all observations are therefore ambiguous. Was the subject infected? If she was infected, was the parasite unavailable in both samples? Or was it available, yet the tests failed to detect it? Are the “0”s true or false negatives? Note (i) that replicate samples may be taken from more than a single tissue or bodily fluid, as well as on more than 1 occasion, and (ii) that all the parameters above (Ψ, θ, and p) are probabilities and hence can take on any value between 0 and 1.