| Literature DB >> 28729702 |
Nguyen Thi Duy Nhat1,2, Stacy Todd1,3, Erwin de Bruin4,5, Tran Thi Nhu Thao1, Nguyen Ha Thao Vy1, Tran Minh Quan1, Dao Nguyen Vinh1, Janko van Beek5, Pham Hong Anh1, Ha Minh Lam1, Nguyen Thanh Hung1, Nguyen Thi Le Thanh1, Huynh Le Anh Huy1, Vo Thi Hong Ha6, Stephen Baker1,2, Guy E Thwaites1,2, Nguyen Thi Nam Lien7, Tran Thi Kim Hong8, Jeremy Farrar1,9, Cameron P Simmons10, Nguyen Van Vinh Chau11, Marion Koopmans4,5, Maciej F Boni12,13,14.
Abstract
Seroepidemiological studies aim to understand population-level exposure and immunity to infectious diseases. Their results are normally presented as binary outcomes describing the presence or absence of pathogen-specific antibody, despite the fact that many assays measure continuous quantities. A population's natural distribution of antibody titers to an endemic infectious disease may include information on multiple serological states - naiveté, recent infection, non-recent infection, childhood infection - depending on the disease in question and the acquisition and waning patterns of immunity. In this study, we investigate 20,152 general-population serum samples from southern Vietnam collected between 2009 and 2013 from which we report antibody titers to the influenza virus HA1 protein using a continuous titer measurement from a protein microarray assay. We describe the distributions of antibody titers to subtypes 2009 H1N1 and H3N2. Using a model selection approach to fit mixture distributions, we show that 2009 H1N1 antibody titers fall into four titer subgroups and that H3N2 titers fall into three subgroups. For H1N1, our interpretation is that the two highest-titer subgroups correspond to recent and historical infection, which is consistent with 2009 pandemic attack rates. Similar interpretations are available for H3N2, but right-censoring of titers makes these interpretations difficult to validate.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28729702 PMCID: PMC5519701 DOI: 10.1038/s41598-017-06177-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Antibody titer histograms for n = 20,152 individuals, plotted for all ages (top panels) and by age group (bottom four panels). Titers shown are to the HA1 components of the 2009 H1N1 pandemic influenza virus (left column) and to recently circulating H3N2 viruses (right column). The fractions of individuals with titers below the detection limit of 20 and above 1280 that were out of the plotting ranges are given next to the respective bar. Histograms were weighted to adjust for age and gender according to the Vietnam national housing census in 2009 for the four collection sites.
Figure 2Titer histograms for 2009 H1N1, showing fit results for mixture models with different numbers of normal components (top to bottom; the label to the left of the y‐axis is the number of mixture components) and grouped by collection sites. Histograms are weighted to adjust for age and gender according to the Vietnam national housing census in 2009 for each of the four collection sites. The blue lines in each panel are the normalized probability density functions of the component distributions with darker colors used for increasing μ. The black lines show the full mixture distribution density, and the black dots are the estimated cumulative distribution of the mixture models at 7.0 (titer of 1280). The numbers in the upper right corner of each panel are the BIC scores of the model fits. The fractions of individuals with titers below the detection limit of 20 and above 1280 that were out of the plotting ranges were given next to their respective bars.
Figure 3Visualization of model selection process for 2009 H1N1 titer-distribution models from Fig. 2. The y-axes show the fitted values of w i (mixture weights), μ i (means), and σ i (standard deviations). Components’ shades were ranked from lightest to darkest in the order of increasing μ. In the top panel, the “0th component” represents the point mass w 0 placed at 20 for titers below the lower detection limit of 20. Note that in many cases for five or six components, the weights or standard deviation parameters are close to zero; for some cases, two of the inferred mean parameters are very close to each other.
Change in BIC values as the number of normal distributions in the mixture increases from one to six for 2009 H1N1, for the aggregated data as well as the individual collection sites.
| ALL SITES | HUE | KHANH HOA | HCMC | DAK LAK | ||
|---|---|---|---|---|---|---|
| (N = 19335) | (N = 4390) | (N = 5053) | (N = 5753) | (N = 4139) | ||
| SumNegLLH, single component | 71504.04 | 15812.88 | 18726.69 | 21496.01 | 15432.06 | |
| change in number of components | 1 to 2 | −826.40 | −170.79 | −182.44 | −215.33 | −208.86 |
| 2 to 3 | −410.07 | −74.55 | −79.83 | −137.95 | −131.34 | |
| 3 to 4 | −318.12 | −66.76 | −56.37 | −77.14 | −32.68 | |
| 4 to 5 | −19.87 | −18.32 | 8.68 | 16.82 | 6.10 | |
| 5 to 6 | −27.42 | 2.13 | 24.23 | 9.57 | −2.59 | |
The values of the negative sum of the log likelihood were weighted to adjust for age and gender according to the Vietnam national housing census in 2009. The first row shows the exact values of sum of the negative log-likelihoods (SumNegLLH).
Figure 4Titer histograms and fit results for mixture models with different numbers of components (label on the left is the number of mixture components) and grouped by different age groups recommended by the CONCISE (http://consise.tghn.org/) consortium for 2009 H1N1 influenza. Histograms are weighted to adjust for age and gender according to the Vietnam national housing census in 2009. The numbers in the upper right corner of each panel are the fitted BIC scores of the respective model. For each panel, the blue lines are the normalized probability density of the component distributions with darker colors used for increasing μ. Black lines are the total mixture distribution density; and the black dots are estimated probability weight of the mixture model for titers ≥7.0. The fractions of individuals with titers below the detection limit of 20 and above 1280 that were out of the plotting ranges are shown next to their respective bars.