| Literature DB >> 28238698 |
Thomas W Burke1, Ricardo Henao2, Erik Soderblom3, Ephraim L Tsalik4, J Will Thompson3, Micah T McClain5, Marshall Nichols1, Bradly P Nicholson6, Timothy Veldman1, Joseph E Lucas2, M Arthur Moseley7, Ronald B Turner8, Robert Lambkin-Williams9, Alfred O Hero10, Christopher W Woods11, Geoffrey S Ginsburg12.
Abstract
Infection of respiratory mucosa with viral pathogens triggers complex immunologic events in the affected host. We sought to characterize this response through proteomic analysis of nasopharyngeal lavage in human subjects experimentally challenged with influenza A/H3N2 or human rhinovirus, and to develop targeted assays measuring peptides involved in this host response allowing classification of acute respiratory virus infection. Unbiased proteomic discovery analysis identified 3285 peptides corresponding to 438 unique proteins, and revealed that infection with H3N2 induces significant alterations in protein expression. These include proteins involved in acute inflammatory response, innate immune response, and the complement cascade. These data provide insights into the nature of the biological response to viral infection of the upper respiratory tract, and the proteins that are dysregulated by viral infection form the basis of signature that accurately classifies the infected state. Verification of this signature using targeted mass spectrometry in independent cohorts of subjects challenged with influenza or rhinovirus demonstrates that it performs with high accuracy (0.8623 AUROC, 75% TPR, 97.46% TNR). With further development as a clinical diagnostic, this signature may have utility in rapid screening for emerging infections, avoidance of inappropriate antibacterial therapy, and more rapid implementation of appropriate therapeutic and public health strategies.Entities:
Keywords: Diagnostic biomarker; Human rhinovirus; Infectious disease; Influenza; Proteomics
Mesh:
Substances:
Year: 2017 PMID: 28238698 PMCID: PMC5360578 DOI: 10.1016/j.ebiom.2017.02.015
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Description of experimental ARV challenge cohorts. Four experimental HRV challenge cohorts (two influenza A/H3N2 and two HRV) are described, including adjudicated phenotype summary data for each. Individuals with discordanta symptom and shedding labels, i.e. symptomatic non-shedders, or asymptomatic shedders, are shown. Sx = symptomatic; Asx = asymptomatic; mean Sx and Asx time T represents the average time of maximal self-reported symptoms among subject included in NPL analysis.
| Cohort | H3N2 #1 | H3N2 #2 | HRV #1 | HRV #2 |
|---|---|---|---|---|
| Subjects included in challenge | 17 | 21 | 20 | 30 |
| Subjects included in NPL analysis | 15 | 21 | 20 | 24 |
| Symptomatic | 9 | 13 | 12 | 15 |
| Shedding positive | 10 | 10 | 12 | 15 |
| Number discordant | 1 | 7 | 4 | 9 |
| Mean Sx time T (in hrs) | 69 | 68 | 66 | 76 |
| Corresponding Asx time T (in hrs) | 69.5 | 71 | 72 | 72 |
Individuals with discordant symptom and shedding labels, i.e. symptomatic non-shedders, or asymptomatic shedders. NPL = Nasopharyngeal lavage. Sx = Symptomatic. Asx = Asymptomatic. Time T represents the time of maximal symptoms.
Fig. 1Study design and experimental workflow. A two-phased strategy was employed to identify and characterize candidate protein biomarkers of ARV infection from NPL samples collected from participants in four experimental ARV challenge cohorts. For phase 1 discovery analysis, four NPL pools were prepared from H3N2 #1 cohort and analyzed using unbiased 2D-LC-MS/MS. The numbers of subject (N) with samples included in each pool are shown (Uninf = uninfected individuals; Inf = infected individuals; BL = baseline; T = time of maximal symptoms). For phase 2, the original and three additional independent challenge cohorts were assayed by targeted MRM. Quantitative peptide expression data from 80 individuals and 156 total samples were used in the derivation of an NPL ARV classifier, and classification performance was assessed in independent challenge cohorts using LOOCV.
Fig. 2Expression levels of candidate protein biomarkers as measured by unbiased LC/LC-MS/MS and targeted MRM. Four representative proteins (CFAB, TIG1, TBA1B, STATH) in H3N2 #1 challenge cohort were measured by (A) LC/LC-MS/MS unbiased discovery methods in sample pools (duplicate measures of multi-peptide meta-protein is shown), and (B) targeted MRM analysis of unique peptides (peptide sequence is shown above each plot) from individual participant samples from the same H3N2 #1 cohort. MRM data are expressed as relative ratio (endogenous “light” peptide to spiked SIL “heavy” peptide). Box-and-whisker plots for each group indicate intra-quartile range (box) with upper and lower (whisker) values representing 99% data coverage. Colored circles represent Uninf-BL (blue), Uninf-T (red), Inf-BL (yellow), and Inf-T (purple).
Candidate biomarker protein relative expression ratios. Rows represents the 26 host NPL proteins for which MRM assays were developed. UniProt gene symbol and protein names are shown for each candidate protein, with number of peptides measured for both unbiased pool and targeted MRM analyses. Unbiased pool ratios are shown for selection criteria 1, 2, and 3, as described in Supplementary Table S3. Fold expression changes from BL to T for infected individuals, as measured by peptide-specific MRM assay, across all four ARV challenge studies are shown with two-sided t-test Benjamini-Hochberg FDR-adjusted p-values (in parentheses) for each peptide, respectively.
| UniProt gene symbol/protein name | Unbiased pool peptides | Unbiased (pool) H3N2 #1 | Targeted MRM (individual) | ||||
|---|---|---|---|---|---|---|---|
| Criterion | Targeted MRM peptides | H3N2 #1/2 and HRV #1/2 | Notes | ||||
| 1 | 2 | 3 | |||||
| CFAB | Complement factor B | 34 | 3.6 | 2.1 | 1.6 | 2 | 2.0, 2.0 (1.8e − 4, 2.2e − 4) | |
| IC1 | Plasma protease C1 inhibitor | 8 | 6.4 | 1.8 | 1.0 | 2 | 1.3, 1.8 (0.089, 3.9e − 4) | |
| A1AT | Alpha-1-antitrypsin | 100 | 5.1 | 2.1 | 1.3 | 2 | 1.7, 1.7 (7.8e − 4, 5.0e − 4) | |
| A1AG1 | Alpha-1-acid glycoprotein 1 | 19 | 4.1 | 2.6 | 1.1 | 2 | 1.8, 1.8 (3.9e − 4, 3.9e − 3) | |
| ALB | Albumin | 805 | 2.4 | 1.4 | 1.0 | 2 | 1.5, 1.6 (7.8e − 4, 7.8e − 4) | |
| TIG1 | Retinoic acid receptor responder prot 1 | 3 | 3.3 | 4.2 | 5.7 | 2 | 2.3, 2.0 (6.2e − 4) | |
| LCN15 | Lipocalin-15 | 6 | − 2.5 | − 3.7 | − 2.9 | 2 | − 3.2, − 3.3 (1.1e − 3, 1.1e − 3) | |
| TBA1B | Tubulin alpha-1B chain | 3 | 2.8 | 2.6 | 2.2 | 2 | 1.8, 1.9 (1.0e − 3, 8.6e − 4) | |
| CYTA | Cystatin-A | 5 | − 2.6 | − 2.8 | − 2.4 | 2 | − 3.5, − 2.8 (8.6e − 4, 6.7e − 4) | |
| APOA1 | Apolipoprotein A-I | 57 | 5.1 | − 1.3 | − 1.8 | 2 | 2.1 (1.2e − 3) | |
| ANGT | Angiotensinogen | 6 | 4.9 | 2.3 | 1.5 | 2 | 1.4, 1.7 (0.018, 8.4e − 3) | |
| SPR1A | Cornifin-A | 2 | − 2.7 | − 2.0 | − 1.2 | 2 | − 2.2, − 1.8 (9.0e − 3, 3.9e − 3) | |
| CASPE | Caspase-14 | 10 | − 3.8 | − 2.1 | − 2.2 | 2 | − 3.1, − 2.7 (6.7e − 4, 1.0e − 3) | |
| APOA4 | Apolipoprotein A-IV | 7 | 11.4 | 1.1 | 1.0 | 2 | 2.2, 2.0 (1.3e − 3, 1.4e − 3) | |
| AMY1 | Alpha-amylase 1 | 7 | − 2.4 | − 2.9 | − 1.7 | 2 | − 2.1 (2.9e − 3) | |
| VMO1 | Vitelline membrane outer layer prot 1 | 3 | − 2.0 | − 2.9 | − 1.4 | 2 | − 2.2, − 2.6 (0.016, 8.7e − 3) | |
| ANXA2 | Annexin A2 | 8 | − 2.0 | − 2.0 | − 1.6 | 2 | − 1.5, − 1.6 (5.7e − 3, 4.7e − 3) | |
| CAYP1 | Calcyphosin | 4 | 7.5 | 2.5 | 2.1 | 3 | 2.6, 2.7, 2.4 (2.9e − 3, 1.6e − 3, 2.4e − 3) | |
| GGCT | Gamma-glutamylcyclotransferase | 2 | − 4.6 | − 2.0 | − 1.7 | 2 | − 1.7, − 2.1 (0.029, 9.9e − 4) | |
| DCD | Dermcidin | 3 | − 7.4 | − 2.5 | − 4.3 | 2 | − 2.3, − 2.2 (6.4e − 3, 6.4e − 3) | |
| SPRR3 | Small proline-rich protein 3 | 2 | 2.4 | 2.1 | 1.2 | 2 | − 1.6, − 1.9 (0.023, 7.8e − 3) | |
| APOA2 | Apolipoprotein A-I | 9 | 8.7 | 1.3 | 1.3 | 2 | 1.7, 1.5 (0.016, 0.07) | |
| STATH | Statherin | 2 | 112.6 | 17.5 | 140.0 | 1 | 5.4 (1.0e − 3) | |
| PROL4 | Proline-rich protein 4 | 66 | − 2.6 | − 2.0 | − 2.3 | 2 | − 1.1 (0.802) | |
| GSTA2 | Glutathione S-transferase A2 | 3 | 9.1 | 4.2 | 1.6 | 2 | 1.0 (0.902) | |
| FILA | Filaggrin | 2 | − 7.3 | − 3.8 | − 3.9 | 1 | − 1.6 (0.061) | |
Fold change in pooled analysis for each of 3 prioritization criteria.
Average fold change per peptide across all cohorts (p-values adjusted for false discovery rate).
Included due to potential biological significance.
Included as potential endogenous control.
Fig. 3Correlation of the peptide expression changes for all four H3N2 and HRV cohorts (r = 0.871). Graph shows peptides with significant expression changes (Benjamini-Hochberg FDR < 0.05) in H3N2 (red circle, 10 peptides), HRV (yellow circle, 14 peptides), both H3N2 and HRV (blue circle, 16 peptides), or neither cohort (purple circle) as measured by MRM.
Fig. 4Performance of a 10-peptide classification model in independent ARV cohorts. The model was fit to discriminate Inf-T from Inf-BL and Uninf-T. (A) Probability of positive is plotted (x-axis) for individual participant samples from 4 cohorts, with threshold for positive classification set at > 0.5. Lines connect paired samples from individuals at baseline (left) and time T (right). Solid red line = infected; dashed blue line = uninfected; dashed green line = uninfected sham inoculation; open circles = classifier agrees with phenotype label; X = classifier disagrees with phenotype label. (B) Receiver operating curve for LOOCV with the 10 peptide (9 unique proteins) classifier. The optimal threshold on the curve (open red circle) produces 0.8623 AUROC, 75% TPR, 97.46% TNR, with the confusion matrix and peptide weights shown in Table 3A and B, respectively.
10-peptide classification model performance. (A) Classifier performance on individual samples from all four ARV cohorts as represented by LOOCV confusion matrix, with (B) identity and contribution (weight) for 10 peptides, with amino acid sequence in one-letter notation. Average peptide length is 11.8 amino acids, with range between 8 and 30 residues. Negative weight values indicate down-regulation upon infection.
| A: 10-peptide classification model performance on ARV challenge cohorts | ||
|---|---|---|
| Actual/predicted | Othersa | Inf at T |
| Othersa | 2 | |
| Inf at T | 10 | |
| aInf-BL and Uninf-T; Confusion matrix based on LOOCV using 10 peptide (9 protein) classification model. | ||