| Literature DB >> 30294592 |
Xiaojing Zheng1, Catherine M O'Connell1, Wujuan Zhong2, Taylor B Poston1, Harold C Wiesenfeld3,4, Sharon L Hillier3,4, Maria Trent5, Charlotte Gaydos6, George Tseng7, Brandie D Taylor8, Toni Darville1.
Abstract
Sexually transmitted infection (STI) of the upper reproductive tract can result in inflammation and infertility. A biomarker of STI-induced upper tract inflammation would be significant as many women are asymptomatic and delayed treatment increases risk of sequelae. Blood mRNA from 111 women from three cohorts was profiled using microarray. Unsupervised analysis revealed a transcriptional profile that distinguished 9 cases of STI-induced endometritis from 18 with cervical STI or uninfected controls. Using a hybrid feature selection algorithm we identified 21 genes that yielded maximal classification accuracy within our training dataset. Predictive accuracy was evaluated using an independent testing dataset of 5 cases and 10 controls. Sensitivity was evaluated in a separate test set of 12 women with asymptomatic STI-induced endometritis in whom cervical burden was determined by PCR; and specificity in an additional test set of 15 uninfected women with pelvic pain due to unknown cause. Disease module preservation was assessed in 42 women with a clinical diagnosis of pelvic inflammatory disease (PID). We also tested the ability of the biomarker to discriminate STI-induced endometritis from other diseases. The biomarker was 86.7% (13/15) accurate in correctly distinguishing cases from controls in the testing dataset. Sensitivity was 83.3% (5/6) in women with high cervical Chlamydia trachomatis burden and asymptomatic endometritis, but 0% (0/6) in women with low burden. Specificity in patients with non-STI-induced pelvic pain was 86.7% (13/15). Disease modules were preserved in all 8 biomarker predicted cases. The 21-gene biomarker was highly discriminatory for systemic infections, lupus, and appendicitis, but wrongly predicted tuberculosis as STI-induced endometritis in 52.4%. A 21-gene biomarker can identify asymptomatic women with STI-induced endometritis that places them at risk for chronic disease development and discriminate STI-induced endometritis from non-STI pelvic pain and other diseases.Entities:
Keywords: Chlamydia; biomarker; gonorrhea; mRNA; pelvic inflammatory disease
Mesh:
Substances:
Year: 2018 PMID: 30294592 PMCID: PMC6158555 DOI: 10.3389/fcimb.2018.00307
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Figure 1Diagram of study organization. Blood transcriptional mRNA profiles from cases and controls were analyzed in a training dataset. Identified classifier genes were subsequently validated in an independent testing dataset. The classifier genes were also evaluated in three additional independent patient datasets for sensitivity, specificity, and disease module preservation, respectively. †Cases: women with symptoms consistent with PID and with biopsy confirmed endometrial STI with N. gonorrhoeae and/or C. trachomatis and/or M. genitalium and endometritis. ‡‡Controls: asymptomatic women who were uninfected or infected at the cervix only, with no endometrial inflammation. *Participants recruited from TRAC cohort. **Participants recruited from ACE cohort. ***Participants recruited from TECH-N cohort comprised women with symptoms of PID who had cervical microbiology determined but lacked endometrial data.
Demographic, clinical, and microbiological characteristics of training and testing datasets.
| 9 | 18 | 5 | 10 | |||
| 23 (20–37) | 21 (18–35) | 0.288 | 21 (18–25) | 19 (18–22) | 0.117 | |
| 0.108 | 0.341 | |||||
| African American | 3 (33) | 10 (56) | 5 (100) | 6 (60) | ||
| White | 5 (56) | 2 (11) | 0 (0) | 0 (0) | ||
| American Indian/Alaska Native | 0 (0) | 1 (6) | 0 (0) | 0 (0) | ||
| Multiracial | 1 (11) | 5 (28) | 0 (0) | 3 (30) | ||
| Other | 0 (0) | 0 (0) | 0 (0) | 1 (10) | ||
| 0.333 | 0.524 | |||||
| Hispanic or Latino | 1 (11) | 0 (0) | 0 (0) | 2 (20) | ||
| Not Hispanic or Latino | 8 (89) | 18 (100) | 5 (100) | 8 (80) | ||
| 0.628 | 1 | |||||
| Single | 8 (89) | 13 (72) | 5 (100) | 9 (90) | ||
| Living with partner (at least 4 months) | 1 (11) | 5 (28) | 0 (0) | 1 (10) | ||
| 0.36 | 0.417 | |||||
| < HS graduate | 2 (22) | 3 (17) | 0 (0) | 2 (20) | ||
| HS graduate of GED | 2 (22) | 8 (44) | 3 (60) | 4 (40) | ||
| some college | 2 (22) | 6 (33) | 1 (20) | 4 (40) | ||
| college graduate | 2 (22) | 1 (6) | 0 (0) | 0 (0) | ||
| vocational | 1 (12) | 0 (0) | 1 (20) | 0 (0) | ||
| 0.439 | 0.529 | |||||
| None | 1 (11) | 6 (33) | 3 (60) | 1 (10) | ||
| Private | 3 (33) | 2 (11) | 0 (0) | 1 (10) | ||
| Public | 0 (0) | 0 (0) | 0 (0) | 1 (10) | ||
| Don't know | 0 (0) | 0 (0) | 0 (0) | 1 (10) | ||
| Medicaid HMO | 5 (56) | 9 (50) | 2 (40) | 5 (50) | ||
| Other | 0 (0) | 1 (6) | 0 (0) | 1 (10) | ||
| 0.268 | 1 | |||||
| Yes | 0 (0) | 4 (22) | 0 (0) | 0 (0) | ||
| No | 9 (100) | 14 (78) | 5 (100) | 10 (100) | ||
| 0.106 | 0.1305 | |||||
| Nugent Score 0–3 | 1 (11) | 7 (39) | 1 (20) | 6 (60) | ||
| Nugent Score 4–6 | 1 (11) | 5 (28) | 0 (0) | 2 (20) | ||
| Nugent Score 7–10 | 7 (78) | 6 (33) | 4 (80) | 2 (20) | ||
| Oral contraceptive pill | 0.333 | 0.524 | ||||
| Yes | 1 (11) | 0 (0) | 0 (0) | 2 (20) | ||
| No | 8 (89) | 18 (100) | 5 (100) | 8 (80) | ||
| Depot medroxyprogesterone acetate | 1 | 0.6 | ||||
| Yes | 0 (0) | 1 (6) | 1 (20) | 4 (40) | ||
| No | 9 (100) | 17 (94) | 4 (80) | 6 (60) | ||
| Intrauterine device | 0.333 | 1 | ||||
| Yes | 1 (11) | 0 (0) | 0 (0) | 0 (0) | ||
| No | 8 (89) | 18 (100) | 5 (100) | 10 (100) | ||
| Condoms | 0.695 | 1 | ||||
| Yes | 4 (44) | 10 (56) | 4 (80) | 7 (70) | ||
| No | 5 (56) | 8 (44) | 1 (20) | 3 (30) | ||
Figure 2Unsupervised hierarchical clustering and principal component analysis (PCA) revealed distinctive blood mRNA transcriptional profiles separating cases from controls. Unsupervised hierarchical clustering of blood transcriptional profiles of women in training (A) and testing (B) datasets using 4952 transcripts. Samples were ordered by hierarchical clustering (Spearman correlation with average linkage) creating a condition tree, upper horizontal edge of heat map; study groupings (clinical phenotypes) are the colored blocks on the top of each profile. Heat map rows are genes; columns are participants. Principle component analysis of the variance in mRNA expression of the subjects depicted in the heat maps in training (C) and testing (D) datasets, using 4,952 transcripts and the same color scheme with each colored square depicting one subject. The x axis represents the first principal component, PC1, which accounts for the largest variance of mRNA expression, and the y axis, PC2, explains the second largest variance.
A list of 21 classifiers of STI PID using blood gene expression arrays.
| AIM2 | 1.31E-05 | 1.32E-08 | 2.06 |
| DSC2 | 1.65E-05 | 3.02E-08 | 2.3 |
| SIGLEC5 | 1.65E-05 | 2.81E-08 | 1.97 |
| CASP1 | 1.87E-05 | 6.99E-08 | 1.58 |
| CD58 | 1.87E-05 | 5.34E-08 | 1.53 |
| GBP2 | 1.87E-05 | 7.90E-08 | 1.47 |
| RNF19B | 1.87E-05 | 8.15E-08 | 1.44 |
| LIN7A | 4.47E-05 | 5.87E-07 | 1.6 |
| ADM | 4.83E-05 | 7.22E-07 | 2.48 |
| NAMPT | 5.85E-05 | 1.08E-06 | 2.09 |
| LOC440731 | 6.76E-05 | 1.48E-06 | 1.61 |
| CYB5R4 | 6.88E-05 | 1.56E-06 | 1.46 |
| PLAUR | 9.56E-05 | 2.93E-06 | 1.56 |
| FAS | 0.00022 | 1.30E-05 | 1.39 |
| LYSMD2 | 0.000698 | 8.78E-05 | 1.4 |
| RPS15A | 0.009756 | 0.0032 | 1.35 |
| FAIM3 | 1.87E-05 | 8.15E-08 | −1.89 |
| CD79A | 3.33E-05 | 3.29E-07 | −1.96 |
| TCL1A | 6.67E-05 | 1.41E-06 | −1.85 |
| EEF1D | 0.000134 | 5.55E-06 | −1.54 |
| TSPAN3 | 0.001352 | 0.00025 | −1.35 |
Figure 3Predicted probability of STI-induced endometritis in testing datasets using a 21-gene signature and SVM learning algorithm. Each dot represents one subject. The x-axis indicates the biopsy confirmed groups of subjects and y-axis indicates the predicted probability of STI-induced endometritis. The dotted line corresponds to a predicted probability cutoff of 0.5. If the predicted probability of PID is > 0.5, we consider the predicted response as high risk.
Differential expression of 21 genes transcripts detected in blood using nCounter expression assay (NanoString Technologies) in training and testing datasets.
| AIM2 | 5.54 | 2.7 |
| DSC2 | 6.78 | 3.99 |
| SIGLEC5 | 2.91 | 2.75 |
| CASP1 | 1.03 | 1.81 |
| CD58 | 5.55 | 1.8 |
| GBP2 | 1.14 | 2.01 |
| RNF19B | 3.85 | 1.74 |
| LIN7A | 1.40 | 2.47 |
| ADM | 1.11 | 3.74 |
| NAMPT | 5.87 | 2.9 |
| LOC440731 | 5.42 | 2.98 |
| CYB5R4 | 4.21 | 1.8 |
| PLAUR | 2.59 | 1.98 |
| FAS | 1.40 | 1.72 |
| LYSMD2 | 1.45 | 1.57 |
| RPS15A | 4.29 | 1.71 |
| FAIM3 | 6.66 | −1.85 |
| CD79A | 7.66 | −2.2 |
| TCL1A | 3.50 | −2.62 |
| EEF1D | 2.26 | −1.26 |
| TSPAN3 | 7.37 | −1.61 |
| AIM2 | 0.0104 | 1.78 |
| DSC2 | 0.004 | 3.25 |
| SIGLEC5 | 0.0034 | 2.15 |
| CASP1 | 0.0559 | 1.47 |
| CD58 | 0.0001 | 1.84 |
| GBP2 | 0.0649 | 1.48 |
| RNF19B | 0.0004 | 1.78 |
| LIN7A | 0.0001 | 2.99 |
| ADM | 0.0008 | 3.64 |
| NAMPT | 0.0001 | 3.05 |
| LOC440731 | 0.0315 | 2.16 |
| CYB5R4 | 0.0015 | 1.78 |
| PLAUR | 0.0008 | 2.03 |
| FAS | 0.028 | 1.49 |
| LYSMD2 | 0.0634 | 1.45 |
| RPS15A | 0.0466 | 1.54 |
| FAIM3 | 0.0564 | −1.5 |
| CD79A | 0.0564 | −1.53 |
| TCL1A | 0.0629 | −1.65 |
| EEF1D | 0.0258 | −1.25 |
| TSPAN3 | 0.0243 | −1.32 |
Figure 4Predicted probability using 21-gene signature and SVM of STI-induced endometritis in a target dataset of asymptomatic subjects with biopsy proven subclinical CT/GC-induced endometritis. Each dot represents one subject. The x-axis indicates the log10 cervical C. trachomatis load and the y-axis indicates the predicted probability of STI-induced endometritis.
Figure 5Predicted probability using 21-gene signature and SVM of STI-induced endometritis in a dataset of women who were symptomatic but were uninfected. Subsets of patients within the dataset were positive and negative for histologic endometritis. Each dot represents one subject. The x-axis indicates two groups of women with pelvic pain without STI. One group has normal histology, the other group has chronic endometritis; y-axis indicates the predicted probability of STI-induced endometritis.
Figure 6Zsummary statistics reveal module preservation of blood profiles from women with biopsy-confirmed CT/GC-induced endometritis compared to women with biomarker-predicted STI-induced endometritis from independent TECH-N cohort. Colored circles correspond with highly correlated gene modules identified in biopsy-confirmed CT/GC-induced endometritis. The x-axis indicates the number of genes in each module; y axis is the preservation Zsummary value. The dotted lines with cutoff Zsummary of 2 and 10 indicate preservation and high level preservation respectively.
Preservation and functional annotation of significantly differentially expressed modules between cases and controls by Gene Set Enrichment Analysis (GSEA).
| Turquoise | 32 | 1.58E-84 | 5.75E-15 | Overexpression of myeloid cell genes |
| Yellow | 20 | 2.47E-34 | 6.29E-14 | Suppression of T-cell specific genes |
| Brown | 28 | 4.07E-21 | 3.55E-10 | Suppression of protein synthesis and mitochondrial oxidative phosphorylation |
| Midnight Blue | 8 | 3.64E-06 | 2.46E-02 | Overexpression of NF-κB and Integrin-linked kinase signaling |
| Purple | 19 | 1.76E-03 | Suppression of cell cycle control and regulation |
FDR, False discovery rate;
NS, Not significant (FDR > 0.05).
Figure 7The 21-gene biomarker distinguishes STI-induced endometritis from other infectious and inflammatory conditions. (A) The number of samples within each disease and control group predicted as STI-induced endometritis is summarized in the table. (B) The predicted probability of STI-induced endometritis in each disease and control group using a 21-gene biomarker. Each dot represents one subject. The x-axis indicates the groups of subjects and y-axis indicates the predicted probability of STI-induced endometritis. The dotted line corresponds to a predicted probability cutoff of 0.5. (C) Unsupervised hierarchical clustering using 21-genes revealed distinctive blood mRNA transcriptional profiles separating STI-induced endometritis from other diseases and controls. Study groupings (clinical phenotypes) are the colored blocks on the top of each profile. Heat map rows are genes; columns are participants. Transformed expression levels are indicated by color scale, with red representing relative high expression and blue relative low expression.
Top 10 up-regulated pathways in acute appendicitis patients vs. controls with non-specific abdominal pain.
| 3-phosphoinositide biosynthesis | 1.51E-05 | 0.001778 | 0.1 | PPFIBP2, PIP4K2B, PPP4R1, HACD2, PTPN12, UBLCP1, SACM1L, PTEN, MTMR6, PPP1CC, MTMR4, SYNJ1, PPP1R12A, DUSP1, PIK3CG, PPM1F, PIK3AP1, RNGTT, PTPN22, ATM |
| Production of nitric oxide and reactive oxygen species in macrophages | 3.31E-05 | 0.001778 | 0.0979 | RAP1B, MAP3K1, IFNGR1, NCF4, JAK2, IRF1, TLR2, PPP1CC, LYZ, RHOQ, RHOB, PPP1R12A, PIK3CG, CAT, NCF2, CYBB, MAP3K2, ATM, PRKCA |
| Inflammasome pathway | 3.72E-05 | 0.001778 | 0.3 | IL18, NLRP3, AIM2, CASP1, CTSB, CASP5 |
| ILK signaling | 4.07E-05 | 0.001778 | 0.0964 | PXN, PDPK1, VIM, HIF1A, CREB5, PTEN, NCK2, MYC, RHOQ, RHOB, PPP1R12A, LIMS1, CDC42, PIK3CG, CREB1, RSU1, PTGS2, ITGB5, ATM |
| Role of pattern recognition receptors in recognition of bacteria and viruses | 6.17E-05 | 0.002455 | 0.109 | NLRP3, RNASEL, TLR2, IFIH1, CLEC7A, IL18, TLR5, PIK3CG, CREB1, TLR1, CASP1, OSM, EIF2AK2, ATM, PRKCA |
| PI3K/AKT signaling | 8.32E-05 | 0.00309 | 0.112 | YWHAG, ITGA5, PDPK1, JAK2, PTEN, YWHAQ, HSP90B1, SYNJ1, LIMS1, PIK3CG, HSP90AA1, CDKN1B, PTGS2, MCL1 |
| TREM1 signaling | 0.000204 | 0.005754 | 0.133 | TLR2, TREM1, IL18, NLRP3, TLR5, TLR1, CASP1, ITGA5, JAK2, CASP5 |
| Fcγ receptor-mediated phagocytosis in macrophages and monocytes | 0.000295 | 0.007413 | 0.118 | NCK2, PXN, ACTR3, CBL, FCGR2A, CDC42, PIK3CG, ARPC5, FYB1, PRKCA, PTEN |
| Integrin signaling | 0.000479 | 0.010715 | 0.0822 | RAP1B, PXN, ASAP1, ARPC5, ITGA5, PTEN, NCK2, WIPF1, RHOQ, ACTR3, RHOB, PPP1R12A, LIMS1, CDC42, PIK3CG, NEDD9, ITGB5, ATM |
| Phagosome maturation | 0.010715 | 0.010715 | 0.0946 | TUBB1, DCTN4, VPS41, CTSO, TUBB2A, TUBA4A, ATP6V1A, LAMP2, CTSH, HLA-DRB3, NCF2, CYBB, CTSB, CTSC |
P-value: The p value identifies statistically significant over-representation of focus genes in a given process. Over represented functional or pathway processes are processes which have more focus genes than expected by chance.
False Discovery Rate by Benjamini–Hochberg procedure.
Ratio: The ratio indicates the number of differentially expressed genes that map to the pathway divided by the total number of genes that map to the canonical pathway.
Top 10 down-regulated pathways in acute appendicitis patients vs. controls with non-specific abdominal pain.
| Mitochondrial dysfunction | 0.000145 | 0.016982 | 0.0468 | FIS1, COX17, UCP2, ATP5D, UQCRH, NDUFB7, NDUFA12, NDUFA3 |
| Systemic lupus erythematosus signaling | 0.000182 | 0.016982 | 0.04 | CD3E, IL1RN, SNRPF, IL6R, SNRPB2, SNRNP70, PLCG1, CD79A, HLA-F |
| Calcium-induced T lymphocyte apoptosis | 0.000295 | 0.019498 | 0.0758 | CD3E, ZAP70, PLCG1, ATP2A3, EP300 |
| Remodeling of epithelial adherens junctions | 0.000363 | 0.019498 | 0.0725 | RAB5C, ARPC1B, TUBA1C, TUBB, ACTG1 |
| Oxidative phosphorylation | 0.000417 | 0.019498 | 0.055 | COX17, ATP5D, UQCRH, NDUFB7, NDUFA12, NDUFA3 |
| CTLA4 signaling in cytotoxic T lymphocytes | 0.001862 | 0.066069 | 0.0505 | FYN, AP2M1, CD3E, ZAP70, PLCG1 |
| Interferon signaling | 0.003548 | 0.100462 | 0.0833 | IFI6, PSMB8, BAX |
| EIF2 signaling | 0.003548 | 0.100462 | 0.0317 | RPL4, RPS26, RPL17, RPS15A, RPS29, RPL10A, RPLP0 |
| Caveolar-mediated endocytosis signaling | 0.003631 | 0.100462 | 0.0563 | FYN, RAB5C, ACTG1, ITGAX |
| Antigen presentation pathway | 0.004571 | 0.10617 | 0.0789 | PSMB8, CD74, HLA-F |
False Discovery Rate by Benjamini-Hochberg procedure.
P-value: The p value identifies statistically significant over-representation of focus genes in a given process. Over represented functional or pathway processes are processes which have more focus genes than expected by chance.
Ratio: The ratio indicates the number of differentially expressed genes that map to the pathway divided by the total number of genes that map to the canonical pathway.