| Literature DB >> 35336723 |
Weihao Chen1, Xiaoyang Lv2, Weibo Zhang1, Tingyan Hu1, Xiukai Cao2, Ziming Ren1, Tesfaye Getachew3, Joram M Mwacharo3, Aynalem Haile3, Wei Sun1,2.
Abstract
It has long been recognized that enterotoxigenic Escherichia coli (ETEC) is the major pathogen responsible for vomiting and diarrhea. E. coli F17, a main subtype of ETEC, is characterized by high morbidity and mortality in young livestock. However, the transcriptomic basis underlying E. coli F17 infection has not been fully understood. In this study, RNA sequencing was performed to explore the expression profiles of circRNAs and miRNAs in the jejunum of E. coli F17-antagonism (AN) and -sensitive (SE) lambs. A total of 16,534 circRNAs and 271 miRNAs (125 novel miRNAs and 146 annotated miRNAs) were screened, and 214 differentially expressed (DE) circRNAs and 53 DE miRNAs were detected between the AN and SE lambs (i.e., novel_circ_0025840, novel_circ_0022779, novel_miR_107, miR-10b). Functional enrichment analyses showed that source genes of DE circRNAs were mainly involved in metabolic-related pathways, while target genes of DE miRNAs were mainly enriched in the immune response pathways. Then, a two-step machine learning approach combining Random Forest (RF) and XGBoost (candidates were first selected by RF and further assessed by XGBoost) was performed, which identified 44 circRNAs and 39 miRNAs as potential biomarkers (i.e., novel_circ_0000180, novel_circ_0000365, novel_miR_192, oar-miR-496-3p) for E. coli infection. Furthermore, circRNA-related and lncRNA-related ceRNA networks were constructed, containing 46 circRNA-miRNA-mRNA competing triplets and 630 lncRNA-miRNA-mRNA competing triplets, respectively. By conducting a serious of bioinformatic analyses, our results revealed important circRNAs and miRNAs that could be potentially developed as candidate biomarkers for intestinal inflammatory response against E. coli F17 infection; our study can provide novel insights into the underlying mechanisms of intestinal immunity.Entities:
Keywords: E. coli F17; ceRNA; circRNA; lamb; machine learning; miRNA
Year: 2022 PMID: 35336723 PMCID: PMC8945857 DOI: 10.3390/biology11030348
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Summary of the circRNA library.
| Sample | Raw Reads | Clean Reads | Mapping Rate (%) | Error Rate (%) | Q20 (%) | Q30 (%) | GC Content (%) |
|---|---|---|---|---|---|---|---|
| AN1 | 86,448,964 | 85,310,470 | 98.68 | 0.03 | 97.55 | 93.27 | 51.32 |
| AN2 | 82,985,976 | 82,314,372 | 99.19 | 0.03 | 97.00 | 91.76 | 46.32 |
| AN3 | 81,095,934 | 79,701,960 | 98.28 | 0.03 | 97.49 | 93.16 | 51.61 |
| AN4 | 94,502,330 | 93,722,960 | 99.18 | 0.03 | 97.25 | 92.49 | 48.07 |
| AN5 | 84,496,940 | 83,246,004 | 98.52 | 0.03 | 97.45 | 93.06 | 50.25 |
| AN6 | 83,613,850 | 82,012,052 | 98.08 | 0.03 | 97.49 | 93.19 | 54.43 |
| SE1 | 82,325,980 | 81,420,394 | 98.90 | 0.03 | 97.31 | 92.67 | 52.18 |
| SE2 | 83,101,628 | 81,439,640 | 98.00 | 0.03 | 97.39 | 93.00 | 48.07 |
| SE3 | 83,731,304 | 82,241,834 | 98.22 | 0.03 | 97.45 | 93.09 | 49.79 |
| SE4 | 80,794,124 | 79,478,658 | 98.37 | 0.03 | 96.90 | 91.99 | 56.07 |
| SE5 | 92,174,900 | 90,902,860 | 98.62 | 0.03 | 97.35 | 92.99 | 49.55 |
| SE6 | 84,577,884 | 83,190,218 | 98.36 | 0.03 | 96.54 | 91.14 | 49.89 |
Note: AN and SE represent antagonism group and sensitive group, respectively. Error rate% represents overall sequencing error rate. Quality score (Q) represent the probability of incorrect based call.
Summary of the miRNA library.
| Sample Name | Raw Reads | Clean Reads | Clean Bases | Error Rate (%) | Q20 (%) | Q30 (%) | GC Content (%) |
|---|---|---|---|---|---|---|---|
| AN1 | 16,273,383 | 16,059,313 | 98.68 | 0.01 | 99.49 | 98.16 | 49.67 |
| AN2 | 13,434,545 | 13,274,363 | 98.81 | 0.01 | 99.50 | 98.35 | 48.56 |
| AN3 | 14,558,297 | 14,136,725 | 97.10 | 0.01 | 99.06 | 96.65 | 48.87 |
| AN4 | 11,883,680 | 11,545,885 | 97.16 | 0.01 | 99.10 | 96.96 | 49.54 |
| AN5 | 15,402,425 | 15,008,710 | 97.44 | 0.01 | 99.04 | 96.79 | 49.09 |
| AN6 | 11,625,621 | 10,918,820 | 93.92 | 0.01 | 99.30 | 97.26 | 50.01 |
| SE1 | 18,148,953 | 17,949,815 | 98.90 | 0.01 | 99.49 | 98.30 | 48.85 |
| SE2 | 13,392,060 | 13,198,054 | 98.55 | 0.01 | 99.32 | 97.92 | 49.46 |
| SE3 | 10,839,760 | 10,594,527 | 97.74 | 0.01 | 99.34 | 97.74 | 49.80 |
| SE4 | 13,718,249 | 13,297,138 | 96.93 | 0.01 | 99.02 | 97.04 | 49.09 |
| SE5 | 12,498,474 | 11,906,416 | 95.26 | 0.01 | 98.97 | 96.90 | 50.58 |
| SE6 | 11,896,114 | 11,605,384 | 97.56 | 0.01 | 99.33 | 97.73 | 48.81 |
Note: AN and SE represent antagonism group and sensitive group, respectively. Error rate% represents overall sequencing error rate. Quality score (Q) represents the probability of incorrect based call.
Figure 1Length distribution of the identified circRNAs (A) and miRNAs (B).
Figure 2Volcano plot of differentially expressed (DE) circRNAs (A) and DE miRNAs (B).
Figure 3Top annotated GO terms (A) and top enriched KEGG pathways (B) of source genes of DE circRNAs. Top annotated GO terms (C) and top enriched KEGG pathways (D) of target genes of DE miRNAs.
Figure 4Gain value of top circRNAs (A) and miRNAs (B) selected by Random Forest-XGBoost.
Figure 5ceRNA networks of circRNA-miRNA-mRNA (A) and lncRNA-miRNA-mRNA (B), where the “V” shape (blue), triangle (blue), and rectangle (red) represent circRNAs (lncRNAs), miRNAs, and mRNAs, respectively.
Figure 6Comparisons of the results of the RNA–seq and RT–qPCR analyses of selected circRNAs and miRNAs.