| Literature DB >> 33969037 |
Andrew Marete1, Olivier Ariel1,2, Eveline Ibeagha-Awemu1, Nathalie Bissonnette1.
Abstract
Mycobacterium avium ssp. paratuberculosis (MAP) causes chronic enteritis in most ruminants. The pathogen MAP causes Johne's disease (JD), a chronic, incurable, wasting disease. Weight loss, diarrhea, and a gradual drop in milk production characterize the disease's clinical phase, culminating in death. Several studies have characterized long non-coding RNA (lncRNA) in bovine tissues, and a previous study characterizes (lncRNA) in macrophages infected with MAP in vitro. In this study, we aim to characterize the lncRNA in macrophages from cows naturally infected with MAP. From 15 herds, feces and blood samples were collected for each cow older than 24 months, twice yearly over 3-5 years. Paired samples were analyzed by fecal PCR and blood ELISA. We used RNA-seq data to study lncRNA in macrophages from 33 JD(+) and 33 JD(-) dairy cows. We performed RNA-seq analysis using the "new Tuxedo" suite. We characterized lncRNA using logistic regression and multilayered neural networks and used DESeq2 for differential expression analysis and Panther and Reactome classification systems for gene ontology (GO) analysis. The study identified 13,301 lncRNA, 605 of which were novel lncRNA. We found seven genes close to differentially expressed lncRNA, including CCDC174, ERI1, FZD1, TWSG1, ZBTB38, ZNF814, and ZSCAN4. None of the genes associated with susceptibility to JD have been cited in the literature. LncRNA target genes were significantly enriched for biological process GO terms involved in immunity and nucleic acid regulation. These include the MyD88 pathway (TLR5), GO:0043312 (neutrophil degranulation), GO:0002446 (neutrophil-mediated immunity), and GO:0042119 (neutrophil activation). These results identified lncRNA with potential roles in host immunity and potential candidate genes and pathways through which lncRNA might function in response to MAP infection.Entities:
Keywords: Johne's disease; MAP disease; bovine; genomics; long non-coding RNA; macrophages; paratuberculosis (Mycobacterium avium ssp. paratuberculosis)
Year: 2021 PMID: 33969037 PMCID: PMC8100051 DOI: 10.3389/fvets.2021.639053
Source DB: PubMed Journal: Front Vet Sci ISSN: 2297-1769
Figure 1(A) The study workflow from raw fastq to lncRNA identification and DE of the nearest genes. (B) Two-graph receiver operating characteristic curve (ROC) to determine the optimal coding-probability cutoff value. (C) Combinatorial effects of Fickett score, hexamer score, and ORF size on coding transcripts (brown dots) and non-coding genes (blue dots). (D) Mapping statistics (vertical axis) showing unique (red), multi-mapped (yellow-green), and overall (blue) alignment rate for all cows (horizontal axis). (E) Snapshot of lncRNA statistics identified by two tools with known lncRNAs in parentheses.
Location of the highest DE lncRNAs in macrophages from JD(+) vs. JD(–) cows identified on Bos taurus autosomes 1–29 and chromosome X.
| 1 | 128523379 | 128576527 | 53.15 | — | 10198.41 | 0.25 (±0.08) | 3.23 | 1.25e-03 | |
| 2 | 59365700 | 59366021 | 0.32 | 10.25 | 43.3 | −0.23 (±0.15) | −1.48 | 1.30e-01 | |
| 3 | 120249265 | 120249575 | 0.31 | 20.37 | 56.44 | −0.64 (±0.22) | −2.91 | 3.65e-03 | |
| 4 | 7843837 | 8440753 | 596.92 | 49.92 | 3159.91 | 0.53 (±0.18) | 2.92 | 3.45e-03 | |
| 5 | 28846068 | 28846987 | 0.92 | 0.12 | 111.3 | −0.34 (±0.14) | −2.36 | 1.00e-02 | |
| 6 | 52972592 | 52972837 | 0.24 | 964.81 | 8.3 | 1.12 (±0.54) | 2.09 | 3.00e-02 | |
| 7 | 21107248 | 21107521 | 0.27 | 7.63 | 35.87 | −0.59 (±0.21) | −2.78 | 5.51e-03 | |
| 8 | 23054554 | 23054855 | 0.3 | 0.01 | 15.44 | −1.41 (±0.79) | −1.79 | 7.00e-02 | |
| 9 | 89536157 | 89567424 | 31.27 | — | 1033.91 | 1.23 (±0.36) | 3.4 | 6.65e-04 | |
| 10 | 98235095 | 98343170 | 108.08 | — | 218.23 | 1.49 (±0.42) | 3.51 | 4.45e-04 | |
| 11 | 43739783 | 43741807 | 2.02 | 24.14 | 10148.11 | 0.18 (±0.1) | 1.86 | 6.00e-02 | |
| 12 | 52352608 | 52352826 | 0.22 | 0.28 | 18.24 | −0.49 (±0.22) | −2.19 | 2.00e-02 | |
| 13 | 46917583 | 46917875 | 0.29 | 82.89 | 14.21 | −0.32 (±0.23) | −1.37 | 1.70e-01 | |
| 14 | 57874677 | 57874911 | 0.23 | 89.6 | 5.22 | 1.09 (±0.41) | 2.66 | 7.89e-03 | |
| 15 | 22944062 | 22952348 | 8.29 | 23.66 | 5.9 | 0.67 (±0.29) | 2.3 | 2.00e-02 | |
| 16 | 44005434 | 44008857 | 3.42 | 0.77 | 2764.31 | −0.34 (±0.19) | −1.79 | 7.00e-02 | |
| 17 | 6578427 | 6636825 | 58.4 | — | 129.96 | 0.64 (±0.27) | 2.34 | 1.00e-02 | |
| 18 | 65524855 | 65525608 | 0.75 | 1.97 | 314.44 | 0.36 (±0.11) | 3.29 | 9.86e-04 | |
| 19 | 43812878 | 43813292 | 0.41 | 0.64 | 1843.89 | 0.29 (±0.15) | 1.94 | 5.00e-02 | |
| 20 | 5251758 | 5252057 | 0.3 | 13.05 | 12.56 | −1.23 (±0.37) | −3.35 | 8.22e-04 | |
| 21 | 14494659 | 14522280 | 27.62 | 10.08 | 18545.49 | 0.21 (±0.12) | 1.8 | 7.00e-02 | |
| 22 | 58285995 | 58286273 | 0.28 | 0.45 | 9.15 | 1 (±0.3) | 3.39 | 6.97e-04 | |
| 23 | 9006094 | 9006521 | 0.43 | 1.1 | 547.63 | −0.2 (±0.1) | −1.93 | 5.00e-02 | |
| 24 | 42014626 | 42014991 | 0.36 | 6.48 | 8.31 | −1.58 (±0.04) | −4.19 | 2.81e-05 | |
| 25 | 32467533 | 32467886 | 0.35 | — | 25187.9 | 0.5 (±0.04) | 1.23 | 2.10e-01 | |
| 26 | 21892569 | 21892785 | 0.22 | 1.62 | 8.22 | −0.75 (±0.01) | −1.28 | 2.00e-01 | |
| 27 | 24191077 | 24191301 | 0.22 | 1.82 | 27.11 | 1.11 (±0.3) | 3.77 | 1.63e-04 | |
| 28 | 280436 | 280721 | 0.28 | 0.76 | 108.71 | 0.28 (±0.01) | 1.42 | 1.50e-01 | |
| 29 | 30263006 | 30263284 | 0.28 | 44.06 | 1.65 | −1.27 (±0.5) | −2.35 | 1.00e-02 | |
| X | 137567452 | 137567677 | 0.22 | 10.99 | 2.4 | −1.88 (±0.6) | −2.87 | 4.07e-03 |
Chrom: Bos taurus chromosome.
Length (kb): length of the lncRNA transcript calculated as the difference between the start and end positions expressed as thousand base pairs.
Closest gene (kb): the distance to the closest coding gene.
Gene: the gene closest to the significant transcript.
Mean: the average of normalized counts for all cows.
Fold change: for a particular gene, the log.
Wald statistic: Results from a likelihood ratio test comparing the estimated standard error of a log.
P-value: False discovery rate adjusted P-value at 0.05.
Figure 2Distribution of putative, known, and novel lncRNAs in chromosomes 1–29 and X.
Figure 3Normalized count distributions in JD(+) macrophages (purple) and JD(–) macrophages (blue) for 10 highly expressed genes close to lncRNA transcripts.
The top 10 most significant DE lncRNAs in macrophages ranked by the FDR corrected P-value.
| 24 | 42014626 | 42014991 | 0.36 | 6.48 | 8.31 | −1.58 (± 0.4) | −4.19 | 2.81e-05 | ||
| 27 | 24191077 | 24191301 | 0.22 | 1.82 | 27.11 | 1.11 (± 0.3) | 3.77 | 1.63e-04 | ||
| 10 | 98235095 | 98343170 | 108.08 | — | 218.23 | 1.49 (± 0.4) | 3.51 | 4.45e-04 | ||
| 9 | 89536157 | 89567424 | 31.27 | — | 1033.91 | 1.23 (± 0.4) | 3.4 | 6.65e-04 | ||
| 22 | 58285995 | 58286273 | 0.28 | 0.45 | 9.15 | 1 (± 0.3) | 3.39 | 6.97e-04 | ||
| 20 | 5251758 | 5252057 | 0.3 | 13.05 | 12.56 | −1.23 (± 0.4) | −3.35 | 8.22e-04 | ||
| 18 | 65524855 | 65525608 | 0.75 | 1.97 | 314.44 | 0.36 (± 0.1) | 3.29 | 9.86e-04 | ||
| 18 | 65255175 | 65265109 | 9.93 | 13.44 | 1216.85 | 0.31 (± 0.1) | 3.29 | 1.01e-03 | ||
| 1 | 128523379 | 128576527 | 53.15 | — | 10198.41 | 0.25 (± 0.1) | 3.23 | 1.25e-03 | ||
| 4 | 7843837 | 8440753 | 596.92 | 49.92 | 3159.91 | 0.53 (± 0.2) | 2.92 | 3.45e-03 |
Dispersion: Visual representation of normalized count dispersion of lncRNA in a specific region of a selected chromosome in JD(+) macrophages (purple) and JD(–) macrophages (blue).
Length (kb): Length of the lncRNA transcript calculated as the difference between the start and end positions expressed as thousand base pairs.
Closest gene (kb): The distance from the significant lncRNA to the closest coding gene.
Gene: The closest gene to the significant lncRNA transcript. Genes expressed in bovine macrophages are in bold characters.
Mean: The average of normalized counts for all cows.
Fold change: For a particular gene, the log.
Wald statistic: Result of a likelihood ratio test comparing the estimated standard error of a log.
P-value: False discovery rate adjusted P-value at 0.05.
Enriched Reactome pathways using 255 DE lncRNA.
| Nuclear receptor transcription pathway | 10/107 | 5.84e-03 | 1.69e-05 | 1.53e-04 | |
| Deletions in the AMER1 gene destabilize the destruction complex | 1/1 | 6.79e-05 | 1.66e-02 | 7.63e-05 | |
| RNA polymerase I promoter escape | 4/64 | 4.35e-03 | 2.34e-02 | 1.53e-04 | |
| Misspliced GSK3beta mutants stabilize beta-catenin | 2/15 | 1.02e-03 | 2.66e-02 | 7.63e-05 | |
| Phosphorylation site mutants of CTNNB1 | 2/16 | 1.09e-03 | 3.00e-02 | 3.05e-04 | |
| Beta-catenin phosphorylation cascade | 2/19 | 1.29e-03 | 4.09e-02 | 3.05e-04 | |
| RUNX3 regulates YAP1-mediated transcription | 2/20 | 6.11e-04 | 4.48e-02 | 2.29e-04 | |
| Physiological factors | 2/21 | 8.83e-04 | 4.89e-02 | 3.05e-04 | |
| Defective ALG12 causes ALG12-CDG (CDG-1g) | 1/3 | 2.04e-04 | 4.89e-02 | 7.63e-05 | |
| MyD88 deficiency ( | 1/3 | 2.04e-04 | 4.89e-02 | 7.63e-05 | |
Ratio: the ratio of entities from this pathway to all Reactome entities.
P-value: the result of the statistical test for overrepresentation, for molecules of the type of the results selected which have been corrected for overrepresentation probability.
Reactions ratio: the ratio of reactions from this pathway to all Reactome reactions.
Genes found: some of the genes associated with the Reactome pathway. Genes expressed in bovine macrophages are in bold characters.
Gene ontology results using 255 DE lncRNAs.
| GO:0070062 | Extracellular exosome | 31.75 | 3.18e-25 | 37.53 | |
| GO:1903561 | Extracellular vesicle | 27.40 | 5.63e-24 | 32.39 | |
| GO:0043230 | Extracellular organelle | 25.97 | 1.59e-23 | 30.71 | |
| GO:0043227 | Membrane-bounded organelle | 1.36 | 2.86e-16 | 1.61 | |
| GO:0043226 | Organelle | 1.28 | 6.34e-16 | 1.51 | |
| GO:0005622 | Intracellular | 1.21 | 5.43e-15 | 1.42 | |
| GO:0031982 | Vesicle | 2.83 | 1.57e-13 | 3.34 | |
| GO:0005737 | Cytoplasm | 1.32 | 2.27e-12 | 1.56 | |
| GO:0043229 | Intracellular organelle | 1.24 | 3.84e-12 | 1.46 | |
| GO:0005515 | Protein binding | 2.29 | 1.15e-44 | 2.70 | |
| GO:0005488 | Binding | 1.37 | 3.18e-24 | 1.62 | |
| GO:0046872 | Metal ion binding | 1.57 | 4.98e-06 | 1.85 | |
| GO:0043169 | Cation binding | 1.53 | 9.96e-06 | 1.81 | |
| GO:0043167 | Ion binding | 1.34 | 1.10e-05 | 1.58 | |
| GO:0000978 | RNA polymerase II cis-regulatory region sequence-specific DNA binding | 2.12 | 1.16e-04 | 2.5 | |
| GO:0000987 | cis-Regulatory region sequence-specific DNA binding | 2.07 | 1.56e-04 | 2.44 | |
| GO:0043565 | Sequence-specific DNA binding | 1.81 | 1.57e-04 | 2.14 | |
| GO:0003677 | DNA binding | 1.59 | 2.03e-04 | 1.88 | |
| GO:0043312 | Neutrophil degranulation | 75.00 | 7.08e-12 | 141.90 | |
| GO:0019222 | Regulation of metabolic process | 1.53 | 1.25e-10 | 1.80 | |
| GO:0002283 | Neutrophil activation involved in immune response | 60.00 | 4.38e-10 | 70.95 | |
| GO:0050789 | Regulation of biological process | 1.22 | 8.09e-10 | 1.43 | |
| GO:0031323 | Regulation of cellular metabolic process | 1.55 | 9.18e-10 | 1.83 | |
| GO:0050794 | Regulation of cellular process | 1.22 | 2.46e-09 | 1.44 | |
| GO:0002446 | Neutrophil-mediated immunity | 42.86 | 3.20e-09 | 50.67 | |
| GO:0060255 | Regulation of macromolecule metabolic process | 1.51 | 3.81e-09 | 1.78 | |
| GO:0065007 | Biological regulation | 1.17 | 4.04e-09 | 1.38 | |
| GO:0048518 | Positive regulation of biological process | 1.53 | 7.12e-09 | 1.81 | |
| GO:0051171 | Regulation of nitrogen compound metabolic process | 1.53 | 1.04e-08 | 1.80 | |
| GO:0042119 | Neutrophil activation | 30.00 | 2.61e-08 | 35.47 | |
| GO:0080090 | Regulation of primary metabolic process | 1.49 | 3.38e-08 | 1.76 | |
| GO:0043299 | Leukocyte degranulation | 28.57 | 3.47e-08 | 33.78 | |
Id: gene ontology (GO) symbol accepted by the broader scientific community and sorted by ascending p-value.
Label: GO term accepted by the broader scientific community.
Ratio: the proportion of genes submitted and those found to be associated with the GO term.
Fold enrichment: statistical estimation of obtaining the GO term that is not attributed to random chance.
Genes: a snippet of genes associated with the GO domain, more detailed genes available in .