| Literature DB >> 30112357 |
Victor F Oliveira1, Lauro A G Moares2, Ester A Mota1, Liana K Jannotti-Passos3, Paulo M Z Coelho3, Ana C A Mattos3, Flávia F B Couto3, Brian E Caffrey4, Annalisa Marsico4,5, Renata Guerra-Sá1.
Abstract
Long noncoding RNAs (lncRNAs) are transcripts generally longer than 200 nucleotides with no or poor protein coding potential, and most of their functions are also poorly characterized. Recently, an increasing number of studies have shown that lncRNAs can be involved in various critical biological processes such as organism development or cancer progression. Little, however, is known about their effects in helminths parasites, such as Schistosoma mansoni. Here, we present a computational pipeline to identify and characterize lncRNAs from RNA-seq data with high confidence from S. mansoni adult worms. Through the utilization of different criteria such as genome localization, exon number, gene length, and stability, we identified 170 new putative lncRNAs. All novel S. mansoni lncRNAs have no conserved synteny including human and mouse. These closest protein coding genes were enriched in 10 significant Gene Ontology terms related to metabolism, transport, and biosynthesis. Fifteen putative lncRNAs showed differential expression, and three displayed sex-specific differential expressions in praziquantel sensitive and resistant adult worm couples. Together, our method can predict a set of novel lncRNAs from the RNA-seq data. Some lncRNAs are shown to be differentially expressed suggesting that those novel lncRNAs can be given high priority in further functional studies focused on praziquantel resistance.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30112357 PMCID: PMC6077669 DOI: 10.1155/2018/1264697
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Integrative computational pipeline for the identification of lncRNAs in S. mansoni. (a) The raw RNA-seq data was preprocessed, aligned with STAR, and assembled using Cufflinks in ab initio mode. (b) The output was analyzed in several steps and algorithms were written in the Python and Perl programming languages. The numbers in parentheses represent the number of transcripts after each filtering step.
Figure 2Features of S. mansoni adult lncRNAs. (a) Genomic localization of lncRNAs, (b) number of exons per transcripts, (c) transcripts length, (d) log2 FPKM expression.
Figure 3Relative expression of lncRNAs by RT-qPCR. (a) Fifteen Sm-lncRNAs and the LTR retrotransposon were selected for validation by RT-qPCR at the parasite adult stage (∗significantly different from all other lncRNAs and LTR, ∗∗significantly different from all lncRNAs). (b) The first three lncRNAs and the LTR retrotransposon were selected to verify the expression between sexes into four groups Control-Male, Control-Female, PZQ-Male, and PZQ-Female (PZQ-Male and PZQ-Female are related to S. mansoni LE praziquantel-resistant strains) (∗significantly different from Control-Male, ∗∗ different from PZQ-Female). Two-way ANOVA and Tukey's posttest were used for calculating the statistical significance (P value ≤ 0.05).
Figure 4GO enrichment analysis of the lncRNA-target genes. The top 10 most enriched GO categories were calculated and plotted: (a) biological process; (b) molecular function; (c) cellular component.