| Literature DB >> 33281881 |
Xinyi Zhang1, Tao Zhou1, Baohua Chen1, Huaqiang Bai1, Yulin Bai1, Ji Zhao1, Fei Pu1, Yidi Wu1, Lin Chen1, Yue Shi1, Qiaozhen Ke1,2, Weiqiang Zheng2, Jia Chen2, Peng Xu1,2,3,4.
Abstract
Large-scale transcription studies have revealed numerous lncRNAs (long non-coding RNAs). lncRNAs have been proposed to participate in the regulation of a diverse range of biological processes, including transcriptional regulation. Although lncRNAs have attracted increasing attention, the studies in large yellow croaker (Larimichthys crocea) are still rare, and they lack systematic analysis. In this study, 101 RNA-seq datasets varied in ages, sexes, and tissues were retrieved from the NCBI database to generate a comprehensive catalog of large yellow croaker transcriptome database. A set of 14,599 high-confidence lncRNAs from 13,673 loci were identified and characterized. Furthermore, RNA-seq datasets obtained from the infection of C. irritans were employed to investigate the differential expression pattern of lncRNAs and analyze potential biological functions. A total of 77 differentially expressed lncRNAs targeting to 567 protein-coding genes were identified by using expression analysis. Several immune genes, including TLR5, CD2AP, and MMP9, were highlighted. With GO enrichment and KEGG pathway analysis, the immune-related terms or pathways were enriched. This study created a comprehensive dataset of lncRNAs for large yellow croaker, which would be helpful for the researches of functional roles of lncRNAs in large yellow croaker.Entities:
Keywords: Cryptocaryon irritans; Larimichthys crocea; immune response; infection; lncRNAs
Year: 2020 PMID: 33281881 PMCID: PMC7689269 DOI: 10.3389/fgene.2020.590475
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1The filtering pipeline for identification of lncRNAs in L. crocea. Venn diagrams show the results of three coding potential prediction tools, blast against known annotation, and blast against public database. In the end, a total of 14,599 transcripts were identified as lncRNAs in L. crocea.
Summary statistic for lncRNA dataset identified in this study.
| Items | Statistics value |
| Total transcripts | 14599 |
| Total loci | 13673 |
| N50 length (nt) | 999 |
| N60 length (nt) | 857 |
| N70 length (nt) | 726 |
| N80 length (nt) | 596 |
| N90 length (nt) | 444 |
| Total length (nt) | 12065150 |
| Max length (nt) | 10080 |
| Min length (nt) | 201 |
| Average length (nt) | 684 |
| Number of exons | |
| =1 | 12858 (88.06%) |
| ≤2 | 14009 (95.95%) |
| ≤3 | 14374 (98.45%) |
Different characteristics of lncRNAs and mRNAs identified in this study.
| Items | lncRNA | mRNA |
| Total transcripts | 14599 | 23807 |
| Average length (nt) | 684 | 1485 |
| Average number of exons | 1.19 | 9.45 |
| Largest number of exons | 14 | 143 |
FIGURE 2Distribution of sequence length and exon number of lncRNAs compared to mRNAs in large yellow croaker. (A) lncRNAs is much shorter than mRNA. The majority of lncRNAs (73.27%) is shorter than 1000 nt while only 31.45% for mRNAs. (B) lncRNAs have less exons than mRNAs. Of lncRNAs, 88.06% have only one exon while 18.06% of mRNAs have exons more than 14.
FIGURE 3Expression patterns of target genes in infection analysis. Membership values indicated the degree that a transcript belonged to this cluster. The more degree, the more relevance.
The hub genes of each cluster.
| Cluster | Rank | Gene name | Score | Annotation (Uniprot) |
| 1 | 1 | Syne2 | 58 | Nesprin-2 |
| 1 | 2 | PODN | 54 | Podocan |
| 1 | 2 | LRRC20 | 54 | Leucine-rich repeat-containing protein 20 |
| 1 | 4 | RHOBTB2 | 52 | Rho-related BTB domain-containing protein 2 |
| 1 | 5 | Myh8 | 50 | Myosin-8 |
| 1 | 6 | ANK2 | 48 | Ankyrin-2 |
| 2 | 1 | Bcl3 | 44 | B-cell lymphoma 3 protein |
| 2 | 2 | RND1 | 38 | Rho-related GTP-binding protein Rho6 |
| 2 | 3 | Rhog | 36 | Rho-related GTP-binding protein RhoG |
| 2 | 4 | zgc:110179 | 34 | Ras-like protein family member 11A-like protein |
| 3 | 1 | CD2AP | 46 | CD2-associated protein |
| 3 | 2 | RAB25 | 34 | Ras-related protein Rab-25 |
| 3 | 2 | Tlr5 | 34 | Toll-like receptor 5 |
| 3 | 4 | ZBTB7B | 32 | Zinc finger and BTB domain-containing protein 7B |
| 3 | 4 | KLF5 | 32 | Krueppel-like factor 5 |
| 4 | 1 | pbk | 22 | Lymphokine-activated killer T-cell-originated protein kinase |
| 5 | 1 | GP1BB | 46 | Platelet glycoprotein Ib beta chain |
| 5 | 1 | Gp5 | 46 | Baseplate central spike complex protein gp5 |
| 5 | 3 | PKM | 44 | Pyruvate kinase PKM |
| 5 | 3 | TESK1 | 44 | Dual specificity testis-specific protein kinase 1 |
| 5 | 5 | ASB5 | 36 | Ankyrin repeat and SOCS box protein 5 |
| 5 | 5 | spata5l1 | 36 | Spermatogenesis-associated protein 5-like protein 1 |
| 5 | 5 | atp2a1 | 36 | Sarcoplasmic/endoplasmic reticulum calcium ATPase 1 |
FIGURE 4Interaction network of lncRNA TCONS_00012911 with putative target genes. Yellow square nodes: lncRNA TCONS_00012911; green nodes: target genes; brown lines: interactions between genes based on STRING database; red lines: positive interactions between lncRNAs and genes (Pearson’s correlation coefficient > 0.99).