| Literature DB >> 29764394 |
Wen Chen1,2, Xuan Zhang1, Jing Li1, Shulan Huang2, Shuanglin Xiang2, Xiang Hu3, Changning Liu4.
Abstract
BACKGROUND: Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing.Entities:
Keywords: Co-expression network; Conservation; Gene ontology; KEGG; LncRNA; Zebrafish
Mesh:
Substances:
Year: 2018 PMID: 29764394 PMCID: PMC5954278 DOI: 10.1186/s12864-018-4458-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Integration of all sources of zebrafish lncRNA. a Data sources and analysis pipeline. b The number of lncRNA transcripts from each source. c Venn diagram between different sources
Fig. 2Features of Zebrafish lncRNA. a Distribution of lncRNA subtypes. b Zebrafish lncRNA distribution in chromosomes. c Distribution of zebrafish lncRNA isoform number. d Distribution of zebrafish lncRNA exon number
Fig. 3Features of zebrafish coding-lncRNA gene co-expression network. a Cumulative distribution of gene expression Spearman’s correlation coefficient. b Network statistics by different correlation coefficient cutoffs. c Evaluation of function prediction performance of the network with different cutoffs. d Network degree distribution (correlation coefficient cutoff = 0.5)
Fig. 4Functional annotation of zebrafish lncRNAs. a LncRNA GO BP enrichment slim (top 10). b LncRNA KEGG pathway enrichment (top10). c Conserved lncRNA GO BP enrichment slim (top 10). d Conserved lncRNA KEGG pathway enrichment (top10)
Fig. 5Conservation analysis of zebrafish lncRNAs. a Cumulative distribution of conservation levels computed using PhastCons applied to the 8-way whole-genome. b Cumulative distribution of TSI (tissue specificity index). c Cumulative distribution of Spearman’s correlation coefficient of gene expression. d Cumulative distribution of TF families’ intersection over union score
Fig. 6ZFLNCG05544 is a candidate lncRNA gene related to human neuron diseases. a The two transcripts of ZFLNCG05544(ZFLNCT08573, ZFLNCT08573) and durga in UCSC genome browser. b ZFLNCG05544 co-expression subnetwork. c GO annotation of ZFLNCG05544 (top10). d KEGG annotation of ZFLNCG05544 (top10)
Fig. 7ZFLNCG08251 is a human MALAT1 homolog in zebrafish. a ZFLNCG08251 co-expression subnetwork. b. GO annotation of ZFLNCG08251 (top10). c KEGG annotation of ZFLNCG08251 (top10)