| Literature DB >> 28073357 |
Kévin Muret1,2, Christophe Klopp3, Valentin Wucher4, Diane Esquerré5,6, Fabrice Legeai7,8, Frédéric Lecerf1,2, Colette Désert1,2, Morgane Boutin1,2, Frédéric Jehl1,2, Hervé Acloque6, Elisabetta Giuffra9, Sarah Djebali6, Sylvain Foissac6, Thomas Derrien10, Sandrine Lagarrigue11,12.
Abstract
BACKGROUND: Improving functional annotation of the chicken genome is a key challenge in bridging the gap between genotype and phenotype. Among all transcribed regions, long noncoding RNAs (lncRNAs) are a major component of the transcriptome and its regulation, and whole-transcriptome sequencing (RNA-Seq) has greatly improved their identification and characterization. We performed an extensive profiling of the lncRNA transcriptome in the chicken liver and adipose tissue by RNA-Seq. We focused on these two tissues because of their importance in various economical traits for which energy storage and mobilization play key roles and also because of their high cell homogeneity. To predict lncRNAs, we used a recently developed tool called FEELnc, which also classifies them with respect to their distance and strand orientation to the closest protein-coding genes. Moreover, to confidently identify the genes/transcripts expressed in each tissue (a complex task for weakly expressed molecules such as lncRNAs), we probed a particularly large number of biological replicates (16 per tissue) compared to common multi-tissue studies with a larger set of tissues but less sampling.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28073357 PMCID: PMC5225574 DOI: 10.1186/s12711-016-0275-0
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Fig. 1Predicted lncRNA features. a LncRNA prediction with a user-defined lncRNA specificity/sensitivity cut-off according to the two ROC curve graph provided by FEELnc. b Expression distribution within the three classes (new lncRNAs, ambiguous RNAs and new mRNAs) compared to that of known protein-coding genes from Ensembl. c Structural features for lncRNAs and Ensembl protein-coding RNAs in three species (G = Gallus gallus, M = Mus musculus, H = Homo sapiens). For the chicken lncRNAs, the data were generated in this study, while for the human and mouse lncRNAs, the data are taken from Ensembl V83. d Number of genes considered as expressed (FPKM-UQ ≥ 0.1) (y-axis) according to the number of biological replicates (x-axis) in the liver (left) and adipose tissue (right) for lncRNAs and Ensembl protein-coding genes. On each plot are indicated the number of genes for which at least 10 samples have a FPKM-UQ ≥ 0.1 (right number) and the number of genes for which a maximum of four samples have a FPKM-UQ ≥ 0.1 (left number). e Classification by FEELnc of the 1670 reliable lncRNA genes for 2412 transcripts
Fig. 2Gene density and structural features for protein-coding genes and lncRNA genes across the chicken macro- and micro-chromosomes. a Gene density for all chromosomes (except for chromosomes 16, 25, and W that are not sufficiently well sequenced). b Correlation of gene densities between protein-coding genes (y-axis) and long noncoding genes (x-axis). c Exon size, exon number and intron size for macro-chromosomes 1–5 and micro-chromosomes 20, 21, 23, 26, 27 and 28
Fig. 3Chicken/human lncRNA conservation in terms of sequence (a) and syntenic position (b–e). a An example of chicken lncRNA (XLOC_014262) that has a conserved sequence with the human RP11-386B13.3 lncRNA and a similar syntenic position in both species. b Schematic picture illustrating our approach for identifying syntenic lncRNAs between the chicken and human genomes. c, d Schematic representations of the SLC38A4-AMIGO2 (d) and VPS18-DLL4 loci. e Distributions of the expression of the two subsets of lncRNAs with conserved or not synteny
Fig. 4Tissue expression of lncRNA and protein-coding RNA genes in liver and adipose tissue in chicken. a Expression levels in both tissues. b Tissue-specific expression for the whole lncRNAs and Ensembl protein-coding RNAs. c Tissue-specific expression for a subset of the lncRNAs and protein-coding RNAs with similar expression (between the extreme medians of the lncRNA and mRNA expression distributions represented by x = 0.76 and y = 9.94 FPKM-UQ, respectively). The read counts were normalized for library size and gene size, and the biological replicates per tissue were taken into account as explained in “Methods” section
Significant correlations between expression for lncRNA-mRNA and mRNA–mRNA pairs considering FEELnc classes and distance between genes
| Same strand | Convergent | Divergent | Antisense exon | Antisense intron | |
|---|---|---|---|---|---|
| Genes | 674 | 194 | 370 | 23 | 100 |
| lncRNA-mRNA | |||||
| ≤1 kb | 51/91 (56%) +49/−2 | 5/28 (18%) +3/−2 | 23/105 (22%) +19/−4 | 5/23 (22%) | 13/100 (13%) |
| >1 kb | 139/583 (24%) +127/−12 | 13/166 (8%) +10/−3 | 34/265 (13%) +27/−7 | ||
| p value | 2.37 × 10−9 | NS (0.15) | 3.7 × 10−2 | NA | |
| mRNA–mRNA | |||||
| ≤1 kb | 28% | 17% | 30% | 24% | |
| >1 kb | 22% | 19% | 22% | ||
| p value | NS (0.19) | NS (0.80) | 0.09 | NA | |
The FEELnc classes are “same strand”, “divergent” and “convergent” for intergenic lncRNAs, and “antisense exon” and “antisense intron” for genic lncRNAs
For intergenic lncRNAs, co-expression was tested according to the physical distance (1 kb) between the two genes considered
The difference in correlated pairs between the “distance” sets was tested by a Fisher test (NS: non-significant). Note that this co-expression table depends on the modeling of protein-coding genes obtained by Ensembl V84.4
Fig. 5NPNT gene and its antisense lncRNA gene. a Gene models of the lncRNA/mRNA pair in the chicken and human genomes. b Expression of the lncRNA/mRNA pair analyzed with RNA-Seq data in liver (left) and adipose tissue (right). c Expression analysis with RT-qPCR data. d Expression of 20 fed and fasted birds (analyzed by RT-qPCR). Correlation significance: ***p value <0.001
Fig. 6DHCR24 gene and its divergent lncRNA gene. a Gene models of the lncRNA/mRNA pair in the chicken and human genomes. b Expression correlation in liver using RNA-Seq data (left) and confirmed by RT-qPCR (right). c Expression in adult birds analyzed by RNA-Seq (left) and young birds under fasted and fed statuses analyzed by RT-qPCR (right). d Expression across 17 tissues (see the “Methods” section). Correlation significance: ***p value <0.001
RT-qPCR primers used to amplify genes of interest
| Gene of interest | Forward | Reverse |
|---|---|---|
| DHCR24 | TGGAGAGCCCAAAACGAAACA | CGCGGGTCATGTAGCAATC |
| lncRNA_DHCR24 | GAGAGAAGCTGGATGGTCCTG | CTGAAGGAGACTGCAAGGTGT |
| NPNT | CGATGAATGTGCTACTGGGAGA | AACTACCACACTGATGCTGGC |
| lncNPNT-AS | TGCACTCTCATCTTGTGTGCT | CAACGTGACCATAAGGGCTG |