| Literature DB >> 31824560 |
Wietje Nolte1, Rosemarie Weikard1, Ronald M Brunner1, Elke Albrecht2, Harald M Hammon3, Antonio Reverter4, Christa Kühn1,5.
Abstract
Background: Genomic regions associated with divergent livestock feed efficiency have been found predominantly outside protein coding sequences. Long non-coding RNAs (lncRNA) can modulate chromatin accessibility, gene expression and act as important metabolic regulators in mammals. By integrating phenotypic, transcriptomic, and metabolomic data with quantitative trait locus data in prioritizing co-expression network analyses, we aimed to identify and functionally characterize lncRNAs with a potential key regulatory role in metabolic efficiency in cattle. Materials andEntities:
Keywords: Bos taurus; Functional Annotation of Animal Genomes; co-expression network analysis; long non-coding RNA; metabolic efficiency
Year: 2019 PMID: 31824560 PMCID: PMC6883949 DOI: 10.3389/fgene.2019.01130
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Sample characteristics.
| Metabolic efficiency group | Number of animals | Sex | RFI1 in last month of life (bulls) | ECM2w (cows) | IMF3 (both sexes) | CFC4 (both sexes) |
|---|---|---|---|---|---|---|
| µ5 (SD6) | µ (SD) | µ (SD) | µ (SD) | |||
| 25 | 12 males 13 females | -21.30 (4.44) | 190.87 (22.02) | 3.46 (1.30) | 15.93 (3.16) | |
| 23 | 12 males 11 females | 20.83 (4.41) | 30.97 (9.18) | 5.51 (2.34) | 22.93 (4.88) |
1RFI, residual feed intake; 2ECMw, energy corrected milk 7 days before slaughter; 3IMF, intramuscular fat content (given in percent, measured in M. longissimus dorsi); 4CFC, carcass fat content; 5µ, mean; 6SD, standard deviation.
Figure 1Workflow for the identification and functional characterization of key lncRNAs with regulatory potential in two contrasting biological conditions. The phenotypes under investigation were high and low metabolic efficiency in a Charolais x Holstein cross-population. lncRNA, long non-coding RNA; FPKM, fragments per kilobase transcript length per million reads; TS, tissue specific; DE, differentially expressed; QTL, quantitative trait locus; RFI, residual feed intake; MY, milk production; RIF, regulatory impact factor; PCIT, partial correlation and information theory.
Overall and tissue-specific RNA sequencing, alignment, and mapping statistics.
| RIN1 | Sequencing depth [read pairs] | Alignment to UMD.3.1 [%] | Mapping to project-specific annotation [%] | |||||
|---|---|---|---|---|---|---|---|---|
| µ2 | SD3 | µ | SD | µ | SD | µ | SD | |
| 8.22 | 0.81 | 48,041,209 | 5,601,638 | 92.98 | 9.50 | 81.89 | 8.67 | |
| 8.73 | 0.44 | 48,954,376 | 3,993,201 | 96.91 | 0.31 | 84.99 | 2.20 | |
| 8.00 | 0.62 | 50,093,826 | 5,869,833 | 98.43 | 0.20 | 91.36 | 1.21 | |
| 7.55 | 0.85 | 47,117,156 | 5,815,843 | 98.59 | 0.13 | 82.42 | 1.79 | |
| 8.41 | 0.86 | 45,999,477 | 5,587,407 | 78.00 | 7.75 | 69.05 | 4.67 | |
1RIN, RNA integrity number, 2µ, mean, 3SD, standard deviation.
Characterization of high connectivity long non-coding RNAs from networks specific for high or low metabolic efficiency in cattle.
| Identifier | FEElnc Prediction (based on UMD3.1 release 92) | FPKM | Position & Structure | Differential Expression Analysis | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MSTRG | TN | Closest partner gene (gene symbol) | Direction, type, location | Distance bp | Mean | BTA | Exons | Start bp | Strand | Exonic length | Log2 FC | p-value | q-value (BH) | Tissue |
| MSTRG.4740 | 1 | ENSBTAG00000002062 (TRPA1) | Anti-sense, genic, intronic | 0 | 6.74 | 14 | 2 | 37760114 | + | 437 | 0.77 | 2.10E-04 | 9.13E-03 | Liver |
| MSTRG.4926 | 1 | ENSBTAG00000021964 (CDH17) | Anti-sense, genic, exonic | 0 | 0.78 | 14 | 18 | 72437085 | - | 3,321 | NA | NA | NA | None |
| MSTRG.9051 | 1 | ENSBTAG00000004651 (NME1) | Anti-sense, intergenic, downstream | 358 | 2.57 | 19 | 2 | 36225984 | - | 1,866 | NA | NA | NA | None |
| 2 | 1,587 | 36227213 | ||||||||||||
| MSTRG.10337 | 1 | ENSBTAG00000005353 (DES) | Anti-sense, genic, exonic | 0 | 5.37 | 2 | 9 | 108075380 | - | 2,989 | NA | NA | NA | None |
| MSTRG.17681 | 1 | ENSBTAG00000005726 (HNRNPA2B1) | Anti-sense, intergenic, upstream | 9,170 | 19.80 | 4 | 3 | 70184682 | - | 1,552 | -0.38 | 8.45E-05 | 5.04E-03 | Liver |
| MSTRG.18433 | 1 | ENSBTAG00000015828 (FKBP11) | Sense, intergenic, upstream | 5,808 | 8.94 | 5 | 2 | 31038376 | + | 2,683 | NA | NA | NA | None |
| MSTRG.19098 | 1 | ENSBTAG00000046324 (C-type lectin domain family 2 member D11) | Anti-sense, genic, exonic | 0 | 1.72 | 5 | 5 | 100926473 | + | 1,570 | -0.63 | 1.27E-06 | 3.04E-04 | Liver |
| MSTRG.19312 | 1 | ENSBTAG00000009886 (KDELR3 ) | Anti-sense, genic, exonic | 0 | 25.84 | 5 | 3 | 110665971 | - | 5,768 | NA | NA | NA | None |
| 3 | 110670286 | |||||||||||||
*MSTRG, identifier from project-specific bovine transcriptome annotation; TN, transcript number; FPKM, fragments per kilobase million; BTA, bovine chromosome, bp, base pair; FC, foldchange; BH, Benjamini–Hochberg, NA, not available.
Figure 2Venn diagrams of (A) all loci (B) exclusively lncRNAs with differential expression (DE) between high and low metabolic efficiency in cattle. DE analysis was performed within the tissues jejunum, liver, muscle, and rumen. No loci were significantly [q-value (Benjamini–Hochberg) < 0.05] DE in rumen.
Figure 3Venn diagram of 3,754 loci selected for co-expression network construction. Loci belonging to at least one of these four categories were considered: differential expression (DE) in at least one tissue, tissue specific (TS) expression, harboring a QTL for residual feed intake and or milk production (QTL) and key regulatory long non-coding (lnc) RNAs [significant (p < 0.05) regulatory impact factor score].
BLAST results for eight high connectivity long non-coding RNAs (>100 nodes) in partial correlation and information theory networks with connections exclusive for high or low metabolic efficiency.
| lncRNA | BLAST against bovine reference genome (ARS-UCD1.2, release 106) | |||||
|---|---|---|---|---|---|---|
| Identifier | Network (connectivity in nodes) | Annotated gene with highest sequence homology | Identity [%] | Query cover [%] | E-Value | Position of lncRNA relative to homologous gene in ARS-UCD1.2 |
| MSTRG.4740 | Low (147) | mRNA-transient receptor potential cation channel subfamily A member 1 (TRPA1) | 100.00 | 100.00 | 9.00E-116 | Intronic, anti-sense |
| ADP-ribosylation factor 4 (ARF4) | 98.57 | 91.00 | 3.00E-100 | Exonic, sense | ||
| MSTRG.4926 | High (144) | Cadherin-17 precursor (CDH17) | 100.00 | 100.00 | 0.00E+00 | Anti-sense |
| MSTRG.9051 | High (170) | Nucleoside diphosphate kinase A 1 isoform X1 (NME1) | 99.72 | 100.00 | 0.00E+00 | Sense, genic |
| MSTRG.10337 | Low (239) | Desmin (DES) | 99.93 | 100.00 | 0.00E+00 | Exonic, anti-sense |
| MSTRG.17681 | High (120) | 39,201 bp at 5' side: alpha-aminoadipic semialdehyde synthase, mitochondrial precursor 88559 bp at 3' side: fez family zinc finger protein 1 | 98.40 | 99.00 | 0.00E+00 | Sense, genic |
| Chromobox protein homolog 3 isoform X1 (CBX3) | 99.00 | 89.00 | 0.00E+00 | Sense, genic | ||
| MSTRG.18433 | High (268) | 364 bp at 5' side: ADP-ribosylation factor 3; 37831 bp at 3' side: peptidyl-prolyl cis-trans isomerase FKBP11 precursor | 99.96 | 100.00 | 0.00E+00 | Sense, intergenic |
| MSTRG.19098 | High (184) | C-type lectin domain family 2 member D11 | 100.00 | 100.00 | 0.00E+00 | Anti-sense, genic |
| MSTRG.19312 | High (212) | ER lumen protein-retaining receptor 3 (KDELR3) | 100.00 | 99.00 | 0.00E+00 | Anti-sense, genic |
Figure 4Co-expression network for the novel long non-coding (lnc) RNA MSTRG.4740 with key regulatory potential for metabolic efficiency in cattle and significantly (p < 0.05) correlated genes with a minimal correlation coefficient of |r| > 0.8. Correlations are exclusive for animals with low metabolic efficiency.
Top 10 enriched pathways derived from genes and metabolites significantly correlated with key long non-coding RNAs associated with metabolic efficiency
| ID | Ingenuity Canonical Pathways | log(p) | Ratio | Molecules |
|---|---|---|---|---|
| MSTRG.4740 | tRNA Charging | 5.56E00 | 8.54E-02 | L-valine, L-phenylalanine, L-tryptophan, glycine, L-arginine, L-tyrosine, L-lysine |
| EIF2 Signaling | 4.13E00 | 3.83E-02 | MYC, RPS7, RPL27A, RPL35, RPL23A, RPL37, RPL26, EIF3E, RPL31 | |
| Glucose and Glucose-1-phosphate Degradation | 3.18E00 | 1.3E-01 | D-glucose, PGM3, phosphate | |
| Tyrosine Biosynthesis IV | 2.94E00 | 2.86E-01 | L-phenylalanine, L-tyrosine | |
| Acetyl-CoA Biosynthesis III (from Citrate) | 2.82E00 | 2.5E-01 | phosphate, citric acid | |
| Glycine Degradation (Creatine Biosynthesis) | 2.71E00 | 2.22E-01 | glycine, L-arginine | |
| Phenylalanine Degradation IV (Mammalian, via Side Chain) | 2.68E00 | 8.82E-02 | L-phenylalanine, phenylpyruvic acid, glycine | |
| Glutathione Biosynthesis | 2.53E00 | 1.82E-01 | phosphate, glycine | |
| Thymine Degradation | 2.53E00 | 1.82E-01 | 5, 6-dihydrothymine, beta-ureidoisobutyric acid | |
| MSTRG.10337 | Calcium Signaling | 1.63E01 | 9.35E-02 | TNNT1, CHRNA1, CACNB1, CACNG1, CACNA1S, MYL2, TNNI2, TNNT3,T NNC2, TNNC1, MYL1, ATP2A1, CAMK2A, CASQ1, RYR1, TNNI1, CASQ2, MYL3, ACTA1, CAMK2B |
| Protein Kinase A Signaling | 7.45E00 | 3.88E-02 | TNNI2, MYL2, MYLPF, MYLK2, PPP1R3A, TTN, MYL1, EPM2A, CAMK2A, PLCB1, RYR1, TNNI1, EYA1, MYL3, CAMK2B, PHKG1 | |
| nNOS Signaling in Skeletal Muscle Cells | 6.1E00 | 1.3E-01 | CACNG1, CACNB1, CACNA1S, CHRNA1, RYR1, L-arginine | |
| Cellular Effects of Sildenafil (Viagra) | 6.09E00 | 6.25E-02 | CACNA1S, CACNG1, MYL2, MYLPF, PLCB1, L-arginine, MYL1, MYL3, ACTA1 | |
| RhoA Signaling | 4.55E00 | 5.6E-02 | MYL2, MYLPF, MYLK2, TTN, MYL1, MYL3, ACTA1 | |
| Apelin Cardiomyocyte Signaling Pathway | 3.7E00 | 5.00E-02 | MYL2, MYLPF, PLCB1, MYL3, MYL1, ATP2A1 | |
| Actin Cytoskeleton Signaling | 3.55E00 | 3.36E-02 | MYL2, MYLPF, ACTN3, MYLK2, TTN, ACTA1, MYL3, MYL1 | |
| Regulation of Actin-based Motility by Rho | 3.24E00 | 5.21E-02 | MYL2, MYLPF, MYL3, ACTA1, MYL1 | |
| ILK Signaling | 3.19E00 | 3.38E-02 | PARVB, MYL2, TNFRSF1A, ACTN3, MYL1, MYL3, ACTA1 | |
| Thrombin Signaling | 2.93E00 | 3.06E-02 | CAMK2A, MYL2, MYLPF, PLCB1, MYL1, MYL3, CAMK2B | |
| MSTRG.17681 | Caveolar-mediated Endocytosis Signaling | 3.56E00 | 5.48E-02 | ARCN1, COPA, COPE, COPB2 |
| Fatty Acid α-oxidation | 2.29E00 | 8.00E-02 | ALDH3A2, ALDH9A1 | |
| Death Receptor Signaling | 2.15E00 | 3.3E-02 | PARP10, PARP4, HTRA2 | |
| Histamine Degradation | 2.05E00 | 6.06E-02 | ALDH3A2, ALDH9A1 | |
| Oxidative Ethanol Degradation III | 2.05E00 | 6.06E-02 | ALDH3A2, ALDH9A1 | |
| G Protein Signaling Mediated by Tubby | 2.03E00 | 5.88E-02 | GNG2, GNAQ | |
| Tryptophan Degradation X (Mammalian, via Tryptamine) | 2.00E00 | 5.71E-02 | ALDH3A2, ALDH9A1 | |
| Putrescine Degradation III | 2.00E00 | 5.71E-02 | ALDH3A2, ALDH9A1 | |
| Ethanol Degradation IV | 1.98E00 | 5.56E-02 | ALDH3A2, ALDH9A1 | |
| NER Pathway | 1.96E00 | 2.8E-02 | HIST2H4B, XAB2, RAD23B |
Figure 5Co-expression network for the novel long non-coding (lnc) RNA MSTRG.17681 with key regulatory potential for metabolic efficiency in cattle and significantly (p < 0.05) correlated genes with a minimal correlation coefficient of |r| > 0.8. Correlations are exclusive for animals with high metabolic efficiency.
Figure 6Co-expression network for the novel long non-coding (lnc) RNA MSTRG.10337 with key regulatory potential for metabolic efficiency in cattle and significantly (p < 0.05) correlated genes with a minimal correlation coefficient of |r| > 0.8. Correlations are exclusive for animals with low metabolic efficiency.