| Literature DB >> 28673235 |
Alberto Lopez-Ezquerra1, Mark C Harrison1, Erich Bornberg-Bauer2.
Abstract
BACKGROUND: The ever increasing availability of genomes makes it possible to investigate and compare not only the genomic complements of genes and proteins, but also of RNAs. One class of RNAs, the long noncoding RNAs (lncRNAs) and, in particular, their subclass of long intergenic noncoding RNAs (lincRNAs) have recently gained much attention because of their roles in regulation of important biological processes such as immune response or cell differentiation and as possible evolutionary precursors for protein coding genes. lincRNAs seem to be poorly conserved at the sequence level but at least some lincRNAs have conserved structural elements and syntenic genomic positions. Previous studies showed that transposable elements are a main contribution to the evolution of lincRNAs in mammals. In contrast, plant lincRNA emergence and evolution has been linked with local duplication events. However, little is known about their evolutionary dynamics in general and in insect genomes in particular.Entities:
Keywords: Evolution; RNA secondary structure; Transcriptomics; lincRNA
Mesh:
Substances:
Year: 2017 PMID: 28673235 PMCID: PMC5494802 DOI: 10.1186/s12862-017-0985-0
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
LincRNAs used in this study
| Species | Total lincRNAs | Monoexonic LincRNAs | Multiexonic lincRNAs | Reference |
|---|---|---|---|---|
|
| 1559 | 1327 | 232 | Here assembled |
|
| 2602 | 1807 | 795 | [ |
|
| 2066 | 330 | 1735 | [ |
|
| 1529 | 310 | 1199 | [ |
|
| 2459 | 379 | 2080 | [ |
|
| 2176 | 431 | 1713 | Here assembled |
|
| 1770 | 655 | 1115 | [ |
Fig. 1GC content and TE content of lincRNAs. a GC content of lincRNAs, lincRNA introns and coding sequences. LincRNAs have an intermediate GC content: higher than introns but lower than coding sequences. b Percentage of repeats of lincRNAs. LincRNAs have also an intermediate level of repeats. More repeats than coding sequences but less than introns. c Conserved lincRNAs have less TE. In contrast lincRNAs with signals of conservation in their ORF or paralogs have more TEs
Fig. 2Secondary structure analysis of lincRNAs. a Distribution of Z-scores with folding strength (FS). Both are highly correlated (rho=0.46, p-value 2.2e16) which indicates that strongly folded sequences also tend to be more stable than their shuffled controls. b Distribution of Z-scores obtained from MFE of lincRNAs compared to shuffled sequences. A negative correlation (rho=-0.22, p-value 2.2e16) indicates than thermodynamically stable sequences (i.e longer because MFE scales with length) have higher Z-scores although several short sequences outliers with very strong Z-scores are present. c Comparison of Z-scores for FS obtained between lincRNAs and 10000 CDS from the seven species. CDS shows significantly bigger FS than lincRNAs. d Comparison of Z-scores for MFE calculations obtained between lincRNAs and 1O000 CDS from the seven species. Z-scores are significantly higher for lincRNAs
Fig. 3Conservation analysis of lincRNAs in the seven studied species. Comparison of lincRNAs exons was performed using BLAST with the native lincRNAs, with the sequences of the lincRNAS with masked repeats and with the longest ORF of all the lincRNAs. A low sequence conservation was observed for lincRNAs in insects. Five hundred ninety three lincRNAs were observed conserved in their nucleotide sequence both with masking and without masking repeats. Furthermore 43 lincRNAs showed signals of conservation in their ORFs and 68 showed indication of conservation only without masking repeats
LincRNAs with signals of conservation in other species (labelled as conserved); with paralog streches (paralogs) and containing overlaps with transposable elements in their spliced exonic sequences (transposable element derived)
| Species | Total lincRNAs | Conserved (after masking repeats) | Paralogs | TE related | Structured |
|---|---|---|---|---|---|
|
| 1559 | 0 | 768 | 827 | 37 |
|
| 2602 | 174 | 869 | 443 | 79 |
|
| 2066 | 0 | 422 | 237 | 180 |
|
| 1529 | 129 | 83 | 1 | 80 |
|
| 2459 | 159 | 356 | 0 | 56 |
|
| 2176 | 16 | 259 | 109 | 205 |
|
| 1770 | 155 | 506 | 246 | 72 |