| Literature DB >> 25799544 |
Zhiqiang Hao1, Chunyan Fan2, Tian Cheng2, Ya Su2, Qiang Wei2, Guanglin Li1.
Abstract
Long intergenic noncoding RNAs (lincRNAs) are intergenic transcripts with a length of at least 200 nt that lack coding potential. Emerging evidence suggests that lincRNAs from animals participate in many fundamental biological processes. However, the systemic identification of lincRNAs has been undertaken in only a few plants. We chose to use cucumber (Cucumis sativus) as a model to analyze lincRNAs due to its importance as a model plant for studying sex differentiation and fruit development and the rich genomic and transcriptome data available. The application of a bioinformatics pipeline to multiple types of gene expression data resulted in the identification and characterization of 3,274 lincRNAs. Next, 10 lincRNAs targeted by 17 miRNAs were also explored. Based on co-expression analysis between lincRNAs and mRNAs, 94 lincRNAs were annotated, which may be involved in response to stimuli, multi-organism processes, reproduction, reproductive processes, and growth. Finally, examination of the evolution of lincRNAs showed that most lincRNAs are under purifying selection, while 16 lincRNAs are under natural selection. Our results provide a rich resource for further validation of cucumber lincRNAs and their function. The identification of lincRNAs targeted by miRNAs offers new clues for investigations into the role of lincRNAs in regulating gene expression. Finally, evaluation of the lincRNAs suggested that some lincRNAs are under positive and balancing selection.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25799544 PMCID: PMC4370693 DOI: 10.1371/journal.pone.0121800
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Pipeline for lincRNA identification.
The left frames contain the number of transcripts that passed the previous filter. The right frames show the process of filtering and the screened transcript numbers. Three sources were used to identify lincRNAs: hi-seq, EST, and 454-seq data. Hi-Seq: Illumina high-throughput RNA Sequencing; EST: expressed sequence tags; 454-Seq: 454-pyrosequencing; CPC: coding potential calculator; TU: transcript unit.
Fig 2Characteristics of lincRNAs.
(A) Length distribution of 3,274 lincRNAs. The X-axis represents the length of lincRNAs. The Y-axis represents the number of lincRNAs with specific lengths. (B) Distribution of exon numbers. The X-axis displays the exon numbers; the Y-axis shows the number of lincRNAs corresponding to specific exon numbers. (C) The nearest distance between lincRNAs and their neighboring protein-coding genes. (D) The conservation of lincRNAs.
Fig 3Expression pattern of lincRNAs.
(A) The numbers of lincRNAs showing different expression levels in each tissue. (B) Different expression levels of lincRNAs and mRNAs in the stem. (C) Proportion of lincRNAs exhibiting tissue-specific expression in different tissues. (D) Heat map of lincRNAs with tissue-specific expression.
LincRNAs targeted by miRNAs.
|
|
|
|
|
|---|---|---|---|
| csa-miR156a | CU2NC895 | 0 | miRNA: 1 GCTCACTTCTCTCTCTGTCAGA 22 |
| |||||||||||||||||||||| | |||
| lincRNA: 304 CGAGTGAAGAGAGAGACAGTCT 283 | |||
| csa-miR156b | CU2NC895 | 1 | miRNA: 1 GCTCACTTCTCTTTCTGTCAG-T 22 |
| ||||||||||||:|||||||| | | |||
| lincRNA: 304 CGAGTGAAGAGAGAGACAGTCTA 282 | |||
| csa-miR156d | CU7NC3095 | 2.5 | miRNA: 1 TGCCAGAAGAGAGTGAGCAC 20 |
| |:|||||||||||: |||| | |||
| lincRNA: 108 ATGGTCTTCTCTCTTTCGTT 89 | |||
| csa-miR159c | CU5NC2102 | 2.5 | miRNA: 1 TTTGGATTGAAGGGAGCTCT 20 |
| ||||:||||||:|||||| | |||
| lincRNA: 228 TCACCTGACTTCCTTCGAGA 209 | |||
| csa-miR162a | CU6NC2683 | 2.2 | miRNA: 1 GGAGGCAGCGGTTCATCGACC 21 |
| : | ||||:|||||||||||| | |||
| lincRNA: 142 TCGCCGTTGCCAAGTAGCTGT 122 | |||
| csa-miR166 | CU2NC947 | 0.8 | miRNA: 1 TCGGACCAGGCTTCATTC-TCG 21 |
| |||||||||||||||||| ||: | |||
| lincRNA: 275 AGCCTGGTCCGAAGTAAGGAGT 254 | |||
| csa-miR172c | CU3NC1224 | 2.5 | miRNA: 1 GAGAATCTTGATGATGCTGCA 21 |
| |||||||||||| ||| ||| | |||
| lincRNA: 246 CTCTTAGAACTAATACTTCGT 226 | |||
| csa-miR319 | CU5NC2102 | 1.5 | miRNA: 1 TTGGACTGAAGGGAGCTCCCT 21 |
| |||||||||||:||||| || | |||
| lincRNA: 227 CACCTGACTTCCTTCGAGAGA 207 | |||
| csa-miR396a | CU1NC272 | 1 | miRNA: 1 CCACAGCTTTCTTGAACTGCA 21 |
| |||||||||||||||||| | | |||
| lincRNA: 181 GGTGTCGAAAGAACTTGAATT 161 | |||
| csa-miR396b | CU1NC272 | 0.8 | miRNA: 1 GTTCAAGAAAGCTGTGGGAGA 21 |
| |:|||||||||||||||||:| | |||
| lincRNA: 102 CGAGTTCTTTCGACACCCTTT 82 | |||
| csa-miR396c | CU1NC272 | 1.8 | miRNA: 1 GTTCAATAAAGCTGTGGGAAG 21 |
| |:|||| |||||||||||||: | |||
| lincRNA: 102 CGAGTTCTTTCGACACCCTTT 82 | |||
| csa-miR396d | CU1NC272 | 0 | miRNA: 1 TTCCACAGCTTTCTTGAACTT 21 |
| ||||||||||||||||||||| | |||
| lincRNA: 183 AAGGTGTCGAAAGAACTTGAA 163 | |||
| csa-miR399a | CU5NC2035 | 0 | miRNA: 1 AGGGCTTCTCTCCATTGGCAGG 22 |
| |||||||||||||||||||||| | |||
| lincRNA: 818 TCCCGAAGAGAGGTAACCGTCC 797 | |||
| csa-miR399b | CU6NC2935 | 2 | miRNA: 1 TGCCAAAAGAGACTTGCCC 19 |
| |||||| |||||||:|| | | |||
| lincRNA: 148 ACGGTTATCTCTGAGCGTG 130 | |||
| csa-miR399b | CU5NC2035 | 2 | miRNA: 1 TGCCAAAAGAGACTTGCCC 19 |
| ||||||| |||| |||||| | |||
| lincRNA: 771 ACGGTTTCCTCTCAACGGG 753 | |||
| csa-miR399c | CU5NC2035 | 0 | miRNA: 1 TGCCAAAGGAGAGTTGCCCTT 21 |
| ||||||||||||||||||||| | |||
| lincRNA: 771 ACGGTTTCCTCTCAACGGGAA 751 | |||
| csa-miR399d | CU5NC2035 | 2 | miRNA: 1 TGCCAAAGGAGATTTGCCCGG 21 |
| |||||||||||| |||||| | |||
| lincRNA: 771 ACGGTTTCCTCTCAACGGGAA 751 | |||
| csa-miRn2–3p | CU5NC2296 | 2.5 | miRNA: 1 ATCTAACGATGTAGGAGCAAT 21 |
| || |||||||:|||:||||| | |||
| lincRNA: 210 TACATTGCTATATCTTCGTTT 190 |
a indicates the total score of the alignment.
b indicates the alignment between the miRNA and target (lincRNA).
Fig 4LincRNAs targeted by miRNAs.
Secondary structure of one lincRNA (CU1NC272) and the base-pairing relationship between the lincRNA (CU1NC272) and Csa-miRNA396b. The predicted secondary structure was generated using RNAfold (minimum free energy: −63.50).
Fig 5LincRNA-mRNA co-expression network.
Nodes with red circles represent lincRNAs, and nodes with blue circles represent mRNAs. The edges represent connected nodes that exhibit a high correlation. Several large modules highlighted in yellow are also shown and annotated according to BP (Biological Processes).
Fig 6Functions of lincRNAs.
(A) A Venn Diagram showing the number of lincRNAs predicted via the hub-based and module-based methods. (B) The main biological processes (BP) of overlapping lincRNAs predicted by two methods. The X-axis indicates the number of enriched mRNAs.
Fig 7Distribution of SNPs and Tajima’s D values.
(A) The distribution of SNPs for all lincRNAs. (B) The distribution of SNPs per 100 nt for all lincRNAs. (C) The distribution of Tajima’s D values for all lincRNAs from 102 cultivated cucumber accessions. (D) The distribution of Tajima’s D values for all intergenic regions from 102 cultivated cucumber accessions. The horizontal line indicates the 95% confidence interval.
Cucumber lincRNAs under positive selection and balancing selection.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| CU1NC10 | Chr1 | 239 | 9 | 0.001105 | −2.2659441 | No | NA |
| CU1NC88 | Chr1 | 278 | 4 | 0.0062904 | 2.5505173 | Yes | Ovary1 |
| CU1NC163 | Chr1 | 291 | 2 | 0.0034695 | 2.4648383 | Yes | Root |
| CU1NC200 | Chr1 | 501 | 7 | 0.0050127 | 2.0830965 | Yes | Root |
| CU1NC212 | Chr1 | 636 | 4 | 0.0026174 | 2.3492285 | Yes | Root |
| CU1NC213 | Chr1 | 544 | 4 | 0.0030601 | 2.3492285 | Yes | Root |
| CU1NC281 | Chr1 | 203 | 3 | 0.0070937 | 2.7242963 | No | Ovary3 |
| CU1NC336 | Chr1 | 513 | 3 | 0.0029223 | 2.8775269 | No | Leaf |
| CU1NC346 | Chr1 | 253 | 2 | 0.0039875 | 2.503925 | Yes | Root |
| CU1NC413 | Chr1 | 294 | 2 | 0.0032004 | 2.2522393 | Yes | Root |
| CU1NC476 | Chr1 | 256 | 3 | 0.0057236 | 2.7862235 | No | Ovary3 |
| CU1NC531 | Chr1 | 569 | 7 | 0.0003921 | −1.9389757 | Yes | Ovary3 |
| CU1NC532 | Chr1 | 612 | 5 | 0.0001901 | −2.0106222 | Yes | Ovary3 |
| CU2NC602 | Chr2 | 419 | 3 | 0.0005007 | −1.9692466 | Yes | Ovary3 |
| CU2NC712 | Chr2 | 313 | 10 | 0.0039749 | −2.0240647 | No | Ovary3 |
| CU2NC844 | Chr2 | 492 | 4 | 0.000271 | −1.8212527 | Yes | Tendril |
| CU2NC897 | Chr2 | 303 | 4 | 0.0012324 | −1.8243111 | Yes | Ovary3 |
| CU2NC903 | Chr2 | 365 | 3 | 0.0040301 | 2.8203487 | No | Root |
| CU2NC943 | Chr2 | 1005 | 7 | 0.000392 | −1.9146939 | Yes | Ovary3 |
| CU3NC998 | Chr3 | 206 | 4 | 0.007585 | 2.0897819 | Yes | Ovary2 |
| CU3NC1042 | Chr3 | 261 | 2 | 0.0037901 | 2.4445273 | Yes | Root |
| CU3NC1046 | Chr3 | 211 | 2 | 0.0047828 | 2.5080474 | Yes | Leaf |
| CU3NC1070 | Chr3 | 224 | 2 | 0.0042939 | 2.3222843 | Yes | Ovary3 |
| CU3NC1211 | Chr3 | 342 | 5 | 0.0004525 | −2.1717458 | No | Root |
| CU3NC1228 | Chr3 | 205 | 4 | 0.0020915 | −1.8572054 | Yes | Ovary3 |
| CU3NC1230 | Chr3 | 232 | 5 | 0.0004226 | −1.9060085 | Yes | Root |
| CU3NC1394 | Chr3 | 325 | 4 | 0.00515 | 2.3797784 | Yes | Ovary3 |
| CU3NC1397 | Chr3 | 271 | 3 | 0.0050691 | 2.5022962 | Yes | Ovary3 |
| CU3NC1466 | Chr3 | 359 | 2 | 0.0025722 | 2.1949941 | Yes | Male_flower |
| CU3NC1523 | Chr3 | 271 | 6 | 0.0115769 | 2.5886134 | Yes | Root |
| CU4NC1592 | Chr4 | 519 | 3 | 0.0028003 | 2.7559386 | No | Root |
| CU4NC1593 | Chr4 | 270 | 5 | 0.0072616 | 2.2587333 | Yes | Root |
| CU4NC1605 | Chr4 | 367 | 4 | 0.0045567 | 2.3688663 | Yes | Ovary3 |
| CU4NC1606 | Chr4 | 568 | 3 | 0.0021774 | 2.0839754 | Yes | Male_flower |
| CU4NC1644 | Chr4 | 350 | 3 | 0.0037895 | 2.3348271 | Yes | Ovary3 |
| CU4NC1659 | Chr4 | 488 | 2 | 0.0019324 | 2.2553741 | Yes | Female_flower |
| CU4NC1667 | Chr4 | 246 | 2 | 0.0037646 | 2.1717333 | Yes | Ovary1 |
| CU4NC1767 | Chr4 | 260 | 3 | 0.0058175 | 2.9329276 | No | Ovary2 |
| CU4NC1781 | Chr4 | 283 | 3 | 0.0044405 | 2.135983 | Yes | Ovary1 |
| CU4NC1796 | Chr4 | 196 | 5 | 0.0030814 | −1.8400142 | Yes | Ovary3 |
| CU4NC1800 | Chr4 | 292 | 2 | 0.0033105 | 2.3304458 | Yes | Ovary3 |
| CU4NC1802 | Chr4 | 417 | 2 | 0.0023499 | 2.3860244 | Yes | Male_flower |
| CU4NC1868 | Chr4 | 304 | 3 | 0.0041277 | 2.1525254 | Yes | Root |
| CU4NC1885 | Chr4 | 516 | 3 | 0.0002247 | −2.0113611 | Yes | Ovary2 |
| CU5NC1977 | Chr5 | 302 | 2 | 0.0033386 | 2.5229679 | Yes | Ovary3 |
| CU5NC2008 | Chr5 | 467 | 2 | 0.0019797 | 2.1919823 | Yes | Female_flower |
| CU5NC2111 | Chr5 | 277 | 3 | 0.0008379 | −1.9128665 | Yes | Ovary3 |
| CU5NC2118 | Chr5 | 385 | 2 | 0.0023619 | 2.1313885 | Yes | Stem |
| CU5NC2119 | Chr5 | 605 | 2 | 0.001503 | 2.1313885 | Yes | Stem |
| CU5NC2199 | Chr5 | 409 | 5 | 0.0002844 | −2.0106222 | Yes | Tendril_base |
| CU5NC2215 | Chr5 | 280 | 6 | 0.0006261 | −1.8997299 | Yes | Male_flower |
| CU5NC2264 | Chr5 | 361 | 2 | 0.002709 | 2.389199 | Yes | Ovary2 |
| CU5NC2297 | Chr5 | 750 | 3 | 0.0001277 | −1.9040852 | Yes | Ovary2 |
| CU5NC2308 | Chr5 | 306 | 5 | 0.0004429 | −2.0981125 | No | Ovary1 |
| CU5NC2322 | Chr5 | 289 | 5 | 0.0004683 | −1.9736752 | Yes | Ovary1 |
| CU5NC2333 | Chr5 | 1185 | 12 | 0.0003154 | −2.2046959 | No | Ovary3 |
| CU5NC2398 | Chr5 | 462 | 5 | 0.0006118 | −1.839069 | Yes | Ovary2 |
| CU5NC2405 | Chr5 | 346 | 2 | 0.002886 | 2.4711995 | Yes | Tendril |
| CU5NC2413 | Chr5 | 302 | 5 | 0.0003246 | −1.9060085 | Yes | Leaf |
| CU5NC2414 | Chr5 | 258 | 5 | 0.0004501 | −1.8626232 | Yes | Tendril |
| CU6NC2488 | Chr6 | 233 | 3 | 0.0013168 | −1.7863936 | Yes | Female_flower |
| CU6NC2519 | Chr6 | 435 | 5 | 0.0058552 | 2.1541265 | Yes | Female_flower |
| CU6NC2591 | Chr6 | 368 | 8 | 0.0006311 | −2.3781938 | No | Ovary1 |
| CU6NC2592 | Chr6 | 388 | 7 | 0.0003503 | −2.0974517 | No | Ovary1 |
| CU6NC2614 | Chr6 | 376 | 3 | 0.0039007 | 2.7872349 | No | Root |
| CU6NC2664 | Chr6 | 1414 | 5 | 0.0003119 | −1.8432369 | Yes | Ovary3 |
| CU6NC2673 | Chr6 | 258 | 3 | 0.000859 | −1.9415208 | Yes | Ovary1 |
| CU6NC2686 | Chr6 | 205 | 4 | 0.0080763 | 2.3471704 | Yes | Female_flower |
| CU6NC2748 | Chr6 | 260 | 2 | 0.0035033 | 2.1110979 | Yes | Female_flower |
| CU6NC2798 | Chr6 | 517 | 2 | 0.0002231 | −2.0124695 | Yes | Ovary3 |
| CU6NC2830 | Chr6 | 1978 | 12 | 0.0001921 | −2.273675 | No | Ovary3 |
| CU6NC2833 | Chr6 | 754 | 7 | 0.0036891 | 2.5633424 | Yes | Ovary3 |
| CU6NC2881 | Chr6 | 231 | 2 | 0.0043548 | 2.5263375 | Yes | Male_flower |
| CU7NC3029 | Chr7 | 370 | 2 | 0.0026393 | 2.3991555 | Yes | Root |
| CU7NC3079 | Chr7 | 453 | 6 | 0.0003417 | −2.1717458 | No | Root |
| CU7NC3086 | Chr7 | 202 | 2 | 0.0009368 | −1.8691152 | Yes | Female_flower |
| CU7NC3093 | Chr7 | 339 | 6 | 0.0018118 | −1.8311426 | Yes | Root |
| CU7NC3094 | Chr7 | 330 | 5 | 0.0014267 | −1.9062613 | Yes | Ovary2 |
| CU7NC3201 | Chr7 | 345 | 3 | 0.0036221 | 2.130209 | Yes | Tendril |
| CU7NC3207 | Chr7 | 465 | 3 | 0.0002485 | −1.8638834 | Yes | Male_flower |
| CU7NC3276 | Chr7 | 389 | 4 | 0.000346 | −1.9751531 | Yes | Ovary3 |
* indicates a significance level of p<0.05
** indicates a significance level of p<0.01.
a Chr: Chromosome where the lincRNA is located.
b S: Number of polymorphic (segregating) sites.
c Pi: Nucleotide diversity.
d Tajima’s D calculated for lincRNAs from 102 cucumber accessions.
e The threshold 95% confidence interval calculated for all cucumber intergenic regions from 102 cucumber accessions. "Yes" indicates that the Tajima’s D value for each lincRNA is within the 95% confidence interval, while "No" indicates that the Tajima’s D value for each lincRNA is outside of the 95% confidence interval.
f Peak expression tissue: tissue with the highest expression level of lincRNAs. Ovary1: unexpanded ovary. Ovary2: expanded ovary (fertilized). Ovary3: expanded ovary (unfertilized).