| Literature DB >> 30285615 |
You-Jie Zhao1,2, Yong Cao2, Juan Wang3, Zhi Xiong4.
Abstract
BACKGROUND: Pines are widely distributed in the Northern Hemisphere and have a long evolutionary history. The availability of transcriptome data has facilitated comparative transcriptomics for studying the evolutionary patterns associated with the different geographical distributions of species in the Pinus phylogeny.Entities:
Keywords: Comparative transcriptomics; Pinus kesiya var. langbianensis; Pinus phylogeny; Transcriptome sequencing
Mesh:
Year: 2018 PMID: 30285615 PMCID: PMC6171231 DOI: 10.1186/s12864-018-5127-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Transcriptome sequencing in 13 conifer species
| Main distribution | Data source | Number of Unigenes | Total length(bp) | Mean length(bp) | |
|---|---|---|---|---|---|
|
| Southwestern China southeastern Asian | Illumina Sequencing | 68,881 | 58,975,225 | 856 |
|
| Northern China northern Korea | NCBI SRA(SRR546476) | 10,285 | 8,669,639 | 843 |
|
| Middle and high latitudes of Eurasia | PlantGDB | 73,609 | 29,552,311 | 401 |
|
| Mediterranean region southern Europe | NCBI SRA(SRR445497, SRR445498) | 11,403 | 10,122,178 | 888 |
|
| Mediterranean region | NCBI SRA(SRR942848) | 72,028 | 47,224,748 | 656 |
|
| Mediterranean region | PlantGDB | 15,648 | 11,498,176 | 735 |
|
| Western North America | PlantGDB | 13,570 | 15,334,600 | 1130 |
|
| North American Canada | PlantGDB | 13,040 | 14,756,117 | 1132 |
|
| Southeastern United States | Treegenesdb (v2.01) | 91,046 | 75,408,866 | 828 |
|
| Southeastern United States | NCBI SRA(SRR065012) | 15,013 | 14,111,464 | 940 |
|
| Mountains of western United States and Canada | NCBI SRA(SRR1013836) | 86,230 | 33,420,814 | 388 |
|
| Mountains of western North America | NCBI SRA(SRR064207) | 30,981 | 33,041,789 | 1067 |
|
| Northern United States southern/central Canada | PlantGDB | 48,619 | 54,962,881 | 1130 |
Transcriptome sequencing and assembly of P. kesiya var. langbianensis transcriptome
| Sequencing results | Assembly results | ||
|---|---|---|---|
| Total number of raw reads | 141,234,042 | Total number of unigenes | 68,881 |
| Total number of clean reads | 133,236,316 | Total unigenes length (bp) | 56,537,842 |
| Total clean Nucleotides (bp) | 11,991,268,440 | Contig N50 (bp) | 1402 |
| Q20 (%) | 98.25% | Mean length (bp) | 821 |
| N content(%) | 0.01% | Max length (bp) | 15,498 |
| GC content (%) | 45.02% | Min length (bp) | 200 |
Unigene annotation of P. kesiya var. langbianensis transcriptome
| Unigenes | NR | NT | Swiss-Prot | KEGG | COG | GO | ALL |
|---|---|---|---|---|---|---|---|
| 68,881 | 37,757 | 42,869 | 24,093 | 21,433 | 13,528 | 23,491 | 48,035 |
Fig. 1Summary and taxonomic source of the BLASTX matches to unigenes. Number of unique best BLASTX matches of unigenes grouped by genus
Fig. 2Distribution of SSRs in the P. kesiya var. langbianensis transcriptome
Fig. 3Distribution of SNPs in the P. kesiya var. langbianensis transcriptome
Number and Ks peaks of orthologous genes in 13 conifer species
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||||
|
| 6351/0.01 | ||||||||||||
|
| 4331/0.01 | 898/0.01 | |||||||||||
|
| 8112/0.03 | 4144/0.03 | 1153/0.02 | ||||||||||
|
| 16,051/0.03 | 5498/0.03 | 727/0.02 | 5839/0.03 | |||||||||
|
| 7561/0.02 | 3311/0.03 | 944/0.02 | 3844/0.03 | 6419/0.03 | ||||||||
|
| 8922/0.03 | 3953/0.03 | 1129/0.03 | 4392/0.04 | 6578/0.04 | 4011/0.03 | |||||||
|
| 7784/0.03 | 3577/0.03 | 1374/0.03 | 3815/0.04 | 5642/0.04 | 3610/0.04 | 4996/0.01 | ||||||
|
| 13,493/0.04 | 4981/0.03 | 2624/0.03 | 6114/0.04 | 14,088/0.04 | 6496/0.04 | 6836/0.01 | 5744/0.01 | |||||
|
| 9311/0.03 | 3514/0.03 | 1092/0.03 | 4097/0.04 | 8425/0.04 | 4035/0.04 | 4321/0.01 | 3994/0.01 | 7551/0.01 | ||||
|
| 7472/0.08 | 3538/0.08 | 1433/0.06 | 4481/0.08 | 5940/0.08 | 3790/0.09 | 4773/0.07 | 4769/0.06 | 4847/0.09 | 5087/0.08 | |||
|
| 11,185/0.08 | 4477/0.08 | 1964/0.06 | 5233/0.08 | 8269/0.09 | 4783/0.08 | 5308/0.08 | 4722/0.08 | 8349/0.08 | 6428/0.08 | 7169/0.01 | ||
|
| 13,691/0.14 | 4808/0.15 | 959/0.14 | 6493/0.15 | 12,041/0.15 | 5082/0.15 | 7999/0.14 | 7092/0.14 | 9724/0.15 | 6019/0.14 | 7710/0.15 | 9969/0.15 |
Fig. 4Functional annotation and divergence between the orthologues of 12 pine and one spruce species. The heat map is based on the 130 putative orthologous transcripts of the 13 species. The orthologues were annotated to different functions based on GO terms (Additional file 3: File S1). Colors indicate similarity from yellow (highly similar) to red (weakly similar)
Fig. 5Distribution of Ks values of the orthologous pairs between P. kesiya var. langbianensis and others
Fig. 6Phylogenetic tree of 12 Pinus and one Picea species. The phylogram was derived using the pairwise synonymous substitution rates of the orthologous transcripts as a distance metric and the neighbor-joining method
Number and contained function of positive selection genes in Genus Pinus
|
|
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||||
|
| 2750/AS | |||||||||||
|
| 1700/D | 422 | ||||||||||
|
| 949 | 505 | 267 | |||||||||
|
| 2173 | 720/A | 154 | 697 | ||||||||
|
| 1267 | 561 | 227 | 573 | 1032 | |||||||
|
| 1169 | 519 | 299 | 411 | 720 | 511 | ||||||
|
| 960/A | 453 | 338 | 361/A | 603/A | 444 | 1685/A | |||||
|
| 1823/S | 648 | 697 | 512 | 1798 | 887 | 1685/S | 1386/AS | ||||
|
| 1101 | 444/A | 268 | 323 | 754 | 497 | 1335 | 1110/A | 2839 | |||
|
| 271 | 189 | 135 | 177 | 209 | 195 | 235 | 266/A | 209 | 215 | ||
|
| 398 | 154/A | 200 | 122 | 221 | 205 | 156 | 166 | 296 | 204 | 2688/D |
A Abscisic acid stress, D Drought stress, S Salt stress
Information of resistance genes involved in positive selection in Genus Pinus
| Funtion | Nr Annotation | Specie1 | Gene1 | Specie2 | Gene2 | Ka/Ks | Ka | Ks |
|---|---|---|---|---|---|---|---|---|
| Salt stress | NP_001105589.1 |
| isotig03867 |
| CL1082.Contig1_All | > 1 | 0.0013 | 0 |
| Salt stress | NP_001105589.1 |
| PITA_000000350-RA |
| PUT-177a-Pinus_banksiana-8213 | 4.3893 | 0.0873 | 0.0199 |
| Salt stress | NP_001105589.1 |
| PITA_000000350-RA |
| CL1082.Contig1_All | 4.9134 | 0.0994 | 0.0202 |
| Salt stress | NP_001105589.1 |
| PITA_000000350-RA |
| PUT-175a-Pinus_contorta-5514 | 4.3893 | 0.0873 | 0.0199 |
| Drought stress | AAD37375.1 |
| PUT-187a-Pinus_sylvestris-52,858 |
| Unigene23170_All | 1.0047 | 0.0083 | 0.0083 |
| Drought stress | AAD37375.1 |
| isotig24007 |
| comp65821_c3 | > 1 | 0.0046 | 0 |
| ABA stress | NP_001105719.1 |
| isotig03955 |
| PUT-177a-Pinus_banksiana-10,254 | 1.2072 | 0.0054 | 0.0045 |
| ABA stress | NP_001105719.1 |
| Unigene11406_All |
| PUT-177a-Pinus_banksiana-10,253 | 1.166 | 0.0262 | 0.0224 |
| ABA stress | NP_001105719.1 |
| PUT-175a-Pinus_contorta-11,400 |
| PUT-177a-Pinus_banksiana-10,253 | 1.2 | 0.0162 | 0.0135 |
| ABA stress | NP_001105719.1 |
| PITA_000045763-RA |
| PUT-177a-Pinus_banksiana-10,254 | > 1 | 0.004 | 0 |
| ABA stress | NP_001105719.1 |
| isotig05514 |
| PUT-177a-Pinus_banksiana-10,253 | 1.0145 | 0.05 | 0.0493 |
| ABA stress | NP_001105719.1 |
| comp18337_c1 |
| PUT-177a-Pinus_banksiana-10,253 | 1.2039 | 0.0601 | 0.0499 |
ABA is Abscisic acid. Sequences showed in Additional file 4: File S2