| Literature DB >> 25887584 |
Witold Wachowiak1,2, Urmi Trivedi3, Annika Perry4, Stephen Cavers5.
Abstract
BACKGROUND: Pinus sylvestris, P. mugo, P. uliginosa and P. uncinata are closely related but phenotypically and ecologically very distinct European pine species providing an excellent study system for analysis of the genetic basis of adaptive variation and speciation. For comparative genomic analysis of the species, transcriptome sequence was generated for 17 samples collected across the European distribution range using Illumina paired-end sequencing technology.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25887584 PMCID: PMC4458023 DOI: 10.1186/s12864-015-1401-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Locations of the populations of the four pine species sampled for the study. Populations labelled PS - P. sylvestris, M - P. mugo, UN - P. uncinata, UG - P. uliginosa.
Plant material used for transcriptome sequencing
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| ||||||
| PS1 | 1_SD_30 | Scotland, Shieldaig | 57°30′35″ | −5°38′24″ | 81 | |
| PS2 | 2_GT_31 | Scotland, Glen Tanar | 57°2′60″ | −2°51′36″ | 334 | |
| PS3 | 3_Punk_39 | Finland, Punkaharju | 61°45′33″ | 29°23′21″ | 80 | |
| PS4 | 4_Jar_43 | Poland, Jarocin | 51°58′20″ | 17°28′40″ | 120 | |
| PS5 | 5_Trev_37 | Spain, Trevenque | 37°05′47″ | 3°32′51″ | 1170 | |
|
| ||||||
| M1 | 6_SC_5 | Romania, Southern Carpathians, Busteni | 45°25′55″ | 25°27′06″ | 2070 | |
| M2 | 7_BH_9 | Bosnia and Herzegovina, Bjelasnica Mts | 43°45′00″ | 18°13′08″ | 2120 | |
| M3 | 8_Abr_16 | Italy, Abruzzi, La Maiella | 41°46′20″ | 13°58′30″ | 2200 | |
| M4 | 9_Alps_12 | Austria, Karwendel Mts., Scharnitz | 47°22′42″ | 11°17′45″ | 1400 | |
| M5 | 10_Sdt_1 | Poland, Sudety Mts, Śląskie Kamienie | 50°46′35″ | 15°36′08″ | 1400 | |
|
| ||||||
| UN1 | 11_CC_28 | France, Col de la Croix de Morand | 45°35′58″ | 2°50′44″ | 1200 | |
| UN2 | 12_LaT_23 | Spain, Pyrenees, La Trapa | 0°32′12″ | 42°41′19″ | 1720 | |
| UN3 | 13_VdR_17 | Andorra, Eastern Pyrenees, Vall de Ransol | 42°35′02″ | 1°38′21″ | 2025 | |
| UN4 | 14_Val_24 | Spain, Sierra de Gudar | 40°28′49″ | -0°41′51″ | 2000 | |
|
| ||||||
| UG1 | 15_Weg_57 | Poland, Low Silesian Pinewood, Węgliniec | 51°17′50″ | 15°14′20″ | 190 | |
| UG2 | 16_Bat_59 | Poland, Wielkie Torfowisko Batorowskie reserve | 50°27′32″ | 16°23′01″ | 750 | |
| UG3 | 17_Mit_58 | Germany, Mittenwald | 47°28′50″ | 11°16′27″ | 856 |
Statistics for de novo transcriptome assembly of the reference sample (2_GT_31)
|
|
|
|
|---|---|---|
| Max contig length | 16652 | 16652 |
| Num contigs >100 | 151932 | 40798 |
| Total bases in contigs >100 | 119849194 | 61246267 |
| N50 for contigs >100 | 1555 | 2118 |
| Contigs >100 in N50 | 22593 | 9640 |
| GC contigs >100 | 41.8 | 42.5 |
| nonATGC in contigs >100 | 0 | 0 |
| Mean length for contigs >100 | 788.8 | 1501.2 |
N50 - the contig length for which the collection of all contigs of equal or longer length produces half the bases of the contigs.
Non ATGC - non ATGC bases (such as Ns).
Figure 2Gene ontology classification of the unigenes.
Mapping statistics of the samples to the reference transcriptome sequence (2_GT_31)
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 1_SD_30 |
| 31116472 | 26922787 | 86.52 | 26.36 | 84.14 | 67817 |
| 2_GT_31 |
| 258401512 | 229042493 | 88.64 | 43.22 | 85.34 | 81519 |
| 3_Punk_39 |
| 37849782 | 31980676 | 84.49 | 34.37 | 80.21 | 63874 |
| 4_Jar_43 |
| 38970706 | 31951845 | 81.99 | 19.34 | 79.47 | 94021 |
| 5_Trev_37 |
| 45140044 | 38150182 | 84.52 | 19.44 | 81.73 | 95814 |
| 6_SC_5 |
| 43752078 | 35804512 | 81.83 | 24.08 | 78.50 | 116762 |
| 7_BH_9 |
| 32600000 | 26565484 | 81.49 | 26.23 | 78.40 | 103818 |
| 8_Abr_16 |
| 40104880 | 33111153 | 82.56 | 29.18 | 78.19 | 100602 |
| 9_Alps_12 |
| 52934684 | 43411825 | 82.01 | 23.36 | 78.67 | 130942 |
| 10_Sdt_1 |
| 69248828 | 57560565 | 83.12 | 28.02 | 80.03 | 138989 |
| 11_CC_28 |
| 34805254 | 28627271 | 82.25 | 16.74 | 79.34 | 102037 |
| 12_LaT_23 |
| 30291214 | 24678075 | 81.47 | 36.80 | 78.06 | 87181 |
| 13_VdR_17 |
| 45834740 | 36942040 | 80.60 | 18.27 | 77.85 | 118800 |
| 14_Val_24 |
| 48550644 | 39819426 | 82.02 | 21.98 | 79.12 | 115050 |
| 15_Weg_57 |
| 52718596 | 42948249 | 81.47 | 18.33 | 78.55 | 127068 |
| 16_Bat_59 |
| 40729720 | 33913094 | 83.26 | 21.04 | 80.38 | 116297 |
| 17_Mit_58 |
| 69457322 | 57212028 | 82.37 | 22.07 | 79.37 | 147646 |
|
|
|
|
|
|
|
|
|
Sample ID with reference to Table 1.
Common and unique SNPs in pair-wise comparisons between species
|
| ||||
|---|---|---|---|---|
|
| ||||
|
|
|
|
| |
|
|
| |||
|
| 65646 |
| ||
|
| 69345 | 149679 |
| |
|
| 65390 | 162769 | 144422 |
|
|
| ||||
|
| 229312 | 189735 | 190084 | |
|
| 53728 | 109363 | 92675 | |
|
| 50042 | 145279 | 111026 | |
|
| 53997 | 132189 | 114644 | |
Total number of SNPs within each species is marked in bold.
Figure 3Shared and unique SNPs in pairwise comparisons between (S), (M), (UN) and (UG). SNPs total: P. sylvestris (225544), P. mugo (262582), P. uncinata (220365), P. uliginosa (232822).
Nucleotide variation at 676 merged nuclear ( DNA) contigs in the pine species
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| 5 | 1364676 | 12920 | 8674 | 0.0044 | −0.243 |
|
| 5 | 1364676 | 13129 | 8710 | 0.0045 | −0.221 |
|
| 4 | 1364676 | 13420 | 10374 | 0.0053 | −0.169 |
|
| 3 | 1364676 | 9581 | 9581 | 0.0047 | - |
|
|
|
|
|
|
|
|
N- number of samples analysed; L – length of the sequences in base pairs; SNPs- number of polymorphic sites detected; Sing – number of singleton mutations; πtot – total nucleotide diversity (Nei [31]); D – multilocus Tajima’s D statistics [32].
Figure 4Principal Coordinates Analysis (PCoA) based on pairwise nucleotide difference matrix at 676 contigs (>1.3Mbp, 27929 SNPs) showing genetic relationships between (●), (■), (♦) and (▲) samples.
Pairwise between species at 27929 SNPs identified at 676 merged nuclear ( DNA) contigs
|
|
|
| |
|---|---|---|---|
|
| 0.257** | ||
|
| 0.142* | 0.121* | |
|
| 0.212* | 0.030 | 0.075 |
Significance level: * p < 0.05, **p < 0.01.