| Literature DB >> 31580414 |
Deyou Qiu1, Shenglong Bai2, Jianchao Ma2, Lisha Zhang1, Fenjuan Shao1, Kaikai Zhang1,3, Yanfang Yang1, Ting Sun2, Jinling Huang2, Yun Zhou2, David W Galbraith2,4, Zhaoshan Wang1, Guiling Sun2.
Abstract
Poplar 84K (Populus alba x P. tremula var. glandulosa) is a fast-growing poplar hybrid. Originated in South Korea, this hybrid has been extensively cultivated in northern China. Due to the economic and ecological importance of this hybrid and high transformability, we now report the de novo sequencing and assembly of a male individual of poplar 84K using PacBio and Hi-C technologies. The final reference nuclear genome (747.5 Mb) has a contig N50 size of 1.99 Mb and a scaffold N50 size of 19.6 Mb. Complete chloroplast and mitochondrial genomes were also assembled from the sequencing data. Based on similarities to the genomes of P. alba var. pyramidalis and P. tremula, we were able to identify two subgenomes, representing 356 Mb from P. alba (subgenome A) and 354 Mb from P. tremula var. glandulosa (subgenome G). The phased assembly allowed us to detect the transcriptional bias between the two subgenomes, and we found that the subgenome from P. tremula displayed dominant expression in both 84K and another widely used hybrid, P. tremula x P. alba. This high-quality poplar 84K genome will be a valuable resource for poplar breeding and for molecular biology studies.Entities:
Keywords: zzm321990 P. albazzm321990 ; zzm321990 P. tremulazzm321990 ; genome sequencing; poplar 84K; subgenome assignment
Mesh:
Substances:
Year: 2019 PMID: 31580414 PMCID: PMC6796506 DOI: 10.1093/dnares/dsz020
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Statistics of genome and subgenome assembly of poplar 84K using different sequencing data
| Length (bp) /number | PacBio assembly | Hi-C assembly | Pseudomolecules | Subgenome A | Subgenome G |
|---|---|---|---|---|---|
| N90 | 429,473 | 12,722,015 | 13,367,995 | 13,017,457 | 13,367,995 |
| N80 | 959,980 | 13,867,023 | 14,133,090 | 13,907,236 | 14,133,090 |
| N70 | 1,381,280 | 14,269,014 | 16,373,422 | 16,345,784 | 16,373,422 |
| N60 | 1,818,021 | 17,170,365 | 17,428,093 | 17,170,365 | 17,629,401 |
| N50 | 2,239,559 | 19,641,611 | 19,971,399 | 18,769,030 | 20,657,227 |
| N40 | 2,832,264 | 20,657,227 | 20,657,227 | 20,018,725 | 21,475,114 |
| N30 | 3,174,752 | 21,475,114 | 21,827,254 | 21,194,248 | 22,685,518 |
| N20 | 4,522,243 | 23,093,407 | 24,417,576 | 24,417,576 | 25,197,140 |
| N10 | 6,295,541 | 49,053,667 | 49,053,667 | 49,053,667 | 49,243,677 |
| Maximum length | 12,124,769 | 49,243,677 | 49,243,677 | 49,053,667 | 49,243,677 |
| Minimum length | 1,245 | 1,245 | 6,540,224 | 11,888,367 | 6,540,224 |
| Total length | 753,822,854a | 747,538,837b | 710,053,368 | 356,027,211 | 354,026,157 |
| Total sequences | 1,384 | 841 | 38 | 19 | 19 |
| Gap numbers | — | 505 | 505 | 227 | 278 |
a The original assembly obtained using CANU.
b Popar 84K nuclear genome assembly after removing organellar sequences and microorganism contamination.
Figure 1Assessment of the assembled 84K genome. (a) Interaction frequency distributions of Hi-C linkage groups. The log2 of the valid interaction link number of Hi-C data between any pair of 500 kb non-overlapping bins were calculated and is displayed as a heatmap by HiCPlotter. The black/white bar of the heatmap indicates the interaction frequency of the Hi-C links. The square indicates the abnormal high frequency of interaction between linkage group 35 (chromosome 11 of subgenome G) and linkage group 10 (chromosome 12 in subgenome A). (b) BUSCO analysis of the genomes from poplar 84K, P. trichocarpa, P. alba var. pyramidalis, and P. tremula. (C) complete; (S) single-copy; (D) duplicated; (F) Fragmented; (M) Missing.
Figure 2Characteristics of the poplar 84K genome. From the outer edge inward: (a) circles represent the subgenomes A (right) and G (left); (b) gene density on each chromosome; (c) repeat density at 10 kb intervals; (d) GC content at 10 kb intervals; (e) the sequencing depth of Illumina short reads at 10 kb intervals, and collinear blocks linked by grey lines.
Figure 3Allelic genes with transcriptional bias and their promoter region identity in poplar 84K and P. tremula x P. alba. (a) Identified allelic genes with transcriptional bias in poplar 84K. (b) Identified allelic genes with transcriptional bias in P. tremula x P. alba. (c) Boxplot of the identity in allelic gene promoter region. Significant differences of identity between allelic genes with transcriptional bias and no-bias were supported by the Kolmogorov–Smirnov test (P-value < 0.001) in poplar 84K and P. tremula x P. alba.
Genes with transcriptional bias within different tissues in two poplar hybrids
| Species | Tissues | DEG numbers | Dominant expression in subgenome A | Dominant expression in subgenome G |
|---|---|---|---|---|
|
| Shoot | 4124 | 1,856 (45.00%) | 2,268 (55.00%) |
|
| Rooting | 4109 | 1,826 (44.44%) | 2,283 (55.56%) |
|
| Callus | 3965 | 1,802 (45.45%) | 2,163 (54.55%) |
|
| Leaf | 4253 | 1,936 (45.52%) | 2,317 (54.48%) |
|
| Leaf | 2792 | 1,065 (38.14%) | 1,727 (61.86%) |
|
| Bark | 3197 | 1,314 (41.10%) | 1,883 (58.90%) |
|
| Xylem | 2808 | 1,150 (40.95%) | 1,658 (59.05%) |
|
| Root | 3121 | 1,288 (41.27%) | 1,833 (58.73%) |
P. alba x P. tremula var. glandulosa = 84K.