| Literature DB >> 29893845 |
Bagdevi Mishra1,2, Deepak K Gupta1,2, Markus Pfenninger1,3, Thomas Hickler1,4, Ewald Langer5, Bora Nam1,2, Juraj Paule6, Rahul Sharma1, Bartosz Ulaszewski7, Joanna Warmbier7, Jaroslaw Burczyk7, Marco Thines1,2.
Abstract
Background: The European beech is arguably the most important climax broad-leaved tree species in Central Europe, widely planted for its valuable wood. Here, we report the 542 Mb draft genome sequence of an up to 300-year-old individual (Bhaga) from an undisturbed stand in the Kellerwald-Edersee National Park in central Germany. Findings: Using a hybrid assembly approach, Illumina reads with short- and long-insert libraries, coupled with long Pacific Biosciences reads, we obtained an assembled genome size of 542 Mb, in line with flow cytometric genome size estimation. The largest scaffold was of 1.15 Mb, the N50 length was 145 kb, and the L50 count was 983. The assembly contained 0.12% of Ns. A Benchmarking with Universal Single-Copy Orthologs (BUSCO) analysis retrieved 94% complete BUSCO genes, well in the range of other high-quality draft genomes of trees. A total of 62,012 protein-coding genes were predicted, assisted by transcriptome sequencing. In addition, we are reporting an efficient method for extracting high-molecular-weight DNA from dormant buds, by which contamination by environmental bacteria and fungi was kept at a minimum. Conclusions: The assembled genome will be a valuable resource and reference for future population genomics studies on the evolution and past climate change adaptation of beech and will be helpful for identifying genes, e.g., involved in drought tolerance, in order to select and breed individuals to adapt forestry to climate change in Europe. A continuously updated genome browser and download page can be accessed from beechgenome.net, which will include future genome versions of the reference individual Bhaga, as new sequencing approaches develop.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29893845 PMCID: PMC6014182 DOI: 10.1093/gigascience/giy063
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:The sequenced individual Bhaga at the time of sampling. Note the very low branching on the cliff, with a major part of the individual reaching over the edge.
Figure 2:Parameter correlations in the Fagus sylvatica genome. (A) Gene density vs the GC content in each of the 50,994 nonoverlapping 10-kb windows. (B) Gene density vs. the proportion of heterozygous sites.
Figure 3:Parameter frequency distributions in 50-994 nonoverlapping 10-kb windows. (A) Gene density, measured as proportion of the window annotated as gene. (B) Proportion of GC bases. (C) Genetic diversity, measured as proportion of heterozygous sites.
Statistics of the completeness of de novo genome assembly of Fagus sylvatica assessed with CEGMA and BUSCO
| Genome | BUSCO complete (in %) | BUSCO duplicated (in %) | BUSCO fragmented (in %) | BUSCO missing (in %) | CEGMA complete (in %) | CEGMA partial (in %) | Reference |
|---|---|---|---|---|---|---|---|
|
| 94 | 19 | 1.7 | 3.6 | 82 | 94 | This study |
|
| 91 | 13 | 4.2 | 4.0 | 77 | 94 | [ |
|
| 92 | 10 | 2.7 | 4.8 | 81 | 96 | [ |
|
| 94 | 11 | 2.4 | 3.0 | 83 | 98 | [ |
|
| 87 | 19 | 5.2 | 7.6 | 90 | 96 | [ |
|
| 96 | 17 | 1.4 | 2.1 | 92 | 97 | [ |
|
| 94 | 5 | 1.8 | 4.7 | 93 | 100 | [ |