| Literature DB >> 28938721 |
Wenlu Yang1, Kun Wang1, Jian Zhang2, Jianchao Ma2, Jianquan Liu1,2, Tao Ma1.
Abstract
Populus pruinosa is a large tree that grows in deserts and shows distinct differences in both morphology and adaptation compared to its sister species, P. euphratica. Here we present a draft genome sequence for P. pruinosa and examine genomic variations between the 2 species. A total of 60 Gb of clean reads from whole-genome sequencing of a P. pruinosa individual were generated using the Illumina HiSeq2000 platform. The assembled genome is 479.3 Mb in length, with an N50 contig size of 14.0 kb and a scaffold size of 698.5 kb; 45.47% of the genome is composed of repetitive elements. We predicted 35 131 protein-coding genes, of which 88.06% were functionally annotated. Gene family clustering revealed 224 unique and 640 expanded gene families in the P. pruinosa genome. Further evolutionary analysis identified numerous genes with elevated values for pairwise genetic differentiation between P. pruinosa and P. euphratica. We provide the genome sequence and gene annotation for P. pruinosa. A large number of genetic variations were recovered by comparison of the genomes between P. pruinosa and P. euphratica. These variations will provide a valuable resource for studying the genetic bases for the phenotypic and adaptive divergence of the 2 sister species.Entities:
Keywords: Illumina sequencing; Populus pruinosa; annotation; genome assembly
Mesh:
Year: 2017 PMID: 28938721 PMCID: PMC5603765 DOI: 10.1093/gigascience/gix075
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Summary of genome assembly and annotation of P. pruinosa
| Genome assembly | |
|---|---|
| Estimate of genome size | 590 Mb |
| GC content | 31.80% |
| Contigs | |
| N50 size | 14 011 bp |
| Longest | 197 623 bp |
| Total number | 170 219 |
| Total size | 450 157 195 bp |
| Scaffolds | |
| N50 size | 698 525 bp |
| Longest | 10 688 665 bp |
| Total number | 78 960 |
| Total length | 479 307 600 bp |
| Genome annotation | |
| Transposable elements | |
| LTR | 142 923 156 bp (29.82%) |
| LINE | 4 956 260 bp (1.03%) |
| DNA | 20 990 612 bp (4.38%) |
| Total | 213 236 753 bp (45.47%) |
| Protein coding genes | |
| Total number | 35 131 |
| Mean transcript length | 3703.4 bp |
| Mean coding sequence length | 1224.38 bp |
| Mean exon length | 226.27 bp |
| Mean intron length | 561.98 bp |
| Functional annotation | |
| GO | 22 361 (63.64%) |
| KEGG | 11 746 (33.43%) |
| Total | 30 938 (88.06%) |
Figure 1:Synteny relationship of P. pruinosa, P. euphratica, and P. trichocarpa.