| Literature DB >> 34946942 |
Xiaolong Gan1,2,3, Shiming Li4,5, Yuan Zong2,3, Dong Cao2,3, Yun Li2,3, Ruijuan Liu2,3, Shu Cheng4,5, Baolong Liu1,2,3, Huaigang Zhang1,2,3.
Abstract
Potentilla anserina is a perennial stoloniferous plant with edible tuberous roots in Rosaceae, served as important food and medicine sources for Tibetans in the Qinghai-Tibetan Plateau (QTP), China, over thousands of years. However, a lack of genome information hindered the genetic study. Here, we presented a chromosome-level genome assembly using single-molecule long-read sequencing, and the Hi-C technique. The assembled genome was 454.28 Mb, containing 14 chromosomes, with contig N50 of 2.14 Mb. A total of 46,495 protein-coding genes, 169.74 Mb repeat regions, and 31.76 Kb non-coding RNA were predicted. P. anserina diverged from Potentilla micrantha ∼28.52 million years ago (Mya). Furthermore, P. anserina underwent a recent tetraploidization ∼6.4 Mya. The species-specific genes were enriched in Starch and sucrose metabolism and Galactose metabolism pathways. We identified the sub-genome structures of P. anserina, with A sub-genome was larger than B sub-genome and closer to P. micrantha phylogenetically. Despite lacking significant genome-wide expression dominance, the A sub-genome had higher homoeologous gene expression in shoot apical meristem, flower and tuberous root. The resistance genes was contracted in P. anserina genome. Key genes involved in starch biosynthesis were expanded and highly expressed in tuberous roots, which probably drives the tuber formation. The genomics and transcriptomics data generated in this study advance our understanding of the genomic landscape of P. anserina, and will accelerate genetic studies and breeding programs.Entities:
Keywords: Potentilla anserina; allopolyploids; gene family; genome assembly; starch metabolism; sub-genome expression dominance
Mesh:
Year: 2021 PMID: 34946942 PMCID: PMC8700974 DOI: 10.3390/genes12121993
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Genome size and ploidy analysis. (a) The k-mer (k = 17) frequency distribution. The sharp peak (∼83) is the homozygous region of genome and the smooth peak (∼168) reflected allotetraploidy. (b) The ploidy analysis for reference-free profiling of polyploid genomes.
Figure 2LAI scores and genome landscape of 14 chromosomes. (a) LAI scores distribution of Poan 14 chromosomes. The LAI score between 10 and 20 means the assembly reached the reference genome level. (b) Circos plot of genomic features. The tracks from the outermost to innermost are: 1 chromosome, 2 GC content, 3 gene density, 4 repeat density, 5 LTR-Gypsy density, 6 LTR-Copia density, and 7 collinear genes of Poan genome. The data are shown with a 500 Kb window.
Statistics of Non-coding RNAs and TE sequences.
| Type | Number | Total Length (bp) | Proportion in the Genome (%) | |
|---|---|---|---|---|
| miRNA | 479 | 58,847 | 0.013 | |
| tRNA | 1116 | 83,079 | 0.018 | |
| Non-coding RNA | rRNA | 148 | 98,151 | 0.022 |
| snRNA | 655 | 77,523 | 0.017 | |
| Total | 2398 | 317,600 | 0.07 | |
| Retroelements | 166,482 | 118,458,221 | 26.08 | |
| LINEs | 39,955 | 12,791,517 | 2.82 | |
| SINEs | 2373 | 269,946 | 0.06 | |
| LTR elements | 124,154 | 105,396,758 | 23.20 | |
| TE |
| 56,850 | 53,227,095 | 11.72 |
|
| 31,449 | 38,942,113 | 8.57 | |
| DNA transposons | 167,514 | 4,317,7431 | 9.50 | |
| Others | 27,886 | 6,454,876 | 1.42 | |
| Total | 368,838 | 169,737,457 | 37.26 |
Figure 3Evolution of the Poan genome. (a) Orthologous gene families among Rosaceae plants. (b) Phylogenetic relationship of Poan and eight other species. Herbaceous plants are shown with a purple background and woody plants are shown with a yellow background. (c) Gene duplication events among nine different species. (d) Ks distribution between Poan, Pomi and Frve.
Figure 4Sub-genome structure and expression dominance analysis. (a) Intra-genome gene synteny within Poan 14 chromosomes. (b) The phylogenetic tree of single copy genes between Pomi and Poan sub-genomes. (c) Sub-genome constitution of Poan. (d) Ka/Ks values of two sub-genomes. (e) Expression values for co-expression of homoeologous gene pairs.
Figure 5Analysis of key genes involved in starch metabolism. (a) A diagram depicting the main reactions involved starch metabolism. (b) Key gene copy numbers among Poan, Pomi, Frve, Roch, and Ruoc. (c) Heatmap of the expression level of genes related in starch metabolism in different tissues.