| Literature DB >> 31241154 |
Quanxin Bi1,2, Yang Zhao1, Wei Du2, Ying Lu3, Lang Gui3, Zhimin Zheng4,5, Haiyan Yu1,6, Yifan Cui1, Zhi Liu4,5, Tianpeng Cui7, Deshi Cui7, Xiaojuan Liu1, Yingchao Li1, Siqi Fan1, Xiaoyu Hu1, Guanghui Fu1, Jian Ding2, Chengjiang Ruan2, Libing Wang1.
Abstract
BACKGROUND: Yellowhorn (Xanthoceras sorbifolium) is a species of the Sapindaceae family native to China and is an oil tree that can withstand cold and drought conditions. A pseudomolecule-level genome assembly for this species will not only contribute to understanding the evolution of its genes and chromosomes but also bring yellowhorn breeding into the genomic era.Entities:
Keywords: zzm321990 Xanthoceras sorbifoliumzzm321990 ; Hi-C; PacBio sequencing; conserved chromosome; genome annotation; genome assembly; yellowhorn
Mesh:
Year: 2019 PMID: 31241154 PMCID: PMC6593361 DOI: 10.1093/gigascience/giz070
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Images of yellowhorn plants. (A) Yellowhorn tree in an artificial forest. (B) Ripe fruit, which dehisce into 3 parts by the carpels. (C) A harvest scene of yellowhorn in northern China. (D) Seeds in ripe fruits, which number 18–24 in 1 fruit.
Figure 2:Estimation of genome size. (A) Test results of yellowhorn, poplar, and yellowhorn + soybean samples using flow cytometry. (B) Distribution of 17-mer frequency. The x-axis and y-axis indicate the 17-mer frequency and number, respectively. The leftmost truncated peak at a low occurrence frequency (1–2) was mainly due to random base errors in the raw sequencing reads.
Overview of assembly and annotation for the yellowhorn genome
| Total length | 504,196,643 bp |
| Length of unclosed gaps | 73,800 bp |
| N50 length | |
| Initial contigs | 1,044,891 bp |
| Scaffolds | 32,173,403 bp |
| N90 length (scaffolds) | 25,069,408 bp |
| No. of scaffolds (>N90 length) | 21 |
| Largest scaffold | 40,097,451 bp |
| Guanine-cytosine content | 36.95% |
| No. predicted | |
| Protein-coding genes | 24,672 |
| Noncoding RNA genes | 1,066 |
| Content of repetitive sequences | 68.67% |
| Length of genome anchored on linkage groups | 489,286,946 bp (97.04%) |
Quantity of the contigs anchored with Hi-C
| Linkage Group | No. of anchored contigs | Sequence length (bp) |
|---|---|---|
| 1 | 68 | 40,738,791 |
| 2 | 92 | 40,039,835 |
| 3 | 38 | 37,159,809 |
| 4 | 112 | 35,552,403 |
| 5 | 84 | 35,291,867 |
| 6 | 62 | 35,706,508 |
| 7 | 66 | 33,002,525 |
| 8 | 46 | 32,947,898 |
| 9 | 66 | 30,804,552 |
| 10 | 62 | 30,699,318 |
| 11 | 68 | 29,306,026 |
| 12 | 56 | 29,390,540 |
| 13 | 47 | 29,816,145 |
| 14 | 71 | 25,601,946 |
| 15 | 72 | 23,228,783 |
| Total (ratio %) | 1,010 (35.61) | 489,286,946 (97.04) |
Figure 3:Genome evolution. (A) Distribution of insertion ages of LTR-retrotransposons. The x-axis represents the estimated insertion age (mya) of the LTR-retrotransposons. The y-axis represents the number of intact LTR-retrotransposons. (B) Comparison of copy numbers in gene clusters of analysed eudicot genomes. According to the identified gene clusters, the genes were grouped into single-copy, multiple-copy, and species-specific (specific) genes. (C) Constructed phylogenetic tree and divergence time estimation. The black numbers represent estimated divergence times (mya), which are measured with a scale bar of 20 million years, and green numbers represent bootstrap values. Grape (V. vinifera) was used as an outgroup. (D) Genome duplication in eudicot genomes as revealed through 4DTv analyses. The percentages of the orthologous pairs (Y vs L) between yellowhorn (Y) and longan (L) and paralogous gene pairs within the yellowhorn (Y vs Y) and longan (L vs L) genomes are plotted against their calculated 4DTv values.
Figure 4:Chromosome synteny. The circularized blocks represent the chromosomes of yellowhorn and other genomes. Aligned genes identified by MCscanX are connected by lines, with their chromosome locations shown in different colours. (A) Chromosome alignment of yellowhorn and Arabidopsis. (B) Chromosome alignment of yellowhorn and grape. (C) Chromosome alignment of yellowhorn and clementine. Coloured ribbons connect the aligned genes. Yellowhorn linkage groups are labelled LG 1−15, Arabidopsis chromosomes labelled Chr 1−5, grape chromosomes are labelled C1−19 and CUn (chromosome location unknown), and clementine scaffolds are labelled Sc 1−9. Scale, 10 Mb. (D) Chromosome rearrangements between Arabidopsis and yellowhorn. (E) Chromosome rearrangements between clementine and yellowhorn. Arabidopsis and clementine chromosomes are represented as bars of different colours. Synteny and rearrangement of the yellowhorn chromosomes are indicated by different blocks, corresponding to the reference Arabidopsis and clementine chromosomes.