| Literature DB >> 35040911 |
Kenta Shirasawa1, Saki Ueta2, Kyoko Murakami2, Mostafa Abdelrahman3,4,5, Akira Kanno3, Sachiko Isobe1.
Abstract
Asparagus kiusianus is a disease-resistant dioecious plant species and a wild relative of garden asparagus (Asparagus officinalis). To enhance A. kiusianus genomic resources, advance plant science, and facilitate asparagus breeding, we determined the genome sequences of the male and female lines of A. kiusianus. Genome sequence reads obtained with a linked-read technology were assembled into four haplotype-phased contig sequences (∼1.6 Gb each) for the male and female lines. The contig sequences were aligned onto the chromosome sequences of garden asparagus to construct pseudomolecule sequences. Approximately 55,000 potential protein-encoding genes were predicted in each genome assembly, and ∼70% of the genome sequence was annotated as repetitive. Comparative analysis of the genomes of the two species revealed structural and sequence variants between the two species as well as between the male and female lines of each species. Genes with high sequence similarity with the male-specific sex determinant gene in A. officinalis, MSE1/AoMYB35/AspTDF1, were presented in the genomes of the male line but absent from the female genome assemblies. Overall, the genome sequence assemblies, gene sequences, and structural and sequence variants determined in this study will reveal the genetic mechanisms underlying sexual differentiation in plants, and will accelerate disease-resistance breeding in garden asparagus.Entities:
Keywords: dioecy; genome assembly; haplotype phasing sequence; linked reads; structural variation
Mesh:
Year: 2022 PMID: 35040911 PMCID: PMC8826022 DOI: 10.1093/dnares/dsac002
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.477
Figure 1Estimation of the genome sizes of the male and female lines of A. kiusianus, based on k-mer analysis (k = 17) with the given multiplicity values.
Statistics of the genome assemblies of the male and female lines of A. kiusianus
| AKIK1p1 | AKIK1p2 | AKIK2p1 | AKIK2p2 | |
|---|---|---|---|---|
| Total contig size (bp) | 1,618,866,837 | 1,618,519,636 | 1,567,596,515 | 1,567,276,205 |
| Number of sequences | 111,443 | 111,443 | 125,241 | 125,241 |
| Sequence N50 length (bp) | 155,490 | 155,269 | 59,656 | 59,610 |
| Longest sequence size (bp) | 2,375,321 | 2,375,321 | 1,231,068 | 1,222,162 |
| Gap size (bp) | 168,294,890 | 168,291,350 | 132,162,460 | 132,161,290 |
| Single-copy complete BUSCOs | 82.4% | 82.6% | 83.5% | 83.4% |
| Duplicated complete BUSCOs | 6.0% | 6.0% | 5.2% | 5.2% |
| Fragmented BUSCOs | 5.8% | 5.5% | 5.6% | 5.6% |
| Missing BUSCOs | 5.8% | 5.9% | 5.7% | 5.8% |
Statistics of the pseudomolecule sequences of A. kiusianus
| AKIK1p1 | AKIK1p2 | |||||||
|---|---|---|---|---|---|---|---|---|
| Total length (bp) | Gap (%) | Number of contigs | Number of genes | Total length (bp) | Gap (%) | Number of contigs | Number of genes | |
| ch01 (Y) | 179,686,458 | 11.8 | 10,537 | 6,938 | 179,663,709 | 11.8 | 10,543 | 6,706 |
| ch02 | 94,379,980 | 12.5 | 6,056 | 3,631 | 94,286,185 | 12.5 | 6,066 | 3,427 |
| ch03 | 160,690,220 | 10.4 | 10,792 | 6,135 | 161,086,385 | 10.5 | 10,791 | 5,855 |
| ch04 | 222,888,549 | 11.1 | 13,146 | 7,547 | 222,850,935 | 11.1 | 13,139 | 7,251 |
| ch05 | 175,875,079 | 12.0 | 10,573 | 6,878 | 176,035,479 | 12.0 | 10,582 | 6,668 |
| ch06 | 112,254,750 | 9.4 | 7,553 | 3,696 | 112,182,592 | 9.4 | 7,553 | 3,504 |
| ch07 | 231,128,771 | 11.0 | 13,973 | 8,285 | 230,178,869 | 11.0 | 13,928 | 7,953 |
| ch08 | 209,954,455 | 10.5 | 13,763 | 7,167 | 209,720,314 | 10.5 | 13,796 | 6,882 |
| ch09 | 73,375,337 | 10.6 | 4,435 | 2,759 | 73,241,043 | 10.7 | 4,437 | 2,638 |
| ch10 | 85,316,880 | 10.8 | 5,396 | 3,353 | 85,678,944 | 10.9 | 5,389 | 3,229 |
|
|
|
|
|
|
|
|
|
|
| ch00 | 84,496,458 | 10.4 | 15,223 | 2,819 | 84,775,281 | 10.6 | 15,223 | 2,593 |
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
| AKIK2p1 | AKIK2p2 | |||||||
|
|
| |||||||
| Total length (bp) | Gap (%) | Number of contigs | Number of genes | Total length (bp) | Gap (%) | Number of contigs | Number of genes | |
|
| ||||||||
| ch01 (X) | 167,748,676 | 9.5 | 11,500 | 6,772 | 167,821,022 | 9.5 | 11,527 | 6,835 |
| ch02 | 88,515,862 | 10.3 | 6,726 | 3,557 | 87,905,216 | 10.2 | 6,698 | 3,651 |
| ch03 | 159,691,030 | 9.0 | 12,193 | 5,956 | 159,408,608 | 9.0 | 12,181 | 6,030 |
| ch04 | 217,008,733 | 9.0 | 15,036 | 7,377 | 217,068,856 | 9.0 | 15,080 | 7,655 |
| ch05 | 172,460,497 | 9.9 | 11,974 | 6,762 | 171,835,019 | 9.9 | 11,943 | 6,970 |
| ch06 | 107,102,574 | 8.1 | 8,296 | 3,565 | 107,106,266 | 8.1 | 8,278 | 3,621 |
| ch07 | 222,804,915 | 9.2 | 15,815 | 7,998 | 221,855,507 | 9.2 | 15,811 | 8,158 |
| ch08 | 213,448,917 | 8.6 | 15,408 | 7,141 | 213,032,411 | 8.6 | 15,401 | 7,250 |
| ch09 | 71,157,753 | 9.4 | 5,055 | 2,517 | 71,672,067 | 9.5 | 5,054 | 2,573 |
| ch10 | 82,583,459 | 9.3 | 5,872 | 3,262 | 82,780,279 | 9.2 | 5,891 | 3,270 |
|
|
|
|
|
|
|
|
|
|
| ch00 | 77,633,199 | 8.8 | 17,366 | 2,616 | 79,350,054 | 8.9 | 17,377 | 2,681 |
|
|
|
|
|
|
|
|
|
|
Bold indicates subtotal (ch01 to ch10) and total values (ch01 to ch10 and ch00).
Repetitive sequences in the A. kiusianus genomes
| Repeat type | AKIK1p1 | AKIK1p2 | AKIK2p1 | AKIK2p2 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number of elements | Length occupied (bp) | % | Number of elements | Length occupied (bp) | % | Number of elements | Length occupied (bp) | % | Number of elements | Length occupied (bp) | % | |
| SINEs | 296 | 100,129 | 0.0 | 295 | 100,876 | 0.0 | 260 | 88,942 | 0.0 | 269 | 88,784 | 0.0 |
| LINEs | 36,139 | 22,836,398 | 1.4 | 36,171 | 22,868,973 | 1.4 | 36,022 | 22,848,619 | 1.5 | 36,257 | 22,865,141 | 1.5 |
| LTR elements | 534,586 | 740,043,346 | 45.4 | 534,419 | 739,629,919 | 45.4 | 538,917 | 731,783,197 | 46.3 | 538,772 | 731,702,512 | 46.3 |
| DNA transposons | 119,624 | 77,129,754 | 4.7 | 119,369 | 77,003,389 | 4.7 | 117,911 | 76,193,533 | 4.8 | 118,264 | 76,216,494 | 4.8 |
| Unclassified | 426,795 | 242,401,476 | 14.9 | 426,435 | 242,359,956 | 14.9 | 422,640 | 239,811,052 | 15.2 | 422,494 | 239,544,071 | 15.2 |
| Small RNA | 447 | 231,343 | 0.0 | 451 | 233,377 | 0.0 | 435 | 232,256 | 0.0 | 442 | 228,647 | 0.0 |
| Satellites | 1,945 | 843,970 | 0.1 | 1,894 | 847,909 | 0.1 | 2,155 | 901,555 | 0.1 | 2,171 | 901,844 | 0.1 |
| Simple repeats | 197,902 | 17,199,114 | 1.1 | 198,009 | 17,248,715 | 1.1 | 195,639 | 16,992,727 | 1.1 | 195,634 | 17,024,257 | 1.1 |
| Low complexity | 32,476 | 1,822,571 | 0.1 | 32,548 | 1,822,378 | 0.1 | 32,388 | 1,816,175 | 0.1 | 32,482 | 1,817,483 | 0.1 |
Completeness of gene predictions, based on BUSCO assessment
| Within a line | Between male and female lines | |||||
|---|---|---|---|---|---|---|
| AKIK1p1 vs AKIK1p2 | AKIK2p1 vs AKIK2p2 | AKIK1p1 vs AKIK2p1 | AKIK1p1 vs AKIK2p2 | AKIK1p2 vs AKIK2p1 | AKIK1p2 vs AKIK2p2 | |
| Base substitutions | 386,723 | 293,196 | 324,361 | 327,460 | 314,704 | 318,810 |
| Ts/Tv ratio | 3.0 | 3.0 | 2.9 | 2.9 | 2.9 | 2.9 |
| 1-bp indels | 23,833 | 17,283 | 20,344 | 20,379 | 19,692 | 19,970 |
| 2-bp indels | 5,799 | 4,477 | 5,722 | 5,763 | 5,645 | 5,691 |
| 3- to 49-bp indels | 11,100 | 8,641 | 13,931 | 13,897 | 13,647 | 13,777 |
| 50- to 1000-bp indels | 1,871 | 1,472 | 7,141 | 7,127 | 7,096 | 7,096 |
| ≥1000-bp indels | 3,404 | 3,859 | 3,199 | 3,229 | 3,160 | 3,179 |
Figure 2.Comparative analysis of the genome sequence and structure of A. kiusianus and A. officinalis. Dots represent similarities between the genome sequence and structure of the two species. Genomes of the male and female lines of A. kiusianus are indicated as AKIK1 and AKIK2, respectively. Haplotype phases 1 and 2 are indicated as p1 and p2, respectively. AsparagusV1 indicates A. officinalis genome.
Figure 3Comparison between the M locus in the Y-specific region on chromosome 1 (Y) of the male line and the corresponding region (X) of the female line. Grey bars indicate similarities in the genome sequence and structure. Black arrows indicate the position and orientation of MSE1/AoMYB35/AspTDF1.