| Literature DB >> 31878322 |
Sunil Kumar Sahu1,2, Min Liu1,2, Anna Yssel3, Robert Kariba4, Samuel Muthemba4, Sanjie Jiang1, Bo Song1, Prasad S Hendre4, Alice Muchugi4, Ramni Jamnadass4, Shu-Min Kao5,6, Jonathan Featherston7, Nyree J C Zerega8,9, Xun Xu1,2, Huanming Yang1, Allen Van Deynze4,10, Yves Van de Peer3,4,6, Xin Liu1,2, Huan Liu1,2,11.
Abstract
Two of the most economically important plants in the Artocarpus genus are jackfruit (A. heterophyllus Lam.) and breadfruit (A. altilis (Parkinson) Fosberg). Both species are long-lived trees that have been cultivated for thousands of years in their native regions. Today they are grown throughout tropical to subtropical areas as an important source of starch and other valuable nutrients. There are hundreds of breadfruit varieties that are native to Oceania, of which the most commonly distributed types are seedless triploids. Jackfruit is likely native to the Western Ghats of India and produces one of the largest tree-borne fruit structures (reaching up to 45 kg). To-date, there is limited genomic information for these two economically important species. Here, we generated 273 Gb and 227 Gb of raw data from jackfruit and breadfruit, respectively. The high-quality reads from jackfruit were assembled into 162,440 scaffolds totaling 982 Mb with 35,858 genes. Similarly, the breadfruit reads were assembled into 180,971 scaffolds totaling 833 Mb with 34,010 genes. A total of 2822 and 2034 expanded gene families were found in jackfruit and breadfruit, respectively, enriched in pathways including starch and sucrose metabolism, photosynthesis, and others. The copy number of several starch synthesis-related genes were found to be increased in jackfruit and breadfruit compared to closely-related species, and the tissue-specific expression might imply their sugar-rich and starch-rich characteristics. Overall, the publication of high-quality genomes for jackfruit and breadfruit provides information about their specific composition and the underlying genes involved in sugar and starch metabolism.Entities:
Keywords: A. altilis; A. heterophyllus; breadfruit; jackfruit; starch synthesis
Mesh:
Year: 2019 PMID: 31878322 PMCID: PMC7017358 DOI: 10.3390/genes11010027
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Phylogenetic and evolutionary analysis. (A,B) Gene conservation and gene family expansion and contraction in A. heterophyllus and A. altilis. The scale bar indicates 10 million years. The values at the branch points indicate the estimates of divergence time (mya), while the green numbers show the divergence time (million years ago, Mya), and the red nodes indicate the previously published calibration times. (C) The distribution of gene families among the model species and Artocarpus genus. (D) Distribution of 4DTv distance between collinearity gene pairs among A. heterophyllus, A. altilis, M. notabilis, and Arabidopsis thaliana.
Statistics of the genome assembly of A. altilis and A. heterophyllus.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
| Parameters | Contig | Scaffold | Contig | Scaffold | ||||
| Length (bp) | Number | Length (bp) | Number | Length (bp) | Number | Length (bp) | Number | |
| N90 | 3361 | 52,085 | 183,851 | 637 | 4902 | 39,073 | 77,281 | 2115 |
| N50 | 16,898 | 13,662 | 1,536,010 | 151 | 26,681 | 9516 | 547,861 | 527 |
| N10 | 47,070 | 1284 | 5,076,803 | 14 | 82,850 | 846 | 1,422,119 | 54 |
| Total length | 803,695,923 | 833,038,871 | 930,343,435 | 982,020,585 | ||||
| Maximum length | 174,221 | 7,444,155 | 255,416 | 3,088,173 | ||||
| Total number ≥ 100 bp | 180,971 | 98,152 | 162,440 | 108,267 | ||||
| Total number ≥ 2000 bp | 61,693 | 4338 | 52,444 | 7263 | ||||
| N content (%) | 3.52 | 5.26 | ||||||
BUSCO evaluation of genome assembly of A. altilis and A. heterophyllus.
| BUSCOs |
|
| ||
|---|---|---|---|---|
| N | P (%) | N | P (%) | |
| Complete BUSCOs | 1371 | 95.20 | 1369 | 95.00 |
| Complete single-copy | 988 | 68.60 | 932 | 64.70 |
| Complete duplicated | 383 | 26.60 | 437 | 30.30 |
| Fragmented | 14 | 1.00 | 15 | 1.00 |
| Missing | 55 | 3.80 | 56 | 4.00 |
Abbreviation: BUSCO, Benchmarking universal single-copy orthologs; N, number; P, percentage of complete BUSCOs compared to the total BUSCOs.
The gene coverage based on transcriptome data.
| Species | Dataset | Number | Total Length (bp) | Base Coverage by Assembly (%) | Sequence Coverage by Assembly (%) |
|---|---|---|---|---|---|
|
| All | 141,626 | 165,794,671 | 87.7 | 97.92 |
| >200 bp | 141,626 | 165,794,671 | 87.7 | 97.92 | |
| >500 bp | 79,410 | 146,265,291 | 86.81 | 97.62 | |
| >1000 bp | 49,485 | 125,138,638 | 85.97 | 96.99 | |
|
| All | 14,858 | 6,364,445 | 90.39 | 98.89 |
| >200 bp | 14,858 | 6,364,445 | 90.39 | 98.89 | |
| >500 bp | 2949 | 2,853,909 | 84.41 | 96.74 | |
| >1000 bp | 765 | 1,386,949 | 74.83 | 92.16 |
Classification of predicted transposable elements in the genome of A. altilis and A. heterophyllus.
|
|
| |||
|---|---|---|---|---|
| Repeat Type | in Genome (%) | Length (bp) | in Genome (%) | Length (bp) |
| SINE | 0 | 1187 | 0.03 | 384,983 |
| LINE | 0.14 | 1,214,650 | 0.99 | 9,775,316 |
| LTR | 45.95 | 382,841,531 | 36.99 | 363,293,617 |
| DNA | 2.95 | 24,608,939 | 3.76 | 36,982,825 |
| Satellite | 0 | 34,585 | 0.3 | 3,001,478 |
| Simple repeat | 0.03 | 253,818 | 0.04 | 485,582 |
| Unknown | 5.4 | 45,013,282 | 12.23 | 120,128,962 |
| Total | 52.04 | 433,486,547 | 51.01 | 500,968,186 |
Statistics of gene models of A. altilis, A. heterophyllus, and other species in Rosids.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
| Protein-coding gene number | 34,010 | 35,858 | 34,301 | 61,721 | 27,085 | 28,701 | 37,526 |
| Mean gene length (bp) | 3545.4 | 3472.2 | 2824.6 | 2692.5 | 2866.8 | 2464.8 | 3313.5 |
| Mean cds length (bp) | 1252.6 | 1241.5 | 1174.7 | 1141.4 | 1086.9 | 1210.8 | 1353.0 |
| Mean exons per gene | 5.4 | 5.5 | 5.1 | 4.8 | 4.6 | 4.9 | 5.5 |
| Mean exon length (bp) | 227.8 | 226.5 | 232.5 | 236.7 | 236.4 | 243.6 | 246.0 |
| Mean intron length (bp) | 509.6 | 497.7 | 407.1 | 405.8 | 494.6 | 315.8 | 435.7 |
Annotation of non-coding RNA genes in the A. altilis and A. heterophyllus genomes.
| Species | Type | Copy (w) | Average Length (bp) | Total Length (bp) | % of Genome |
|---|---|---|---|---|---|
|
| miRNA | 159 | 126.7 | 20,145 | 0.002418 |
| tRNA | 713 | 75.3 | 53,705 | 0.006447 | |
| rRNA | 466 | 183.2 | 85,353 | 0.010246 | |
| 18S | 76 | 551.4 | 41,907 | 0.005031 | |
| 28S | 98 | 125.5 | 12,296 | 0.001476 | |
| 5.8S | 32 | 135.6 | 4338 | 0.000521 | |
| 5S | 260 | 103.1 | 26,812 | 0.003219 | |
| snRNA | 1554 | 105.4 | 163,744 | 0.019656 | |
| CD-box | 1410 | 102.6 | 144,676 | 0.017367 | |
| HACA-box | 52 | 130.1 | 6765 | 0.000812 | |
| splicing | 92 | 133.7 | 12,303 | 0.001477 | |
|
| miRNA | 168 | 126.3 | 21,227 | 0.002162 |
| tRNA | 689 | 75.2 | 51,813 | 0.005276 | |
| rRNA | 2706 | 268.2 | 725,709 | 0.073900 | |
| 18S | 654 | 737.5 | 482,306 | 0.049114 | |
| 28S | 920 | 123.6 | 113,667 | 0.011575 | |
| 5.8S | 242 | 151.6 | 36,699 | 0.003737 | |
| 5S | 890 | 104.5 | 93,037 | 0.009474 | |
| snRNA | 1005 | 108.2 | 108,724 | 0.011071 | |
| CD-box | 814 | 102.5 | 83,426 | 0.008495 | |
| HACA-box | 68 | 127.4 | 8665 | 0.000882 | |
| splicing | 123 | 135.2 | 16,633 | 0.001694 |
Statistics of functional annotation of protein-coding genes in the A. altilis and A. heterophyllus genomes.
|
|
| |||
|---|---|---|---|---|
| Values | Number | Percentage | Number | Percentage |
| Total | 34,010 | 100.0% | 35,858 | 100.0% |
| Nr | 33,353 | 98.1% | 35,076 | 97.8% |
| Swissprot | 26,689 | 78.5% | 27,741 | 77.4% |
| KEGG | 24,860 | 73.1% | 25,804 | 72.0% |
| COG | 12,875 | 37.9% | 13,408 | 37.4% |
| TrEMBL | 33,240 | 97.7% | 34,968 | 97.5% |
| Interpro | 26,422 | 77.7% | 27,632 | 77.1% |
| GO | 17,428 | 51.2% | 18,336 | 51.1% |
| Overall | 33,394 | 98.2% | 35,109 | 97.9% |
| Unannotated | 616 | 1.8% | 749 | 2.1% |
Figure 2A graph showing the A. heterophyllus Ks distributions (pink) and the Ks distributions of its anchor pairs (red), A altilis Ks distributions (dark blue) and the distributions of its anchor pairs (light blue) overlaid with the Ks distributions of the one-to-one orthologs of A. heterophyllus, and A. altilis (green). (B) log transformed version of (A). (C,D) are the log transformed Ka and ω distributions, respectively.
Figure 3Ks distribution (dark pink) and anchor pair Ks distribution (yellow) of A. altilis in overlay with the results of whole paranome distributions between A. altilis and M. notabilis (light blue) and A. altilis and Z. jujube (dark blue).
Figure 4Copy number expansion of starch synthesis-related genes in A. heterophyllus and A. altilis.
Figure 5Tissue specific expression of starch synthesis-related genes in A. heterophyllus and A. altilis.