| Literature DB >> 35325123 |
Katherine M D'Amico-Willman1, Wilberforce Z Ouma2, Tea Meulia3, Gina M Sideli4, Thomas M Gradziel4, Jonathan Fresnedo-Ramírez5.
Abstract
Almond [Prunus dulcis (Mill.) D.A. Webb] is an economically important, specialty nut crop grown almost exclusively in the United States. Breeding and improvement efforts worldwide have led to the development of key, productive cultivars, including 'Nonpareil,' which is the most widely grown almond cultivar. Thus far, genomic resources for this species have been limited, and a whole-genome assembly for 'Nonpareil' is not currently available despite its economic importance and use in almond breeding worldwide. We generated a 571X coverage genome sequence using Illumina, PacBio, and optical mapping technologies. Gene prediction revealed 49,321 putative genes using MinION Oxford nanopore and Illumina RNA sequencing, and genome annotation found that 68% of predicted models are associated with at least one biological function. Furthermore, epigenetic signatures of almond, namely DNA cytosine methylation, have been implicated in a variety of phenotypes including self-compatibility, bud dormancy, and development of noninfectious bud failure. In addition to the genome sequence and annotation, this report also provides the complete methylome of several almond tissues, including leaf, flower, endocarp, mesocarp, exocarp, and seed coat. Comparisons between methylation profiles in these tissues revealed differences in genome-wide weighted % methylation and chromosome-level methylation enrichment.Entities:
Keywords: Nonpareil; almond; cytosine methylation; enzymatic methylation sequencing; multiplatform genome assembly
Mesh:
Year: 2022 PMID: 35325123 PMCID: PMC9073694 DOI: 10.1093/g3journal/jkac065
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Summary of scaffolding using the optical map.
| Contig_v1.0 | Contig_v1.1 | Scaffold_all | Scaffold_hap | |
|---|---|---|---|---|
| No. of sequences | 593 | 740 | 499 | 50 |
| Max length | 6,727,137 | 6,081,999 | 27,216,209 | 27,216,209 |
| Total size | 456,658,006 | 455,350,334 | 458,275,742 | 254,451,385 |
| Sequence N50 | 1,838,609 | 1,308,716 | 3,168,198 | 9,199,899 |
|
| 0 | 0 | 0.37% | 0.40% |
Summary of the sequences of the genomes of ‘Nonpareil.’
| Chromosome | Length (bp) | Effective length (bp) |
|
|---|---|---|---|
| Pdu1 | 53,037,566 | 52,621,683 | 0.78 |
| Pdu2 | 33,982,222 | 33,650,668 | 0.98 |
| Pdu3 | 30,537,282 | 30,345,236 | 0.63 |
| Pdu4 | 30,643,767 | 30,642,693 | 0.01 |
| Pdu5 | 21,032,966 | 21,019,937 | 0.06 |
| Pdu6 | 32,793,846 | 32,733,303 | 0.18 |
| Pdu7 | 26,338,084 | 26,325,646 | 0.05 |
| Pdu8 | 26,045,089 | 26,041,993 | 0.01 |
| Additional contigs | 2,660,941 | 2,660,941 | 0.00 |
|
|
|
|
|
| Plastid | 142,856 | 142,854 | 0.00 |
| Mitochondrion | 444,092 | 444,092 | 0.00 |
| Grand total | 257,658,711 | 256,629,046 | 0.30 |
Fig. 1.BUSCO metrics graphical representation for genome assembly and proteome, respectively, for almond cultivars ‘Nonpareil,’ ‘Texas,’ and ‘Lauranne,’ in reference to the metrics from the peach reference genome sequence version 2.0.1a for the cultivar ‘Lovell.’
Transposable element composition of the ‘Nonpareil’ genome.
| Class | Count | Bp masked | % Masked |
|---|---|---|---|
| LTR—Copia | 10,221 | 7,448,079 | 2.90 |
| LTR—Gypsy | 22,245 | 20,186,500 | 7.87 |
| LTR—unknown | 24,658 | 14,319,765 | 5.58 |
| TIR—CACTA | 22,403 | 10,089,917 | 3.93 |
| TIR—Mutator | 40,465 | 10,846,793 | 4.23 |
| TIR—PIF_Harbinger | 15,287 | 5,579,626 | 2.17 |
| TIR—Tc1 Mariner | 1,564 | 349,110 | 0.14 |
| TIR—hAT | 9,151 | 3,118,210 | 1.22 |
| nonLTR—LINE element | 593 | 272,476 | 0.11 |
| ninLTR—unknown | 225 | 129,418 | 0.05 |
| nonTIR—helitron | 7,240 | 1,892,602 | 0.74 |
| Repeat region | 37,622 | 12,025,696 | 4.69 |
| Total interspersed | 191,674 | 86,258,192 | 33.61 |
Genome-wide weighted percent methylation values for each tissue type in ‘Nonpareil.’
| Tissue type | % CG | % CHG | % CHH |
|---|---|---|---|
| Leaf | 45.2 | 14.6 | 1.3 |
| Flower | 56.4 | 29.4 | 2.9 |
| Exocarp | 52.5 | 17.7 | 4.0 |
| Mesocarp | 52.9 | 18.3 | 4.7 |
| Endocarp | 51.7 | 17.7 | 4.3 |
| Seed coat | 50.5 | 16.7 | 4.3 |
Weighted percent methylation is depicted for each methylation context [CG, CHG, and CHH (H = A, T, or C)].
Fig. 2.Circos plots depicting genome-wide weighted methylation across each of the 8 almond chromosomes for each methylation context [CG, CHG, and CHH (H = A, T, or C)]. Five almond tissue types are represented in concentric genomic tracks from outer to inner: Leaf, Exocarp, Mesocarp, Endocarp, and Seedcoat.