| Literature DB >> 30962943 |
Maria José Aranzana1, Véronique Decroocq2, Elisabeth Dirlewanger2, Iban Eduardo1, Zhong Shan Gao3, Ksenija Gasic4, Amy Iezzoni5, Sook Jung6, Cameron Peace6, Humberto Prieto7, Ryutaro Tao8, Ignazio Verde9, Albert G Abbott10, Pere Arús1.
Abstract
Prior to the availability of whole-genome sequences, our understanding of the structural and functional aspects of Prunus tree genomes was limited mostly to molecular genetic mapping of important traits and development of EST resources. With public release of the peach genome and others that followed, significant advances in our knowledge of Prunus genomes and the genetic underpinnings of important traits ensued. In this review, we highlight key achievements in Prunus genetics and breeding driven by the availability of these whole-genome sequences. Within the structural and evolutionary contexts, we summarize: (1) the current status of Prunus whole-genome sequences; (2) preliminary and ongoing work on the sequence structure and diversity of the genomes; (3) the analyses of Prunus genome evolution driven by natural and man-made selection; and (4) provide insight into haploblocking genomes as a means to define genome-scale patterns of evolution that can be leveraged for trait selection in pedigree-based Prunus tree breeding programs worldwide. Functionally, we summarize recent and ongoing work that leverages whole-genome sequences to identify and characterize genes controlling 22 agronomically important Prunus traits. These include phenology, fruit quality, allergens, disease resistance, tree architecture, and self-incompatibility. Translationally, we explore the application of sequence-based marker-assisted breeding technologies and other sequence-guided biotechnological approaches for Prunus crop improvement. Finally, we present the current status of publically available Prunus genomics and genetics data housed mainly in the Genome Database for Rosaceae (GDR) and its updated functionalities for future bioinformatics-based Prunus genetics and genomics inquiry.Entities:
Year: 2019 PMID: 30962943 PMCID: PMC6450939 DOI: 10.1038/s41438-019-0140-8
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Prunus major genes cloned or fine mapped with strong candidate genes
| Trait | Origin | Positiona | Gene descriptionb | Strategy | Causal polymorphism |
|---|---|---|---|---|---|
| Evergrowing |
| G1 ( | MADS box transcription factors/ | Mapping mutants, positional cloning, candidate gene (CG) functional classification and expression analysis[ | Deletion in the DAM-box gene region of the |
| Maturity date/slow ripening |
| G4 ( | NAC transcription factor ( | Fine mapping, whole-genome sequence analysis, and CG functional classification[ | In-frame insertion of 9 bp in the last exon for |
| Fruit size |
| G2 | Cell number regulator ( | CG approach, segregation and association analysis[ | – |
| Fruit size |
| G6 | Cell number regulator ( | CG approach, segregation and association analysis[ | – |
| Fruit shape |
| G6 ( | Leucine-rich repeat receptor-like kinase | Linkage and association mapping, whole-genome sequencing, CG functional classification and expression analysis[ | Large deletion affecting the promoter and part of the gene |
| Flesh color (white/yellow) |
| G1 ( | Carotenoid cleavage dioxygenase ( | CG approach, whole-genome resequence analysis of cultivars and somatic mutants, gene expression and volatile compound analysis[ | Three independent polymorphisms causing non-functional genes: 1 SNP, an SSR, and a transposable element (TE) |
| Flesh and skin color |
| G3 ( | MYB transcription factor | QTL analysis and CG functional classification[ | SNPs in |
| Blood flesh |
| G5 ( | NAC transcription factor | CG approach, comparative transcriptome and functional analysis[ | – |
| Green vs. red leaf color |
| G6 ( | MYB transcription factor | Transcriptome comparison; transient expression in tobacco and peach leaves[ | – |
| Fruit acidity |
| G5 ( | Auxin efflux carrier family protein ( | Fine mapping[ | SNP |
| Melting vs. nonmelting flesh and clingston vs. freestone |
| G4 ( | Endopolygalacturonase | Fine mapping and CG functional classification[ | SNPs and small indels |
| Stony hard texture |
| G6 ( | Fine mapping, GWAS, and CG approach[ | TE | |
| Hairy vs. glabrous skin |
| G5 ( | R2R3 MYB transcription factor ( | Fine mapping/whole-genome sequence analysis and CG functional classification[ | TE |
| Root-knot nematode resistance |
| G7 ( | Toll/interleukin1 receptor-nucleotide binding site-leucine-rich repeat ( | Fine mapping, positional cloning, and functional validation[ | – |
| Brachytic dwarf |
| G6 ( | Giberellic acid (GA) receptor | Sequencing‐based mapping strategy, sequence analysis, and functional validation in | Nonsense mutation |
| Weeping growth habit |
| G3 ( | SAM domain gene ( | Whole-genome sequencing of mutant and functional validation in | 1.8 kb deletion spanning the gene’s 5’ end |
| Broomy (pillar) |
| G2 ( | Tiller angle control ( | Sequencing‐based mapping strategy, whole-genome sequence analysis, expression analysis, and CG homology with Arabidopsis mutant[ | Insertion producing a nonsense mutation |
| Gametophytic self-incompatibility (pistil) |
| G6 ( | S-Ribonuclease ( | Allele cloning, genotyping, and self-compatibility screening[ | – |
| Gametophytic self-incompatibility (pollen) |
| G6 ( | F-box protein ( | Allele cloning, genotyping, and self-compatibility screening[ | – |
| Gametophytic self-incompatibility (pollen) |
| G3 ( | Disulfide bond A-like oxidoreductase ( | Fine mapping, gene expression analysis, whole-genome resequence analysis[ | Miniature inverted-repeat transposable element (MITE) insertion |
| Double flower |
| G6 ( | EU-AP2 transcription factor, TOE-type ( | Fine mapping, whole-genome resequence analysis, and CG functional classification and expression analysis[ | Deletion of an miRNA target site |
GWAS genome-wide association study, QTL quantitative trait locus, SNP single-nucleotide polymorphism, SSR simple sequence repeat
aIn parenthesis, major gene name
bGenes starting with ppa and Prupe correspond to the annotation of the v1.0 and v2.0 versions of the peach genome, respectively; Pav are genes from the sweet cherry genome v1.0
Comparison of the assembly and statistics among the Prunus genomes so far published
| Peach v2.0 | Peach v1.0 |
| PAV_r1.0 ( |
| |
|---|---|---|---|---|---|
| Sequencing strategy | Sanger–NGS Illumina | Sanger | NGS Illumina | NGS Illumina | NGS Illumina–PacBio |
| Genome coverage | 8.47×–NGS 64× | Sanger 8.47× | 180× | 130.34× | 1265.6× (1192× + 73×) |
| Scaffold number total | 191 (241)a | 202 (234) | 1449 | 10,148 | 3185 |
| Contig number total | 2525 | 2730 | 10,894 | – | 4292 |
| Scaffold sequence total | 227.4 Mb | 227.3 | 237.1 Mb | 272.4 Mb | 323.8 Mb |
| Mapped scaffold sequence total | 225.7 Mb (99.2%) | 218.4 Mb (96%) | 198.9 (83.9%) | 191.7 Mb (70.4%) | 281.6 Mb (87.0%) |
| Oriented scaffold sequence | 223.3 Mb (98.2%) | 194.6 Mb (85.6%) | – | – | – |
| Contig sequence total | 224.6 Mb | 224.6 Mb | 219.9 Mb | – | 318.7 Mb |
| Scaffold N50/L50 | 10/7.3 Mb | 9/8.9 Mb | 120/577.8 kb | –/219.6 kb | 519/198.9 kb |
| Scaffold N50/L50 (chromosome-scale assembly) | 4/27.4 Mb | 4/26.8 Mb | – | – | – |
| Contig N50/L50 | 250/255.4 kb | 294/214.2 kb | 2009/31.8 kb | –/276 bp | 706/132.6 kb |
| Number of scaffolds >50 kb | 11 | 21 | – | – | – |
| % main genome in scaffolds >50 kb | 99.4% | 99.4 | – | – | – |
| Repetitive sequences | 79.38 Mb (35.34%) | 84.41 Mb (37.14%) | 106.8 Mb (44.9%) | 119.4 Mb (43.8%) | 150.8 Mb (47.2%) |
| Protein-coding genes | 26,873 | 27,852 | 31,390 | 43,673 | 41,294 |
| Protein-coding transcripts | 47,089 | 28,689 | – | – | 45,581 |
| Alternative transcripts | 20,216 | 837 | – | – | 4287 |
aIn brackets, assembled scaffolds prior the build of pseudomolecules
Fig. 1Dividing a crop’s genome into qualitative, cognitively manageable segments by haploblocking adjacent sets of loci can be done in several ways, such as the pedigree-based approach, in which loci within haploblocks have not recombined throughout the pedigree of known progenitors of cultivars.
Haplotypes are the variants of haploblocks—sets of co-inherited alleles. To each haplotype can be assigned trait influences, ancestry, and other genetic features. If a breeding parent does not have coupling-phase linkage for desirable alleles within a haploblock containing multiple quantitative trait loci, such tight linkage might be targeted for recombination in the next generation
Fig. 2Flat fruit shape in peach.
a Manhattan plot from Micheletti et al.[23] data. Chromosomes are marked with different colors on the horizontal axis. The horizontal green line represents the significance threshold for the association. b Images of flowers, pistils, and fruits of a round (left) and a flat (right) fruit cultivar where it can be seen that the flat vs. round character is determined early in flower formation
Fig. 3Haplotype mosaic of “Rainier”, a classic Washington-bred sweet cherry cultivar.
Segments that “Rainier” inherited via its parents from its three specified ancestors are displayed across the eight chromosomes of sweet cherry. In some cases, these ancestral segments are homozygous, highlighting consequences of inbreeding and signifying common ancestry in generations behind known ancestors. Trait locus alleles are indicated with phenotypic effects and ancestral origins; despite the commercial success of “Rainier,” it can be seen that there is still much to be improved. These results were obtained from single-nucleotide polymorphism (SNP) data curation and pedigree ascertainment by L. Cai and C. Peace using the RosBREED cherry 6K SNP array v1 on a U.S. breeding germplasm set (n ~ 500)[21]. Diagram is from Peace et al.[175]