| Literature DB >> 35269811 |
Cassandria Geraldine Tay Fernandez1, Benjamin John Nestor1, Monica Furaste Danilevicz1, Mitchell Gill1, Jakob Petereit1, Philipp Emanuel Bayer1, Patrick Michael Finnegan1, Jacqueline Batley1, David Edwards1.
Abstract
Pangenomes are a rich resource to examine the genomic variation observed within a species or genera, supporting population genetics studies, with applications for the improvement of crop traits. Major crop species such as maize (Zea mays), rice (Oryza sativa), Brassica (Brassica spp.), and soybean (Glycine max) have had pangenomes constructed and released, and this has led to the discovery of valuable genes associated with disease resistance and yield components. However, pangenome data are not available for many less prominent crop species that are currently under-utilised. Despite many under-utilised species being important food sources in regional populations, the scarcity of genomic data for these species hinders their improvement. Here, we assess several under-utilised crops and review the pangenome approaches that could be used to build resources for their improvement. Many of these under-utilised crops are cultivated in arid or semi-arid environments, suggesting that novel genes related to drought tolerance may be identified and used for introgression into related major crop species. In addition, we discuss how previously collected data could be used to enrich pangenome functional analysis in genome-wide association studies (GWAS) based on studies in major crops. Considering the technological advances in genome sequencing, pangenome references for under-utilised species are becoming more obtainable, offering the opportunity to identify novel genes related to agro-morphological traits in these species.Entities:
Keywords: QTLs; graph pangenomes; pangenome assembly; presence absence variation; trait discovery
Mesh:
Year: 2022 PMID: 35269811 PMCID: PMC8910360 DOI: 10.3390/ijms23052671
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Plant pangenomes constructed to date and method of assembly.
| Species Name | # of Individual Genomes | Assembly Method | References |
|---|---|---|---|
|
| 10 | Iterative mapping and assembly | [ |
|
| 7 | De novo assembly | [ |
|
| 54 | De novo assembly | [ |
|
| 4 | De novo assembly | [ |
|
| 53 | Iterative mapping and assembly | [ |
|
| 8 | De novo assembly | [ |
|
| 10 | Iterative mapping and assembly | [ |
|
| 89 | Iterative mapping and assembly | [ |
|
| 5 | Iterative mapping and assembly | [ |
|
| 29 | Graph-based de novo assembly | [ |
|
| 1110 | Iterative mapping and assembly | [ |
|
| 1961 | De novo assembly | [ |
|
| 20 | De novo assembly | [ |
|
| 287 | De novo assembly | [ |
|
| 91 | De novo assembly | [ |
|
| 57 | Practical haplotype graphs | [ |
|
| 15 | De novo assembly | [ |
|
| 3 | De novo assembly | [ |
|
| 31 | De novo assembly | [ |
|
| 10 | De novo assembly | [ |
|
| 5 | De novo assembly | [ |
|
| 725 | De novo assembly | [ |
|
| 398 | Practical haplotype graphs | [ |
|
| 176 | Iterative mapping and assembly | [ |
|
| 18 | Iterative mapping and assembly | [ |
|
| 4705 | Practical haplotype graphs | [ |
Research involving underutilised crops without genomic references.
| Scientific Names | Common Names | Type of Resource | References |
|---|---|---|---|
|
| Malabar spinach | Reports of viruses infecting Malbar spinach | [ |
| Chromosome counts/Nuclear DNA quantification | [ | ||
|
| Guinea arrowroot | Future prospects for underutilised medicinally valuable plants | [ |
|
| Milk tree | Identifying pollinators in edible Amazon fruit plants | [ |
|
| Greater sea-kale | Ancestral chromosomal blocks in Brassiceae species | [ |
|
| Tassel grape hyacinth | Identifying physiological responses | [ |
| Mineral content and chemical analysis | [ | ||
|
| Mongongo tree | Sustainability review | [ |
| Chemical composition of oil | [ | ||
|
| Ulluco | Viruses detected in ulluco | [ |
| High throughput sequencing to detect novel viruses in ulluco | [ |
Underutilised crops with genetic resources.
| Scientific Names | Common Names | Type of Genomic Resources | References |
|---|---|---|---|
| Cereal grains | |||
|
| African arrowroot | Chloroplast genome sequence | [ |
|
| White fonio | Genome assembly and annotation | [ |
| Genotype-by-sequencing and SNP data | [ | ||
|
| Little Millet | Chloroplast genome sequences | [ |
| De novo transcriptome assembly | [ | ||
| Vegetable/Pulse crops | |||
|
| Hyacinth bean/Lablab bean | Chloroplast genome assembly | [ |
| Draft genome assembly | [ | ||
| Upregulation of drought tolerant genes | [ | ||
| RFLP markers | [ | ||
|
| Black nightshade plant | Transcriptome sequence | [ |
| Chloroplast genome sequence | [ | ||
|
| Moth bean | Genetic linkage map | [ |
| Novel | [ | ||
| Tuberous crops | |||
|
| Yam bean | Draft genome assembly | [ |
|
| Zombi pea or Wild cowpea | Anti-inflammatory bioactivity | [ |
| QTL analysis | [ | ||
| Molecular linkage analysis | [ | ||
| Hybridisation accession analysis | [ | ||
| Industrial Crops | |||
|
| Safflowers | Transcriptome sequencing | [ |
| Chromosome-scale reference genome | [ | ||
| Chloroplast genome sequence | [ | ||
| Genetic mapping of SNPs | [ | ||
|
| Kenaf | Mitochondrial genome assembly | [ |
| Genome assembly and annotation | [ | ||
| [ | |||
| Fruit/Nuts | |||
|
| Peach palm | Chloroplast DNA for phylogenetic study | [ |
| Macaúba palm transcriptome sequencing | [ | ||
| RNA-seq of tropical palms | [ | ||
| Plastome sequence | [ | ||
|
| Desert Watermelon or Wild watermelon | Gene markers | [ |
| Transcriptome assembly | [ | ||
| Genome Resequencing | [ | ||
|
| Russian olive or wild olive | Geographic study using machine learning | [ |
| Hi-C assembly | [ | ||
| Transciptome profiling | [ | ||
| Plant signalling regarding salt | [ | ||
|
| Ethiopian Banana | Genome assembly | [ |
| Pangenome assembly | [ | ||
| Markers/Microsatellites | [ | ||
| Metabolite data | [ | ||
|
| Açaí | Chemical genomic profiling | [ |
| Karyotype and genome size | [ | ||
|
| Guava | Genome assembly | [ |
| Genome Markers | [ | ||
| RNA-seq/transcriptome assembly | [ | ||
|
| Agraz or Colombian Berry | Phylogenetic relationships within the | [ |
| Chemical, antimicrobial and molecular characterisation | [ | ||
| Characterisation of phenotypic plasticity | [ | ||
| Antiproliferative potential of Agraz juice | [ | ||
Figure 1Scheme showing three pangenome assembly methods. Sequence reads from genomes A, B and C can be used to assemble the species pangenome using de novo method yielding three separate genomes that will be compared to define the core and variable regions. In the iterative assembly, genome A is assembled de novo and used as a reference for assembling the remaining genomes B and C. Because genome A has different genes from genome B and C, it may change the gene order in genome B (highlighted in the blue box) or collapsing CNV in genome C (highlighted in the blue box). In the iterative assembly, genes not represented in the reference genome (genome A) have to be assembled de novo and may lose their location information as shown by the green gene below genome B assembly. Graph pangenome assembly of genomes A, B and C represent the genes as interconnected nodes, each path representing a genome.
Figure 2Predicted benefits to plant breeding from future developments in pangenomics. Improvements in pangenome assembly and annotation combined with machine learning (ML) technology will increase the accuracy of analyses on gene presence/absence variation (PAV) and structural variation (SV) in different individuals of crop species. These analyses will be available to plant breeders through new tools and browsers, allowing easier selection of traits and genetic diversity in crop plants.