| Literature DB >> 35567182 |
Jakob Petereit1, Jacob I Marsh1, Philipp E Bayer1, Monica F Danilevicz1, William J W Thomas1, Jacqueline Batley1, David Edwards1.
Abstract
Soybean (Glycine max) is a legume species of significant economic and nutritional value. The yield of soybean continues to increase with the breeding of improved varieties, and this is likely to continue with the application of advanced genetic and genomic approaches for breeding. Genome technologies continue to advance rapidly, with an increasing number of high-quality genome assemblies becoming available. With accumulating data from marker arrays and whole-genome resequencing, studying variations between individuals and populations is becoming increasingly accessible. Furthermore, the recent development of soybean pangenomes has highlighted the significant structural variation between individuals, together with knowledge of what has been selected for or lost during domestication and breeding, information that can be applied for the breeding of improved cultivars. Because of this, resources such as genome assemblies, SNP datasets, pangenomes and associated databases are becoming increasingly important for research underlying soybean crop improvement.Entities:
Keywords: assemblies; breeding; databases; genetic variation; genetics; genomics; germplasm; pangenome; soybean
Year: 2022 PMID: 35567182 PMCID: PMC9101001 DOI: 10.3390/plants11091181
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Sequencing required and information gained from core genomic resources for soybean breeding research (n—number of individuals included in a typical study).
Figure 2Milestones progressing soybean genomics and pangenomics. Red boxes indicate genome assemblies of modern cultivars (Wm82.a1/a2/a3—G. max Williams82, first, second and fourth revision, Lee—G. max Lee, Zh13, Zh13 imp—G. max Zhonhuang 13 and Zhonghuang 13 improved), green boxes indicate wild genome assemblies (W05—G. soja accession W05, PI 483463—G. soja accession, PI 559298—G. latifolia accession), blue boxes indicate pangenomes, including the used accessions for their construction, and the red-blue-green box depicts the Glycine super-pangenome, including G. max, G. soja and 7 perennial Glycines. Arrows indicate the use of a constructed genome assembly in a later study.
Available database resources for soybean genome investigation and computational tools.
| Data Type | Database | Description | Website | |
|---|---|---|---|---|
| Genome and | SFGD—Soybean Functional Genomics Database | Integration-friendly genome, transcriptome and protein data for the functional characterization of soybean pathways. Has a dataset, focused on soybean acyl–lipid pathways | [ | |
| Plant-Impute DB | Database for genotype imputation using high-quality reference panels | [ | ||
| GmHapMap—Haplotype Map | Haplotype map constructed using genome sequence data from 1007 soybean accessions, with 4.3 M SNPs identified | [ | ||
| SoyKB—Soybean knowledge base | Soybean data hub with genomic and genetic information linked to external datasets | [ | ||
| SGMD—Soybean genomics and microarray database | Integrated view of the interaction of soybean with the soybean cyst nematode and contains genomic, EST and microarray data with embedded analytical tools, allowing the correlation of soybean ESTs with their gene expression profiles | [ | ||
| SoyTEdb—Soybean Transposable Element database | Database of transposable elements identified by genetic and physical maps based on the Glyma1.01 assembly | [ | ||
| DAIZUbase | Genome visualization and data mining tools (Gbrownse, Unifiedmap, Geneviewer and BLAST) | [ | ||
| LegumeIP V3 | Translational genomics, offering tools to analyze gene expression data and pathway analysis | [ | ||
| SoyBase | Integrates genetic and genomic data, including QTLs and GWAS for several hybrid lines. Facilitates BLAST using soybean pangenome of all cultivars within the database | [ | ||
| Legume Federation | Visualization of genotype comparison, genome context viewer, gene annotation and visualiz+ation, synteny, QTLS and genetic markers search. SNPs and GWAS results available | |||
| Phytozome | Up-to-date repository of genome assemblies and annotation. Useful BLAST functionality between species | [ | ||
| PlantGSAD v2 | Numerous gene set annotations, including metabolic pathways and customized SEA annotations and integrated visualization features | [ | ||
| PlantGDB | Soybean genome and annotation tools for comparative genomics | [ | ||
| SoyTB—Transporter | Comparative analysis of transporter genes in 47 plant genomes and transcriptomes | [ | ||
| PCMDB—Plant cell markers database | Cell markers from 6 plant species to label 263 cell types across 22 tissues | [ | ||
| SoyVCF 2 Genomes | Compares the user-supplied genomic data in the database for the identification of the closest soybean relative in a 222 germplasm collection | [ | ||
| SNPViz v2.0 | Web-based tool for the visualization of large-scale haplotype blocks with detailed SNPs and indels grouped by their chromosomal coordinates, along with their overlapping gene models, phenotype to genotype accuracies, Gene Ontology (GO) annotations, protein families (Pfam) annotations, genomic variant annotations and their functional effects | [ | ||
| Functional networks, and co-expression data | SoyFN—Soybean Functional Networks | Gene and miRNA interaction database built into functional networks, with KEGG pathways and Gene Ontology annotations | [ | |
| SoyNet | Searchable network of soybean genes for network-based functional predictions | [ | ||
| SoyCSN—context-specific network | Computational pipeline to analyze, annotate, retrieve and visualize context-specific network at the transcriptome and interactome levels—based on the Soybean Gene Atlas project | [ | ||
| PlaNet | Platform of web-based tools for the visualization of whole-genome co-expression networks in multiple species, including soybean | [ | ||
| Protein-related data | SoyProDB—Soybean Seed Protein Database | Identification of soybean seed proteins from 2D-PAGE gels | [ | |
| PlantEAR | Database of EAR motif-containing proteins across 71 species | [ | ||
| Expressed sequence tag (EST) | HarvEST | EST data filtered using the soybean genome assembly Glyma1 | [ | |
| OcsESTdb—oil crop seed EST database | EST libraries of four oilseed species with annotated sequences | [ | ||
| Soybean Marker database | Linkage map of soybean genome and genetic markers | [ | ||
| Rsoy—Riken Soybean | cDNA sequences for the functional analysis of genomic features | [ | ||
| Manual and image phenotype | RhizoVision Crown | Crown root images and phenotypic measurements of 187 soybean lines. Additionally, it offers a tool for root phenotyping | [ | |
| SoyNAM | Phenotype and genotype data from 5555 SoyNAM lines, available through the R package NAM | [ |