| Literature DB >> 35126443 |
Zhibin Zhang1, Mao Chai1, Zhaoen Yang1,2, Zuoren Yang1,2, Liqiang Fan1.
Abstract
With the increasing amount of cotton omics data, breeding scientists are confronted with the question of how to use massive cotton data to mine effective breeding information. Here, we construct a Gossypium Resource And Network Database (GRAND), which integrates 18 cotton genome sequences, genome annotations, two cotton genome variations information, and also four transcriptomes for Gossypium species. GRAND allows to explore and mine this data with the help of a toolbox that comprises a flexible search system, BLAST and BLAT suite, orthologous gene ID, networks of co-expressed genes, primer design, Gbrowse and Jbrowse, and drawing instruments. GRAND provides important information regarding Gossypium resources and hopefully can accelerate the progress of cultivating cotton varieties.Entities:
Keywords: GRAND; Gossypium; comparative genomics; genome; variation
Year: 2022 PMID: 35126443 PMCID: PMC8814657 DOI: 10.3389/fpls.2022.773107
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of 18 cotton genome assemblies representing 14 Gossypium species.
| Species | Genome size (Mb) | Number of protein-coding genes | Contig N50 (Mb) | Scaffold N50 (Mb) | References | Data sources |
|---|---|---|---|---|---|---|
| 2,225 | 75,071 | 0.08 | 23.44 |
|
| |
| 2,295 | 72,761 | 0.11 | 15.51 | |||
| 735 | 40,743 | 6.3 | 6.3 |
|
| |
| 2,266 | 71,297 | 2.15 | 92.88 |
|
| |
| 2,347 | 70,199 | 1.89 | 97.74 | |||
| 1,710 | 40,960 | 1.10 | NA |
|
| |
| 2,286 | 73,624 | 4.76 | NA |
|
| |
| 2,309 | 73,707 | 1.98 | NA | |||
| 1,752 | 40,694 | 1.83 | 143.6 |
|
| |
| 801 | 41,471 | 26.8 | NA |
|
| |
| 780 | 41,316 | 24.7 | NA | |||
|
| 2,229 | 72,648 | 11.98 | 103.05 |
|
|
|
| 2,210 | 78,303 | 9.07 | 101.9 |
|
|
|
| 2,344 | 74,699 | 2.31 | 106.76 |
| |
|
| 1,208 | 37,016 | 10.81 | 97.68 |
|
|
|
| 1,572 | 43,952 | 1.91 | 117.88 |
|
|
|
| 1,205 | 38,378 | 17.53 | 95.88 |
|
|
|
| 765 | 39,692 | 7.91 | 60.46 |
|
|
Figure 1Schematic of GRAND database structure and web interface features. GRAND gathers 18 genomes, four transcriptomes of cotton and associated genome variations, and annotations data. All data are stored in a MySQL database.
Figure 2An example of a gene page showing multiple types of information associated with the gene. (A) The annotation information of gene “Gh_D03G071700.” (B) the gene “Gh_D03G071700” is annotated as the hAT family C-terminal dimerization region and can be also found at the end page by searching the gene family function panel with PF05699. (C) The SNP Variation or INDEL Variation analyzes the genomic variation data (SNP or INDEL) on each chromosome of a group of individuals. (D,E) The Gbrowse and Jbrowse page.
Figure 3Search functions in GRAND. (A) The results of a BLAST search. (B) The Search page contains Comparative Search, Multicriteria Search, and Gene Search. (C) The orthologous gene ID function allows searching for orthologous gene IDs among different species and versions of cotton genome.
Figure 4Tools in GRAND. (A) A heatmap showing gene expression search results. Each row represents the expression data of a gene across many samples, and each column shows the expression in a particular sample. (B) Co-expression network search results. The network shows the top 20 nodes near the target gene, and the corresponding information can be viewed by clicking. (C) Primer design page. (D) TargetDesign tool for designing primers for CRISPR/Cas9/Cpf1 genome editing.