| Literature DB >> 35365708 |
Ephrem Habyarimana1, Sunita Gorthy2, Faheem S Baloch3, Sezai Ercisli4, Gyuhwa Chung5.
Abstract
Sorghum (Sorghum bicolor L. (Moench)) is the world's fifth economically most important cereal and is a staple particularly in the semi-arid tropics of Africa and Asia. Genetic gains in this crop can benefit from wild relatives such as Sorghum halepense. Genome sequences including those from this wild species can boost the study of genome-wide and intraspecific variation for dissecting the genetic basis and improving important traits in sorghum. The whole-genome resequencing carried out in this work on a panel of 172 populations of S. bicolor and S. bicolor × S. halepense (SbxSh) advanced lines generated a total of 567,046,841 SNPs, 91,825,474 indels, 1,532,171 SVs, and 4,973,961 CNVs. Clearly, SbxSh accumulated more variants and mutations with powerful effects on genetic differentiation. A total of 5,548 genes private to SbxSh mapped to biological process GO enrichment terms; 34 of these genes mapped to root system development (GO: 0022622). Two of the root specific genes i.e., ROOT PRIMORDIUM DEFECTIVE 1 (RPD1; GeneID: 8054879) and RETARDED ROOT GROWTH (RRG, GeneID: 8072111), were found to exert direct effect on root growth and development. This is the first report on whole-genome resequencing of a sorghum panel that includes S. halepense genome. Mining the private variants and genes of this wild species can provide insights capable of boosting sorghum genetic improvement, particularly the perenniality trait that is compliant with agroecological practices, sustainable agriculture, and climate change resilience.Entities:
Mesh:
Year: 2022 PMID: 35365708 PMCID: PMC8976056 DOI: 10.1038/s41598-022-09433-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Genetic differentiation analysis in Sb and SbxSh populations. The ellipses are drawn accounting for the 95% confidence interval and the Euclidean distance from the center “o”. Genotypes outside the ellipses are outliers[74].
Tukey HSD for SNPs, InDels, SVs, and CNVs in S. bicolor vs. S. bicolor × S. halepense results.
| SNPs | Total SNPs | Homozygous | Heterozygous | Syn_CDS | Nonsyn_CDS | Exon | Gene | Intron | mRNA | Pseudogene | Transcript | tRNA |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sb-SNP | 2,740,707.91 b | 1,568,460.64 a | 1,172,247.27 b | 51,930.82 b | 53,090.27 b | 259,126.96 b | 367,089.86 b | 0.28 b | 502,355.61 b | 24,551.5 b | 66,349.91 b | 47.39 b |
| SbxSh-SNP | 7,774,659.53 a | 266,261.84 b | 7,508,397.68 a | 224,011.11 a | 192,976.26 a | 1,039,206.53 a | 1,487,971.84 a | 0.79 a | 2,098,316.74 a | 72,998.58 a | 248,660 a | 97.89 a |
| CNVs, SV, InDels | CDS | cDNA match | Direct repeat | Exon | Gene | Intron | Lnc RNA | mRNA | Pseudogene | rRNA | tRNA | Transcript |
| Sb-CNV | 220,969.2 a | 813.5 b | 11.71 a | 299,489.67 b | 35,240.17 b | 6.01 a | 6870.76 b | 44,166.06 b | 1481.2 b | 11.03 a | 569.77 a | 3868.22 b |
| SbxSh-CNV | 220,873.05 a | 816 a | 12.42 a | 299,802.58 a | 35,634.95 a | 6:00 AM | 7426.47 a | 44,819.11 a | 1565 a | 11.05 a | 569.74 a | 4134.21 a |
| Sb-SV | 613,211.95 b | 2270.12 a | 6459.33 a | 831,108.19 b | 89,783.18 b | 293.41 a | 12,531.91 b | 1616.44 a | 3531.11 a | 212.8 a | 1615.27 a | 5385.1 b |
| SbxSh-SV | 714,082.79 a | 2530.32 a | 19.68 a | 976,566.21 a | 104,589.79 a | 18.79 a | 15,119.68 a | 1739.21 a | 3927.74 a | 18.63 a | 1640.47 a | 6678.53 a |
| Sb-InDels | 12,638.73 b | 205.48 b | 48,117.67 a | 88,723.88 b | 27,024.42 b | NA | 74,829.22 b | 361,489.41 b | 345,870.29 a | NA | 6755.33 a | 1708.89 b |
| SbxSh-InDels | 36,324.74 a | 604.53 a | 65,756.26 a | 262,396.16 a | 268,724.05 a | NA | 134,405.79 a | 662,402.89 a | 419,384.79 a | NA | 8914.84 a | 20,483.42 a |
*Numbers followed by same letter within a column and for same variant, are not statistically different at the 5% probability level.
Figure 2Chromosomal distribution of WGRS information in Sb and BC1 SbxSh lines, and GWAS data from the evaluated populations[74]. The x-axis corresponds to the genomic coordinate. Tracks 1–4: visualize the genomic density of regions (defined as the fraction of a genomic window that is covered by genomic regions). Tracks 5, 8: significant SNPs and candidate genes are displayed according to their genomic coordinates (x-axis), while y-values were set for the sole purpose improving the resolution (legibility) of the corresponding SNPs and candidate genes.
Figure 3Statistics of different types of large-effect SNPs. Re-sequenced entries are represented on x-axis, numbers on top of each bar represent the number of SNPs[74].
Figure 4Number of shared and private genes among Sb1 (IESV 99,091 DL) and two sister perennial RILs (SbxSh9 and SbxSh50) derived from SbxSh cross backcrossed (2 recurrent parent doses: Tx623*2/Gypsum 9; BC1) to S. bicolor[74].
Hierarchical relations between over-represented (enriched) functional classes using GO biological process annotation in private genes of S. bicolor*2/S. halepense relative to S. bicolor lines.
| Biological process category¥ | Obs | Exp | Fold | O/U | P.value | |
|---|---|---|---|---|---|---|
| Plant-type cell wall organization (GO:0009664) | 66 | 28 | 10.87 | 2.58 | + | 2.02E−02 |
| Cell wall organization (GO:0071555) | 249 | 78 | 41.00 | 1.90 | + | 3.20E−04 |
| Cell wall organization or biogenesis (GO:0071554) | 365 | 104 | 60.10 | 1.73 | + | 3.05E−04 |
| Cellular process (GO:0009987) | 11,897 | 2536 | 1959.00 | 1.29 | + | 5.48E−54 |
| External encapsulating structure organization (GO:0045229) | 268 | 81 | 44.13 | 1.84 | + | 7.57E−04 |
| Cellular component organization (GO:0016043) | 1929 | 469 | 317.64 | 1.48 | + | 2.85E−13 |
| Cellular component organization or biogenesis (GO:0071840) | 2271 | 558 | 373.95 | 1.49 | + | 3.22E−17 |
| Root development (GO:0048364) | 87 | 34 | 14.33 | 2.37 | + | 1.36E−02 |
| Root system development (GO:0022622) | 87 | 34 | 14.33 | 2.37 | + | 1.36E−02 |
| Anatomical structure development (GO:0048856) | 645 | 152 | 106.21 | 1.43 | + | 2.87E−02 |
| Cell wall polysaccharide metabolic process (GO:0010383) | 118 | 43 | 19.43 | 2.21 | + | 5.25E−03 |
| Cellular macromolecule metabolic process (GO:0044260) | 4903 | 983 | 807.34 | 1.22 | + | 8.72E−08 |
| Cellular metabolic process (GO:0044237) | 8202 | 1731 | 1350.57 | 1.28 | + | 6.81E−28 |
| Metabolic process (GO:0008152) | 9514 | 1998 | 1566.61 | 1.28 | + | 4.27E−33 |
| Macromolecule metabolic process (GO:0043170) | 6247 | 1253 | 1028.65 | 1.22 | + | 4.80E−11 |
| Organic substance metabolic process (GO:0071704) | 8662 | 1785 | 1426.31 | 1.25 | + | 6.68E−24 |
| Carbohydrate metabolic process (GO:0005975) | 976 | 218 | 160.71 | 1.36 | + | 1.52E−02 |
| Primary metabolic process (GO:0044238) | 8162 | 1643 | 1343.98 | 1.22 | + | 6.05E−17 |
| Glutathione metabolic process (GO:0006749) | 107 | 38 | 17.62 | 2.16 | + | 3.35E−02 |
| Peptide metabolic process (GO:000006518) | 683 | 190 | 112.47 | 1.69 | + | 2.06E−08 |
| Organonitrogen compound metabolic process (GO:1901564) | 5182 | 1074 | 853.29 | 1.26 | + | 2.05E−12 |
| Nitrogen compound metabolic process (GO:0006807) | 6844 | 1407 | 1126.96 | 1.25 | + | 1.16E−16 |
| Cellular amide metabolic process (GO:0043603) | 786 | 210 | 129.43 | 1.62 | + | 5.56E−08 |
| Cellular nitrogen compound metabolic process (GO:0034641) | 2981 | 669 | 490.86 | 1.36 | + | 1.21E−12 |
| Sulfur compound metabolic process (GO:0006790) | 301 | 81 | 49.56 | 1.63 | + | 4.80E−02 |
| Hydrogen peroxide catabolic process (GO:0042744) | 162 | 53 | 26.68 | 1.99 | + | 8.69E−03 |
| Cellular catabolic process (GO:0044248) | 1250 | 300 | 205.83 | 1.46 | + | 4.10E−07 |
| Catabolic process (GO:0009056) | 1527 | 375 | 251.44 | 1.49 | + | 1.06E−10 |
| Hydrogen peroxide metabolic process (GO:0042743) | 163 | 54 | 26.84 | 2.01 | + | 5.00E−03 |
| Reactive oxygen species metabolic process (GO:0072593) | 178 | 58 | 29.31 | 1.98 | + | 3.59E−03 |
| Anatomical structure morphogenesis (GO:0009653) | 182 | 58 | 29.97 | 1.94 | + | 6.91E−03 |
| Response to oxidative stress (GO:06979) | 274 | 85 | 45.12 | 1.88 | + | 1.38E−04 |
| Response to stress (GO:0006950) | 1514 | 343 | 249.30 | 1.38 | + | 9.78E−06 |
| Response to stimulus (GO:050896) | 2823 | 614 | 464.84 | 1.32 | + | 5.24E−09 |
| Proteasomal protein catabolic process (GO:0010498) | 216 | 65 | 35.57 | 1.83 | + | 1.14E−02 |
| Protein metabolic process (GO:0019538) | 4243 | 836 | 698.67 | 1.20 | + | 6.82E−05 |
| Organonitrogen compound catabolic process (GO:1901565) | 715 | 165 | 117.73 | 1.40 | + | 3.75E−02 |
| Organic substance catabolic process (GO:1901575) | 1308 | 312 | 215.38 | 1.45 | + | 3.32E−07 |
| Macromolecule catabolic process (GO:0009057) | 842 | 191 | 138.65 | 1.38 | + | 2.31E−02 |
| Cellular protein metabolic process (GO:0044267) | 3814 | 761 | 628.03 | 1.21 | + | 4.83E−05 |
| Response to oxygen-containing compound (GO:1901700) | 368 | 101 | 60.60 | 1.67 | + | 2.31E−03 |
| Response to chemical (GO:0042221) | 1117 | 281 | 183.93 | 1.53 | + | 1.55E−08 |
| Generation of precursor metabolites and energy (GO:0006091) | 440 | 119 | 72.45 | 1.64 | + | 5.64E−04 |
| Translation (GO:0006412) | 541 | 146 | 89.08 | 1.64 | + | 3.08E−05 |
| Peptide biosynthetic process (GO:0043043) | 548 | 149 | 90.24 | 1.65 | + | 1.40E−05 |
| Amide biosynthetic process (GO:0043604) | 601 | 158 | 98.96 | 1.60 | + | 4.19E−05 |
| Cellular nitrogen compound biosynthetic process (GO:0044271) | 1258 | 293 | 207.15 | 1.41 | + | 1.14E−05 |
| Cellular biosynthetic process (GO:0044249) | 2478 | 558 | 408.04 | 1.37 | + | 2.55E−10 |
| Biosynthetic process (GO:0009058) | 2735 | 603 | 450.35 | 1.34 | + | 8.91E−10 |
| Organonitrogen compound biosynthetic process (GO:1901566) | 1278 | 323 | 210.44 | 1.53 | + | 2.32E−10 |
| Organic substance biosynthetic process (GO:1901576) | 2576 | 569 | 424.17 | 1.34 | + | 3.32E−09 |
| Gene expression (GO:0010467) | 1647 | 386 | 271.20 | 1.42 | + | 1.85E−08 |
| Cellular macromolecule biosynthetic process (GO:0034645) | 1332 | 309 | 219.33 | 1.41 | + | 6.35E−06 |
| Macromolecule biosynthetic process (GO:0009059) | 1379 | 316 | 227.07 | 1.39 | + | 1.37E−05 |
| Hormone-mediated signaling pathway (GO:0009755) | 315 | 85 | 51.87 | 1.64 | + | 2.80E−02 |
| Cellular response to hormone stimulus (GO:0032870) | 319 | 85 | 52.53 | 1.62 | + | 4.33E−02 |
| Cellular response to organic substance (GO:0071310) | 396 | 107 | 65.21 | 1.64 | + | 2.26E−03 |
| Cellular response to chemical stimulus (GO:0070887) | 677 | 185 | 111.48 | 1.66 | + | 1.53E−07 |
| Cellular response to stimulus (GO:0051716) | 1731 | 370 | 285.03 | 1.30 | + | 8.32E−04 |
| Response to organic substance (GO:0010033) | 685 | 182 | 112.79 | 1.61 | + | 1.72E−06 |
| Response to endogenous stimulus (GO:0009719) | 543 | 144 | 89.41 | 1.61 | + | 1.06E−04 |
| Response to hormone (GO:0009725) | 538 | 144 | 88.59 | 1.63 | + | 6.22E−05 |
| Regulation of cellular process (GO:0050794) | 3891 | 756 | 640.70 | 1.18 | + | 2.51E−03 |
| Regulation of biological process (GO:0050789) | 4249 | 809 | 699.65 | 1.16 | + | 1.62E−02 |
| Biological regulation (GO:0065007) | 4812 | 939 | 792.36 | 1.19 | + | 4.36E−05 |
| Response to abiotic stimulus (GO:0009628) | 505 | 136 | 83.15 | 1.64 | + | 1.06E−04 |
| Vesicle-mediated transport (GO:0016192) | 454 | 117 | 74.76 | 1.57 | + | 6.54E−03 |
| Transport (GO:0006810) | 2373 | 517 | 390.75 | 1.32 | + | 2.73E−07 |
| Establishment of localization (GO:0051234) | 2407 | 523 | 396.34 | 1.32 | + | 3.07E−07 |
| Localization (GO:0051179) | 2509 | 551 | 413.14 | 1.33 | + | 1.90E−08 |
| Ribonucleoprotein complex biogenesis (GO:0022613) | 433 | 111 | 71.30 | 1.56 | + | 1.43E−02 |
| Cellular component biogenesis (GO:0044085) | 1079 | 275 | 177.67 | 1.55 | + | 6.97E−09 |
| Protein-containing complex subunit organization (GO:0043933) | 530 | 130 | 87.27 | 1.49 | + | 2.00E−02 |
| Intracellular transport (GO:0046907) | 666 | 163 | 109.67 | 1.49 | + | 1.84E−03 |
| Establishment of localization in cell (GO:0051649) | 700 | 170 | 115.26 | 1.47 | + | 1.71E−03 |
| Cellular localization (GO:0051641) | 821 | 204 | 135.19 | 1.51 | + | 2.85E−05 |
| Cellular component assembly (GO:0022607) | 641 | 156 | 105.55 | 1.48 | + | 4.23E−03 |
| Protein localization (GO:0008104) | 736 | 169 | 121.19 | 1.39 | + | 3.86E−02 |
| Macromolecule localization (GO:0033036) | 904 | 211 | 148.86 | 1.42 | + | 1.33E−03 |
| Organelle organization (GO:0006996) | 1308 | 297 | 215.38 | 1.38 | + | 8.81E−05 |
| Nitrogen compound transport (GO:0071705) | 968 | 217 | 159.39 | 1.36 | + | 1.27E−02 |
| Organic substance transport (GO:0071702) | 1156 | 258 | 190.35 | 1.36 | + | 2.43E−03 |
| Small molecule metabolic process (GO:0044281) | 1339 | 292 | 220.48 | 1.32 | + | 3.09E−03 |
| Regulation of transcription DNA-templated (GO:0006355) | 2044 | 428 | 336.57 | 1.27 | + | 8.47E−04 |
| Regulation of nucleic acid-templated transcription (GO:1903506) | 2044 | 428 | 336.57 | 1.27 | + | 8.47E−04 |
| Regulation of RNA biosynthetic process (GO:201141) | 2044 | 428 | 336.57 | 1.27 | + | 8.47E−04 |
| Regulation of RNA metabolic process (GO:0051252) | 2141 | 448 | 352.54 | 1.27 | + | 4.82E−04 |
| Regulation of nucleobase-containing compound m. proc. (GO:0019219) | 2179 | 455 | 358.80 | 1.27 | + | 4.83E−04 |
| Nucleobase-containing compound metabolic process (GO:0006139) | 2183 | 449 | 359.46 | 1.25 | + | 2.72E−03 |
| Organic cyclic compound metabolic process (GO:1901360) | 2580 | 546 | 424.83 | 1.29 | + | 4.65E−06 |
| Heterocycle metabolic process (GO:0046483) | 2389 | 505 | 393.38 | 1.28 | + | 2.27E−05 |
| Cellular aromatic compound metabolic process (GO:0006725) | 2510 | 538 | 413.30 | 1.30 | + | 1.10E−06 |
| Unclassified | 17557 | 2200 | 2890.99 | 0.76 | − | 0.00E00 |
| Positive regulation of translation (GO:0045727) | 230 | 2 | 37.87 | 0.05 | − | 4.89E−11 |
| Positive regulation of cellular protein metabolic process (GO:0032270) | 323 | 24 | 53.19 | 0.45 | − | 1.13E−02 |
| Positive regulation of protein metabolic process (GO:0051247) | 330 | 24 | 54.34 | 0.44 | − | 5.83E−03 |
| Positive regulation of gene expression (GO:0010628) | 266 | 10 | 43.80 | 0.23 | − | 1.61E−06 |
| Positive regulation of cellular amide metabolic process (GO:0034250) | 232 | 2 | 38.20 | 0.05 | − | 3.57E−11 |
| Regulation of cellular amide metabolic process (GO:0034248) | 388 | 29 | 63.89 | 0.45 | − | 1.51E−03 |
| Regulation of translation (GO:0006417) | 383 | 28 | 63.07 | 0.44 | − | 1.06E−03 |
| Posttranscriptional regulation of gene expression (GO:0010608) | 436 | 34 | 71.79 | 0.47 | − | 9.51E−04 |
¥Sorting is done only by the most specific subclass first, with its parent terms indented directly below it. These are all related classes in an ontology, and are often interpretable as a group rather than individually. If a term is a parent of more than one term in the results table, it is shown only under its first descendant. Obs/Exp, Fold, O/U, respectively, observed/expected genes unique to SbxSh9 and SbxSh50, fold enrichment, and over-/under-representation. Analysis Type: PANTHER Overrepresentation Test (Released 20,210,224); Annotation Version and Release Date: GO Ontology database https://doi.org/10.5281/zenodo.5080993 Released 2021-07-02; Reference List: Sorghum bicolor (all 33,693 genes in database); annotation dataset: GO Biological process complete; test type: binomial; correction: Bonferroni correction for multiple testing. Displayed only results for Bonferroni-corrected P < 0.05. Of the 12,484 SbxSh9 and SbxSh50 private genes, 5563 were uniquely mapped 36 of which with multiple mapping, and 6921 unmapped, respectively.