| Literature DB >> 35095966 |
Yuting Li1,2, Hongchun Xiong2, Jiazi Zhang2, Huijun Guo2, Chunyun Zhou2, Yongdun Xie2, Linshu Zhao2, Jiayu Gu2, Shirong Zhao2, Yuping Ding2, Zhengwu Fang1, Luxiang Liu2.
Abstract
Induced mutagenesis is a powerful approach for the creation of novel germplasm and the improvement of agronomic traits. The evaluation of mutagenic effects and functional variations in crops is needed for breeding mutant strains. To investigate the mutagenic effects of gamma-ray irradiation in wheat, this study characterized genomic variations of wheat early heading mutant (eh1) as compared to wild-type (WT) Zhongyuan 9 (ZY9). Whole-genome resequencing of eh1 and ZY9 produced 737.7 Gb sequencing data and identified a total of 23,537,117 homozygous single nucleotide polymorphism (SNP) and 1,608,468 Indel. Analysis of SNP distribution across the chromosome suggests that mutation hotspots existed in certain chromosomal regions. Among the three subgenomes, the variation frequency in subgenome D was significantly lower than in subgenomes A and B. A total of 27.8 Gb data were obtained by exome-capturing sequencing, while 217,948 SNP and 13,554 Indel were identified. Variation annotation in the gene-coding sequences demonstrated that 5.0% of the SNP and 5.3% of the Indel were functionally important. Characterization of exomic variations in 12 additional gamma-ray-induced mutant lines further provided additional insights into the mutagenic effects of this approach. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genome (KEGG) analysis suggested that genes with functional variations were enriched in several metabolic pathways, including plant-pathogen interactions and ADP binding. Kompetitive allele-specific PCR (KASP) genotyping with selected SNP within functional genes indicated that 85.7% of the SNPs were polymorphic between the eh1 and wild type. This study provides a basic understanding of the mechanism behind gamma-ray irradiation in hexaploid wheat.Entities:
Keywords: exome; gamma-ray; genomic variation; metabolic pathways; wheat
Year: 2022 PMID: 35095966 PMCID: PMC8790116 DOI: 10.3389/fpls.2021.793496
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Statistics of whole genome re-sequencing and mapping data.
| Sample |
| Wildtype |
| Number of raw reads | 2,493,075,922 | 2,546,130,554 |
| Number of clean reads | 2,439,545,586 | 2,479,168,134 |
| Number of clean bases | 365,931,837,900 | 371,875,220,100 |
| Mapped bases | 359,893,553,535 | 364,529,898,773 |
| Mapping rate (%) | 98.35 | 98.02 |
| Repetitive rate (%) | 22.20 | 20.80 |
| Coverage rate ≥ 1X (%) | 95.16 | 95.43 |
| Coverage rate ≥ 4X (%) | 92.86 | 93.19 |
| Average sequencing depth | 25.21 | 25.53 |
FIGURE 1Diagrammatic distribution of SNP frequency across 21 chromosomes. The rectangle represents a distribution of SNP frequency in each chromosome. The x-axis indicates the physical position on the chromosomes and the y-axis shows the SNP number in every million bases.
Numbers and density of SNP and Indel variations on all chromosomes.
| Chromosome | SNP numbers | SNP density (/Mb) | Indel numbers | Indel density (/Mb) |
| 1A | 1,078,430 | 1815.5 | 72,408 | 121.9 |
| 1B | 1,027,394 | 1467.7 | 71,942 | 102.8 |
| 1D | 204,414 | 403.2 | 23,188 | 45.7 |
| 2A | 2,797,338 | 3581.7 | 154,887 | 198.3 |
| 2B | 2,709,968 | 3333.3 | 178,616 | 219.7 |
| 2D | 260,449 | 392.8 | 31,869 | 48.1 |
| 3A | 814,340 | 1068.7 | 62,718 | 82.3 |
| 3B | 1,590,668 | 1891.4 | 109,414 | 130.1 |
| 3D | 125,267 | 199.8 | 15,836 | 25.3 |
| 4A | 3,093,150 | 4091.5 | 174,294 | 230.5 |
| 4B | 1,063,554 | 1552.6 | 68,982 | 100.7 |
| 4D | 120,996 | 232.2 | 12,606 | 24.2 |
| 5A | 1,460,181 | 2025.2 | 97,871 | 135.7 |
| 5B | 1,874,889 | 2586.1 | 126,961 | 175.1 |
| 5D | 155,776 | 270.0 | 19,419 | 33.7 |
| 6A | 510,962 | 811.1 | 48,492 | 77.0 |
| 6B | 1,429,256 | 1952.5 | 94,927 | 129.7 |
| 6D | 172,587 | 355.8 | 20,017 | 41.3 |
| 7A | 1,574,567 | 2105.0 | 110,313 | 147.5 |
| 7B | 1,275,568 | 1674.0 | 89,764 | 117.8 |
| 7D | 197,363 | 303.6 | 23,944 | 36.8 |
| A genome | 11,328,968 | 2214.1 | 720,983 | 141.9 |
| B genome | 10,971,297 | 2065.3 | 740,606 | 139.4 |
| D genome | 1,236,852 | 308.2 | 146,879 | 36.4 |
| Whole genome | 23,537,117 | 1529.2 | 1,608,468 | 105.9 |
FIGURE 2Variation patterns of genomic SNPs and Indels in eh1. (A) Numbers of specific types of SNPs in the whole genome of the gamma-ray-induced mutant eh1. (B) Statistics of Indel size identified by genome resequencing. The positive and negative values represent insertions and deletions, respectively.
Statistics of exome-capturing sequencing and mapping data.
| Sample | Mutant | Wildtype |
| Number of raw reads | 92,668,310 | 94,511,490 |
| Number of clean reads | 91,693,900 | 93,029,978 |
| Number of clean bases | 13,582,499,408 | 14,176,723,500 |
| Mapping rate (%) | 99.88 | 99.89 |
| Coverage rate ≥ 5X (%) | 74.66 | 73.76 |
| Coverage rate ≥ 20X (%) | 54.82 | 54.28 |
| Coverage rate ≥ 30X (%) | 44.06 | 43.68 |
| Average sequencing depth (%) | 69.50 | 72.54 |
FIGURE 3Distribution and functional annotation of exomic variations between eh1 and WT. (A) Frequency of SNPs in different exome regions. (B) Functional annotation of SNPs in gene-coding region. (C) Frequency of Indel in different regions in the exome. (D) Functional annotation of Indels in gene-coding region.
FIGURE 4Distribution and functional annotation of exomic variations in 12 mutant lines. (A) Distribution of SNPs in different regions in the exome. (B) Functional annotation of SNPs in structural genes. (C) Distribution of Indels in different regions in the exome. (D) Functional annotation of Indels in structural genes.
FIGURE 5GO and KEGG enrichment analysis of genes with functional variations. (A) Top 20 GO terms are enriched in genes with termination or frameshift variations. The significance of enriched terms was determined by REVIGO. (B) The most enriched KEGG pathways in genes with termination or frameshift variations. The rich factor represents the degree of mutated genes in a particular pathway. The number of enriched genes is indicated by the size of the circle, and the circle color reflects the range of the corrected p-value.
FIGURE 6Scatter plot of KASP assay for genotyping selected SNPs. The blue-colored dots represent alleles with FAM tails. The HEX-tailed allele is indicated by red dots. Black dots reflect the negative control or alleles yielding no PCR products. All 26 SNPs were genotyped with four KASP assays.