| Literature DB >> 32410666 |
Lingbo Zhou1, Can Wang1, Xu Gao1, Yanqing Ding1, Bin Cheng1, Guobing Zhang1, Ning Cao1, Yan Xu1, Mingbo Shao1, Liyi Zhang2.
Abstract
BACKGROUND: Hongyingzi is a sorghum (Sorghum bicolor L. Moench) cultivar for brewing Moutai liquor. For an overall understanding of the whole genome of Hongyingzi, we performed whole-genome resequencing technology to reveal its comprehensive variations.Entities:
Keywords: Genome-wide variations analysis; Hongyingzi; Sorghum; Whole-genome resequencing technology
Mesh:
Substances:
Year: 2020 PMID: 32410666 PMCID: PMC7227080 DOI: 10.1186/s41065-020-00130-4
Source DB: PubMed Journal: Hereditas ISSN: 0018-0661 Impact factor: 3.271
Summary of resequencing data of Hongyingzi
| Raw base (Gb) | Clean base (Gb) | Effect rate (%) | Error rate (%) | Q20 (%) | Q30 (%) | GC content (%) |
|---|---|---|---|---|---|---|
| 45.84 | 45.79 | 99.82 | 0.03 | 97.55 | 93.10 | 44.30 |
Sequence alignment of Hongyingzi to BTx623
| Mapped reads | Total reads | Mapping rate (%) | Average depth (X) | Coverage at least | Coverage at least |
|---|---|---|---|---|---|
| 297,504,853 | 305,064,750 | 97.52 | 56.10 | 95.94 | 94.17 |
Fig. 1Genome-wide landscape of genetic variation in Hongyingzi. Cycles from outside to inside indicate chromosome, single nucleotide polymorphism (SNP), small fragments insertion and deletion (Indel), copy number variation (CNV) duplication and deletion, structural variation (SV) insertion, SV deletion, SV invertion, and SV intrachromosomal translocation (ITX) and SV interchromosomal translocation (CTX)
Annotations of single nucleotide polymorphisms (SNPs), small fragments insertions and deletions (Indels), structural variations (SVs), and copy number variations (CNVs) detected in Hongyingzi
| Category | Numbers of SNPs | Numbers of Indels | Numbers of SVs | Numbers of CNVs | Region |
|---|---|---|---|---|---|
| Intergenic | 1,515,993 | 190,165 | 9661 | 17,082 | |
| 1 kb of upstream | 89,326 | 38,198 | 1915 | 985 | |
| 1 kb of downstream | 75,170 | 28,361 | 1460 | 789 | |
| Upstream/downstream | 6344 | 2779 | 176 | 96 | |
| Gain of stop codons | 453 | 103 | Coding regions | ||
| Loss of stop codons | 125 | 22 | Coding regions | ||
| Synonymous | 38,176 | Coding regions | |||
| Non-synonymous | 37,774 | Coding regions | |||
| Frameshift (insertions) | 1354 | Coding regions | |||
| Frameshift (deletions) | 1476 | Coding regions | |||
| Non-frameshift (insertions) | 3219 | Coding regions | |||
| Non-frameshift (deletions) | 3201 | Coding regions | |||
| Exonic | 3657 | 1822 | |||
| Intronic | 122,211 | 40,223 | 1119 | 496 | |
| Splicing sites | 202 | 189 | 5 |
Fig. 2Summary of genes with three types of variations, single nucleotide polymorphism (SNPs), small fragments insertions and deletions (Indels), and structural variations (SVs). The blue pie indicates SNPs, the red pie indicates Indels, and the green pie indicates SVs
Fig. 3WEGO clustering of genes with three types of variations, single nucleotide polymorphism (SNPs) and small fragments insertions and deletions (Indels). The red bar indicates SNPs, the blue bar indicates Indels
Fig. 4Classification of gene variations compared with Kyoto Encyclopedia of Genes and Genomes (KEGG) database. a: Top 20 significantly enriched pathways with single nucleotide polymorphism (SNPs) variations. b: Top 20 significantly enriched pathways with small fragments insertions and deletions (Indels) variations. Gene number: number of target genes in each term or pathway. Rich factor: the ratio of the number of target genes divided by the number of all the gene in each term or pathway
Variation genes involved in tannin synthesis
| Gene name | Chromosome | Annotation | Variation type | Variation information |
|---|---|---|---|---|
| 1 | Non-synonymous SNP | 1,175,307 bp, C/A | ||
| 1 | Non-synonymous SNP | 15,851,598 bp, G/A; 15,851,639 bp, C/T; 15,851,643 bp, G/C; 15,851,644 bp, G/C; 15,857,015 bp, G/A | ||
| Non-frameshift insertion | 15,851,633 bp, −/GGT | |||
| 1 | Non-frameshift insertion | 15,867,230 bp, −/GCACGG | ||
| 1 | Non-synonymous SNP | 15,877,514 bp, G/T; 158,77,530 bp, T/A; 15,879,898 bp, T/G | ||
| Non-frameshift deletion | 15,880,626 bp, ACCGGCGCC/− | |||
| 4 | Non-frameshift insertion | 67,834,717 bp, −/GCTGCT | ||
| 4 | Non-synonymous SNP | 67,848,132 bp, C/G; 67,848,135 bp, C/G; 67,848,250 bp, T/G | ||
| 7 | Non-synonymous SNP | 60,025,990 bp, C/A | ||
| 1 | Non-synonymous SNP | 65,069,116 bp, T/C | ||
| 1 | Non-synonymous SNP | 80,740,375 bp, C/G | ||
| 3 | Non-frameshift insertion | 73,733,903 bp, −/CCG | ||
| 10 | Non-synonymous SNP | 5,033,142 bp, C/A; 5,033,614 bp, C/T; 5,033,877 bp, C/A; 5,034,044 bp, C/T; 5,034,053 bp, G/C | ||
| Frameshift insertion | 5,032,581 bp, −/GC; 5,032,758 bp, −/GAGC; 5,033,017 bp, −/ATCT | |||
| Non-frameshift deletion | 5,032,669 bp, GTGCTGTTC/− | |||
| Non-frameshift insertion | 5,033,742 bp, −/GGG; 5,034,105 bp, −/TTCCAC | |||
| Gain of stop codons | 5,034,223 bp, −/CTATTTCA | |||
| 10 | Non-synonymous SNP | 55,088,876 bp, A/C | ||
| 2 | Non-synonymous SNP | 14,508,960 bp, C/A | ||
| 2 | Non-synonymous SNP | 68,442,264 bp, A/C; 68,442,283 bp, G/A | ||
| 4 | Non-synonymous SNP | 53,190,344 bp, C/T | ||
| 5 | Non-synonymous SNP | 58,503,342 bp, T/A; 58,503,472 bp, T/C; 58,503,507 bp, G/A; 58,503,555 bp, C/G | ||
| 5 | Non-synonymous SNP | 58,859,286 bp, C/G | ||
| 5 | Non-synonymous SNP | 58,881,162 bp, G/A | ||
| 5 | Non-synonymous SNP | 58,943,632 bp, C/T | ||
| 8 | Non-synonymous SNP | 3,477,776 bp, G/A | ||
| Non-frameshift deletion | 3,477,795 bp, ACG/− | |||
| 3 | Non-synonymous SNP | 57,029,960 bp, C/T | ||
| 3 | Non-frameshift deletion | 57,041,941 bp, CTGGGA/− | ||
| 4 | Non-frameshift deletion | 4,052,019 bp, AAC/− | ||
| 9 | Non-synonymous SNP | 4,149,752 bp, T/C; 4,149,842 bp, G/A; 4,149,896 bp, G/T; 4,149,998 bp, T/C; 4,150,031 bp, C/G | ||
| 4 | Non-synonymous SNP | 55,234,140 bp, T/G | ||
| Non-frameshift deletion | 55,233,739 bp, CGGGAA/− | |||
| 9 | Non-synonymous SNP | 51,944,205 bp, A/G; 51,948,174 bp, C/G | ||
| 4 | Non-synonymous SNP | 58,382,355 bp, G/A; 58,382,419 bp, G/A; 58,383,602 bp, A/G; 28,383,682 bp, G/T | ||
| 4 | Non-synonymous SNP | 58,391,947 bp, C/T | ||
| Frameshift deletion | 58,392,294 bp, CTAC/− | |||
| 5 | Non-synonymous SNP | 62,814,031 bp, G/A; 62,814,043 bp, A/G; 62,814,250 bp. C/G | ||
| Non-frameshift deletion | 62,816,156 bp, CGTCAACGT/− | |||
| Frameshift deletion | 62,813,716 bp, C/−; 62,813,926 bp, A/−; 62,814,183 bp, C/− | |||
| Frameshift insertion | 62,813,832 bp, −/A; 62,814,474 bp, −/TA | |||
| 4 | Non-synonymous SNP | 64,699,203 bp, G/A | ||
| 6 | Non-synonymous SNP | 59,157,048 bp, A/T; 59,157,274 bp, C/T; 59,158,255 bp, T/A | ||
| 6 | Non-synonymous SNP | 59,160,879 bp, A/C; 59,161,461 bp, G/A | ||
| 7 | Non-synonymous SNP | 4,721,737 bp, G/C; 4,721,966 bp, C/T; 4,724,116 bp, T/C | ||
| 10 | Non-synonymous SNP | 4,072,017 bp, C/G | ||
| 8 | Non-synonymous SNP | 2,684,008 bp, C/G |
Fig. 5DNA sequence alignments of 11 genes that had only 1 bp non-synonymous SNP variation in coding regions. a:Sobic.001G012600. b:Sobic.001G543900. c:Sobic.003G230900. d:Sobic.004G179000. e:Sobic.004G310100. f:Sobic.005G136200.g:Sobic.005G136300. h:Sobic.007G165500. i:Sobic.008G030100. j:Sobic.010G207800. k: Sobic.010G052200