| Literature DB >> 35596150 |
Yue Zhao1, Yufeng Sun1, Kun Cao1, Xiaoyan Zhang1, Jing Bian1, Chengwei Han1, Ying Jiang1, Lei Xu2, Xiaonan Wang3.
Abstract
Hemp (Cannabis sativa L.), an ancient crop, is a significant source of high-quality fiber that primarily caters to the textile industry worldwide. Fiber content is a crucial quantitative trait for evaluating fiber yield in hemp. Understanding the genetic mechanisms involved in hemp breeding is essential for improving yield. In this study, we developed 660 F1 plants from a cross between Jindao-15 (high fiber content fiber-use variety) and Fire No.1 (low fiber content fiber-use variety), and thirty plants each with high and low fiber content were selected from 305 monoecious plants of this population according to 5%-10% of population size for quantitative traits. The DNA from these plants was extracted to establish two bulk DNA pools and then subjected to the restriction digestion by the enzymes RsaI and HaeIII to obtain 314-364 bp digestion fragments and subjected to sequencing using specific length amplified fragment sequencing (SLAF-seq). Finally, we successfully developed 368,404 SLAF tags, which led to the detection of 25,133 high-quality SNPs. Combing with the resequencing results of parents, the SNPs of mixed pools were then subjected to the SNP-Index correlation algorithm, which revealed four candidate regions related to fiber content traits on Chromosome 1, with a length of 8.68 Mb and containing 389 annotated genes. The annotation information and the comparison results identified 15 genes that were highly likely to modulate the fiber content of hemp. Further, qPCR validation identified six genes (LOC115705530, LOC115705875, LOC115704794, LOC115705371, LOC115705688 and LOC115707511) that were highly positively correlated with influencing the hemp fiber content. These genes were involved in the transcription regulation, auxin and water transportion, one carbon and sugar metabolism. And non-synnoumous mutation SNPs which may play vital role in influencing the fiber content were detected in LOC115705875, LOC115704794, LOC115705688 and LOC115707511. Thus, our study highlights the importance of the combined use of SLAF-Seq and Bulked Segregant analysis (BSA) to locate genes related to hemp fiber content rapidly. Hence, our study provides novel mechanistic inputs for the fast identification of genes related to important agronomic traits of hemp and other crops catering to the textile industry.Entities:
Keywords: Bulked Segregant analysis; Candidate gene; Hemp fiber content; SNP analysis; Specific length amplified fragment sequencing
Mesh:
Year: 2022 PMID: 35596150 PMCID: PMC9123736 DOI: 10.1186/s12870-022-03594-w
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 5.260
Fig. 1Distribution diagrams of all SLAFs (black lines) on each chromosome of hemp. The black lines indicate SLAF tags
SNP filtering statistic
| Total SNP | Locus of multiple alleles | Locus with Read support less than 4 | Locus of mixed pool genotypic uniformity | Locus filtered by the parent | High quality SNP |
|---|---|---|---|---|---|
| 7,331,769 | 44,040 | 6,773,604 | 122,330 | 366,662 | 25,133 |
Fig. 2Distribution of SNP-index correlation values on chromosomes. The x-axis indicate chromosome name. Color dot represents the calculated SNPS—index (or Δ SNPS—index) values, the black line represents fitting SNP-index (or Δ SNPS—index) values. A SNP-index graphs of high fiber content mixed pool. B SNP-index graphs of low fiber content mixed pool. C Distribution diagram of ΔSNP-index value, of which the red dashed line represents the threshold line for the 99th percentile
Fig. 3GO annotated results of genes in candidate regions
Fig. 4Genes enrichment results in the candidate regions via KEGG analysis [82–84]. The x-axis represents the enrichment factor and the y-axis represents Q-value. The smaller the Q-value, the higher the enrichment degree
Fig. 5GO annotated results of non-synonymous genes
Fig. 6KEGG [82–84] annotated results of non-synonymous genes
Fig. 7A The total fiber rate of three different hemp varieties. B-G Relative expression level of six candidate genes in different hemp varieties. Each bar represents the average expression level of three independent biological replicates. Error bars show standard errors of the average values. (*P < 0.05, **P < 0.01)
Fig. 8Experimental cultivars and materials A Monoecious Jindao-15. B Female plant of dioecious Fire No.1 C Male plant of Fire No.1.D Fresh bast fiber peeled from stem
Statistic results of sequencing data from each sample
| Sample | Sample ID | Clean-Read number | Clean-Base number | Q30 percentage(%) | GC percentage(%) |
|---|---|---|---|---|---|
| Male parent | R01 | 61,937,343 | 18,556,453,868 | 93.05 | 35.00 |
| Female parent | R02 | 59,738,133 | 17,895,069,796 | 93.60 | 34.16 |
| High-fiber content pool | aa | 12,354,037 | 3,113,217,324 | 93.40 | 42.51 |
| Low-fiber content pool | ab | 19,527,628 | 4,920,962,256 | 93.60 | 40.08 |