| Literature DB >> 35336623 |
Yang Liu1, Xiaomei Fang1, Tian Tang1, Yudong Wang1, Yinhuan Wu1, Jinyu Luo1, Haotian Wu1, Yingqian Wang1, Jian Zhang1, Renwu Ruan1, Meiliang Zhou2, Kaixuan Zhang2, Zelin Yi1.
Abstract
Common buckwheat (Fagopyrum esculentum M.) is known for its adaptability, good nutrition, and medicinal and health care value. However, genetic studies of buckwheat have been hindered by limited genomic resources and genetic markers. In this study, Illumina HiSeq 4000 high-throughput sequencing technology was used to sequence the transcriptome of green-flower common buckwheat (Gr) with coarse pedicels and white-flower Ukrainian daliqiao (UD) with fine pedicels. A total of 118,448 unigenes were obtained, with an average length of 1248 bp and an N50 of 1850 bp. A total of 39,432 differentially expressed genes (DEGs) were identified, and the DEGs of the porphyrins and chlorophyll metabolic pathway had significantly upregulated expression in Gr. Then, a total of 17,579 sequences containing SSR loci were detected, and 20,756 EST-SSR loci were found. The distribution frequency of EST-SSR in the transcriptome was 17.52%, and the average distribution density was 8.21 kb. A total of 224 pairs of primers were randomly selected for synthesis; 35 varieties of common buckwheat and 13 varieties of Tartary buckwheat were verified through these primers. The clustering results well verified the previous conclusion that common buckwheat and Tartary buckwheat had a distant genetic relationship. The EST-SSR markers identified and developed in this study will be helpful to enrich the transcriptome information and marker-assisted selection breeding of buckwheat.Entities:
Keywords: DEGs; EST-SSR; RNA-seq; transcriptome
Year: 2022 PMID: 35336623 PMCID: PMC8950064 DOI: 10.3390/plants11060742
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Summary of the transcriptome.
| Nucleotide Length (bp) | Transcripts | Genes |
|---|---|---|
| Min Length (bp) | 201 | 201 |
| Mean Length (bp) | 932 | 1248 |
| Median Length (bp) | 514 | 882 |
| Max Length (bp) | 16,788 | 16,788 |
| N50 (bp) | 1657 | 1850 |
| N90 (bp) | 356 | 578 |
| Total Nucleotides (bp) | 165,113,675 | 147,868,721 |
| Length Interval | Numbers | |
| <301 bp | 48,071 | 9402 |
| 301–500 bp | 38,807 | 20,658 |
| 501–1000 bp | 36,940 | 34,910 |
| 1001–2000 bp | 31,069 | 31,040 |
| >2000 bp | 22,238 | 22,238 |
Figure 1The distribution of species similarity of Nr annotations. (A) The similarity distribution of the Nr database. (B) The species classification of Nr annotations.
Figure 2Analysis of the traits of green-flower buckwheat (Gr_F) and white-flower buckwheat (UD_F). (A) Electron microscopic photograph of the inflorescence. (B) The chlorophyll content of the flowers. (C) Pedicel diameter. Asterisks denote significant difference of Gr_F and UD_F (** p < 0.01).
Figure 3Differentially expressed genes in chlorophyll metabolism. (A) The DEGs in chlorophyll metabolism, where the genes ID in red show the significantly upregulated expression and the genes in green show the downregulated expression in green-flower common buckwheat. (B) The chlorophyll synthesis pathway. The arrows show the direction of metabolism.
Figure 4Differential expression analysis of flavonoid synthesis pathway. The color represents the amount of expression, the up and down arrows next to the expression represent upregulated or downregulated expression, respectively.
Summary of the identified EST-SSRs.
| Summary of Identified EST-SSRs | Numbers |
|---|---|
| Total number of sequences examined | 118,448 |
| Total size of examined sequence (bp) | 147,868,721 |
| Total number of identified SSRs | 20,756 |
| Number of SSR containing sequences | 17,579 |
| Number of sequences containing more than 1 SSR | 2579 |
| Number of SSRs present in compound formation | 920 |
| Mono-nucleotide | 12,447 |
| Di-nucleotide | 3691 |
| Tri-nucleotide | 4417 |
| Tetra-nucleotide | 153 |
| Penta-nucleotide | 28 |
| Hexa-nucleotide | 20 |
Figure 5Frequency of different SSR repeat types in the entire sample. (A) Frequency of different motif lengths in SSRs. The X-axis shows the repeated SSR motif unit, the Y-axis shows the repeat counts, and the color shows the number of repetitions. (B) Frequency of EST-SSR motif types. The X-axis shows the SSR motif types and the Y-axis shows the SSR motif numbers.
Frequency of EST-SSR motif types. “–”: Unit Size Number of undetected duplicates.
| Repeat Numbers | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Repeated Motif | 5 | 6 | 7 | 8 | 9 | 10 | >10 | Total | % |
| A/T | - | - | - | - | - | 5369 | 6944 | 12,313 | 59.32 |
| C/G | - | - | - | - | - | 31 | 103 | 134 | 0.65 |
| AC/GT | - | 181 | 82 | 54 | 33 | 20 | 31 | 401 | 1.92 |
| AG/CT | - | 618 | 291 | 138 | 65 | 32 | 43 | 1187 | 5.72 |
| AT/AT | - | 802 | 375 | 293 | 243 | 284 | 96 | 2093 | 10.08 |
| CG/CG | - | 7 | 2 | 1 | - | - | - | 10 | 0.05 |
| AAC/GTT | 294 | 98 | 71 | 3 | - | - | 1 | 467 | 2.25 |
| AAG/CTT | 815 | 296 | 138 | 9 | - | - | 5 | 1263 | 6.09 |
| AAT/ATT | 296 | 173 | 129 | 2 | - | 1 | - | 601 | 2.90 |
| ACC/GGT | 273 | 140 | 56 | 15 | - | - | - | 484 | 2.33 |
| ACG/CGT | 80 | 35 | 24 | 2 | - | - | - | 141 | 0.68 |
| ACT/AGT | 54 | 13 | 14 | - | 1 | - | - | 82 | 0.40 |
| AGC/CTG | 192 | 69 | 30 | 2 | - | - | - | 293 | 1.41 |
| AGG/CCT | 243 | 62 | 24 | 4 | - | - | - | 333 | 1.60 |
| ATC/ATG | 352 | 151 | 64 | 1 | - | - | - | 568 | 2.74 |
| CCG/CGG | 100 | 59 | 22 | 4 | - | - | - | 185 | 0.90 |
| Others | 162 | 29 | 6 | 1 | - | 1 | 2 | 201 | 0.97 |
| Total | 2861 | 2733 | 1328 | 529 | 342 | 5738 | 7225 | 20,756 | |
| % | 13.78 | 13.17 | 6.40 | 2.55 | 1.65 | 27.65 | 34.81 | ||
Figure 6UPGMA clustering graph for buckwheat materials. The X-axis shows the genetic similarity coefficient.