| Literature DB >> 32831010 |
Anna Bienias1, Magdalena Góralska1, Piotr Masojć1, Paweł Milczarski1, Beata Myśków2.
Abstract
BACKGROUND: Transcription factor (TF) GAMYB, belonging to MYB family (named after the gene of the avian myeloblastosis virus) is a master gibberellin (GA)-induced regulatory protein that is crucial for development and germination of cereal grain and involved in anther formation. It activates many genes including high-molecular-weight glutenin and α-amylase gene families. This study presents the first attempt to characterize the rye gene encoding GAMYB in relation to its sequence, polymorphisms, and phenotypic effects.Entities:
Keywords: DArTseq; High-density genetic map; RNAseq; SNP; ScGAMYB; Secale cereale L.; transcription factor
Mesh:
Substances:
Year: 2020 PMID: 32831010 PMCID: PMC7444254 DOI: 10.1186/s12864-020-06991-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Result of data processing of RNAseq performed for four rye inbred lines using Illumina HiSeq 2000 system
| Sample | Raw Data | Trimmed Data | ||||
|---|---|---|---|---|---|---|
| Total Bases | Read Count | Q30 (%) | Total Bases | Read Count | Q30 (%) | |
| M12w1 | 11,702,831,822 | 115,869,622 | 96.27 | 11,314,892,610 | 112,863,536 | 97.56 |
| M12w2 | 9,580,521,852 | 94,856,652 | 96.61 | 9,305,022,511 | 92,771,556 | 97.71 |
| M12wl1 | 8,383,791,436 | 83,007,836 | 96.39 | 8,121,465,377 | 80,988,190 | 97.61 |
| M12wl2 | 10,188,945,852 | 100,880,652 | 96.52 | 9,902,654,357 | 98,700,562 | 97.64 |
| M12wl3 | 8,262,443,572 | 81,806,372 | 96.44 | 8,012,781,602 | 79,886,624 | 97.63 |
| DS2w1 | 8,990,270,782 | 89,012,582 | 96.7 | 8,745,071,781 | 87,164,468 | 97.75 |
| DS2w2 | 8,919,988,720 | 88,316,720 | 96.46 | 8,651,277,777 | 86,255,968 | 97.63 |
| DS2wl1 | 9,516,783,378 | 94,225,578 | 96.4 | 9,221,235,694 | 91,939,090 | 97.6 |
| DS2wl2 | 9,282,121,392 | 91,902,192 | 96.42 | 8,997,604,730 | 89,717,130 | 97.61 |
| DS2wl3 | 10,305,669,936 | 102,036,336 | 96.01 | 10,003,200,236 | 99,819,220 | 97.19 |
| RXL10w1 | 9,914,476,130 | 98,163,130 | 96.39 | 9,618,122,251 | 95,877,590 | 97.57 |
| RXL10w2 | 8,362,917,362 | 82,801,162 | 95.97 | 8,111,094,232 | 80,948,782 | 97.18 |
| RXL10wl1 | 6,894,170,918 | 68,259,118 | 95.69 | 6,680,084,234 | 66,723,890 | 96.95 |
| RXL10wl2 | 7,355,772,834 | 72,829,434 | 96.44 | 7,133,202,875 | 71,110,558 | 97.61 |
| L35w3 | 7,553,455,084 | 74,786,684 | 96.39 | 7,319,205,497 | 72,975,450 | 97.59 |
| L35wl3 | 8,013,652,696 | 79,343,096 | 96.59 | 7,795,961,803 | 77,714,250 | 97.65 |
| L35wl | 9,717,103,344 | 96,208,944 | 96.58 | 9,444,304,014 | 94,146,958 | 97.66 |
De novo transcriptome assembly statistics with Trinity program for 17 RNA samples of four rye inbred lines
| Sample | All transcript contigs | Only longest isoform per ‘GENE’ |
|---|---|---|
| Total trinity ‘genes’ | 150,758 | 150,758 |
| Total trinity transcripts | 218,935 | 150,758 |
| N50 | 1366 | 1366 |
| Maximum contig length | 16,024 | 16,024 |
| Minimum contig length | 201 | 201 |
| Median contig length | 500 | 401 |
| Average contig length | 839.29 | 703.25 |
| Total assembled bases | 183,750,947 | 106,021,123 |
Similarity of ScGAMYB coding sequence (cds) with sequences of related cereal species - NCBI BLAST analyze results
| Species | Max Score | Query cover | % Identity | Accession no. |
|---|---|---|---|---|
| 1277 | 99% | 96.74 | AY008692.1 | |
| 1279 | 100% | 96.63 | JF951917.1 | |
| 1254 | 100% | 96.10 | AB214883.1 | |
| 878 | 99% | 87.50 | AJ133638.1 | |
| 852 | 100% | 86.68 | XM_003564404.3 |
Fig. 1The structure of the ScGAMYB gene determined based on the sequence data of six studied rye inbred lines with respect to the genomic sequence of the Lo7 inbred line scaffold [21]
Polymorphisms between rye inbred lines in ScGAMYB sequence detected in nucleotide (nt) sequence, expected amino acid (aa) sequence and predicted protein second-structure (2D)
| changed nt position | type of nt change* | line with change | changed aa position | type of aa change | line with change | change in 2D |
|---|---|---|---|---|---|---|
| Lines: DS2, L35, M12, RXL10 | ||||||
| 99 | C → T | DS2 | 33 | – | – | – |
| 129 | G → A | DS2 | 43 | – | – | – |
| 204 | C → G | DS2 | 68 | – | – | – |
| 226, 227 | AA→TT | M12 | 76 | N → F | M12 | + |
| 259–261 | ATC → GCG | M12 | 87 | I → A | M12 | + |
| 265 | T → C | M12 | 89 | Y → H | M12 | + |
| 271 | C → A | M12 | 91 | – | – | – |
| 277 | G → A | M12 | 93 | D → N | M12 | – |
| 564 | C → T | DS2 | 188 | – | – | – |
| Lines: 541, Ot1–3, DS2, L35, M12, RXL10 | ||||||
| 681 | C↔T | L35, M12, RXL10 | 227 | – | – | – |
| 698 | A↔C | L35, M12, RXL10 | 233 | Q↔P | L35, M12, RXL10 | + |
| 720 | A↔G | L35, M12, RXL10 | 240 | – | – | – |
| 735 | G↔A | L35, M12, RXL10 | 245 | – | – | – |
| 741 | C↔T | L35, M12, RXL10 | 247 | – | – | – |
| 944 | G↔A | L35, M12, RXL10 | 315 | S↔N | L35, M12, RXL10 | – |
| | ||||||
| 1110 | G → C | 541, DS2 | 370 | – | – | + |
| 1185 | C → T | 541, DS2 | 395 | – | – | – |
| 1190, 1191 | GG → GA/AG | 541/DS2 | 397 | R → K | DS2 | + |
| 1375 | T → C | 541,Ot1–3 | 459 | S → P | 541, Ot1–3 | – |
*The unidirectional arrows were used when the change concerned a minority of lines from the set of four or six lines tested. Changes important for the 2D protein structure and related to the effect of ScGAMYB on AMY are bolded
Characteristics of AS-PCR primers for ScGAMYB identification in rye, differentiating rye inbred lines used as parental components of three mapping populations: BSR-F2 (S32N/07 × RXL10), RIL-K (541 × Ot1–3)), and RIL-L (DS2 × RXL10)
| Common primer | AS primer designed to 541 line sequence | AS primer designed to Ot1–3 line sequence | SNP position (cds / exon 3 fragment) | Product size (bp) | |||||
|---|---|---|---|---|---|---|---|---|---|
| Name | Sequence | Name | Sequence | Allele presented in line | Name | Sequence | Allele presented in line | ||
| 31F20 | catcaggcgacgcagtgctc | 541.285R20c | ccgtcagtgaaatcggagtc | 541, DS2, S32N/07 | Ot.285R20g | ccgtcagtgaaatcggagtg | Ot1–3 | 1100/485 | 274 |
| 541.275R20g | aatcggagtstgcgtcgccg | 541, DS2 | Ot.275R20c | aatcggagtctgcgtcgccc | Ot1–3 | 1110/495 | 264 | ||
| 17F21 | ggagagctgaaaaacatcagg | 541.285R20c | ccgtcagtgaaatcggagtc | 541, DS2, S32N/07 | Ot.285R20g | ccgtcagtgaaatcggagtg | Ot1–3, RXL10 | 1100/485 | 288 |
| 541.275R20g | aatcggagtstgcgtcgccg | 541, DS2 | Ot.275R20c | aatcggagtctgcgtcgccc | Ot1–3, RXL10 | 1110/495 | 278 | ||
| 583R24 | cggtcgatcagttctcaaatgact | 541.266F20c | gctgttcctcggcgacgcac | 541, DS2, S32N/07 | Ot.266F20g | gctgttcctcggcgacgcag | Ot1–3, RXL10 | 1100/485 | 341 |
Fig. 2The changes in secondary structure of ScGAMYB of rye inbred lines resulting from SNPs (the protein structure predicted with EMBOSS 6.5.7 plug)
Fig. 3Relationships between rye inbred lines (DS2, L35, M12, RXL10, Lo7) and related species established based on GAMYB sequences using UPGMA method. The Lo7 sequence is sequence of scaffold Sc17168 [21]
Fig. 4Location of the ScGAMYB on the chromosome 3R of populations BSR-F2, RIL-K [22] and RIL-L [22]. To integrate these three maps the reference map of RIL-S population [23] was used
Relationship between allele polymorphism in the ScGAMYB locus and variance of α-amylase activity (AMY) established using Kruskal-Wallis test (K-W), Fisher test (F) and composite interval mapping (CIM) procedure in RIL-K and RIL-L mapping populations. Only statistically significant values are shown
| Population | Year | Number of genotypes | Mean AMY value | Test | Statistics | Significance | ||
|---|---|---|---|---|---|---|---|---|
| mm | U/ml | |||||||
| RIL-K | 2014 | a (541) | 42 | 11.94 | 8.12 | K-W | H = 7.24 | |
| b (Ot1–3) | 39 | 10.89 | 1.79 | |||||
| RIL-L | 2011 | a (DS2) | 17 | 15.09 | 72.80 | F | F = 5.21 | |
| b (RXL10) | 25 | 14.33 | 33.60 | |||||
| RIL-L | 2007 | a (DS2) | 17 | 15.02 | 56.63 | CIM | LOD = 2.71 | |
| b (RXL10) | 23 | 14.44 | 42.73 | |||||