| Literature DB >> 30159521 |
Guoqi Chen1,2, Wei Zhang1,2, Jiapeng Fang1,2, Liyao Dong1,2.
Abstract
Echinochloa phyllopogon proliferation seriously threatens rice production worldwide. We combined a restriction-site associated DNA (RAD) approach with Illumina DNA sequencing for rapid and mass discovery of simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers for E. phyllopogon. RAD tags were generated from the genomic DNA of two E. phyllopogon plants, and sequenced to produce 5197.7 Mb and 5242.9 Mb high quality sequences, respectively. The GC content of E. phyllopogon was 45.8%, which is high for monocots. In total, 4710 putative SSRs were identified in 4132 contigs, which permitted the design of PCR primers for E. phyllopogon. Most repeat motifs among the SSRs identified were dinucleotide (>82%), and most of these SSRs were four motif-repeats (>75%). The most frequent motif was AT, accounting for 36.3%-37.2%, followed by AG and AC. In total, 78 putative polymorphic SSR loci were found. A total of 49,179 SNPs were discovered between the two samples of E. phyllopogon, 67.1% of which were transversions and 32.9% were transitions. We used eight SSRs to study the genetic diversity of four E. phyllopogon populations collected from rice fields in China and all eight loci tested were polymorphic.Entities:
Keywords: Echinochloa phyllopogon; Polymorphic; RAD sequencing; SNP; SSR
Year: 2017 PMID: 30159521 PMCID: PMC6112297 DOI: 10.1016/j.pld.2017.08.004
Source DB: PubMed Journal: Plant Divers ISSN: 2468-2659
Characteristics of the eight primers tested for E. phyllopogon genotyping: locus name, forward (F) and reverse (R) primer sequences, motif, annealing temperature (Tm), fluorescent dye used (Fl. dye), allele size range (ASR), number of alleles amplified per sample, and number of alleles amplified among the plants of four populations sampled (Allele. total).
| Marker | Sequence | Motif | ASR (bp) | Fl. dye | Tm (°C) | No. of alleles per sample | Allele. total | ||
|---|---|---|---|---|---|---|---|---|---|
| Min. | Mean | Max. | |||||||
| EG _1 | F: GCTCCTGAACTGTGTACATTCTTGC | TG | 123–153 | TAM | 49 | 0 | 0.7 | 2 | 5 |
| R: TCGATTCACCCTTCAGCTTCTC | |||||||||
| EG_2 | F: CATCGGATTCAGATTGAAAGGG | TA | 131–159 | FAM | 51.5 | 1 | 1.7 | 3 | 7 |
| R: GGTCGTAGGTCTATAGTCCGTAGAGTCA | |||||||||
| EG_301 | F: GCGTCGTCAAGTCGTTCTTCTA | AT | 147–173 | TAM | 57 | 0 | 2.4 | 3 | 8 |
| R: TGTATTCAGCTGTCGTGCATGT | |||||||||
| EG_302 | F: ATTCGAACACCCATCAACCAAC | ATTT | 133–293 | FAM | 57 | 1 | 2.8 | 5 | 12 |
| R: GAAACAGAAGGGAGGTGTGCTG | |||||||||
| EG_305 | F: AGCCGTTCCTCTAGTCGGATTTCT | AT | 100–162 | ROX | 57 | 3 | 4.1 | 6 | 14 |
| R: TATTCAGCTGCCGTGCATGTAGTA | |||||||||
| EG_306 | F: TAAAACAAAACGACCGGCGTAA | CT | 146–167 | HEX | 57 | 1 | 1.25 | 2 | 7 |
| R: TCAATCATTTCAGCCTTCGGAT | |||||||||
| EG_307 | F: AACATTGTCATCACAAATATCATCATCA | ATC | 108–134 | TAM | 57 | 2 | 3.5 | 5 | 8 |
| R: AATCAAGGAAGCCCCTTCACTC | |||||||||
| EG_320 | F: CAACTCATAAGACAATTCAAAGGGTTT | TA | 136–153 | FAM | 57 | 2 | 3.0 | 4 | 5 |
| R: GCATCATTTAAGCATCAAAATGACA | |||||||||
FAM: 6-carboxyfluorescein, HEX: hexachloro-fluoresceine, ROX: carboxy-X-rhodamine, and TAM: 5-TAMRA (5-Carboxytetramethylrhodamine).
Summary statistics of the RAD tags sequencing via Illumina for E. phyllopogon.
| Feature | Total |
|---|---|
| Illumina reads (million) | 71.45 |
| Total base (million) | 10,440.6 |
| GC% | 45.8% |
| Q20 (%) | 94.0% |
| No. of contigs | 37,662 |
| Total length (bp) | 12,789,629 |
| Contig length range (bp) | 200–588 |
| Average contig length (bp) | 339.5 |
Length distributions of SSR motifs identified for the two samples of E. phyllopogon tested.
| Motif length | 13 | 04 |
|---|---|---|
| Dinucleotide | 1908 (83.0%) | 1998 (82.4%) |
| Trinucleotide | 329 (14.3%) | 360 (14.9%) |
| Tetranucleotide | 40 (1.7%) | 50 (2.1%) |
| Pentanucleotide | 15 (0.7%) | 13 (0.5%) |
| Hexanucleotide | 6 (0.3%) | 3 (0.1%) |
| Total | 2298 | 2424 |
Fig. 1SSR motifs with different repeat numbers for the two samples of E. phyllopogon.
SSR motifs with a frequency > 0.5% and the ranges of PCR product length (mean length) of the relative motifs for the two samples tested for E. phyllopogon.
| Motif | Count (% of total SSRs) | PCR product length (average length, bp) | ||
|---|---|---|---|---|
| 13 | 04 | 13 | 04 | |
| AT | 854 (37.2) | 880 (36.3) | 80–234 (133.5) | 80–239 (131.0) |
| AG | 562 (24.5) | 617 (25.5) | 80–208 (126.5) | 80–208 (127.1) |
| AC | 372 (16.2) | 395 (16.3) | 80–225 (130.3) | 80–234 (126.6) |
| CG | 120 (5.2) | 106 (4.4) | 80–204 (131.6) | 80–237 (124.1) |
| CCG | 99 (4.3) | 103 (4.2) | 80–172 (132.0) | 80–160 (126.9) |
| AAG | 45 (2.0) | 43 (1.9) | 85–159 (128.8) | 81–153 (121.0) |
| AAT | 28 (1.2) | 30 (1.2) | 80–160 (130.3) | 80–160 (127.5) |
| ACC | 27 (1.2) | 14 (0.6) | 80–157 (122.9) | 80–220 (134.2) |
| AAC | 25 (1.1) | 47 (1.9) | 85–155 (120.3) | 122–155 (136.9) |
| AGG | 24 (1.0) | 25 (1.0) | 81–188 (128.3) | 80–159 (132.3) |
| AGC | 23 (1.0) | 29 (1.2) | 80–157 (122.6) | 89–159 (134.2) |
| ACG | 22 (1.0) | 15 (0.6) | 86–160 (136.1) | 83–159 (121.4) |
| AGT | 22 (1.0) | 32 (1.3) | 80–160 (133.9) | 81–160 (130.8) |
| ATG | 14 (0.6) | 22 (0.9) | 91–160 (134.5) | 87–159 (127.9) |
Note: motifs with dinucleotide plus trinucleotide contributed to 97.3% of the total SSRs for both samples. Thus motifs with length >3 were not shown in this table.
The 78 putative polymorphic SSR loci found by RAD sequencing.
| Marker | Motif | Primer_F | Primer_R | Marker | Motif | Primer_F | Primer_R |
|---|---|---|---|---|---|---|---|
| EG_1 | TG | gctcctgaactgtgtacattcttgc | tcgattcacccttcagcttctc | EG_40 | GAA | aacagacaaaatacaaaagaaagcaca | gtttttcagcatcatcctgtgg |
| EG_2 | TA | catcggattcagattgaaaggg | ggtcgtaggtctatagtccgtagagtca | EG_41 | AT | tcactacgaaattatcgtttatggacaa | gcccgctccgtgtttagattat |
| EG_3 | TA | ttgctttctgcaatgccaatta | gtccatgtggagtcagggagtt | EG_42 | TA | atgggcgacaagcaagtatgat | gacggacgaaggtttgaagattt |
| EG_4 | TA | ccgttgatgattaactcgttgattt | tgatggtagctacaagcgttgg | EG_43 | GA | catcctctggctgcttctctct | gaatgtgagaatctccgctgct |
| EG_5 | TA | ttcactatgctgaaccagcagc | ctgagtccggtatcgctcctta | EG_44 | GA | acacctttctccatcctctggc | ccgctgctgctactactcttgg |
| EG_6 | AT | ccatggtcaagtcactttgtctg | tctggatctcccaaattcatgtc | EG_45 | TG | ttgtacaagcttctgagataacctga | atttcagaaactgtttgaattaggattt |
| EG_7 | AAG | catttcttaccgtcccatctgc | cctttttcagggagaagccact | EG_46 | TA | aaatggatatggcaaacgcatc | ccaagtccatcatgccaagttt |
| EG_8 | AT | ttttgtaggcctaacctgttgtgg | tttttgctatgcatgtgtctactcg | EG_47 | AT | tttgggattgtttatgaggtttga | cacacggcaaaatgaccaata |
| EG_9 | TG | tataacatccctttcgttgccatc | tgcaatgaaattcagatattcggac | EG_48 | AAG | tgctatgcatgaggagatgcag | ccttataccttggaggctcgct |
| EG_10 | AAG | taaattgcccaaacaagaaagagg | atcggagtcccactcaacaaagta | EG_49 | TA | aattctagtttgcgacgggttatt | ttgagtgaatgggatcgaaaaa |
| EG_11 | TGCA | agccggtgcaggaagacag | aagaagggaaaaggtagtcgttgg | EG_50 | TC | aaggacaaagtcgcagcgttt | atgggatttggttttggcttct |
| EG_12 | CTCTC | tttgaagccttttcggtcttga | aacaagcagtggaagacgaagg | EG_51 | GC | gccgggtgattaacggattagt | agactagctagccagcgggttg |
| EG_13 | AT | ggcccaatataatatccatgcc | ctatcaagggcagctatttggg | EG_52 | AT | aattcaacacaaccaaaggtaaaaa | tcaatgccatattgattctccc |
| EG_14 | AT | ggtggtgtgtcctgatgtgtgt | tgtttccttttgtttttgttttgtttc | EG_53 | TG | tcaaatggcaaagtatggaactca | tcattttctcaagaagcagtggtc |
| EG_15 | TA | catgaactgttctgactccaacaac | aagcattgcagctctgtcttgt | EG_54 | AT | aatattaacgtacccttgacaaatgaa | tttttgttggtacgtaagataaacaatc |
| EG_16 | TA | tcagttgagctccatcatttgttt | tcactggctgttctttaccgtact | EG_55 | AG | ccaagaaaccaactaagagccaaa | atttgtgcatgatgtgctttgc |
| EG_17 | CGG | gatagcgactcgagcgtggt | tctcgagcatggggagagac | EG_56 | AG | agcaagaaaccaactaagagccaa | aattcgtgcatgatgtgctttg |
| EG_18 | ATG | agccatattgccttgtgaccaa | ttttccttgcgcaatttttcat | EG_57 | TC | tgaaaagccagtggacagtcag | gagttcctcctgatggcaagaa |
| EG_19 | AC | ccttcagctgatgtaatcttggtaag | tccatctctcagcacctgaaaa | EG_58 | TC | tctccctccaaactttactattcacc | gctcaaaagatttgtctcgtcg |
| EG_20 | AT | gaaggtcgtgcactatggtgag | agcaagttgaagcaatccaagg | EG_59 | AT | cgtcaagtcgttcctctagtcg | tgtattcagctgtcgtgcatgt |
| EG_21 | AT | cgccgtcaagtcattcctcta | tcagctgccgtgcatgtagta | EG_60 | AT | tgccagacagtccaacaagcta | ggccgactctatattcatattagctgac |
| EG_22 | CT | cacatgatacatccgttgcgtc | atcggaggagggggaagag | EG_61 | TA | aatgcagtcaggcccttgttta | gcacgggcacatttcctagt |
| EG_23 | AG | aaaacgccgcaaaaacaaaag | cccctctaggattctcgctgtt | EG_62 | TC | cttcttcctcgcctccaattc | aaacaagttattacccggcgct |
| EG_24 | TA | acgagcacccattatgttttgg | cgagatcccagagcaaagctac | EG_63 | TA | cgattgcttaagggaataaatgg | caacattttactggtaatcctttcttg |
| EG_25 | CT | atcaaaccccctcgaattcct | gagggagagaaagctgacaggc | EG_64 | GA | tcttggctgaaaaatctatttggG | acctctcccacttgaagaagca |
| EG_26 | TA | ttcaaaaattcgatctttgctgc | aaccttttccgtggcctacct | EG_65 | AT | cccctgagcaaatttcaatcat | agggacagggaaggatcttgac |
| EG_27 | GA | gctcagcatctccaacgaactt | caaaccaattctgaatcgaaaagc | EG_66 | AT | ttcatagaggtggtgtgtcctga | tggtttccttttgtttttattatgtttc |
| EG_28 | TA | gatgacgtggctagcttgcata | cgtaggacgaaggatgaaaacg | EG_67 | AT | cgcacactggctgtaattggta | ccgagctttcagatttactcctca |
| EG_29 | CT | cctccttcctttgctgagcC | ctgcagcatgccctttctattt | EG_68 | TA | aatgcaaaataggacaccacgg | ggaacccatgaataagctgcaa |
| EG_30 | GA | aggtcgtgcatgggctagag | cggagtagcttcacgcttcagt | EG_69 | AT | ggaaattgcatctgcatcaact | cccatgcagcatactaatgtgaa |
| EG_31 | TCT | ttgagatgatgatgcattcacttg | tgggaagccatgaagaatatgg | EG_70 | AT | ttcgttcatttcgctctcatca | ttggcaatagttttcaatcttgcat |
| EG_32 | TA | gtgggctcataccttaatgccc | ggggagccatctctcttctcat | EG_71 | GA | aggaagaaaagagaagtgaggcG | cgagcacctcctctaggaatca |
| EG_33 | AT | gccgtcaagtcgttcctctagt | cagctgccgtgcatgtaatact | EG_72 | TA | ctgcgggtgacatttgtacagt | gtctgaacacgttaccacaccg |
| EG_34 | TCT | gatgatgatgcattcacttgagttg | tggatgatgtgagaggtgatgg | EG_301 | AT | gcgtcgtcaagtcgttcttcta | tgtattcagctgtcgtgcatgt |
| EG_35 | AT | tcctctagtcggatttcttaatttgc | tgtattcagctgtcgtgcatgt | EG_302 | ATTT | attcgaacacccatcaaccaac | gaaacagaagggaggtgtgctg |
| EG_36 | AG | catgaccatcaggcatcatctc | atgaagaagctactccgccgat | EG_305 | AT | agccgttcctctagtcggatttct | tattcagctgccgtgcatgtagta |
| EG_37 | TCT | tcagaaacaatatgttcctcatcatca | caaatgggtcacaagacgagaa | EG_306 | CT | taaaacaaaacgaccggcgtaa | tcaatcatttcagccttcggat |
| EG_38 | TG | ggagctggagaaactgaaggaag | cacttcgttgagggctcgatag | EG_307 | ATC | aacattgtcatcacaaatatcatcatca | aatcaaggaagccccttcactc |
| EG_39 | CA | gtggcatgtgaattgtttccct | caatcttacctcccaccttccc | EG_320 | TA | caactcataagacaattcaaagggttt | gcatcatttaagcatcaaaatgaca |
Diversity of four populations of E. phyllopogon using eight nuclear microsatellite loci.
| Population | EP13 | EP14 | EP53 | EP50 | Total |
|---|---|---|---|---|---|
| No. of alleles | 39 | 34 | 25 | 37 | 66 |
| No. of alleles per locus | 4.875 | 4.25 | 3.125 | 4.625 | 8.25 |
| No. of private alleles | 13 | 1 | 2 | 8 | / |
| Heterozygosity | 0.086 | 0.082 | 0.064 | 0.091 | 0.081 |
| Shannon's information index | 0.381 | 0.21 | 0.087 | 0.222 | 0.225 |
Analysis of molecular variance (AMOVA) showing the partitioning of genetic variation within and between regions of E. phyllopogon.
| Source | df | SS | MS | Est. var. | % | |
|---|---|---|---|---|---|---|
| Among Pops | 3 | 135.563 | 45.188 | 3.328 | 39 | <0.01 |
| Within Pops | 44 | 231.250 | 5.256 | 5.256 | 61 | <0.01 |
df = degree of freedom, SS = sum of squares, MS mean squares, Est. var. = estimate of variance, % = percentage of total variation, P-value is based on 9999 permutations.
Fig. 2Functional annotation of assembled sequences with SSR loci for the two samples of E. phyllopogon based on gene ontology (GO) terms.
Fig. 3Transitions and transversions occurring within a set of 49,179 E. phyllopogon SNPs.