| Literature DB >> 28934115 |
Jinsu Gil1, Yurry Um2, Serim Kim3, Ok Tae Kim4, Sung Cheol Koo5, Chinreddy Subramanyam Reddy6, Seong-Cheol Kim7, Chang Pyo Hong8, Sin-Gi Park9, Ho Bang Kim10, Dong Hoon Lee11, Byung-Hoon Jeong12, Jong-Wook Chung13, Yi Lee14.
Abstract
Angelica gigas Nakai is an important medicinal herb, widely utilized in Asian countries especially in Korea, Japan, and China. Although it is a vital medicinal herb, the lack of sequencing data and efficient molecular markers has limited the application of a genetic approach for horticultural improvements. Simple sequence repeats (SSRs) are universally accepted molecular markers for population structure study. In this study, we found over 130,000 SSRs, ranging from di- to deca-nucleotide motifs, using the genome sequence of Manchu variety (MV) of A. gigas, derived from next generation sequencing (NGS). From the putative SSR regions identified, a total of 16,496 primer sets were successfully designed. Among them, we selected 848 SSR markers that showed polymorphism from in silico analysis and contained tri- to hexa-nucleotide motifs. We tested 36 SSR primer sets for polymorphism in 16 A. gigas accessions. The average polymorphism information content (PIC) was 0.69; the average observed heterozygosity (HO) values, and the expected heterozygosity (HE) values were 0.53 and 0.73, respectively. These newly developed SSR markers would be useful tools for molecular genetics, genotype identification, genetic mapping, molecular breeding, and studying species relationships of the Angelica genus.Entities:
Keywords: Angelica gigas Nakai; next generation sequencing (NGS); simple sequence repeat (SSR)
Year: 2017 PMID: 28934115 PMCID: PMC5664088 DOI: 10.3390/genes8100238
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Genome sequencing of five Angelica gigas accessions.
| MV | HV | GLV | JLV | SLV | |
|---|---|---|---|---|---|
| Total Reads | 351,885,410 | 153,890,676 | 112,354,576 | 139,011,848 | 142,791,294 |
| Total Bases | 35,540,426,410 | 15,542,958,276 | 11,347,812,176 | 14,040,196,648 | 14,421,920,694 |
| Number of Scaffolds | 395,007 | 541,229 | 468,810 | 526,114 | 546,442 |
| Assembled genome size (bp) | 804,583,850 | 298,362,936 | 242,460,776 | 273,337,815 | 298,605,080 |
MV, Manchu variety; HV, Hwangje variety; GLV, Gangwon local variety; JLV, Jecheon local variety; SLV, Sancheong local variety.
Characterization of 36 polymorphic single sequence repeat (SSR) markers validated in 16 A. gigas accessions.
| Marker | GenBank No. | Motif | Primer Sequence | Annealing Temperature (°C) | Allele Size (bp) |
|---|---|---|---|---|---|
| YL-AGN tri0110 | KX138563 | (AAT) 5 | F:FAM-TATGTCGGCAACACATGC | 58 | 177–186 |
| R:GTTTGGCTAGGCTAGACTAATGTGGGT | |||||
| YL-AGN tri0221 | KX138564 | (ACC) 7 | F:FAM-CGACACCAGTGTTGATCCAT | 58 | 179–194 |
| R:GTTTGTATCGAGTCCATAACGCTCAG | |||||
| YL-AGN tri0303 | KX138565 | (AGC) 5 | F:FAM-AGGCGATAGAATCCCCAA | 58 | 157–175 |
| R:GTTTCGAGAAACAAAGCTTCAGGG | |||||
| YL-AGN tri0336 | KX138566 | (AGG) 8 | F:FAM-GTGGTGTTCTTTTCTCCACG | 58 | 185–203 |
| R:GTTTCTTGTCGTCTTCTCGTCTCTCT | |||||
| YL-AGN tri0359 | KX138567 | (AGT) 8 | F:FAM-CGTCGGCTAGTAACTGCAAA | 58 | 188–194 |
| R:GTTTATGTCGCGGTGATTTACG | |||||
| YL-AGN tri0379 | KX138568 | (ATA) 15 | F:FAM-GCTGTTCTGGCGTATGTACTTC | 58 | 161–191 |
| R:GTTTACTCCCAACAAACACTGCTC | |||||
| YL-AGN tri0537 | KX138569 | (ATG) 7 | F:FAM-GGTCCTCAGCTTTCTCAGAATC | 58 | 175–217 |
| R:GTTTCCTGATATCTCCCTGCAATC | |||||
| YL-AGN tri0638 | KX138594 | (ATT) 7 | F:PET-AGCATAGGATCCAGCTTGTG | 58 | 134–161 |
| R:GTTTTTTCGAGATGGAGTCTCAGC | |||||
| YL-AGN tri0685 | KX138570 | (CAC) 5 | F:FAM-AGTAAGCAAGTGATGCCGAG | 58 | 179–197 |
| R:GTTTGGGGGTTTTGTAGTGGTTCA | |||||
| YL-AGN tri0832 | KX138571 | (CCA) 5 | F:FAM-CCACTTTACTCCCCTGATAAGC | 58 | 163–184 |
| R:GTTTTTCCTGCCAGCTCTGATTAC | |||||
| YL-AGN tri0861 | KX138572 | (CCT) 5 | F:NED-CTGGTGGCTCTTAAGTTACTCG | 58 | 193–230 |
| R:GTTTCGGTTACTACAGTACGTTGGTTGC | |||||
| YL-AGN tri0889 | KX138573 | (CCT) 5 | F:NED-GACAAAGAAGCAGTGGCATC | 58 | 193–202 |
| R:GTTTGAGATTACGACGAGCGAGAA | |||||
| YL-AGN tri0953 | KX138595 | (CTG) 16 | F:PET-TGTTTGCACCAGCTCTCA | 58 | 180–209 |
| R:GTTTCCACCTCAAGGTTCAATGAC | |||||
| YL-AGN tri0955 | KX138596 | (CTG) 9 | F:PET-TAGCCAAGACCAGCTCAATC | 58 | 144–167 |
| R:GTTTTACTGCTACAATGGCACACC | |||||
| YL-AGN tri0957 | KX138574 | (CTG) 6 | F:NED-CAGTTCCGTTGTTCCAACTC | 58 | 188–203 |
| R:GTTTGAAAGCGAACGAGAGATGAG | |||||
| YL-AGN tri1043 | KX138575 | (GAA) 8 | F:NED-GCTGGATTATCACCTTCACG | 58 | 191–204 |
| R:GTTTGAAGAGGTAATGTGGGGTGTAG | |||||
| YL-AGN tri1092 | KX138597 | (GAA) 14 | F:PET-CTAGCTTTCCCATGTCTGAACC | 58 | 143–201 |
| R:GTTTCATCCATGCACCAATGTC | |||||
| YL-AGN tri1102 | KX138576 | (GAC) 5 | F:NED-ACTCAAAAGGACAAGTCCCC | 58 | 150–200 |
| R:GTTTCCTTCCACTCTCTGGTTGTAGAC | |||||
| YL-AGN tri1118 | KX138577 | (GAG) 5 | F:NED-AACTGATTGGGAGGAGTAGGAG | 58 | 168–183 |
| R:GTTTCCTGAAAGAGTACTAACACCCG | |||||
| YL-AGN tri1174 | KX138578 | (GAT) 6 | F:NED-AGCAACTAGCTCACTCACAACC | 58 | 196–210 |
| R:GTTTAGACGAGTTAAGGGACTTGC | |||||
| YL-AGN tri1211 | KX138579 | (GCA) 8 | F:NED-ATGCAACAATGTCTCGGC | 58 | 162–180 |
| R:GTTTCAACTTGGGTTCTGCCCTAT | |||||
| YL-AGN tri1262 | KX138580 | (GCT) 7 | F:NED-GTTACATTGAGTCCTCGTAGGG | 58 | 171–199 |
| R:GTTTATCACGAACCAGAACCCA | |||||
| YL-AGN tri1269 | KX138581 | (GCT) 5 | F:VIC-CCCTTGACACTCACTACTCTCA | 58 | 202–217 |
| R:GTTTTTCTCTCTGCTACCCTTAGAGC | |||||
| YL-AGN tri1276 | KX138598 | (GCT) 5 | F:PET-GATTGCTGCTGCTGAGTTG | 58 | 156–177 |
| R:GTTTGAACTCTCCGATGGTGTTGTAG | |||||
| YL-AGN tri1358 | KX138582 | (GTA) 6 | F:VIC-GACAGCCAGCCTTCTTCAT | 58 | 182–194 |
| R:GTTTCCCTACATTGCGTTGATCC | |||||
| YL-AGN tri1582 | KX138583 | (TCA) 8 | F:VIC-GGTACGGGAATATGAGACAGG | 58 | 183–206 |
| R:GTTTGTGATCTTGCTGATGACGG | |||||
| YL-AGN tri1708 | KX138584 | (TCT) 10 | F:VIC-GTGGTGCTGCCAATGTTT | 58 | 172–241 |
| R:GTTTGAAGATGTCGGCAGATCAGT | |||||
| YL-AGN tri1849 | KX138585 | (TGG) 7 | F:VIC-GTCCAATGATTCTGCTGCTC | 60.5 | 174–183 |
| R:GTTTATATACTGGGAGTGTTGCGGAG | |||||
| YL-AGN tri1920 | KX138586 | (TTA) 8 | F:VIC-TACAGTCGAGTTCTGGACACAC | 58 | 179–200 |
| R:GTTTCTTTCTCCGTGAACATGTCG | |||||
| YL-AGN tetra2116 | KX138587 | (AGAT) 9 | F:VIC-GTCACTAAAACACGAGACTGCC | 58 | 179–250 |
| R:GTTTCCAGAACTCGTTCCGAATC | |||||
| YL-AGN tetra2173 | KX138588 | (ATTT) 5 | F:VIC-AGGCATGCACTACTCCTCTATC | 58 | 179–191 |
| R:GTTTATCTCGCAGCTATGAGACTACC | |||||
| YL-AGN tetra2208 | KX138589 | (GATA) 6 | F:VIC-GCTGGGTTTAGGTTTCTGGA | 60.5 | 157–177 |
| R:GTTTACCGCAAACACCTAGTACTCC | |||||
| YL-AGN tetra2275 | KX138590 | (TGTA) 7 | F:PET-CATCATCTTGCAAGGTCCAC | 58 | 164–188 |
| R:GTTTTCTCCCAAACTGGTACTCTG | |||||
| YL-AGN Hexa2350 | KX138591 | (AGAATC) 6 | F:PET-GCTAGCAATAGCAGGTTGAC | 58 | 193–210 |
| R:GTTTATAGACCTGGTTTCGGGC | |||||
| YL-AGN Hexa2367 | KX138592 | (GATCTC) 5 | F:PET-ACGTGACCACCATATTGC | 58 | 187–199 |
| R:GTTTGTGGACACTGTTTCGTCACTG | |||||
| YL-AGN Hexa2374 | KX138593 | (TCATGC) 5 | F:PET-GGCAGACGTTCTGGTTTTC | 58 | 164–176 |
| R:GTTTCCGTGAGTGGTAGGGAAATA |
Diversity statistics from initial primer screening in 16 A. gigas accessions.
| No. | Marker | PIC | |||||
|---|---|---|---|---|---|---|---|
| 1 | YL-AGNtri0110 | 0.47 | 7 | 4 | 0.68 | 0.67 | 0.63 |
| 2 | YL-AGNtri0221 | 0.25 | 11 | 6 | 0.80 | 0.63 | 0.77 |
| 3 | YL-AGNtri0303 | 0.50 | 8 | 5 | 0.67 | 0.50 | 0.62 |
| 4 | YL-AGNtri0336 | 0.34 | 11 | 5 | 0.77 | 0.50 | 0.73 |
| 5 | YL-AGNtri0359 | 0.57 | 4 | 3 | 0.56 | 0.33 | 0.48 |
| 6 | YL-AGNtri0379 | 0.28 | 10 | 7 | 0.78 | 0.44 | 0.75 |
| 7 | YL-AGNtri0537 | 0.53 | 9 | 7 | 0.67 | 0.56 | 0.64 |
| 8 | YL-AGNtri0638 | 0.33 | 9 | 5 | 0.75 | 0.40 | 0.71 |
| 9 | YL-AGNtri0685 | 0.44 | 10 | 7 | 0.73 | 0.69 | 0.70 |
| 10 | YL-AGNtri0832 | 0.38 | 8 | 7 | 0.76 | 0.19 | 0.73 |
| 11 | YL-AGNtri0861 | 0.31 | 9 | 10 | 0.84 | 0.50 | 0.82 |
| 12 | YL-AGNtri0889 | 0.59 | 4 | 3 | 0.53 | 0.19 | 0.44 |
| 13 | YL-AGNtri0953 | 0.31 | 13 | 10 | 0.82 | 0.75 | 0.81 |
| 14 | YL-AGNtri0955 | 0.44 | 8 | 5 | 0.72 | 0.69 | 0.68 |
| 15 | YL-AGNtri0957 | 0.31 | 9 | 5 | 0.75 | 0.44 | 0.71 |
| 16 | YL-AGNtri1043 | 0.63 | 7 | 5 | 0.56 | 0.44 | 0.52 |
| 17 | YL-AGNtri1092 | 0.22 | 15 | 11 | 0.87 | 0.63 | 0.86 |
| 18 | YL-AGNtri1102 | 0.34 | 11 | 8 | 0.79 | 0.44 | 0.77 |
| 19 | YL-AGNtri1118 | 0.34 | 9 | 4 | 0.71 | 0.69 | 0.66 |
| 20 | YL-AGNtri1174 | 0.38 | 9 | 5 | 0.71 | 0.31 | 0.66 |
| 21 | YL-AGNtri1211 | 0.41 | 10 | 6 | 0.75 | 0.50 | 0.71 |
| 22 | YL-AGNtri1262 | 0.31 | 10 | 8 | 0.80 | 0.88 | 0.77 |
| 23 | YL-AGNtri1269 | 0.50 | 8 | 6 | 0.68 | 0.75 | 0.64 |
| 24 | YL-AGNtri1276 | 0.25 | 13 | 8 | 0.84 | 0.56 | 0.82 |
| 25 | YL-AGNtri1358 | 0.47 | 9 | 5 | 0.67 | 0.44 | 0.62 |
| 26 | YL-AGNtri1582 | 0.31 | 11 | 8 | 0.82 | 0.75 | 0.80 |
| 27 | YL-AGNtri1708 | 0.16 | 14 | 14 | 0.90 | 0.50 | 0.89 |
| 28 | YL-AGNtri1849 | 0.63 | 3 | 4 | 0.54 | 0.13 | 0.48 |
| 29 | YL-AGNtri1920 | 0.31 | 13 | 8 | 0.81 | 0.75 | 0.79 |
| 30 | YL-AGNtri2275 | 0.41 | 9 | 6 | 0.73 | 0.69 | 0.69 |
| 31 | YL-AGNtetra2116 | 0.22 | 13 | 14 | 0.88 | 0.75 | 0.87 |
| 32 | YL-AGNtetra2173 | 0.41 | 8 | 4 | 0.69 | 0.75 | 0.63 |
| 33 | YL-AGNtetra2208 | 0.41 | 7 | 5 | 0.67 | 0.25 | 0.61 |
| 34 | YL-AGNhexa2350 | 0.44 | 7 | 4 | 0.64 | 0.56 | 0.57 |
| 35 | YL-AGNhexa2367 | 0.46 | 6 | 4 | 0.64 | 0.38 | 0.57 |
| 36 | YL-AGNhexa2374 | 0.44 | 6 | 3 | 0.63 | 0.44 | 0.56 |
| Mean | 0.39 | 9.1 | 6.4 | 0.73 | 0.53 | 0.69 |
M, major allele frequency; N, number of genotypes; N, number of alleles; H, expected heterozygosity; H, observed heterozygosity; PIC, polymorphism information content.