Literature DB >> 24736399

Development and characterization of polymorphic EST-SSR and genomic SSR markers for Tibetan annual wild barley.

Mian Zhang1, Weihua Mao2, Guoping Zhang1, Feibo Wu1.   

Abstract

Tibetan annual wild barley is rich in genetic variation. This study was aimed at the exploitation of new SSRs for the genetic diversity and phylogenetic analysis of wild barley by data mining. We developed 49 novel EST-SSRs and confirmed 20 genomic SSRs for 80 Tibetan annual wild barley and 16 cultivated barley accessions. A total of 213 alleles were generated from 69 loci with an average of 3.14 alleles per locus. The trimeric repeats were the most abundant motifs (40.82%) among the EST-SSRs, while the majority of the genomic SSRs were di-nuleotide repeats. The polymorphic information content (PIC) ranged from 0.08 to 0.75 with a mean of 0.46. Besides this, the expected heterozygosity (He) ranged from 0.0854 to 0.7842 with an average of 0.5279. Overall, the polymorphism of genomic SSRs was higher than that of EST-SSRs. Furthermore, the number of alleles and the PIC of wild barley were both higher than that of cultivated barley, being 3.12 vs 2.59 and 0.44 vs 0.37. Indicating more polymorphism existed in the Tibetan wild barley than in cultivated barley. The 96 accessions were divided into eight subpopulations based on 69 SSR markers, and the cultivated genotypes can be clearly separated from wild barleys. A total of 47 SSR-containing EST unigenes showed significant similarities to the known genes. These EST-SSR markers have potential for application in germplasm appraisal, genetic diversity and population structure analysis, facilitating marker-assisted breeding and crop improvement in barley.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24736399      PMCID: PMC3988095          DOI: 10.1371/journal.pone.0094881

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Barley (Hordeum vulgare L.) is the fourth important cereal crop worldwide. With the rapid development of beer and feed industry, the demand for barley keeps increasing. However, during the long-term domestication of the cultivated barley, especially after the modern breeding and intensive cultivation, the genetic variation degraded significantly, resulting in missing lots of genes, including some rare alleles [1]. The monotonous genetic background of cultivated barley has become the bottleneck of the effectiveness of breeding, while the abundant diversity of wild barley can provide a pool of alleles for barley breeding and improvement [2], [3]. Morphological, archaeological cytogenetic and isozyme data revealed that wild barley on the Qinghai-Tibet Plateau is different from the Fertile Crescent wild barley [4]. Researches so far have shown even rich genetic diversity in Tibetan wild barley than in Ethiopian barley [5]. Novel germplasm has been identified from the Tibetan wild barley tolerant to drought, salinity and aluminum toxicity [6]–[8]. Increasing efficient molecular markers would be valuable in diversity analyses, resource conservation and beneficial alleles exploitation for wild barley. Comprehensive sets of expressed sequence tags (ESTs) sequences have been generated in many plants (http://www.ncbi.nlm.nih.gov/dbEST). The availability of increasing sequence databases enables the identification of functional genes with similar sequences in related species [9]. EST-based SSR markers (EST-SSRs) have been widely employed as powerful molecular genetic tools in a large number of cereal crop species due to their high level of transferability, close association to genes with known function, codominant inheritance, and low cost for development with available development from public databases [10]–[12]. Jaikishan et al. [13] used 25 EST-SSRs and 25 genomic SSRs to predict grain yield heterosis; multiple EST-SSRs were generated for wheat (Triticum aestivum L.) and these markers showed high transferability between wheat and the other crops, such as barley, maize, rice, and sorghum [14]–[16]. Up to date, polymorphic EST-SSRs were identified to establish Hordeum chilense evolutional relationships [17] and new EST-SSRs and genomic SSRs were complemented to the published Australian barley genetic maps [18]. However, to our knowledge, little work has been performed to develop EST-SSRs and apply them for population structure in Tibetan wild barley. In the present study, with the objective of exploiting new SSRs from EST databases and confirming the published genomic SSRs in the Tibetan wild and cultivated barley accessions, 49 EST-SSRs and 20 genomic SSRs were developed and characterized. These markers can be utilized to evaluate the genetic variation and phylogenetic relationships of 96 barley genotypes. Furthermore, polymorphism, and genetic diversity in the Tibetan wild barley accessions were evaluated which would be particularly useful for identification of novel genes with traits of interest, and marker-assisted breeding in barley.

Materials and Methods

Plant materials

A total of 96 barley accessions were used in this study including 80 Tibetan annual wild barley from Qinghai-Tibet Plateau provided by Huazhong Agricultural University barley germplasm collection, and 16 cultivars from China which were stored at the Institute of Crop Science, Zhejiang University, Hangzhou, China (Table S1). These accessions were collected on public land. And no specific permits were required for the collection. Seeds were surface sterilized with 3% H2O2 for 30 minutes and thoroughly rinsed with distilled water, followed by germination in nutrient rich soil in an incubator (22/18°C, day/night) for 10 days. Total genomic DNA was extracted from barley leaves using the Plant Genomic DNA Kit (TianGen, Beijing, China).

Sequence screening and primer designing

A total of 525999 barley ESTs were acquired from the EST database of GenBank (up to September 2012) (http://www.ncbi.nlm.nih.gov/Genbank/). Redundant sequences were removed from these ESTs using CD-HIT-EST (http://cd-hit.org) with the identity parameter of 95%. The presence of SSRs was screened using Simple Sequence Repeat Identification Tool (SSRIT) software (http://www.gramene.org/gramene/searches/ssrtool). The criteria for di-, tri-, tetra-, and penta-nucleotides were 10, 7, 5, and 4 repeat units, respectively. A total of 188 EST-SSRs were randomly selected and primers were designed using Primer5.0 with a length ranging from 18–22 bp, and product sizes of 100 to 300 bp. The reverse primers were marked with 6-FAM or HEX fluorescent dye at 5′ side for each pair. Based on the previous study of barley, 41 genomic SSR markers were selected and SSR primers were designed with the same criteria as mentioned above.

PCR amplification and sequencing

PCR amplification was performed in a total of 20 µL reaction mixture that contained 1 µL of genomic DNA, 1 U ExTaq DNA polymerase (Takara Inc.), 2 µL of 10×Ex Taq Buffer (Mg2+ Plus), 0.2 mM dNTPs mix, 0.05 µM forward primers, 0.1 µM reverse primers and fluorescent primers (FAM or HEX). The PCR protocol used was as follows: initial denaturation for 5 min at 94°C, followed by 5 cycles of denaturation for 30 s at 94°C, annealing for 30 s at 50°C, and extension for 30 s at 72°C, subsequently followed by 32 cycles of denaturation for 30 s at 94°C, annealing for 30 s at 55°C, extension for 30 s at 72°C, with a final extension for 10 min at 72°C and a 4°C holding temperature. PCR products were diluted and tested on a MegaBACE 1000 DNA analysis system (Amersham Biosciences, Piscataway, NJ) at the Center of Analysis and Measurement in Zhejiang University. The lengths of PCR fragments were calculated using the ET550-R size standard and Genetic Profiler version 2.2.

Calculation of polymorphism

The polymorphism of EST- and genomic SSR alleles were scored for the presence (1) and absence (0) for 96 accessions. Alleles with frequency less than 5% (rare alleles) in the population were removed and considered as missing data for the polymorphism calculation and population structure analysis [19]. The genetic diversity was evaluated by the number of alleles (Na), the effective number of alleles (Ne), observed heterozygosity (Ho), and expected heterozygosity (He) using POPGENE v.1.31 [20]. Polymorphism information content (PIC) was calculated by applying software PIC_CALC version 0.6.

Population structure

Population structure was assessed using the STRUCTURE software v2.3.3 based on the admixture model [21]. Models were tested for clusters (k) from 1 to 15, each with ten independent runs and 100,000 MCMC (Markov Chain Monte Carlo) iterations. The most likely number of clusters (k) was indicated by Δk, the change rate of the estimated log probability of the data (LnP[D]) [22].

Gene function blast

EST-SSRs associated unigene sequences were blasted against the GenBank non-redundant (nr) protein database using BLASTX (http://www.ncbi.nlm.nih.gov/BLAST) with an expected value (E-value) of 10−10 for the function of polymorphic EST-SSRs.

Results

Characterization of polymorphic SSRs

In total, 69 SSR primer pairs, including 49 (26% out of 188) EST-SSRs and 20 (49% out of 41) genomic SSRs (Tables 1 and 2), showed polymorphism among 96 accessions. A total of 213 alleles were generated from 69 loci with an average of 3.14 alleles per locus. The ratio of the EST-SSR repeat motifs was not equally distributed. The di-, tri-, tetra-, and penta-nucleotides accounted for 16.32%, 40.82%, 26.53%, and 16.32%, respectively. Whilst most of the genomic SSRs selected were composed of dinucleotide repeats. According to the results of POPGENE for the 69 SSRs, the observed number of alleles per locus (Na) ranged from 2 to 6 (mean = 3.14) and the effective number of alleles per locus (Ne) varied from 1.09 to 4.54 (mean = 2.30). The average Na was 3.12 and 2.59 for wild and cultivated barley, respectively (Table 3). Besides this, the polymorphic information content (PIC) ranged from 0.08 to 0.75 with a mean of 0.46, and the PIC of wild barley was higher than that of cultivars with 0.44 vs 0.37. The expected heterozygosity (He) ranged from 0.0854 to 0.7842 with an average of 0.5279, while the observed heterozygosity (Ho) ranged from 0 to 0.766 with an average of 0.1677. As an indicator of genetic diversity, the average He was 0.5098 in wild barley accessions and 0.4333 in cultivated accessions.
Table 1

Characterization of 49 polymorphic EST-SSR makers in barley (Hordeum vulgareL.).

PrimerSSR motifPrimer sequence (5′-3′)Expected size(bp)NaNeHoHePIC
F/R
P181(GAGAG)4 GTCGTCTCCCTCCCTTCA 22753.230.19790.69440.6379
CATTGCCAGCACTGTTTC
P129(GCC)7 CGAGGAGTTCGAGGTGGA 26043.260.60420.6970.635
ACTCTGCGTCCCAGTTCTT
P184(TGC)9 CCTACCAAACAACGGAATA 27643.040.10530.67460.6232
CAGCCAGAAGGTCTACGA
P50(AATC)5 ACAAGCAGATCACCGACG 21532.950.27270.66490.5868
AACCCGACTGAACAAATAAT
P91(TC)13 CGAGGCTCCTCATCTCCT 21132.890.35420.6570.5796
CCAGCATCGTCGCAAACT
P8(AG)15 TCGTTGATCCGAACTTTACC 19732.820.21880.64840.5735
CACCGCAGACGCTGAGTA
P29(ATAC)13 CTGCTTAGTTCTAGGAGGCT 14032.570.19350.61410.5391
CTCGGTTCGATTGTTCAT
P103(CTG)9 CATTTGGCATTGGTTGAT 10032.530.01040.60730.5362
AGTTCTTCTTCGCTGGAA
P32(GATG)6 GCAGAATGGCAGAAACAG 23332.60.09570.61870.5352
CAAGAATGAGCGAAAGGT
P168(TTC)7 TTCCTCCAGTCCTTCTCC 16942.490.35110.60130.5344
CTGCTGCTACCGTTCTTAT
P99(ATC)7 GATGTGATCTGATGCCATTT 27332.460.250.59660.5266
TTTCTTCGGTGTTCTTTCC
P152(CT)11 ACCAAGCCCACGAGTAGCA 25132.40.05210.58670.5186
CGACCCGAGGACGACAGAT
P144(CT)11 CTTCGTTCCCTCCTCACC 13452.190.45450.54730.5127
TCCGCTTCCACGATTGAC
P121(TACAT)4 CCCAGGAATAAGAACAGACAC 28742.250.36840.55870.4971
CACCGCCTAATAGCAACAA
P34(CTTC)6 GGCGAGGAACTGTTGTTG 25232.330.20830.57380.4898
GATCGGCTTCATCGTCTACT
P101(ATC)12 CCCCGTATAAACCACCCA 24532.180.25560.54390.4827
GGCAGAACTTCAGCACCC
P149(AGC)9 CTTGGCACGCTTTGTTTG 25932.170.1910.54310.478
ACTTTCCCACGGCATCAG
P150(GAGC)5 TAAGTAGGTTTGAGGAAGGGAA 26532.220.11490.55330.4693
CAACATAGACAAGGTGCTGGA
P83(AAGAA)4 CTCGGCAAACAGAGGACA 27842.210.20830.55060.468
TTGTAGCAGCGGATGGTC
P30(ATGT)12 ACTGCCACTCCATTTAGG 24132.160.16840.54070.4626
CTGTCGTAGGCTTGCTTT
P63(AGC)9 GGCTTGGCACGCTTTGTT 25932.070.11460.52060.4573
TTTCCCACGGCATCAGTC
P90(GAT)7 CGCAAGCCACAGAGCACA 17732.120.11460.53010.4507
TCCGTCCGTTCGTCCATC
P9(AC)11 ATCACAAACAGCCACTGTCCTA 11142.010.28120.50470.4388
GTGGTGAACCTTGCCCTTG
P3(GA)10 GCGAGGATGATGTATAAACCG 13241.950.30770.48870.4256
TGCATTCTGTGCCCTAACTAA
P45(GGTT)5 CCCACAACACCAACAAAC 22932.080.17710.52190.413
GCCCGTAGAATGAACAAGTA
P55(CTG)9 TTGATGGAGAAGGAGCAT 26431.780.03190.44190.3926
ACATAGTAGGATAGATAGACCC
P105(CCTCG)4 GCGACTACCAGGACGACAA 29731.780.06320.44150.381
CACCGACCGATACAGACAGA
P56(CTG)7 AGTGATCTGAGGCGGTAT 17621.990.18750.50070.374
CGTACGTCCAATGTTGTC
P66(CTCTT)4 CAAATGTGCCAGTAGAAA 29321.990.2340.50060.374
GGATGAGTTGCAGGTGAT
P67(TTG)12 AGAAACAAACAGACAGACCCAT 28421.970.57290.4960.3717
ATTCCACCACCGTCACCA
P180(CAG)8 ATTCTCGCCGCCAACAACT 21721.970.20.49460.371
CCACGTAGAAAGGGAGGGTCA
P80(GGTTG)4 ACTCCTGCTGCTGCTGAC 14921.950.32290.49030.3688
CGGTATTAGGCGACTCTTC
P57(AATA)5 ATAACAGCCGTTGATGAG 26021.940.26040.48690.367
GATCCGTTCCACAAACAT
P54(ATC)7 CAGCACCACTACTAATCAAGAA 24521.9300.48490.366
GCCACCAACAAGACCTCC
P137(GAAGA)4 AGAGGACAAGCCAAGGAAG 16121.910.17390.4790.3629
CACGGAAACGGAACAAAA
P106(CTG)8 CGAGCCGTTGCTTAGGTC 20621.850.13830.46120.3535
TCTACTGCCAGGGCGTGA
P139(GCAT)5 ACTCACATAGTAATCGAAGGG 28721.830.48960.45680.3512
GGGCAAGAACGAATCTCC
P186(CTGA)5 GGTAGTTCCGCCATCAGA 17721.720.26040.41970.3303
CCTCCTGTGGACGAAGAT
P187(GCACA)4 CTCGGACGACCATTTATT 20921.70.18750.41540.3278
TTCAAAGTTCAAGGGTGC
P53(CCAA)5 AGGGAAAGAAATCCTAAC 22421.630.09680.39020.3128
TTGACTTGCTTATACACCT
P13(AT)19 CACATGCGTTAGTGTCCC 29821.6300.38990.3126
GCGATTATCTTCGTCCAG
P16(TG)11 CGAGCAGGCATAGCCATAT 25631.440.0690.30970.2853
GACGCTGAGTACGTTGAGGT
P61(GCA)8 CAAATGGAGCCAAGCAAC 23521.470.18280.32040.2679
CCATCCTTGACGCACATC
P81(CTG)8 GCAGGATAGGCGACACTC 14121.380.13330.27930.2392
GAGACGGAGAAGGAGCAG
P185(CGG)8 AAACGGCTTTCACATCTCCC 20121.380.06250.27920.2392
CGCCCAAACAAGTCCTCC
P120(AGC)7 GAAATACTCCCAGGACAGC 24921.330.01060.24730.2157
AGCAAGTGCCAGTTCTACC
P100(CACG)6 CACATAAACAACCGAACCAA 24521.230.02080.18760.1693
CGACATACGCAGGGAGTG
P21(GAC)7 AACCTATGCCGCCTACTT 24121.110.04170.09930.0939
CCACCCGTCCACTCTTTT
P44(GCAA)5 AGTCCCGTAAACCTACCTGAG 16521.0900.08540.0813
TGCCGGAGAATGTAATCG

Note: Na, number of alleles; Ne, number of effective alleles; Ho, observed heterozygosity; He, expected heterozygosity; PIC, polymorphic information content.

Table 2

Characterization of 20 genomic-SSR makers in barley.

PrimerSSR motifPrimer sequence (5′-3′)Expected size (bp)NaNeHoHePIC
F/R
S40(AT)29 ACACCTTCCCAGGACAATCC 18264.540.0220.78420.748
CAGAGCACCGAAAAAGTCTGTA
S22(GT)13,(AG)19 AAGCTCTTTCTTGTATTCGTG 15854.090.05260.75950.7162
GTCCATACTCTTTAACATCCG
S18(CT)28 CTGGGATTGGATCACTCTAA 10753.90.02110.74740.7016
AAAACAAGTACTGAAAATAGGAGA
S7(AC)20 ATAGATCACCAAGTGAACCAC 17753.490.08330.71750.6776
GGTTATCACTGAGGCAAATAC
S37(CT)18 CCGACAACATGCTATGAAGC 13153.350.05210.70490.6596
CTGCAGCAAATACCCATGTG
S2(AC)7T(CA)15 (AT)9 CCATCAAAGTCCGGCTAG 21543.320.03260.7030.6504
GTCGGGCCTCATACTGAC
S11(AG)15 TCCATGATGATGTGTGCATAGA 17353.010.09090.6720.6121
CGGATCCCAACAAACACAC
S4(AT)6(AC)16 GCTATGGCGTACTATGTATGGTTG 17343.040.05490.67490.6106
TCACGATGAGGTATGATCAAAGA
S41(TG)8 AGTATGGGGAATTTATTTGG 13642.790.03120.64550.5864
GCTGCAAAGTATGACAATATG
S25(CT)24 TTTGTGACATCTCAAGAACAC 15842.770.18890.64280.5845
TGACAAACAAATAATCACAGG
S38(GA)17(GA)7 CTATCACACGACGCAACATG 16952.730.51060.63760.5828
CCTGAGAAAGAAAGCGCAAC
S30(GC)5GGG (GT)16 CAAATCAATCAAGAGGCC 15332.7400.63840.5615
TTTGAAGTGAGACATTTCCA
S21(AG)7C(AG)30-(AG)6 GGGAACTTGCTAATGAAGAG 15032.6700.62840.5546
AATGTAAGGGAGTGTCCATAG
S19(AG)19 CCCTAGCCTTCCTTGAAG 13532.460.03160.59730.5292
TTACTCAGCAATGGCACTAG
S29(GT)16 AGAATCAAGATCGACCAAAC 12442.190.02330.54640.5027
AAAAACATGAACCGATGAA
S15(CT)16 ATTCATCGATCTTGTATTAGTCC 17432.160.03190.53910.4749
ACATCATGTCGATCAAAGC
S31(CT)21 CTATTTTCTAATGCTTGGACC 14932.180.09470.54370.4647
TGTCTAGTTCATCATCATTGC
S36(CA)9 GGATTTTCTCAAGAACACTT 23932.130.7660.53240.4597
GCGTGAGTGCATAACATT
S1(AC)11 GTCCTTTACGCATGAACCGT 13832.10.03160.52560.4547
ACATACGCCAGACTCGTGTG
S8(AC)13(AT)9 GCTCTCTCTCAGAAAAATGAA 17731.630.04440.38990.3492
GAATTATTCTAGGGCTGTGAA
Table 3

Polymorphism of SSR makers in Tibetan wild and cultivated barley.

No. of allelesPICHeNo. of allelesPICHe
MarkerWildCultivatedWildCultivatedWildCultivatedMarkerWildCultivatedWildCultivatedWildCultivated
P3420.4500.1560.52590.1754P129440.5810.6580.65440.7359
P8330.5820.4820.65970.5565P137210.37400.50190
P9420.4670.1100.54540.1210P139220.3320.3660.42260.4980
P13220.2710.3710.32510.5081P144520.5230.3460.55530.4598
P16330.2560.3750.27570.4456P149320.4950.3320.56090.4345
P21210.11000.11790P150330.4600.4560.53570.5701
P29330.4840.5200.56370.6048P152330.4420.4500.50480.5222
P30330.4310.3980.49080.4758P168420.5710.2580.63880.3145
P32320.5420.3150.62010.4046P180210.37400.50190
P34320.5070.3660.58780.4980P181430.6180.4780.67310.5544
P44220.0750.1100.07830.1210P184430.5670.4680.61770.5484
P45330.3650.2940.46230.3306P185220.2290.2830.26530.3528
P50320.5450.3050.62850.3871P186220.3450.1950.44580.2258
P53220.2880.3740.35030.5149P187220.3470.1100.45000.1210
P54220.3720.2580.49810.3145S1320.4880.1950.56510.2258
P55320.3820.3590.42370.4839S2430.6550.4400.71060.5425
P56220.3710.3590.49530.4839S4430.6150.5610.67400.6587
P57220.3470.2830.45000.3528S7420.5830.3590.63180.4839
P61220.2800.1950.33890.2258S8330.2750.5280.29880.6323
P63320.4750.3230.53860.4173S11530.6400.3270.69960.3730
P66220.3640.2580.48210.3145S15330.4280.5630.49270.6621
P67220.3750.3050.50240.3871S18540.7040.5920.75110.6694
P80220.3720.3230.49810.4173S19330.4570.4120.51970.4966
P81220.2480.1950.29160.2258S21330.5210.4600.59020.5652
P83430.4730.4380.55500.5423S22550.6830.6870.73440.7581
P90330.3780.5440.44790.6371S25440.5600.5150.63070.6000
P91330.5870.3270.66500.3730S29440.4290.6070.47130.7059
P99320.5490.3050.62230.3871S30330.5280.3540.59850.4113
P100220.1100.3370.11790.4435S31320.4990.1100.58460.1210
P101320.4990.3150.56370.4046S36330.4230.5480.50210.6414
P103320.4700.3710.52520.5081S37530.5840.5550.63180.6452
P105320.3800.1950.46340.2258S38540.5100.6260.57320.7011
P106220.3290.3490.41850.4657S40530.6940.3630.74220.4203
P120220.1590.3590.17490.4839S41440.5300.4830.58090.5565
P121430.4850.3670.55840.4529
Average 3.12 2.59 0.441 0.373 0.5098 0.4333
Note: Na, number of alleles; Ne, number of effective alleles; Ho, observed heterozygosity; He, expected heterozygosity; PIC, polymorphic information content.

Gene functions of the 49 unigene sequences containing polymorphic EST-SSRs

Functions of the 49 polymorphic EST-SSRs were determined and 47 unigenes showed significant similarities to the known genes (Table 4), for instance, zinc finger protein MAGPIE, transcription factor LAF1, photosystem II reaction center PSB28 protein, xyloglucan endotransglycosylase (XET), and protein kinase APK1B. In addition, the results revealed that the most annotated proteins were from Triticum urartu (17, 36.2%), and the species Hordeum vulgare and Aegilops tauschii accounted for the same percentage (11, 23.4%).
Table 4

The putative proteins identified by BLASTX of 49 unigene sequences containing polymorphic EST-SSRs.

PrimerAccession No.Putative proteinOrganismE-value
P181CA032876.1Hypothetical protein TRIUR3_30088 Triticum urartu 4.00E-51
P129CV063130.1Putative SKP1 protein T.aestivum 1.00E-77
P184CB858539.1Hypothetical protein TRIUR3_19075 T.urartu 1.00E-46
P50DN178534.1UCW116, putative lipase H. vulgare subsp. vulgare 3.00E-125
P91FD524685.1Putative syntaxin-131 Aegilops tauschii 1.00E-93
P8AL506646.1Zinc finger protein MAGPIE T.urartu 4.00E-41
P29AV943994.1RNA polymerase sigma factor rpoD T.urartu 7.00E-116
P103CA009356.1GID1-like gibberellin receptor H. vulgare subsp. vulgare 4.00E-04
P32EX593207.1Disease resistance protein RGA2 Aegilops tauschii 8.70E-02
P168BU997138.1Hypothetical protein TRIUR3_09517 T.urartu 1.00E-04
P99GH218162.1Two-component response regulator ARR9 T.urartu 2.00E-64
P152AV938130.1Predicted protein H. vulgare subsp. vulgare 1.10E-01
P144EX598444.1No hit - -
P121CK569829.1ACC oxidase H. vulgare 9.00E-74
P34DN186304.1Predicted: UDP-glucose 6-dehydrogenase-like Brachypodium distachyon 5.00E-65
P101GH223749.1FT-like protein H. vulgare subsp. vulgare 1.00E-45
P149EX583185.1Condensin-2 complex subunit G2 T.urartu 5.00E-54
P150FD519288.1Curcuminoid synthase T.urartu 5.00E-59
P83FD527549.1Putative pectinesterase 53 Aegilops tauschii 1.00E-76
P30DN177250.1Hypothetical protein F775_31773 Aegilops tauschii 1.00E-05
P63EX577085.1Condensin-2 complex subunit G2 T.urartu 6.00E-69
P90FD528427.1Photosystem II reaction center PSB28 protein T.urartu 2.00E-83
P9AL505258.1Hypothetical protein f775_27232 Aegilops tauschii 6.00E-113
P3BJ547928.1Hypothetical protein TRIUR3_27885 T.urartu 1.00E-113
P45FD523777.1Hypothetical protein OsI_14737 Oryza sativa Indica Group3.00E-50
P55AL505545.1No hit - -
P105CA014373.1Eukaryotic translation initiation factor 1A Leymus chinensis 5.00E-72
P56EX584572.1Hypothetical protein F775_08651 Aegilops tauschii 2.00E-37
P66FD518055.1Predicted: protein LOC100843116 B.distachyon 5.00E-51
P67FD520223.1Hypothetical protein TRIUR3_27901 T.urartu 8.00E-36
P180CA030489.1Hypothetical protein TRIUR3_23016 T.urartu 4.00E-73
P80FD523499.1Casein kinase I-2-like protein A.tauschii 1.00E-75
P57EX599270.1Hypothetical protein ZEAMMB73_419738 Zea mays 7.00E-56
P54AL500476.1PM2 H. vulgare subsp. vulgare 5.00E-67
P137DN180922.1PREDICTED: protein LOC100846358 B.distachyon 2.00E-02
P106CA031374.1OSJNBa0074L08.11 Oryza sativa Japonica Group1.00E-46
P139AL501810.1GDSL esterase/lipase A.tauschii 3.00E-40
P186CB864664.1Protein kinase APK1B, chloroplastic A.tauschii 4.00E-50
P187CB864737.1Inactive ubiquitin carboxyl-terminal hydrolase 54 T.urartu 1.00E-17
P53EH090859.1TBC1 domain family member 15 A.tauschii 2.00E-57
P13CK569261.1Hypothetical protein TRIUR3_25268 T.urartu 3.50E-01
P16CB873886.1Phospholipid transfer protein precursor H. vulgare subsp. vulgare 2.00E-43
P61EX573461.1Predicted protein H. vulgare subsp. vulgare 6.00E-60
P81FD521065.1Predicted protein H. vulgare subsp. vulgare 1.00E-81
P185CB860073.1Peptide transporter PTR2 A.tauschii 5.00E-60
P120CK569159.1Xyloglucan endotransglycosylase (XET) H. vulgare subsp. vulgare 5.00E-69
P100GH216950.1Rho GDP-dissociation inhibitor 1 T.urartu 7.00E-69
P21CK122115.1Predicted protein H. vulgare subsp. vulgare 5.00E-116
P44CV063055.1Transcription factor LAF1 T.urartu 3.00E-70

Population structure and genetic distance

To detect the population structure in the 96 barley genotypes, we performed STRUCTURE program for Bayesian clustering analysis using 69 SSR markers, assuming that the number of populations (K) ranged from 1 to 15. The highest log likelihood score (Δk) was at K = 8 (Figure 1A), indicating that the most suitable number of subpopulations was eight. The frequency of each accession assigned to a subpopulation was shown in Table S1. If the threshold of frequency was set at 0.5, only six accessions were defined as admixed. However, about 80% of the accessions can be derived from the subpopulations when the threshold was at 0.7. The output of structure analysis demonstrated that wild and cultivated barleys were assigned to different subpopulations (Figure 1B). Most of the cultivated barleys were classified into the subpopulation 4, except for A74, Tadmor, B1342 and B1031. Fifty percent of the wild barley accessions studied were assigned to subpopulation 1.
Figure 1

Δk and population structure.

Estimation of the likelihood of clusters (k) for the most appropriate subpopulations (Δk) (A), and the population structure of 96 barley accessions in k = 8 clusters (B).

Δk and population structure.

Estimation of the likelihood of clusters (k) for the most appropriate subpopulations (Δk) (A), and the population structure of 96 barley accessions in k = 8 clusters (B). According to the values of genetic distance of the eight subpopulations, we get the dendrogram showing the genetic relationship of the subpopulations via UPGMA clustering analysis (Figure 2). The dendrogram showed that the subpopulation 3 was most close to the cultivated barleys (subpopulation 4) with the genetic distance of 132.188. The subpopulation 7 had the largest genetic distance (165.167) with the cultivated subpopulation.
Figure 2

The dendrogram of the eight subpopulations according to the genetic distance using UPGMA clustering analysis.

Discussion

In recent years, different kinds of molecular markers have been used widely, including marker-assisted breeding, study of genetic relationships between populations, and screening candidate genes associated with the target traits [23]. The simple sequence repeats (SSRs) are increasingly important due to their high polymorphism and convenient techniques. However, EST-SSRs are superior to genomic SSRs for their transcriptional sequence and suitable application in cross-species [24]. In the present study, we developed 49 EST-SSR and 20 genomic SSR markers for wild barley. These novel EST-derived markers will be a valuable resource for tagging and mapping of genes related to agronomic and stress-resistant traits of interest. In addition, these markers are advantageous for identifying functional diversity of unique adaptive germplasm because of their genic function. In many plants, the di- and tri-nucleotides repeat motifs were the major types, but the predominant motifs were different in various species [25], [26]. In our research, the tri-meric repeats were the most abundant motifs (40.82%), followed by the tetra-meric repeats accounted for 26.53%, and the di-meric and penta-meric repeat motifs were at the same frequency (16.32%).The polymorphism of SSRs can be divided into three degrees: high (PIC>0.5), medium (0.5>PIC>0.25) or low (PIC<0.25) [27]. In our study, the genetic diversity of genomic SSRs was higher than the EST-SSRs, with the mean PIC value of 0.57 (high) and 0.41 (medium), respectively, resulting in the general medium polymorphism (mean = 0.46). This finding was in line with previous results, and the lower level of polymorphism of EST-SSRs might be due to the selection against the variation in the conserved regions of the EST-SSRs [28]. Moreover, the expected levels of heterozygosity at EST-SSRs were also not as high as that of genomic SSRs, ranging from 0.0854 to 0.697 vs 0.3899 to 0.7842. Pompanon et al. [29] contributed the deficiency of heterozygosity to the primer problems, the deletion of alleles and appearance of invalid alleles at the annealing points. Studies of the genetic variation in barley suggested that Tibetan wild barley showed higher polymorphism than cultivated barley [30]–[32]. The results of our study were consistent with the previous studies. The number of alleles and the PIC of wild barley were both higher than that of cultivated barley, being 3.12 vs 2.59 and 0.44 vs 0.37. The expected heterozygosity (He) showed the same trend, with 0.5098 and 0.4333 for wild and cultivated barley, respectively. The richness of genetic diversity in Tibetan wild barley may be the source of novel genes contributing to the tolerance of biotic and abiotic stresses, which is important in the barley breeding. BLASTX analysis indicated that 47 (96%) of the 49 unigenes containing EST-SSRs can be matched to at least one important proteins in the NCBI nr protein database. For futher study, we can search the candidate genes of interest via association analysis referring to the function of markers in the metabolism pathways. Furthermore, these EST-SSR markers can be utilized as affirmative markers for comparative studies in the related species, for example, Triticum urartu and Aegilops tauschii. In the present investigation, the findings of population structure analysis demonstrated that the developed EST-SSRs and genomic SSRs could distinguish between the cultivated and wild barley genotypes clearly. The 96 genotypes were divided to eight subpopulations. The subpopulation 3 (XZ161, XZ163, XZ165, XZ168) was most closely related to the cultivated barley (subpopulation 4), and the subpopulation 7 (XZ120, XZ151, XZ153) and the cultivated barleys were two most genetically distant populations. The genetic relation of the subpopulations suggested that the subpopulation 3 contained the most domesticated genotypes among the studied wild barley. Futhermore, the other subpopulations of wild barley, especially subpopulation 7, may be the important germplasm resource for the improvement of cultivars tolerant of abiotic and biotic stresses. These results were consistent with recent clustering studies in the Tibetan wild barley genotype using DArT markers and SNPs[3]. This indicates that the cluster analysis using EST-SSR and SSR markers is an effective way to determine the structure of populations and can constitute a solid foundation for the genetic variation study.

Conclusion

The 49 novel EST-SSRs and 20 genomic SSR markers developed from 96 barley genotypes were highly polymorphic and could be employed to examine genetic diversity, evolution, linkage mapping, comparative genomics, and population structure. The Tibetan wild barley showed higher genetic variation than cultivated barley, and the cultivated subpopulation could be separated from the wild barley clearly. For further studies, these developed markers could be useful in identifying trait-marker association of interest in the marker-assisted breeding programs in barley. List of 96 genotypes used in this study and their inferred subpopulations with k = 8. (DOCX) Click here for additional data file.
  18 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species.

Authors:  Rajeev K Varshney; Thomas Thiel; Nils Stein; Peter Langridge; Andreas Graner
Journal:  Cell Mol Biol Lett       Date:  2002       Impact factor: 5.787

3.  EST analysis in barley defines a unigene set comprising 4,000 genes.

Authors:  W Michalek; W Weschke; K-P Pleissner; A Graner
Journal:  Theor Appl Genet       Date:  2002-01       Impact factor: 5.699

Review 4.  Molecular markers from the transcribed/expressed region of the genome in higher plants.

Authors:  P K Gupta; S Rustgi
Journal:  Funct Integr Genomics       Date:  2004-04-17       Impact factor: 3.410

5.  Mining and survey of simple sequence repeats in expressed sequence tags of dicotyledonous species.

Authors:  Siva P Kumpatla; Snehasis Mukhopadhyay
Journal:  Genome       Date:  2005-12       Impact factor: 2.166

Review 6.  Genotyping errors: causes, consequences and solutions.

Authors:  François Pompanon; Aurélie Bonin; Eva Bellemain; Pierre Taberlet
Journal:  Nat Rev Genet       Date:  2005-11       Impact factor: 53.242

7.  Use of tall fescue EST-SSR markers in phylogenetic analysis of cool-season forage grasses.

Authors:  M A Rouf Mian; Malay C Saha; Andrew A Hopkins; Zeng-Yu Wang
Journal:  Genome       Date:  2005-08       Impact factor: 2.166

8.  Development of a EST dataset and characterization of EST-SSRs in a traditional Chinese medicinal plant, Epimedium sagittatum (Sieb. Et Zucc.) Maxim.

Authors:  Shaohua Zeng; Gong Xiao; Juan Guo; Zhangjun Fei; Yanqin Xu; Bruce A Roe; Ying Wang
Journal:  BMC Genomics       Date:  2010-02-08       Impact factor: 3.969

9.  A comparison of sequence-based polymorphism and haplotype content in transcribed and anonymous regions of the barley genome.

Authors:  Joanne Russell; Allan Booth; John Fuller; Brian Harrower; Peter Hedley; Gordon Machray; Wayne Powell
Journal:  Genome       Date:  2004-04       Impact factor: 2.166

10.  Tibet as a potential domestication center of cultivated barley of China.

Authors:  Xifeng Ren; Eviatar Nevo; Dongfa Sun; Genlou Sun
Journal:  PLoS One       Date:  2013-05-03       Impact factor: 3.240

View more
  11 in total

1.  Development and characterization of polymorphic EST based SSR markers in barley (Hordeum vulgare).

Authors:  Won-Sam Jo; Hye-Yeong Kim; Kyung-Min Kim
Journal:  3 Biotech       Date:  2017-07-26       Impact factor: 2.406

2.  The complete chloroplast genome of critically endangered Chimonobambusa hirtinoda (Poaceae: Chimonobambusa) and phylogenetic analysis.

Authors:  Yanjiang Liu; Xiao Zhu; Mingli Wu; Xue Xu; Zhaoxia Dai; Guangqian Gou
Journal:  Sci Rep       Date:  2022-06-10       Impact factor: 4.996

3.  Transcriptomic analysis, genic SSR development, and genetic diversity of proso millet (Panicum miliaceum; Poaceae).

Authors:  Siyu Hou; Zhaoxia Sun; Yaoshen Li; Yijie Wang; Hubin Ling; Guofang Xing; Yuanhuai Han; Hongying Li
Journal:  Appl Plant Sci       Date:  2017-07-28       Impact factor: 1.936

4.  Genome size, cytogenetic data and transferability of EST-SSRs markers in wild and cultivated species of the genus Theobroma L. (Byttnerioideae, Malvaceae).

Authors:  Rangeline Azevedo da Silva; Gustavo Souza; Lívia Santos Lima Lemos; Uilson Vanderlei Lopes; Nara Geórgia Ribeiro Braz Patrocínio; Rafael Moysés Alves; Lucília Helena Marcellino; Didier Clement; Fabienne Micheli; Karina Peres Gramacho
Journal:  PLoS One       Date:  2017-02-10       Impact factor: 3.240

5.  Association mapping unveils favorable alleles for grain iron and zinc concentrations in lentil (Lens culinaris subsp. culinaris).

Authors:  Akanksha Singh; Vinay Sharma; Harsh Kumar Dikshit; Muraleedhar Aski; Harish Kumar; Nepolean Thirunavukkarasu; Basavanagouda S Patil; Shiv Kumar; Ashutosh Sarker
Journal:  PLoS One       Date:  2017-11-21       Impact factor: 3.240

6.  Genome survey of Zanthoxylum bungeanum and development of genomic-SSR markers in congeneric species.

Authors:  Jingmiao Li; Siqiao Li; Lijuan Kong; Lihua Wang; Anzhi Wei; Yulin Liu
Journal:  Biosci Rep       Date:  2020-06-26       Impact factor: 3.840

7.  Development and technical application of SSR-based individual identification system for Chamaecyparis taiwanensis against illegal logging convictions.

Authors:  Chiun-Jr Huang; Fang-Hua Chu; Yi-Shiang Huang; Yu- Mei Hung; Yu-Hsin Tseng; Chang-En Pu; Chi-Hsiang Chao; Yu-Shyang Chou; Shau-Chian Liu; Ya Ting You; Shuo-Yu Hsu; Hsiang-Chih Hsieh; Cheng Te Hsu; Meng-Yi Chen; Ting-An Lin; Hsin-Yi Shyu; Yu-Ching Tu; Chi-Tsong Chen
Journal:  Sci Rep       Date:  2020-12-16       Impact factor: 4.379

8.  Genic microsatellite marker characterization and development in little millet (Panicum sumatrense) using transcriptome sequencing.

Authors:  Hiral Desai; Rasmieh Hamid; Zahra Ghorbanzadeh; Nishant Bhut; Shital M Padhiyar; Jasminkumar Kheni; Rukam S Tomar
Journal:  Sci Rep       Date:  2021-10-18       Impact factor: 4.379

9.  The leaf transcriptome of fennel (Foeniculum vulgare Mill.) enables characterization of the t-anethole pathway and the discovery of microsatellites and single-nucleotide variants.

Authors:  Fabio Palumbo; Alessandro Vannozzi; Nicola Vitulo; Margherita Lucchin; Gianni Barcaccia
Journal:  Sci Rep       Date:  2018-07-11       Impact factor: 4.379

10.  Capsicum chinense MYB Transcription Factor Genes: Identification, Expression Analysis, and Their Conservation and Diversification With Other Solanaceae Genomes.

Authors:  Khushbu Islam; Abdul Rawoof; Ilyas Ahmad; Meenakshi Dubey; John Momo; Nirala Ramchiary
Journal:  Front Plant Sci       Date:  2021-10-13       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.