| Literature DB >> 32010068 |
Prem Lal Kashyap1, Sudheer Kumar1, Ravi Shekhar Kumar1, Rahul Tripathi1, Palika Sharma1, Anju Sharma1, Poonam Jasrotia1, Gyanendra Pratap Singh1.
Abstract
Barley covered smut (CS) pathogen Ustilago hordei genome was mined for microsatellite distribution and their application in defining population structure and genetic variation. To dissect the molecular variation and genetic structure of U. hordei, 59 fungal isolates representing two distinct agro-ecological zones of India were analyzed by employing simple sequence repeats (SSRs). Using bioinformatic approaches, a total of 100,239 and 137,442 microsatellites were identified from 20.13 and 26.94 Mb of assembled genomic sequences of Uh364 and Uh4857-4 isolates of U. hordei, respectively. Penta-nucleotides (31.29 and 29.75%) followed by tri-nucleotide (28.27 and 29.88%) were most prevalent in both the genomes. Out of them, 15 polymorphic microsatellites showing conservancies in both the genomes were selected for exploring population genetic structure of U. hordei. An average of two alleles per microsatellite marker was generated with band size ranging from 180 to 850 bp. Polymorphic information content (PIC) varied between 0.095 and 0.37. Fifty-nine isolates were distributed in two distinct groups with about 65% genetic similarity according to UPGMA clustering and population structure analysis (K = 2). Gene flow analysis (Nm = 1.009) reflected moderate gene flow among the analyzed population. An analysis of molecular variance (AMOVA) displayed high level of genetic variation within population (87%) and low variation among populations (13%). Linkage disequilibrium (LD) analysis indicated positively significant but relatively low standardized index of association (SIA) value in both the population sets (SIA = 0.181), advocating a state of LD with epidemic population structure. In conclusion, the newly developed neutral SSR markers are highly polymorphic within U. hordei and will be useful for revealing evolutionary history and providing deep insight into the population dynamics of U. hordei in India as well as facilitating developing management strategies for CS of barley.Entities:
Keywords: barley; covered smut; gene flow; genetic diversity; genome; haplotype; microsatellite; population structure
Year: 2020 PMID: 32010068 PMCID: PMC6974468 DOI: 10.3389/fmicb.2019.02929
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1A map showing states of PZ (Uttar Pradesh, Punjab, Haryana, and Rajasthan) and HZ (Himachal Pradesh and Uttarakhand) surveyed for the collection of Ustilago hordei isolates from barley fields. PZ, plain zone; HZ, hill zone.
Genetic characteristics of 15 polymorphic simple-sequence repeat (SSR) loci in 59 Indian isolates of U. hordei.
| UHB3 | TAGCCTTTGAGGTCGATGTAGG | (GTT)5 | 246 | 225–250 | 54 | 100 | 2 | 0.0997 | 0.0948 |
| TGGGTGTCTTTCAGATGAGTTG | |||||||||
| UHB4 | CAATTACTCGTCGTCCTCCTTC | (TCC)6 | 222 | 200–225 | 54 | 100 | 2 | 0.1349 | 0.1258 |
| AGCGTTCAGCGTAAGGTAGTTC | |||||||||
| UHB5 | GCTCGTCTACCTCTGCGATACT | (GGA)5 | 241 | 240–280 | 55 | 100 | 2 | 0.1769 | 0.1612 |
| TCTGCATCTCAATCAACCAATC | |||||||||
| UHB6 | AGACATGCACCGTAACAACAAC | (CGG)6 | 204 | 210–240 | 55 | 100 | 2 | 0.4767 | 0.3631 |
| TACCCTCCATACTCTTGTCCGT | |||||||||
| UHB17 | TCTTGTGGAGTCTGCTGTTGTT | (TGC)5 | 239 | 220–250 | 54 | 100 | 2 | 0.2688 | 0.2327 |
| GTAGCTTCAGGTCGCATCACTT | |||||||||
| UHB 20 | GGTTGTTGTCATAGGGGTTGTC | (TCG)4 | 259 | 260–520 | 54 | 100 | 2 | 0.4842 | 0.367 |
| TACCAGAACATGGGTTTCAGC | |||||||||
| UHB21 | AGGTCTGGTGTGAGTGTTGATG | (TGA)4 | 258 | 225–260 | 54 | 100 | 2 | 0.233 | 0.2058 |
| CTCCTCATTGTAGTGCGTGTGT | |||||||||
| UHF 25 | TACTTCTCCTCCTCCTCCTCCT | (TATT)5 | 285 | 300–350 | 52 | 100 | 2 | 0.4904 | 0.3702 |
| UHB 26 | AGAGACCAAGTCGAATCCAAAG | (CAAGG)6 | 294 | 260–300 | 52 | 100 | 2 | 0.3866 | 0.3119 |
| UHB 27 | CATTTCAGTGTTGGACAAGCAT | (GTGTCA)4 | 251 | 200–250 | 52 | 100 | 2 | 0.2408 | 0.2118 |
| UHB 28 | CTAAGCATAAGGAGGCAACCAG | (TAAAA)5 | 271 | 280–850 | 54 | 100 | 2 | 0.4228 | 0.3334 |
| CGGAGTATTGGGAGTGAAATGT | |||||||||
| UHB 30 | GGTGATTGGAAGACCACAGAAT | (AAGCCA)5 | 227 | 180–230 | 55 | 100 | 2 | 0.2726 | 0.2354 |
| UHB 31 | CACAAACACACACACACACACA | (GCTCCC)4 | 224 | 200–230 | 54 | 100 | 2 | 0.375 | 0.3047 |
| CTGAACAGTAAAGCCTGAAGGG | |||||||||
| UHB 32 | TCCTACATTGGGATGACTGATG | (CAACGG)4 | 217 | 190–220 | 52 | 100 | 2 | 0.2008 | 0.1806 |
| UHB 36 | ATGAGGTCAAGAGTCAGCAACA | (GA)8 | 200 | 180–210 | 54 | 100 | 2 | 0.2149 | 0.1918 |
Number and distribution of SSRs in whole genome of two isolates of U. hordei.
| NCBI BioProject No. | ||
| NCBI GenBank assembly accession | ||
| Whole Genome Release date | 20.03.2018 | 20.12.2013 |
| Total size covered by examined sequences (Mb) | 20.13 | 26.94 |
| Number of SSR identified | 100,239 | 137,442 |
| Perfect SSRs | 72,865 | 98,613 |
| Imperfect SSRs | 21,633 | 31,665 |
| Compound SSRs | 5741 | 7164 |
| Total length of SSRs | 1,655,993 | 2,294,817 |
| Relative density (bp per Mb) | 82,253.64 | 85,178.03 |
| Relative abundance (SSR per Mb) | 5125.14 | 5246.27 |
Comparative account of total count length, percentage, relative abundance, and relative density of most abundant motifs in the U. hordei genome.
| A/T | 8879 | 11,910 | 71,432 | 100,935 | 12.19 | 12.08 | 8.05 | 8.47 | 441.02 | 442.07 | 3548.05 | 3746.46 |
| ACG/CGT | 5843 | 6691 | 58,590 | 66,888 | 8.02 | 6.79 | 10.03 | 10 | 290.22 | 248.35 | 2910.18 | 2482.72 |
| C/G | 3662 | 4422 | 25,275 | 30,018 | 5.03 | 4.48 | 6.9 | 6.79 | 181.89 | 164.13 | 1255.42 | 1114.2 |
| AAG/CTT | 2805 | 3224 | 27,429 | 31,446 | 3.85 | 3.27 | 9.78 | 9.75 | 139.33 | 119.67 | 1362.41 | 1167.2 |
| ACC/GGT | 2450 | 2815 | 24777 | 28,347 | 3.36 | 2.85 | 10.11 | 10.07 | 121.69 | 104.49 | 1230.68 | 1052.17 |
| AAC/GTT | 1522 | 2053 | 17,598 | 22,617 | 2.09 | 2.08 | 11.56 | 11.02 | 75.6 | 76.2 | 874.1 | 839.49 |
| AAACG/CGTTT | 814 | 994 | 8390 | 10,275 | 1.12 | 1.01 | 10.31 | 10.34 | 40.43 | 36.89 | 416.73 | 381.38 |
| AAAAG/CTTTT | 775 | 895 | 8680 | 10,105 | 1.06 | 0.91 | 11.2 | 11.29 | 38.49 | 33.22 | 431.14 | 375.07 |
| AG/CT | 664 | 838 | 12,502 | 15,328 | 0.91 | 0.85 | 18.83 | 18.29 | 32.98 | 31.1 | 620.98 | 568.94 |
| AC/GT | 501 | 540 | 10,218 | 10,816 | 0.69 | 0.55 | 20.4 | 20.03 | 24.88 | 20.04 | 507.53 | 401.46 |
| AAAAAG/CTTTTT | 384 | 514 | 6102 | 8778 | 0.53 | 0.52 | 15.89 | 17.08 | 19.07 | 19.08 | 303.09 | 325.82 |
| AAT/ATT | 291 | 419 | 3465 | 4896 | 0.4 | 0.42 | 11.91 | 11.68 | 14.45 | 15.55 | 172.11 | 181.73 |
| AAAAC/GTTTT | 285 | 349 | 3060 | 3725 | 0.39 | 0.35 | 10.74 | 10.67 | 14.16 | 12.95 | 151.99 | 138.26 |
| AAACC/GGTTT | 243 | 300 | 2445 | 3015 | 0.33 | 0.3 | 10.06 | 10.05 | 12.07 | 11.14 | 121.44 | 111.91 |
| AAAAT/ATTTT | 199 | 258 | 2275 | 2910 | 0.27 | 0.26 | 11.43 | 11.28 | 9.88 | 9.58 | 113 | 108.01 |
| AACG/CGTT | 174 | 196 | 2736 | 2996 | 0.24 | 0.2 | 15.72 | 15.29 | 8.64 | 7.28 | 135.9 | 111.2 |
| AT | 138 | 176 | 2464 | 3022 | 0.19 | 0.18 | 17.86 | 17.17 | 6.85 | 6.53 | 122.39 | 112.17 |
| AAAG/CTTT | 100 | 131 | 1452 | 1864 | 0.14 | 0.13 | 14.52 | 14.23 | 4.97 | 4.86 | 72.12 | 69.19 |
| AAAACG/CGTTTT | 91 | 115 | 1152 | 1596 | 0.12 | 0.12 | 12.66 | 13.88 | 4.52 | 4.27 | 57.22 | 59.24 |
| AAAAAT/ATTTTT | 88 | 110 | 1230 | 1464 | 0.12 | 0.11 | 13.98 | 13.31 | 4.37 | 4.08 | 61.09 | 54.34 |
| AAAAAC/GTTTTT | 78 | 107 | 1362 | 1680 | 0.11 | 0.11 | 17.46 | 15.7 | 3.87 | 3.97 | 67.65 | 62.36 |
| AAAT/ATTT | 56 | 79 | 960 | 1252 | 0.08 | 0.08 | 17.14 | 15.85 | 2.78 | 2.93 | 47.68 | 46.47 |
| AACC/GGTT | 42 | 49 | 576 | 660 | 0.06 | 0.05 | 13.71 | 13.47 | 2.09 | 1.82 | 28.61 | 24.5 |
| AAAC/GTTT | 35 | 37 | 448 | 484 | 0.05 | 0.04 | 12.8 | 13.08 | 1.74 | 1.37 | 22.25 | 17.96 |
| AAAACC/GGTTTT | 33 | 36 | 444 | 498 | 0.05 | 0.04 | 13.45 | 13.83 | 1.64 | 1.34 | 22.05 | 18.48 |
| CG | 9 | 10 | 112 | 124 | 0.01 | 0.01 | 12.44 | 12.4 | 0.45 | 0.37 | 5.56 | 4.6 |
Summary of the population diversity indices and estimation of linkage disequilibrium computed on the basis of SSR markers.
| HZ | 38 | 1.066 ± 0.319∗ | 0.766 ± 0.279 | 0.227 ± 0.167 | 0.1384 ± 0.1083 | 0.5903 | 86.67 | 0.022 (<0.001) | Yes | LD |
| PZ | 21 | 1.433 ± 0.175 | 1.140 ± 0.259 | 0.4425 ± 0.0920 | 0.2696 ± 0.0536 | 0.5271 | 100 | 0.129 (<0.001) | Yes | LD |
| Total | 59 | 0.181 | Yes | LD |
FIGURE 2UPGMA dendrogram from SSR data for 59 isolates of Ustilago hordei amplified by 15 microsatellites. Scale indicates Jaccard’s coefficient of similarity. The numbers at the branches are confidence values based on Felsenstein’s bootstrap produced by FreeTree software (Pavlícek et al., 1999). HZ, hill zone; PZ, plain zone.
FIGURE 3Population structure analysis of 59 isolates of Ustilago hordei. Analysis was carried out using STRUCTURE software with K set at 2. Inferred ancestries of the 59 UH isolates were based on two geographical distinct population.