Literature DB >> 31156587

Phylogeography and Population Structure Analysis Reveal Diversity by Gene Flow and Mutation in Ustilago segetum (Pers.) Roussel tritici Causing Loose Smut of Wheat.

Prem Lal Kashyap1, Sudheer Kumar1, Rahul Tripathi1, Ravi Shekhar Kumar1, Poonam Jasrotia1, Devendra Pal Singh1, Gyanendra Pratap Singh1.   

Abstract

Ustilago segetum (Pers.) Roussel tritici (UST) causes loose smut of wheat account for considerable grain yield losses globally. For effective management, knowledge of its genetic variability and population structure is a prerequisite. In this study, UST isolates sampled from four different wheat growing zones of India were analyzed using the second largest subunit of the RNA polymerase II (RPB2) and a set of sixteen neutral simple sequence repeats (SSRs) markers. Among the 112 UST isolates genotyped, 98 haplotypes were identified. All the isolates were categorized into two groups (K = 2), each consisting of isolates from different sampling sites, on the basis of unweighted paired-grouping method with arithmetic averages (UPGMA) and the Bayesian analysis of population structure. The positive and significant index of association (IA = 1.169) and standardized index of association (rBarD = 0.075) indicate population is of non-random mating type. Analysis of molecular variance showed that the highest variance component is among isolates (91%), with significantly low genetic differentiation variation among regions (8%) (Fst = 0.012). Recombination (Rm = 0) was not detected. The results showed that UST isolates have a clonal genetic structure with limited genetic differentiation and human arbitrated gene flow and mutations are the prime evolutionary processes determining its genetic structure. These findings will be helpful in devising management strategy especially for selection and breeding of resistant wheat cultivars.

Entities:  

Keywords:  gene flow; genetic diversity; haplotype; mutation; population structure

Year:  2019        PMID: 31156587      PMCID: PMC6529584          DOI: 10.3389/fmicb.2019.01072

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

Loose smut caused by the basidiomycete fungus Ustilago segetum (Pers.) Roussel tritici Jensen (UST), is one of the most serious diseases on wheat (Triticum aestivum L.) globally. The disease is favored by moist and cool climate during anthesis (Quijano et al., 2016). This fungus converts the spike floral tissues to fungal teliospores, causing yield losses equivalent to the percent smutted spikes (Green et al., 1968; Singh, 2018). The primary inoculum of the pathogen survives in the embryo of the wheat seeds (Kassa et al., 2015). Wilcoxon and Saari (1996) documented that the fungus can result in reductions of 5–20 per cent profit at an infection level of 1–2 per cent. Similarly, Nielsen and Thomas (1996) reported 15–30% annual yield losses as a result of UST infection. Joshi et al. (1980) reported loose smut incidence up to 10% in North Western parts of India. Besides India, 5–10% loose smut incidence was also reported from Russia, New Zealand, and USA (Thomas, 1925; Bonne, 1941; Atkins et al., 1943; Watts Padwick, 1948; Menzies et al., 2009; Kaur et al., 2014). The infection process and disease cycle of UST on wheat has been elaborately discussed by several workers (Wilcoxon and Saari, 1996; Ram and Singh, 2004). Dikaryotic spores of UST disembarked on the wheat floret, germinate and penetrate the ovary through feathery stigma during anthesis (Dean, 1964; Shinohara, 1976). Mycelia of UST stay alive within the embryo of infected seeds and move systemically through the growing point of the tillers without showing any visible symptoms (Kumar et al., 2018). The symptoms become visible on emergence of spikes from the boot. Several methods are available to manage loose smut that include use of disease free seed, seed treatment with hot water or systemic fungicides, and host resistance are highly effective in controlling loose smut of wheat (Jones, 1999; Bailey et al., 2003; Knox et al., 2014). Unfortunately, the high genetic variability in the pathogen population may develop strains resistant to fungicides and also reduces lifespan of the resistant varieties (Randhawa et al., 2009). Therefore, the understanding of the variability and mechanism causing variability in the pathogen population is important for framing effective disease management and resistance breeding strategies. Traditionally, variations in fungal pathogens have been deciphered on the basis of morphology, cultural characters, physicochemical characters, virulence pattern, mating type, and disease reaction on differential hosts (Kaur et al., 2014; Kashyap et al., 2015; Yu et al., 2016). Unfortunately, these methods are time consuming, highly influenced by environment and thus are not very precise. Recently, DNA profiling based on restriction fragment length polymorphism (RFLP), random amplified polymorphic DNA (RAPD), amplified fragment length polymorphism (AFLP) and inter simple sequence repeat (ISSR) are being extensively employed to study the population biology and genetic diversity among fungi (Bennett et al., 2005; Kashyap et al., 2016; Kumar et al., 2016; Yu et al., 2016). Karwasra et al. (2002) used RAPD, ISSR, and AFLP profiling for assessing the extent of genetic variation among the regional UST isolates collected from Haryana, India. SSR or microsatellites markers have an advantage in studying genetic diversity, population genetic structure, genetic linkage mapping, and quantitative trait locus (QTL) because of its high repeatability, transferability, co-dominance, and ubiquitous presence (Ellegren, 2004; Kumar et al., 2013a; Singh et al., 2014; Zhu et al., 2016). Recently, SSR has been used for deciphering diversity in many fungal plant pathogens, such as Puccinia triticina (Wang et al., 2009), Magnaporthe grisea (Shen et al., 2004), Rhynchosporium secalis (Bouajila et al., 2007), Phytophthora infestans (Zhao et al., 2008), Phaeosphaeria nodorum (Sommerhalder et al., 2010), Fusarium culmorum (Pouzeshimiab et al., 2014), Ustilago hordei (Yu et al., 2016), Puccinia graminis f. sp. tritici (Prasad et al., 2018), and Bipolaris oryzae (Ahmadpour et al., 2018). Storch et al. (2007) reported that protein-coded genes are generally more conserved and can be aligned with more reliability. Among protein-coding markers, second-largest subunit of nuclear RNA polymerase II (RPB2), translation elongation factor 1-alpha (Tef1), beta-tubulin (Tub2) and actin (ACT) have been most frequently used for inferring phylogenetic relationships among fungi (Stielow et al., 2015; Raja et al., 2017). Functionally, RPB2 gene is responsible for the transcription of protein-encoding genes (Sawadogo and Sentenac, 1990) and present as single-copy in all eukaryotes (Thuriaux and Sentenac, 1992). A high level of polymorphisms in this gene makes this an excellent tool to study molecular evolution and phylogenetic relationships (Matheny et al., 2007; Krimitzas et al., 2013; Wang et al., 2016; Kruse et al., 2017). Stockinger et al. (2014) reported RPB2 gene as a potential marker for adequate phylogenetic resolution to resolve fungal lineages when compared to rDNA loci. Therefore, in the present study phylogenetic analysis of the single copy of RPB2 gene was done to explore genetic differentiation of UST populations. Despite recent advances, the role of gene flow in the reproduction, dispersal and evolution of UST populations is still poorly studied. To the best of our knowledge, fingerprinting and genetic diversity in UST population using microsatellite markers have not been studied extensively on large number of UST isolates. Thus, the present investigation was undertaken to study the genetic variation in the UST isolates collected from four different agro-ecological zones of India. The specific objectives of present investigation were to: (i) analyze the genetic diversity of UST isolates of Indian origin based on geographic areas of collection by microsatellites and RPB2 gene sequence comparison (ii) investigate the possibility of random mating within their sampling sites using MULTILOCUS version 1.31 (Agapow and Burt, 2001) and (iii) determine the population genetic structure of the UST population in four different wheat growing zones by employing genetic data analysis tools like GenAlEx 6.5 (Peakall and Smouse, 2012), NTSYS-pc program V2.1 (Rohlf, 2002), DnaSP program (Tajima, 1983), STRUCTURE 2.3.4 (Pritchard et al., 2000), and Bottleneck v1.2 (Agapow and Burt, 2001).

Materials and Methods

Sampling, Isolation and Purification of Fungal Isolates

One hundred and twelve isolates of UST were collected from different agro-ecological zones of India during 2016–2018 (Table 1, Figure S1). Stratified random sampling method was adopted at spike emergence to anthesis stage at least 30 KM apart, in each field. One smutted spike was gathered per field to avoid the possibility of mixture of genotypes. Isolates were assigned into four populations and named as Central zone (CZ), North Eastern Plain Zone (NEPZ), North Hill Zone (NHZ), and North Western Plain Zone (NWPZ). Single-teliospore cultures were obtained by inoculating teliospores on half-strength potato dextrose agar (50% PDA) and incubated at 25 ± 1°C for 24 h. Single germinating teliospores were transferred to slants containing 50% PDA, incubated at 25 ± 1°C and maintained at 4°C for further use.
Table 1

Sampling details of Ustilago segetum tritici (UST) isolates used in the study.

S.N.Isolate codeRegion/StateZoneLatitude (N)Longitude (E)UTM (E)UTM (N)HGHCNCBI accession
1WLS17-PUN-1Yodhapur, PunjabNWPZ30.08476.573651593.803329136.60521MH160206
2WLS17-PUN-2Mirapur, PunjabNWPZ30.20076.492643618.003341932.4012MH160207
3WLS17-HR-3Taprapur, HaryanaNWPZ30.26777.152707042.003350276.80533MH160208
4WLS17-HR-4Gagsina, HaryanaNWPZ29.56876.886682707.803272403.20541MH160209
5WLS17-HR-5Samora, HaryanaNWPZ29.81677.006693861.703300051.80553MH160210
6WLS17-HR-6Indri, HaryanaNWPZ29.95177.048697648.503315152.90561MH160211
7WLS17-HR-7Stondi, HaryanaNWPZ29.57076.895683555.803272583.30571MH160212
8WLS15-WB-8Karimpur, West BengalNEPZ23.54588.539657067.902604691.20121MH160213
9WLS17-HR-9Gagsina, HaryanaNWPZ29.56876.886682707.803272403.20581MH160214
10WLS16-WB-10Muradpur (Jalangi), West BengalNEPZ24.08988.696672410.802665131.90131MH160215
11WLS15-WB-11Isalampur, West BengalNEPZ26.22788.146614505.402901327.10142MH160216
12WLS15-WB-12Dakashin Dinapur, West BengalNEPZ25.26488.905691837.202795509.90151MH160217
13WLS17-HR-13Budanpur, HaryanaNWPZ29.59376.914685393.003275245.30142MH160218
14WLS17-MH-14Pune, MaharashtraCZ18.48373.779371125.302044019.4012MH160219
15WLS17-MH-15Wakapur, MaharashtraCZ20.72676.967704857.902293078.0021MH160220
16WLS17-HR-16Mohay-Ud-Dinpur, HaryanaNWPZ29.64477.076700955.903281138.80591HD 2967
17WLS17-HR-17Bati, HaryanaNWPZ29.69776.297625522.103285898.80601HD 2967
18WLS17-PUN-18Langroya, PunjabNWPZ31.11176.174611948.103442538.30611MH160223
19WLS17-MH-19Washim, MaharashtraCZ20.19576.925701191.802234174.8031MH160224
20WLS17-HR-20Laha, Naraingarh, HaryanaNWPZ30.31377.066698615.603355265.50245MH160225
21WLS17-HR-21Dhanoli, Ambala, HaryanaNWPZ30.23277.054697671.803346277.90621MH160226
22WLS17-HR-22Pathreri, HaryanaNWPZ30.25477.016693922.803348650.20533MH160227
23WLS17-HR-23Pratapgarh, Kurukhestra, HaryanaNWPZ29.98276.880681336.503318261.00631MH160228
24WLS17-HR-24Ladwa, HaryanaNWPZ29.99177.038696558.403319515.30641MH160229
25WLS17-UP-25Sherkot, Uttar PradeshNWPZ29.32278.593266235.603246067.60553MH160230
26WLS17-RJ-26Badh Fatehpura, RajasthanNWPZ26.88875.559555511.302974114.30651MH160231
27WLS17-RJ-27Ramkui, RajasthanNWPZ26.96075.550554569.002982086.90661MH160232
28WLS17-RJ-28Sanganer, RajasthanNWPZ26.81975.696569125.002966539.20671MH160233
29WLS17-RJ-29Boraj, RajasthanNWPZ26.86275.448544455.902971262.00681MH160234
30WLS17-UK-30Domet, UttrakhandNHZ30.51177.854773937.403378919.90245MH160235
31WLS17-UP-31Unnao, Uttar PradeshNWPZ26.53680.486448834.202935181.00691MH160236
32WLS17-UP-32Gazipur, Uttar PradeshNWPZ25.78883.171717682.802854017.20701MH160237
33WLS17-MH-33Brahmangaon, MaharashtraCZ20.55274.302427242.102272723.1042MH160238
34WLS17-UP-34Nethaur, Uttar PradeshNWPZ29.32778.392246715.103246986.20553MH160239
35WLS17-UP-35Shahadatpura, Uttar PradeshNWPZ25.93583.569757294.602870954.70711MH160240
36WLS17-UP-36Shahganj, Uttar PradeshNWPZ26.05182.678667882.602882380.80721MH160241
37WLS17-UP-37Jaunpur, Uttar PradeshNWPZ25.74382.702670726.902848327.10731MH160242
38WLS17-MH-38Baramati, MaharashtraCZ18.13974.574454965.602005623.7051MH160243
39WLS17-UK-39Almora, UttrakhandNHZ29.79173.536358481.803296537.70251MH160244
40WLS17-UK-40Kaladungi, UttrakhandNHZ29.29579.336338358.603241810.20261MH160245
41WLS17-UK-41Quano, UttrakhandNHZ30.67777.764764845.003397092.3042MH160246
42WLS17-MH-42Jopul, MaharashtraCZ20.19073.933388515.802232865.2061MH160247
43WLS17-PUN-43Fatehgarh Sahib, PunjabNWPZ31.50976.223616103.603486677.80741MH160248
44WLS17-PUN-44Ropar, PunjabNWPZ31.24676.473640300.803457750.60751MH160249
45WLS17-PUN-45Bhalowal, PunjabNWPZ31.31876.423635408.903465683.00761MH160250
46WLS16-WB-46Petrapole, West BengalNWPZ23.03688.877692320.502548732.70161MH160251
47WLS17-PUN-47Hiyatpur, PunjabNWPZ31.14576.224616681.903446297.40771MH160252
48WLS17-PUN-48Hakimrpur Apra, PunjabNWPZ31.10675.907586525.203441703.60781MH160253
49WLS17-PUN-49Kalpi, PunjabNWPZ30.28877.000692344.903352378.30791MH160254
50WLS17-PUN-50Rupnagar, PunjabNWPZ30.99176.532646292.503429639.10801MH160255
51WLS17-PUN-51Sherpur Gill, PunjabNWPZ31.08276.265620635.203439429.50811MH160256
52WLS17-PUN-52Sibalmajra, PunjabNWPZ31.09976.242618492.203441213.70821MH160257
53WLS17-PUN-53Gurdashpur, PunjabNWPZ32.04675.383536162.003545619.30831MH160258
54WLS17-PUN-54Moga, PunjabNWPZ30.83775.158515088.803411540.20841MH160259
55WLS17-PUN-55Khabra, PunjabNWPZ31.22876.110605691.403455353.50851MH160260
56WLS17-PUN-56Dasuya, PunjabNWPZ31.81375.656562100.003519909.40861MH160261
57WLS17-PUN-57Jalandar, Punjab,NWPZ31.29875.544551727.203462736.10871MH160262
58WLS17-PUN-58Ludhiana, PunjabNWPZ30.90675.806576977.703419451.90881MH160263
59WLS17-JK-59Kathua, Jammu and KashmirNHZ32.42675.437541057.803587777.80271MH160264
60WLS17-JK-60RS Pura, Jammu and KashmirNHZ32.73674.830484087.303622074.90281MH160265
61WLS17-JK-61Chatha, Jammu and KashmirNHZ32.64074.800481229.903611389.90291MH160266
62WLS17-JK-62Barnai, Jammu and KashmirNHZ32.70974.789480216.503619064.90301MH160267
63WLS17-JK-63Udhaywall, Jammu and KashmirNHZ32.74174.821483266.303622559.80311MH160268
64WLS17-JK-64Ramghar, Jammu and KashmirNHZ32.55474.900490617.603601873.00321MH160269
65WLS17-JK-65Kana check, Jammu and KashmirNHZ32.75774.849485811.803624358.70331MH160270
66WLS17-JK-66Domana, Jammu and KashmirNHZ32.76974.817482841.903625643.60341MH160271
67WLS17-JK-67Ranbir Singh pura, Jammu and KashmirNHZ32.60674.734475034.803607688.00351MH160272
68WLS17-PUN-68Kurdan, Phillaur PunjabNWPZ31.04775.944590070.803435193.70891MH160273
69WLS17-PUN-69Khadukhera, PunjabNWPZ30.56376.464640377.703382102.30432MH160274
70WLS17-MP-70Indore, Madhya PradeshCZ22.69875.880590380.702510394.3071MH160275
71WLS17-MP-71Ujjain, Madhya PradeshCZ23.21475.786580451.802567464.0081MH160276
72WLS17-MP-72Rewa, Madhya PradeshCZ24.54181.302530568.702714113.9091MH160277
73WLS17-MP-73Bhopal, Madhya PradeshCZ23.25677.410746609.302573922.20101MH160278
74WLS17-MP-74Arjani, Madhya PradeshCZ23.82078.717267429.302636195.10111MH160279
75WLS17-RJ-75Rajpura, RajasthanNWPZ26.79576.002599620.302964152.30901MH160280
76WLS17-RJ-76Goner, RajasthanNWPZ26.82375.930592427.202967202.70911MH160281
77WLS17-HP-77Bara, Himachal PradeshNHZ31.48176.575649577.803484017.90361MH160282
78WLS17-HP-78Rudhanni, Himachal PradeshNHZ31.45376.690660539.103481039.00245MH160283
79WLS17-HP-79Kangra, Himachal PradeshNHZ32.10976.266619457.203553261.30371MH160284
80WLS17-HP-80Mandi, Himachal PradeshNHZ31.68076.940683934.703506583.40381MH160285
81WLS17-HP-81Ratti, Himachal PradeshNHZ31.60476.903680558.503498104.00391MH160286
82WLS17-HP-82Ramshehar, Himachal PradeshNHZ31.09176.790670693.403441100.20401MH160287
83WLS17-HP-83Una, Himachal PradeshNHZ31.47276.243618040.903482586.40411MH160288
84WLS17-HP-84Sundar Nagar, Himachal PradeshNHZ31.17176.853676644.103450015.20421MH160289
85WLS17-HP-85Kullu, Himachal PradeshNHZ31.95477.095698035.703537234.10432MH160290
86WLS17-UP-86Shamli, Uttar PradeshNWPZ29.62577.090702362.603279039.20921MH160291
87WLS17-UP-87Mohamdabad, Uttar PradeshNWPZ28.77278.164223090.503186033.70245MH160292
88WLS17-UP-88Rajak Nagar, Uttar PradeshNWPZ29.49577.245717686.803264898.00533MH160293
89WLS17-UP-89Faizabad, Uttar PradeshNWPZ26.76982.145613856.102961394.30931MH160294
90WLS17-HP-90Kunihar, Himachal PradeshNHZ31.08476.969687785.903440527.40441MH160295
91WLS17-HP-91Diggal, Himachal PradeshNHZ31.08476.869678280.103440428.70451MH160296
92WLS17-HP-92Jai Nagar, Himachal PradeshNHZ31.17176.853676644.103450015.20461MH160297
93WLS17-JK-93Khandwalmore, Jammu and KashmirNHZ32.40575.307528843.303585363.40472MH160298
94WLS17-JK-94Dablehar, Jammu and KashmirNHZ32.57674.760477448.103604322.60482MH160299
95WLS17-JK-95Satwari, Jammu and KashmirNHZ32.73774.831484195.203622165.00245MH160300
96WLS17-JK-96Saikalan, Jammu and KashmirNHZ32.50774.791480370.003596643.80491MH160301
97WLS17-JK-97Udhaywalla, Jammu and KashmirNHZ32.74174.821483266.303622559.80482MH160302
98WLS17-JK-98Quderpur, Jammu and KashmirNHZ32.57874.760477448.503604544.40472MH160303
99WLS17-JHK-99Dhampur, JharkhandNEPZ23.92186.134411867.502645781.20171MH160304
100WLS17-JHK-100Kannu, JharkhandNEPZ24.24787.255525877.302681561.20181MH160305
101WLS17-JHK-101Jharkhandi, JharkhandNEPZ24.34486.026401160.802692643.80191MH160306
102WLS17-JHK-102Deoghar, JharkhandNEPZ24.47186.693468928.002706386.60201MH160307
103WLS17-JHK-103Gola-Baniyatu, JharkhandNEPZ23.49885.660363208.902599294.30211MH160308
104WLS17-JHK-104Mahadebganj, JharkhandNEPZ24.34387.584559211.902692359.80221MH160309
105WLS17-JHK-105Machut, JharkhandNEPZ25.06387.620562485.702772018.10231MH160310
106WLS17-HP-106Lohar ghat, Himachal PradeshNHZ31.19076.838675153.003452112.90501MH160311
107WLS17-UK-107Kashipur, UttrakhandNHZ29.20378.968302468.503232131.60511MH160312
108WLS17-UP-108Ballia, Uttar PradeshNWPZ28.20079.361339100.503120415.90941MH160313
109WLS17-UP-109Sultanpur, Uttar PradeshNWPZ29.16079.062311507.303227320.80951MH160314
110WLS17-UP-110Mirzapur, Uttar PradeshNWPZ25.13382.551656350.602780566.60961MH160315
111WLS17-UP-111Jagdishpur, Uttar PradeshNWPZ26.74980.542454446.602958723.20971MH160316
112WLS17-UP-112Lucknow, Uttar PradeshNWPZ26.76680.977497740.902960550.80981MH160317

CZ, Central zone; NEPZ, North East Plain zone; NHZ, North hill zone; NWPZ, North Western Plain zone; HG, Haplotype group; HC, Haplotype Count.

Sampling details of Ustilago segetum tritici (UST) isolates used in the study. CZ, Central zone; NEPZ, North East Plain zone; NHZ, North hill zone; NWPZ, North Western Plain zone; HG, Haplotype group; HC, Haplotype Count.

Genomic Preparations

For genomic DNA extraction, isolates of UST were transferred, using a sterile needle, from Petri dishes to 100 ml Erlenmeyer flasks containing potato dextrose yeast (PDY) broth. The cultures were grown for 10 days in an orbital shaker (150 rpm) at 25 ± 1°C. The fungal mat harvested on sterile Whatman filter paper, frozen in liquid nitrogen, and ground to fine powder with pestle and mortar. Cetyl trimethyl-ammonium bromide (CTAB) method was used to extract genomic DNA as described by Kumar et al. (2013b). DNA was quantified by recording absorbance at 260 nm and determined purity by calculating the ratio of absorbance at 260 nm to that of 280 nm. The concentration of DNA was adjusted to 50 ng μl−1 for PCR analysis.

RNA Polymerase II Second Largest Subunit (RPB2) Gene Amplification and Sequence Analysis

A portion of the RNA polymerase II second largest subunit (RPB2) gene was amplified for all the 112 UST isolates using polymerase chain reaction (PCR) with primer RPB2F (5′- AACCACCGATTTGGAGCAGT-3′) and RPB2R (5′- ACTCATTAGATGGCGGGGAGA-3′). The primers were designed using the NCBI accession number DQ846896.1. PCR amplification was performed in Q cycler 96 (Hain Lifescience, UK). Each PCR reaction mixture (50 μl) consisting of 50 ng template DNA, 1.5 μM of each primer, 1.5 mM MgCl2, 0.2 mM of each deoxynucleotides, 1.5 unit of Taq DNA polymerase (New England Biolabs, USA), and final volume of 50 μl was maintained by adding distilled water. The amplification was done as initial denaturing at 94°C for 4 min, followed by 35 cycles at 94°C for 1 min, annealing at 55°C for 1 min, extension at 72°C for 1 min and a final extension at 72°C for 7 min. The amplified PCR products were separated on 1% agarose gel, and the desired specific band was purified by Wizard SV Gel and PCR Clean-Up System (Promega, Madison, WI, USA) according to the manufacturer's instruction. DNA was sequenced commercially at the Eurofins Genomics India Pvt. Ltd. (Bangalore, India). A consensus sequence was obtained from the sequencing of both forward and reverse strands, and further data quality were checked using Chromas 2.32 (Technelysium Pty. Ltd.). BlastN search programme was used to compare the sequences available in National Center for Biotechnology Information (NCBI) databases. The sequences were aligned using MEGA 7 (Kumar et al., 2016) and gaps and missing data were not considered for phylogenetic analysis. Evolutionary tree was drawn using neighbor-joining (NJ) method (Saitou and Nei, 1987) and evolutionary distances were determined by Kimura (1980). The nucleotide sequences of RPB2 gene were submitted to NCBI GenBank.

Microsatellite Genotyping

For the amplification of each microsatellite marker gradient PCR was performed to select the best annealing temperature. PCR amplifications were performed in Q cycler 96 (Hain Lifescience, UK) in a total volume of 10 μl containing Promega™ PCR Master Mix, additional 0.5 mM MgCl2, 0.05–0.15 μM forward primer, 0.05–0.15 μM reverse primer, and 1.0 μl template DNA (50 ng μl−1). PCR cocktail without template DNA was taken as control. PCR was programmed for initial denaturation at 94°C for 4 min; followed by 35 cycles of denaturation at 94°C for 60 s, annealing at 51, 52, 53, 54, and 55°C for 1 min, and extension at 72°C for 1 min; with a final extension step at 72°C for 7 min. PCR products were separated in 4% agarose gel stained with ethidium bromide along 100 bp ladder (Genei, Bangalore) to know the polymorphism. Test isolates were scored on the basis of amplification and non-amplification of SSR markers. The numbers of varoius alleles per locus, effective alleles per locus, private alleles and Shannon's Information Index were computed for each population using GenAlEx 6.5 (Peakall and Smouse, 2012). The polymorphic information content (PIC) value for each SSR markers was calculated using the formula: where k is the total number of alleles detected for a microsatellite; Pi is the frequency of the ith allele in the set.

Population Structure and Gene Flow

The presence (1) and absence (0) of desired amplicom for each SSR marker in all the 112 isolates were treated as binary characters and was analyzed using the NTSYS-pc program V2.1 (Rohlf, 2002). All the isolates were grouped in different clusters using Un-weighted Pair Group Method with Arithmetic average [UPGMA; (Yu et al., 2006)] in the SAHN subprogram. Dice similarity coefficient based on the proportion of shared alleles with the SIMQUAL was used to know the genetic similarity between isolates. Analysis of molecular variance (AMOVA) based on SSR markers was calculated using GenAlEx 6.5 to know the genetic diversity in different populations [Table 1; (Peakall and Smouse, 2012)]. The fixation index (Fst) of the total populations and pairwise Fst among all pairs of populations were calculated to investigate population differentiation, and significance was tested based on 1,000 bootstraps. Gene flow among populations was calculated based on the number of migrants per generation (Nm) using the formula, Population structure analysis was executed with STRUCTURE 2.3.4 (Pritchard et al., 2000) using microsatellite loci data. The optimum number of populations (K) was selected by testing K = 1 to K = 15 using five independent runs of 25,000 burn-in period length at fixed iterations of 100,000 with a model allowing for admixture and correlated allele frequencies. The optimum number of population was predicted using the simulation method of Evanno et al. (2005) in STRUCTURE HARVESTER version 0.6.92 (Earl and vonHoldt, 2012). The K value was determined by the log probability of data [Ln P(D)] based on the rate of change in LnP(D) between successive K. Bottleneck v1.2 was used to determine whether there was an excess (a recent population bottleneck) or deficit (a recent population expansion) in H (gene diversity) relative to the number of alleles present in UST populations (Piry et al., 1999). To determine whether loci displayed a significant excess or deficit in gene diversity Sign and Wilcoxon significance tests (WT) were performed (Cornuet and Luikart, 1996). The Tajima's D value was estimated using the DnaSP program (Tajima, 1983). Number of haplotypes, number of segregating sites, and the π and Φw measures of nucleotide diversity for each population were determined with clone-corrected samples whereas, Nei's haplotype diversity (Hd) was also determined before sample clone-corrected. The value of π and Φw represents the average number of pairwise nucleotide differences and the total number of segregating sites in a set of DNA sequences, respectively. MULTILOCUS version 1.31 was used to measure linkage disequilibrium among SSR loci using the index of association (IA) and rBarD index (Agapow and Burt, 2001). Tests of departure from random mating for both indices were done with 10,000 randomizations of the complete and clone-corrected MLH dataset. To dissect the recombination in UST populations, the proportion of compatible pairs of loci (PrCP) was determined using MULTILOCUS v.1.31 (Agapow and Burt, 2001). The null hypothesis of random mating was rejected if more compatible loci than expected in a randomized population were observed (P < 0.05).

Results

Gene Sequence Analysis and Haplotypic Diversity

The second largest subunit of RNA polymerase II (RPB2) gene sequences of all the 112 UST isolates were compared with those from known species of Ustilago available in the NCBI database (Figure 1). The results confirmed that all the 112 isolates are Ustilago segetum tritici. Sequences were deposited at NCBI GenBank, and got accession numbers (Table 1). Phylogenetic analysis of RPB2 gene from 112 isolates turn out from different wheat growing zones showed that they are related to each another and divergent from the isolate reported from Canada (Figure 1). Analysis of the sequences of the RPB2 gene loci in the 112 isolates by DnaSP version 5.10 identified a total of 98 haplotypes (Table 1). Eighty three haplotypes were represented by single isolate and the remaining 15 haplotypes had minimum two isolates (Table 1). Fifteen haplotypes, which contained at least two isolates, were found only in NWPZ, CZ and NHZ. Haplotype 47 (WLS17-JK-93 and WLS17-JK-98) and haplotype 48 (WLS17-JK-94 and WLS17-JK-97) were collected from the NHZ while haplotype 53 (WLS17-HR-3, WLS17-HR-22 and WLS17-UP-88) and haplotype 55 (WLS17-HR-5, WLS17-UP-25 and WLS17-UP-34) were identified in NWPZ population only (Table 1). However, haplotype 24 (WLS17-UP-87 and WLS17-HR-20, WLS17-UK-30, WLS17-HP-78, and WLS17-JK-95) and haplotype 43 (WLS17-PUN-69 and WLS17-HP-85) were shared by NWPZ and NHZ populations. Similarly, CZ (WLS17-PUN-2) and NWPZ (WLS17-MH-14) shared haplotype 1, while haplotype 14 (WLS16-WB-11 and WLS17-HR-13) shared between NWPZ and NEPZ. NHZ and CZ shared haplotype 4 (WLS17-UK-41 and WLS17-MH-37). All the four populations displayed low nucleotide diversity (>0.0001) and haplotype diversity (>0.9905) (Table 2). CZ and NWPZ population possessed the highest haplotype diversity (1.000), while NEPZ population showed highest nucleotide diversity (0.00456) (Table 2). Five mutations were detected in entire populations (Table 2). NEPZ population showed highest number of singleton mutations. Further, estimates of DNA divergence between populations based on RPB2 gene sequence analysis indicated that mutation events shared between UST population of CZ and NHZ and CZ and NWPZ (Table S1).
Figure 1

A neighbor-joining tree of RNA polymerase II gene (RPB2) sequences of the 112 isolates of UST from wheat growing states of India.

Table 2

DNA polymorphism data for Ustilago segetum tritici (UST) isolates based on RPB2 gene sequence comparisons.

DNA polymorphism parameterPopulationsOverall
CZNEPZNHZNWPZ
Number of sequences11123356112
Selected region analyzed1–6871–6871–6871–6871–687
Number of polymorphic sites (S)24235
Number of mutations (Eta)24335
Total number of singleton mutations, Eta(s)14134
Number of haplotypes (h)1112295198
Haplotype (gene) diversity (Hd)1.01.00.99050.99550.9965
Nucleotide diversity (Pi)0.002460.004560.001390.000530.00053
Theta (per sequence) from Eta0.682831.324560.739190.653080.945
Theta (per site) from Eta0.003300.006430.003570.003220.00471
Number of nucleotide differences (k)0.509090.939390.287880.107140.107
Tajima's D−0.77815−1.02271−1.39934−1.68380−1.85385
Fu and Li's D* test statistic (FLD*)−0.33034−0.45895−0.29252−3.09074−3.28229
Fu and Li's F* test statistic (FLF*)−0.45160−0.68098−0.71183−3.10485−3.20471
Fu's Fs statistic (FFs)−0.6590.560−2.415−4.521−6.254
Minimum number of recombination events (Rm)00000
A neighbor-joining tree of RNA polymerase II gene (RPB2) sequences of the 112 isolates of UST from wheat growing states of India. DNA polymorphism data for Ustilago segetum tritici (UST) isolates based on RPB2 gene sequence comparisons. The population statistic parameters revealed statistically negative values of Tajima's D (−1.68380 to −0.77815) in UST populations of different wheat growing zones and provided evidence that the dominance of purifying selection and population expansion is operating in UST isolates (Table 2). Similarly, the test statistic FLD* and FLF* reflected analogous type of results for UST isolates and highlighted the principle of operation of purifying selection and population size expansion in different wheat growing zones. The statistically significant negative value of FFs statistic except NEPZ further strongly denotes the expansion observed in CZ, NWPZ, and NHZ population (Table 2).

Marker Development, SSRs Polymorphism and Gene Diversity

To evaluate allelic diversity among UST isolates, 35 microsatellite markers were used. Twenty five SSR primer pairs produced clear single amplicons while rest did not amplify. Out of 25 primers, only 16 (Table 3) showed polymorphism and therefore used in genetic diversity analysis of 112 UST isolates originated from four geographical distinct zones of India. The polymorphism of the different SSR loci is presented in Table 3. The alleles per locus varied from two to four and allele size ranged from 170 to 750 bp. The PIC values ranged from 0.3713 to 0.6632. One SSR loci (UST31) was highly informative (PIC ≥ 0.5) and rest all loci were reasonably informative (0.5 < PIC > 0.25). The 16 polymorphic primer pairs revealed a total of 68 alleles across the 34 loci in 112 isolates, ranging from 2 to 4 alleles per isolate (Table 3). The markers were all selectively neutral according to the Ewens-Watterson test (Table S2).
Table 3

Characteristics of sixteen neutral microsatellite markers used in the study for population genetic diversity analysis of Ustilago segetum tritici (UST) isolates.

S. No.Sequence (5-3)MotifAmplicon size (bp)Number of alleleTa (°C)HePIC
1USTF 5: GCTCGTCTACCTCTGCGATACT(GGA)5215–2412550.49840.3742
USTR 5: TCTGCATCTCAATCAACCAATC
2USTF 6: AGACATGCACCGTAACAACAAC(CGG)6180–2002550.49860.3743
USTR 6: TACCCTCCATACTCTTGTCCGT
3USTF 7: AAGCATACTCAAGGCAGGGTAA(CAT)5170–2402540.48540.3676
USTR 7: GTTCTCGGATGGTCTCGTCTAC
4USTF 14: AAAAGTCATCCTCGTTTCGGTA(CCG)6180–2302550.49260.3713
USTR 14: AGATAGGGAAGCAAATCATGGA
5USTF 16: GCCTCTTCATCTCTCTCCTCAC(GAC)6220–2702520.49480.3724
USTR 16: TGACTCTTCTGCATCATATCGG
6USTF 17: TCTTGTGGAGTCTGCTGTTGTT(TGC)5230–2502520.49910.3746
USTR 17: GTAGCTTCAGGTCGCATCACTT
7USTF 23: TCGTGAAAACTAACAGAGCCAA(AAAT)4220–2502520.4980.374
USTR 23: ACACCTATTTGCGTGAAGGAGT
8USTF 25: TACTTCTCCTCCTCCTCCTCCT(TATT)5285–3002520.49970.3748
USTR 25: GAACTCGCAAAGTGGTTTCTCT
9UST 26: AGAGACCAAGTCGAATCCAAAG(CAAGG)6294–3202520.49910.3746
USTR 26: CCTTGCCTACTTCTCCCTACCT
10USTF 27: CATTTCAGTGTTGGACAAGCAT(GTGTCA)4250–3002540.49590.3729
USTR 27: AGAGAGTTTCGTAGTTGGGCAG
11USTF 30: GGTGATTGGAAGACCACAGAAT(AAGCCA)5227–2502550.49880.3744
USTR 30: GTTTTGAACTCTCTGCTTTGGG
12USTF 31: CACAAACACACACACACACACA(GCTCCC)4224–7504540.7170.6632
USTR 31: CTGAACAGTAAAGCCTGAAGGG
13USTF 32: TCCTACATTGGGATGACTGATG(CAACGG)4217–2502520.49750.3737
USTR 32: GACTCGCTTCTTGTTCTTGGTT
14USTF 33: GAAAGAGAGAGGGAGGGAAGAG(GGAGAA)4230–2502530.49910.3746
USTR 33: TGCGTATAGGTATGTGTGGCTT
15USTF 34: GAAGAAAATGCTAGAGCGAAGG(CA)8216–2502550.49730.3737
USTR 34: AGCAGAAGGTGAGAGAGCGTAT
16USTF 36: ATGAGGTCAAGAGTCAGCAACA(GA)8175–2002540.49780.3739
USTR 36: ATTCGTCAAGATGCCTTTCACT
Characteristics of sixteen neutral microsatellite markers used in the study for population genetic diversity analysis of Ustilago segetum tritici (UST) isolates.

Population Genetic Diversity

The genetic diversity indices for diverse four UST populations from different wheat growing zones are depicted in Table 4. The number of different alleles (Na), effective alleles (Ne) and expected heterozygosity (He) averaged across all loci ranged from 1.824 to 2.0, 1.695 to 1.721, and 0.373 to 0.406, respectively for the four different populations (CZ, NEPZ, NHZ, and NWPZ). The NWPZ population had the highest Na values (2.000) while the CZ population had the lowest (1.765). Expected heterogyzosity (He) and effective Alleles (Ne) values were lowest in CZ population (He = 0.373; Ne = 1.695) and the highest in NWPZ population (He = 0.406; Ne = 1.729). Similarly, unbiased gene diversity (uHe) was the lowest in NHZ population (uHe = 0.381) and the highest in NWPZ population (uHe = 0.410), with an average of 0.396. The polymorphic loci (%) fell in the range of 88.24% (NHZ and NEPZ) to 100% (NWPZ), with an average of 88.35% (Table 4). However, NHZ and NWPZ population showed the highest number (17) of polymorphic loci. Information index based on the Shannon's Index (I), the highest diversity was recorded in NWPZ (I = 0.589) population, whereas it was lowest in CZ (I = 0.531) population. Of all the analyzed populations, no private allele was found in entire population. The genetic diversity within populations (Hs) ranged from 0.0501 to 0.4943 with an average of 0.3367 (Table S2), and responsible for 91% of the total genetic diversity (HT = 0.3611). The proportion of the total genetic diversity attributable to the population differentiation (Gst) ranged from 0.0029 to 0.391 with an average of 0.0678 over all loci (Table S2).
Table 4

Summary of the population diversity indices calculated on the basis of SSR markers.

ZoneNNaNeIHeuHePL (%)Number of polymorphic bandsLeast common bands (≤5%)
CZ111.765 ± 0.136*1.695 ± 0.0880.531 ± 0.0630.373 ± 0.0450.391 ± 0.04782.351616
NEPZ121.824 ± 0.1281.721 ± 0.0850.552 ± 0.0560.387 ± 0.0410.403 ± 0.04388.241616
NHZ331.882 ± 0.0811.697 ± 0.0860.536 ± 0.0600.375 ± 0.0440.381 ± 0.04488.241716
NWPZ562.000 ± 0.0001.729 ± 0.0640.589 ± 0.0320.406 ± 0.0270.410 ± 0.027100.001717
Mean1.868 ± 0.0511.710 ± 0.0400.552 ± 0.0270.385 ± 0.0200.396 ± 0.02082.35

N, number of isolates; N.

The values after ± denote the standard errors.

CZ, Central zone; NEPZ, North East Plain zone; NHZ, North Hill zone; NWPZ, North Western Plain zone.

Summary of the population diversity indices calculated on the basis of SSR markers. N, number of isolates; N. The values after ± denote the standard errors. CZ, Central zone; NEPZ, North East Plain zone; NHZ, North Hill zone; NWPZ, North Western Plain zone.

Population Genetic Structure and Gene Flow

The AMOVA analysis comparing the four populations showed that 1% of the total variance was distributed among zones. A relatively higher proportion of the variation (91 %) was distributed within UST isolates (Table 5). Genetic variation among wheat growing zones (Fst = 0.012), isolates (Fis = 0.079) and within isolates (Fit = 0.090) was mentioned in Table 5. Pairwise Fst values of the genetic distance between different populations were low but significant (P < 0.01; Table 6).
Table 5

Summary of the analysis of molecular variance (AMOVA) results for 112 isolates of Ustilago segetum tritici (UST) (n = 112).

SourcedfSSMSEstimated variancePercentage %F-StatisticsValueP
Among population316.4575.4860.0391%Fst0.0120.606
Among individuals108388.0483.5930.2638%Fis0.0790.019
Within individuals112343.5003.0673.06791%Fit0.0900.606
Total223748.0043.369100%

Df, degree of freedom; SS, sum of squared observations; MS, mean of squared observations; F.

Table 6

Measure of the pair-wise comparisons of genetic distance (Fst), genetic flow (Nm) and Nei's unbiased genetic identity for Ustilago segetum tritici (UST) isolates collected from the four zones of India.

Population 1Population 2FstNmNei's unbiased genetic identity
SSR lociRPB2 gene
NEPZCZ0.0160.04514.9890.9358
NEPZNWPZ0.0090.0414.9340.944
NEPZNHZ0.0370.0456.4990.9514
CZNWPZ0.0000.0000.000.9907
CZNHZ0.0480.00029.1310.9781
NHZNWPZ0.0000.0000.000.9947

CZ, Central zone; NEPZ, North East Plain zone; NHZ, North hill zone; NWPZ, North Western Plain zone.

Summary of the analysis of molecular variance (AMOVA) results for 112 isolates of Ustilago segetum tritici (UST) (n = 112). Df, degree of freedom; SS, sum of squared observations; MS, mean of squared observations; F. Measure of the pair-wise comparisons of genetic distance (Fst), genetic flow (Nm) and Nei's unbiased genetic identity for Ustilago segetum tritici (UST) isolates collected from the four zones of India. CZ, Central zone; NEPZ, North East Plain zone; NHZ, North hill zone; NWPZ, North Western Plain zone. The average gene flow among populations (Nm) was ranged from 0.00 (between NWPZ and NHZ; between NWPZ and CZ) to 29.131 (between CZ and NHZ). Pairwise estimates of gene flow (Nm) indicated that Nm value was more than 1.0 in most of the population pairs suggesting gene flow between populations, although with different magnitude except NWPZ and NHZ and NWPZ and CZ (Table 6). The highest value was observed among pair of CZ and NHZ populations (Nm = 29.131; Fst = 0.048) followed by CZ and NEPZ pair (Nm = 14.989; Fst = 0.016). When population of NEPZ was compared with other populations for genetic distance (Fst) on the basis of SSR (0.009–0.037) and RPB2 gene loci (0.041–0.045) and gene flow (Nm = 4.934–14.989) indicating these populations are differentiated with low gene flow. Pair wise comparison of NHZ and NWPZ recorded the highest indices of Nei's unbiased genetic identity (0.9947), while CZ and NHZ showed lowest levels of genetic identity (0.9358) (Table 6). The dendrogram based on unweighted Neighbor-joining method grouped all the 112 isolates representing four populations into two major clusters (Figure 2). Among these, 100 and 12 isolates were grouped in cluster 1 and cluster 2. The grouping by UPGMA using genetic distances do not showed any spatial clustering among the different geographic zones (Figure 2). Several subgroups within cluster 1 were observed irrespective to populations, indicating genetic variability within and among isolates in each population. The similarity coefficient of overall isolates averaged 0.50. The substructure analysis for genetic relationship among UST isolates, excluding loci with null alleles, showed a clear ΔK peak at K = 2 (ΔK = 113.96) (Figure 3A) and K = 2 was the most likely value thus revealed that all individuals grouped into two major clusters (Figure 3B).
Figure 2

Unweighted Neighbor-joining tree using the simple matching similarity coefficient based on 16 microsatellite markers for the 112 isolates of UST isolated from wheat spikes in India.

Figure 3

Cluster analyses of UST populations from four different wheat growing zones of India (results from Structure v2.2). (A) Each isolate is represented by a bar, divided into K colors, where K = 2 is the number of clusters assumed. Individuals are sorted according to Q, the inferred clusters; (B) Magnitude of ΔK calculated for each level of K. Maximum ΔK indicates the most likely number of UST populations (K = 2).

Unweighted Neighbor-joining tree using the simple matching similarity coefficient based on 16 microsatellite markers for the 112 isolates of UST isolated from wheat spikes in India. Cluster analyses of UST populations from four different wheat growing zones of India (results from Structure v2.2). (A) Each isolate is represented by a bar, divided into K colors, where K = 2 is the number of clusters assumed. Individuals are sorted according to Q, the inferred clusters; (B) Magnitude of ΔK calculated for each level of K. Maximum ΔK indicates the most likely number of UST populations (K = 2).

Linkage Disequilibrium and Population Expansion

Linkage disequilibrium analysis was performed to infer the reproductive strategy. The values of IA (0.991–2.034) and rbarD indices (0.066–0.134) in the association tests differed significantly from zero in all the UST populations (Table 7). UST isolates sampled from different wheat growing zones rejected the null hypothesis of gametic equilibrium, this shows that isolates in each zone were not under random mating (Table 7).
Table 7

Estimation of linkage disequilibrium by the index of association (IA) and the unbiased (rd) statistic of Indian Ustilago segetum tritici (UST) isolates.

PopulationIArBarDP-Value
CZ1.3990.092< 0.001
NEPZ2.0340.134< 0.001
NHZ0.9910.066< 0.001
NWPZ1.2810.082< 0.001
Total1.1690.075< 0.001

Index of association (IA) and rBarD, a modified index of association (1); P-values were estimated from 1,000 randomizations and are identical for IA and rBarD.

Estimation of linkage disequilibrium by the index of association (IA) and the unbiased (rd) statistic of Indian Ustilago segetum tritici (UST) isolates. Index of association (IA) and rBarD, a modified index of association (1); P-values were estimated from 1,000 randomizations and are identical for IA and rBarD. The results depicted in Table 8 shows that all the UST populations evolved through stepwise mutation method (SMM). The sign tests in Bottleneck revealed a significant H deficit in 1 of the 13 loci under IAM (P = 0.00027), TPM (P = 0.00070) and SMM (P = 0.00132) model of evolution, indicating recent population expansion in CZ. Similarly, in NHZ and NWPZ population, significant H deficit in 1 of the 14 loci under IAM, TPM, and SMM model of evolution were observed (Table 8).
Table 8

Comparison of observed genotypic diversity (H) and expected genotypic diversity (HEQ) at mutation-drift equilibrium based on the observed number of alleles with infinite number of alleles.

ZoneTestParameterIAMTPMSMMMode
CZSTNumber of loci with heterozygosity excess expected6.356.867.22Shifted
Loci with heterozygosity deficiency or excess1/13 (P = 0.00027)1/13 (P = 0.00070)1/13 (P = 0.00132)
SDTT2 values (Probability)4.024 (P = 0.00003)3.492 (P = 0.00024)3.168 (P = 0.00077)
WTOne tail for H deficiencyP = 0.99997P = 0.99994P = 0.99991
One tail for H excess:P = 0.00006P = 0.00009P = 0.00015
Two tails for H excess and deficiency:P = 0.00012P = 0.00018P = 0.00031
NEPZSTNumber of loci with heterozygosity excess expected6.536.827.22Shifted
Loci with heterozygosity deficiency or excess0/14 (P = 0.00002)1/13 (P = 0.00064)1/13 (P = 0.00130)
SDTT2 values (Probability)5.195 (P = 0.00000)4.575 (P = 0.00000)4.182 (P = 0.00001)
WTOne tail for H deficiencyP = 1.00000P = 0.99997P = 0.99997
One tail for H excess:P = 0.00003P = 0.00006P = 0.00006
Two tails for H excess and deficiencyP = 0.00006P = 0.00012P = 0.00012
NHZSTNumber of loci with heterozygosity excess expected6.326.757.10Shifted
Loci with heterozygosity deficiency or excess1/14 (P = 0.00005)1/14 (P = 0.00012)1/14(P = 0.00024)
SDTT2 values (Probability)5.158 (P = 0.00000)4.635 (P = 0.00000)4.197 (P = 0.00001)
WTOne tail for H deficiencyP = 0.99997P = 0.99995P = 0.99962
One tail for H excess:P = 0.00005P = 0.00008P = 0.00050
Two tails for H excess and deficiencyP = 0.00009P = 0.00015P = 0.00101
NWPZSTNumber of loci with heterozygosity excess expected6.016.446.81Shifted
Loci with heterozygosity deficiency or excess1/14(P = 0.00003)1/14 (P = 0.00006)1/14 (P = 0.00014)
SDTT2 values (probability)5.703 (P = 0.00000)5.161 (P = 0.00000)4.728 (P = 0.00000)
WTOne tail for H deficiencyP = 0.99998P = 0.99998P = 0.99998
One tail for H excess:P = 0.00003P = 0.00003P = 0.00003
Two tails for H excess and deficiencyP = 0.00006P = 0.00006P = 0.00006

CZ, Central zone; NEPZ, North East Plain zone; NHZ, North Hill zone; NWPZ, North Western Plain zone; ST, Sign rank Test; SDT, Standardized Differences Test; WT, Wilcoxon Test; IAM, Infinite allele model; TPM, Two Phase Model; SMM, Stepwise Mutation Model of Evolution.

Probability of deviation from mutation-drift equilibrium (H>HEQ) was determined according to Cornuet and Luikart (.

Comparison of observed genotypic diversity (H) and expected genotypic diversity (HEQ) at mutation-drift equilibrium based on the observed number of alleles with infinite number of alleles. CZ, Central zone; NEPZ, North East Plain zone; NHZ, North Hill zone; NWPZ, North Western Plain zone; ST, Sign rank Test; SDT, Standardized Differences Test; WT, Wilcoxon Test; IAM, Infinite allele model; TPM, Two Phase Model; SMM, Stepwise Mutation Model of Evolution. Probability of deviation from mutation-drift equilibrium (H>HEQ) was determined according to Cornuet and Luikart (. The bottleneck analysis supported for the non-existence of any bottleneck in UST populations in recent past. The concept of heterozygosity excess works on the principle that the observed gene diversity is higher than the expected equilibrium gene diversity (Heq) in a recently bottlenecked population (Table 8). In CZ population, sign rank test under IAM mutation model, expected number of loci with heterozygotic excess was 6.35 while the observed number of loci with heterozygosity excess was 13 (Table 8). The expected and observed loci with heterozygosity excess calculated by using TPM and SMM models were 6.86 and 7.22, respectively. Similarly, the outcome for IAM, TPM, and SMM supported the absence of any bottleneck in CZ population. Similar trends were also observed for NHZ, NEPZ, and NWPZ population using TPM and SMM. Although, one locus with heterozygosity deficiency (P = 0.00002) was also observed in NEPZ population using IAM model. The SDT provided the T2 statistics equal to 4.024, 3.492, and 3.168 for IAM, TPM, and SMM, respectively in CZ population. The probability values were significant for IAM (P = 0.00003), SMM (p = 0.00024), and TPM (P = 0.00077). Thus, null hypothesis was accepted by all the models. The probability values with WT for one tail for H excess under three models IAM (p = 0.00006), TPM (p = 0.00009), and SMM (p = 0.00015) indicated acceptance of null hypothesis under all the models in CZ population. Thus, all the three tests (ST, SDT, and WT) indicated the acceptance of mutation drift equilibrium (P > 0.05) in UST populations under all the mutation models for all CZ, NHZ, NEPZ, and NWPZ populations. Another powerful test of qualitative graphical method based on the allele frequency spectra detected a normal L-shaped curve, where the alleles with the lowest frequencies (0.03–0.3) were found to be most abundant in the entire wheat growing zones (Figure S2).

Discussion

Loose smut is a monocyclic internally seed borne disease. The seed borne inocula leads to long distance rapid spread of the disease across the entire wheat growing zones of India. The Ustilago segetum tritici isolates were collected from four major wheat growing zones (NWPZ, NEPZ, CZ, and NHZ) of India. The genetic structure was analyzed by performing RPB2 gene sequence comparison and fingerprinting with newly developed SSR markers. The degree of nucleotide difference in the RPB2 region in UST populations is low. It may be due to action of concerted evolution leads to homogenizing effect. Furthermore, evidence of recombination (Rm = 0) in entire UST population was not detected. The low but significant Fst values (< 0.01 and < 0.05) and pair wise population differentiation among UST population from different zones indicate low genetic differentiation in the total populations (Fst = 0.012). The UST populations appear either of common origin or limited distribution, reproduces predominantly by asexual means, or experience substantial gene flow (from CZ to NHZ and NEPZ and later from NEPZ to NWPZ and NHZ) coupled with genetic drift. However in populations where mutation rates are high, Fst tends to fall back to zero (in case of CZ) as novel alleles are added to the population (Onaga et al., 2015). This happened because of negative dependence of Fst on diversity (Charlesworth et al., 1997) and has been reported in several pathogens; even in the absence of asexual reproduction (Couch et al., 2005). To outwit this prejudice due to mutation rates, FST has been compared with other genetic diversity indices. Analysis of molecular variance (AMOVA) showed that 91% of the total variation was due to differences among isolates within populations and the variation among populations reflected only 1% of the total variation. A low degree of differentiation among UST populations may be due to admixture among isolates from the different geographic regions. These results are also unswerving with the low Shannon's indices (0.531–0.589). The overall Shannon's index (I = 0.552) suggests that more than 50% of the genetic diversity explained by the differences between isolates. Therefore, all these results conclude that most of the genetic variation (91%) was distributed among isolates across the regions. The similar findings have been earlier reports in Ustilago maydis (Valverde et al., 2000), Mycosphaerella graminicola (Boeger et al., 1993), Phytophthora infestans (Goodwin et al., 1994), Rhynchosporium secalis (McDonald et al., 1999), and Rhizoctonia solani (Goswami et al., 2017) while analyzing their populations structure. It is worth mention here that the fungal isolates are mostly similar at the genetic level despite long distances among different wheat growing zones in India. Haplotype analysis performed in present study provides information on the number of haplotypes (h), their frequency and diversity, and genetic distances within and between RPB2 gene sequence. The Hd can range from zero to 1.0, which means no diversity to high levels of haplotype diversity (Nei and Tajima, 1981). In present study, Hd (0.104–0.473) values indicated low to moderate levels of diversity in different wheat growing zones. NWPZ revealed the maximum diversity based on the number of haplotypes (i.e., 51 haplotypes from 56 UST isolates). Few haplotypes were shared among different populations indicating role of asexual reproduction and long-distance dispersal. Contrary to this, sexual reproduction occurs at the site of infection on the spike, was apparently prevalent in Indian wheat growing areas. Besides this, two other reasons may explain the genetic variability among isolates, first there might be possibility of multiple founder populations that result in the admixture of populations. Secondly, population genetic expansion may took place due to the accumulation of different alleles in UST populations as evident from the total and shared mutations noticed in the presents study. All these are agreeable, as evidenced by the high levels of population admixture identified in Structure analysis. Mutation generates diverse regional populations of UST, creating a pool of mutants from which new, virulent isolates can emerge. Many haplotypes were shared among these populations supports multiple introductions of the same haplotype, which could be due to pronounced asexual reproductive phase, as the case with UST. Furthermore, the analysis revealed that all UST populations are admixed and contain haplotypes from multiple populations within a region or between the regions. To investigate the role of evolutionary forces on UST population, different neutrality test statistics (Tajima's D, Fu and Li's D* and F*, and Fu's Fs) were performed to examine the RPB2 sequence data for departure from neutrality. Significant and negative Tajima's D test statistics indicated that RPB2 locus is experiencing population bottlenecks, where the population is largely uniform and only a few sequences compose the new population. The biology of UST and its colonization could serve as a source of population bottlenecks. Similarly, significant and negative value of almost all D* and F* test statistics showed strong purifying selection. Overall, the RPB2 gene data displayed genetic divergence in the structure of the population among the four wheat growing zones analyzed and well-supported by the results from microsatellite loci. Moreover, 16 newly developed SSR markers used were polymorphic on all of the UST isolates. These results are comparable to earlier reports on SSR markers developed for other plant pathogens (Yang and Zhong, 2008; Pouzeshimiab et al., 2014). Thus, these SSR markers could be useful tool to study the population biology and genetics of this fungus at global level. The loose smut fungus is carried as dormant mycelium within healthy appearing seed and is spread by growing infected seed. Moreover, teliospores are easily shaken from the smutted heads and may be carried for long distances by wind, insects, or other agencies (Ram and Singh, 2004). The low differentiation among regions (8% of total variation) detected on the basis of microsatellites can be explained by different ways. Firstly, the level of gene flow (Nm) is sufficient to maintain genetic similarity. The low levels of population differentiation were observed in corresponding high values of Nm. Nm is >1, reveals little differentiation among populations, and under such circumstances migration is more important than genetic drift (McDermott and McDonald, 1993). Theoretically, average gene flow value (Nm = 21) indicates that 21 isolates would need to be exchanged each generation among populations of different regions to achieve current degree of similarity. Highest gene flow was recorded between the CZ and NHZ (Nm = 29.131) populations. Regular gene flow and random mating among isolates from various populations could result in new pathotypes with improved pathological and biological fitness traits (Mishra et al., 2006). Besides this, inbreeding coefficient (Fit = 0.090) indicated little genetic differentiation across UST populations. The corroborating results were also observed in American populations of P. nodorum (Fst = 0.004; Stukenbrock et al., 2006), North American population of Mycosphaerella graminicola (Fst = 0.08) (Zhan et al., 2003) and Septoria musiva populations (Fst = 0.20) in north-central and northeastern North America (Feau et al., 2005). In UST populations, SSR data provided no discrete clustering in different populations on the basis of structure analysis and little among-populations variance (8%) was observed in AMOVA. Therefore, no specific demarcation of genetic grouping was noticed, and results further suggest large and widespread populations with high migration rates facilitated by wind-dispersed teliospores and frequent exchange and long distance transport of infected seed material in different wheat growing zones. In present study, the plausible reasons leading to the structuration of the regional collection were not elucidated since no clear main direction of gene flow among the sampled sites, and no significant isolation by distance (P = 0.49; > 0.05) were observed. The genetic identities of the four populations evaluated in present study were close to 1. The moderate Gst (−0.0678) values indicated weak genetic differentiation and minimal geographic clustering among the populations from four different zones and yielded average Nm values 6.875 across all loci and populations, suggesting that the level of gene flow was approximately seven times greater than that needed to prevent populations from diverging by genetic drift. Moreover, absence of private alleles in all the four zone population may indicate that the observed migration levels reflect gene flow. The wind direction in India is generally from North West to North East during the wheat cropping season. This may cause migration and gene flow between populations lead to admixture among isolates from the diverse geographic origin as observed in present study. The pathogen is cosmopolitan in distribution, and telisopores are known to be disseminated over long distances by wind (Ram and Singh, 2004). Besides this, long-distance gene flow in UST was man mediated, and subsequent natural gene flow gradually reduced the isolation by distance. Mutation is the main evolutionary mechanism that generates polymorphisms, and its implications to disease management are well-known (Jolley et al., 2005). For UST, point mutations (4 mutations) in the sequence of the RPB2 gene resulted in the introduced of new alleles in the population. Similar observations have been documented by Lourenço et al. (2009), while studying the molecular diversity and evolutionary processes of Alternaria solani, a seed borne pathogen in Brazil inferred using genealogical and coalescent approaches. However, the relatively low number of singleton mutation estimated in present study for RPB2 locus in different wheat growing zones does not signify low mutation rate in whole genome. Therefore, authors felt that the evaluation of other housekeeping genes or genomic regions should be analyzed for more accurate quantification of mutation occurrence in UST populations. Further, bottleneck analysis based on three models (IAM, TPM and SMM) indicated that the observed heterozygosity excess (He) found less than the expected excess heterozygosity (Hee) in all the four wheat growing zones. Thus, the lower magnitude of He with their respective Hee reflect absence of genetic bottleneck in UST populations. Further, the negative TD-value of the UST population indicates that the UST population is undergoing demographic expansion. Further support for this hypothesis is gained from lack of private alleles in UST populations collected from all wheat growing zones. The knowledge of population genetic structure of a pathogen provides information on its potential to overcome host genetic resistance (McDonald and Linde, 2002). The results of present study showed that UST have a clonal genetic structure with limited differentiation between populations. It means variability is mainly contributed by mutation and recombination is uncommon. Therefore, wheat disease management measures, such as replacement of infected seed and fungicide-treated seeds, could help to reduce UST severity and limit gene flow. Host resistance is also economical and effective to manage loose smut of wheat (Singh, 2018), and breeding efforts in different wheat growing zones have put emphasis on exploration of more resistance sources and other gene pools to fill this gap. In addition, screening of germplasm and breeding material against genetically diverse isolates needs to be emphasized to develop durable and effective resistance cultivars. In nutshell, the current study presents a first stab to comprehend the genetic variation within and among populations of UST causing loose smut in different wheat growing regions. The results highlight that microsatellite markers can be used to analyze genotypic and genetic diversity of populations of UST.

Author Contributions

The work was conceived and designed by PK and SK. The sampling survey was performed by PK, SK, PJ, and DS. Experiments were conducted by RT, PK, and RK. Data analysis was done by PJ and RT. The manuscript was drafted by PK and SK. The final editing and proofing of manuscript was done by DS and GS. RK and RT contributed equally. The manuscript was approved by all the authors.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  3 in total

1.  Molecular diversity, haplotype distribution and genetic variation flow of Bipolaris sorokiniana fungus causing spot blotch disease in different wheat-growing zones.

Authors:  Prem Lal Kashyap; Sudheer Kumar; Anju Sharma; Ravi Shekhar Kumar; Sunita Mahapatra; Noyonika Kaul; Annie Khanna; Poonam Jasrotia; Gyanendra Pratap Singh
Journal:  J Appl Genet       Date:  2022-08-05       Impact factor: 2.653

2.  Identification of Novel Microsatellite Markers to Assess the Population Structure and Genetic Differentiation of Ustilago hordei Causing Covered Smut of Barley.

Authors:  Prem Lal Kashyap; Sudheer Kumar; Ravi Shekhar Kumar; Rahul Tripathi; Palika Sharma; Anju Sharma; Poonam Jasrotia; Gyanendra Pratap Singh
Journal:  Front Microbiol       Date:  2020-01-15       Impact factor: 5.640

3.  Genetic Diversity and Population Structure of Head Blight Disease Causing Fungus Fusarium graminearum in Northern Wheat Belt of India.

Authors:  Noyonika Kaul; Prem Lal Kashyap; Sudheer Kumar; Deepti Singh; Gyanendra Pratap Singh
Journal:  J Fungi (Basel)       Date:  2022-08-05
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.