| Literature DB >> 32355611 |
Hai D T Nguyen1, Tahera Sultana1,2, Prasad Kesanakurti1,3, Sarah Hambleton1.
Abstract
Tilletia species cause diseases on grass hosts with some causing bunt diseases on wheat (Triticum). Two of the four species infecting wheat have restricted distributions globally and are subject to quarantine regulations to prevent their spread to new areas. Tilletia indica causes Karnal bunt and is regulated by many countries while the non-regulated T. walkeri is morphologically similar and very closely related phylogenetically, but infects ryegrass (Lolium) and not wheat. Tilletia controversa causes dwarf bunt of wheat (DB) and is also regulated by some countries, while the closely related but non-regulated species, T. caries and T. laevis, both cause common bunt of wheat (CB). Historically, diagnostic methods have relied on cryptic morphology to differentiate these species in subsamples from grain shipments. Of the DNA-based methods published so far, most have focused on sequence variation among tested strains at a single gene locus. To facilitate the development of additional molecular assays for diagnostics, we generated whole genome data for multiple strains of the two regulated wheat pathogens and their closest relatives. Depending on the species, the genomes were assembled into 907 to 4633 scaffolds ranging from 24 Mb to 30 Mb with 7842 to 9952 gene models predicted. Phylogenomic analyses confirmed the placement of Tilletia in the Exobasidiomycetes and showed that T. indica and T. walkeri were in one clade whereas T. controversa, T. caries and T. laevis grouped in a separate clade. Single copy and species-specific genes were identified by orthologous group analysis. Unique species-specific genes were identified and evaluated as suitable markers to differentiate the quarantine and non-quarantine species. After further analyses and manual inspection, primers and probes for the optimum candidate genes were designed and tested in silico, for validation in future wet-lab studies.Entities:
Keywords: Comparative genomics; High throughput next generation sequencing; Phylogenomics; Tilletia caries; Tilletia controversa; Tilletia indica; Tilletia laevis; Tilletia walkeri
Year: 2019 PMID: 32355611 PMCID: PMC7184893 DOI: 10.1186/s43008-019-0011-9
Source DB: PubMed Journal: IMA Fungus ISSN: 2210-6340 Impact factor: 3.515
Genome statistics of Tilletia strains compared in this study
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|
| DAOMC culture number | DAOMC 236408 | DAOMC 236414 | DAOMC 236416 | DAOMC 236422 | DAOMC 238049 | DAOMC 236426 | DAOMC 238052 | DAOMC 238032 | DAOMC 238040 | ATCC 42080 |
| Isolation source |
|
|
|
|
|
|
|
|
|
|
| Isolation location | India | Pakistan | Pakistan | Oregon, USA | Oregon, USA | Ontario, Canada | Ontario, Canada | Idaho, USA | Australia | Washington, USA |
| Sequencing method | Illumina MiSeq | Illumina MiSeq | Illumina HiSeq | Illumina HiSeq | Illumina MiSeq | Illumina HiSeq, MiSeq, PacBio | Illumina MiSeq | Illumina HiSeq | Illumina MiSeq | Illumina MiSeq |
| Number of scaffolds | 2555 | 4624 | 3113 | 1387 | 907 | 3741 | 4243 | 4633 | 4501 | 3961 |
| Largest scaffold size (bp) | 158,844 | 75,904 | 161,394 | 260,724 | 330,910 | 114,021 | 81,512 | 72,392 | 101,287 | 101,260 |
| N50 in scaffolds | 30,694 | 11,729 | 23,649 | 45,552 | 79,486 | 14,874 | 12,786 | 11,482 | 11,569 | 13,920 |
| Gaps per 100 kb | 41.79 | 14.33 | 54.93 | 22.62 | 20.08 | 96.12 | 43.23 | 80.78 | 101.48 | 31.02 |
| GC (%) in scaffolds | 54.8 | 55.0 | 54.9 | 54.9 | 54.9 | 56.5 | 56.7 | 56.7 | 56.6 | 56.6 |
| Total length (Mb) in scaffolds | 29.7 | 29.0 | 29.0 | 24.0 | 24.3 | 29.9 | 28.6 | 28.1 | 28.3 | 28.8 |
| Coveragea | 51x | 45x | 38x | 75x | 46x | 67x | 45x | 52x | 67x | 35x |
| Number of gene models | 9410 | 9677 | 9664 | 7881 | 7842 | 9887 | 9649 | 9952 | 9651 | 9799 |
| BUSCO (complete single copy)b | 98% | 93% | 96% | 97% | 98% | 95% | 96% | 92% | 93% | 95% |
aDetermined by mapping reads with Bowtie2 and analysis with Qualimap v2.1
bBUSCO analysis on scaffolds ≥1000 bp running on the fungal dataset
List of genomes used for phylogenetic analysis in current study
| Phylum | Name | Source of Genome Data | Reference |
|---|---|---|---|
| Basidiomycota | JGI MycoCosm | Kijpornyongpan et al. | |
| Basidiomycota | JGI MycoCosm | Morin et al. | |
| Basidiomycota | JGI MycoCosm | Floudas et al. | |
| Basidiomycota | JGI MycoCosm | Kijpornyongpan et al. | |
| Basidiomycota | NCBI (BCIO01000001.1) a | Manabe et al. unpublished | |
| Basidiomycota | JGI MycoCosm | Spatafora et al. unpublished | |
| Basidiomycota | NCBI (BCHO01000001.1) a | Manabe et al. unpublished | |
| Basidiomycota | JGI MycoCosm | Kijpornyongpan et al. | |
| Basidiomycota |
| JGI MycoCosm | Xu et al. |
| Basidiomycota | JGI MycoCosm | Gioti et al. | |
| Basidiomycota | JGI MycoCosm | Kijpornyongpan et al. | |
| Basidiomycota | NCBI (BCJU01000001.1) a | Manabe et al. unpublished | |
| Basidiomycota | JGI MycoCosm | Spatafora et al. unpublished | |
| Basidiomycota | JGI MycoCosm | Toome et al. | |
| Basidiomycota | JGI MycoCosm | Lorenz et al. | |
| Basidiomycota | JGI MycoCosm | Lorenz et al. | |
| Basidiomycota | JGI MycoCosm | Konishi et al. | |
| Basidiomycota | NCBI (MAIP00000000.1) a | Geiser et al. | |
| Basidiomycota | JGI MycoCosm | Cantu et al. | |
| Basidiomycota | NCBI (MJEU00000000.1) a | Geiser et al. | |
| Basidiomycota | JGI MycoCosm | Schirawski et al. | |
| Basidiomycota | NCBI (CP010913.1) a | Taniguti et al. | |
| Basidiomycota | JGI MycoCosm | Kijpornyongpan et al. | |
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | NCBI (LAXH01000001.1) a | Wang et al. | |
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | current study | ||
| Basidiomycota | JGI MycoCosm | Toome et al. | |
| Basidiomycota | JGI MycoCosm | Kijpornyongpan et al. | |
| Basidiomycota | JGI MycoCosm | Floudas et al. | |
| Basidiomycota | JGI MycoCosm | Laurie et al. | |
| Basidiomycota | JGI MycoCosm | Kämper et al. | |
| Basidiomycota | JGI MycoCosm | Kijpornyongpan et al. | |
| Basidiomycota | JGI MycoCosm | Zajc et al. | |
| Ascomycota | JGI MycoCosm | Galagan et al. | |
| Ascomycota | JGI MycoCosm | Goffeau et al. | |
| Ascomycota |
| JGI MycoCosm | Wood et al. |
| Blastocladiomycota | JGI MycoCosm | Nordberg et al. | |
| Blastocladiomycota | JGI MycoCosm | James et al. unpublished | |
| Blastocladiomycota | JGI MycoCosm | Mondo et al. | |
| Chytridiomycota | JGI MycoCosm | James et al. unpublished | |
| Chytridiomycota | JGI MycoCosm | James et al. unpublished | |
| Chytridiomycota | JGI MycoCosm | Haitjema et al. | |
| Cryptomycota | JGI MycoCosm | James et al. | |
| Mucoromycota | JGI MycoCosm | Uehling et al. | |
| Mucoromycota | JGI MycoCosm | Corrochano et al. | |
| Mucoromycota | JGI MycoCosm | Corrochano et al. | |
| Mucoromycota | JGI MycoCosm | Tisserant et al. | |
| Mucoromycota | JGI MycoCosm | Ma et al. | |
| Zoopagomycota | JGI MycoCosm | Mondo et al. | |
| Zoopagomycota | JGI MycoCosm | Chang et al. | |
| Zoopagomycota | JGI MycoCosm | Chang et al. |
a Denotes genomes that we annotated in this study because no annotations were available at the time of our analyses
Number of unique single copy genes of each Tilletia species after the all versus all BLAST verification step
| Number of single copy genes unique to a given species | |||
|---|---|---|---|
| Species | Strains considered | at the nucleotide level | at the amino acid level |
|
| DAOMC 238032 | 377 | 72 |
|
| DAOMC 236426, DAOMC 238052 | 35 | 2 |
|
| DAOMC 238040, ATCC 42080 | 13 | 1 |
|
| DAOMC 236416, DAOMC 236408, DAOMC 236414 | 61 | 2 |
|
| DAOMC 236422, DAOMC 238049 | 65 | 5 |
Best primers and probes sequences for targetting the best genes unique to each species to give amplicon sizes approximately between 100 bp to 150 bp
| Orthogroup putative function | Effector analysisc | |||||||
|---|---|---|---|---|---|---|---|---|
| Species | Orthogroup ID | InterProScana | NCBI blastp nr databaseb | Prediction | Probability | Forward Primer | Probe | Reverse Primer |
|
| OG0010723 | Unknown | hypothetical protein | Non-effector | 0.71 | TGTCTGCTACCTTTCTTTGGGTT | CGGTCAGCATATCTAGCGTCGCAGCCT | ACCTAGTTCGCAGGAAGAATGTT |
| OG0010724 | Unknown | GTPase | Non-effector | 0.69 | TAGTGGATCGACAACGGAAAACT | GTGATGTGGCGAAAACCATCGGGAGCC | AGCTCATCAATCAGCTCGAACAG | |
| OG0010727 | Unknown | hypothetical protein | Non-effector | 0.66 | CTGAAATTGCTGTCATCTGGGTG | ACCAGTCCTCGCCTACCTTGATAGCCA | AACTAGACTCTGGTTAGACGTGC | |
| OG0010729 | Contains signal peptide | hypothetical protein | Effector | 0.68 | TTCTTATCTGCCCTTACCTGCTG | TATCCACGTCACTCAAGCCTTGCGCCT | CAGCCATAACAATCGCATACAGG | |
| OG0010739 | Contains DNA binding domain | XRE family transcriptional regulator | Non-effector | 0.63 | CCAAAGGCGATATGAGCGTCTC | CCTGAAGCTGGTCAAGGCCGGACATCA | CTCGTCGAGGATGTCTTCAATCG | |
| OG0010773 | Contains EamA domain | EamA family transporter | Non-effector | 0.91 | TCCAAATCCAGTAGCCAATGAGG | TCGAGAAGATGGCGGCGTTGTCGATCT | GGAGATTTATGTGCGGACATTGG | |
| OG0010825 | Unknown | DUF2384 domain-containing protein | Non-effector | 0.59 | GCTCAAAACGATACCAGGCATAG | GCGGTTCCACCTTGCTGAGCACCTCTA | CCGCATGGAATCAGATGACATTC | |
| OG0010847 | Unknown | hypothetical protein | Non-effector | 0.81 | CCTGTAGCAACTTCGTCAGATCC | GTTCGCTGCCATCGACGAGCTGCTTTG | AGCAGATCATCGTTCAGAACCTT | |
| OG0010878 | Contains domain associated eicosanoid/glutathione metabolism | MAPEG family protein | Non-effector | 0.90 | CAAGATCAACCAATGGACCTTGC | GAGTGTCGGTGCTGGCAGTTTGATGGC | GGTGAGAAAGAACTCCTCATGCT | |
| OG0010879 | Contains PDZ domain | hypothetical protein | Non-effector | 0.54 | TTCATCTTCAGGCTGACCAAAGT | GCGCCACAGATCGCCGGAATTGTATGA | GGAAGCGAGACGAGCAAATATTC | |
| OG0010908 | Contains signal peptide | hypothetical protein | Effector | 0.65 | TGAACCATCGGCTTTCTTAGTCA | GCTCCCAACGACAGAAGCTGCTCCTCA | GATGTCACTTGCTACGGCTACTA | |
| OG0010935 | Unknown | hypothetical protein | Unlikely effector | 0.51 | GACACTCGACCGCTATCTTCAAG | CCGGAAAAGGGTGCCGTCACTCAGGAA | CCGAAATCAATGCACGGTCATAT | |
| OG0010940 | Unknown | hypothetical protein | Effector | 0.66 | CGTAGAGTCGCCTCGAGAAATTA | TCATGGGCCAATCCTCCAATGCGAACA | GGAGCTTATCTGTAACCGCAAAA | |
| OG0010947 | Unknown | hypothetical protein | Unlikely effector | 0.50 | GATAGCCTTGATAATGCCGATGC | GAGATCCGAGACTGCCCGGCTGACGTT | TATAATCGACGATGAGCTACGGC | |
| OG0011035 | Unknown | No significant hits found | Non-effector | 0.74 | GATTCAAAATCGCGGGACAGTAC | ATCAGGGCGAGCCTGTGATACACGGTG | TCCAAAGATCAAGCTGGCGATTA | |
| OG0011043 | ABC-2 transporter | ABC transporter permease | Non-effector | 0.89 | CATCCATTGTTGATGCTGCTCTT | CTGACGTCGATCACCTTCAGCCTCGCT | CAGGCTCAATTGCTCGAAATTCT | |
| OG0011044 | Aspartic peptidase | TIGR02281 family clan AA aspartic protease | Unlikely effector | 0.55 | CCTGAGTTCATCCGTCTGCATC | GCAGCGAACTCAGGCTCGACAAGAAGC | GATCATGCGAACCAATGTCGAG | |
|
| OG0009908 | Unknown | hypothetical protein | Effector | 0.69 | CTTGACTCTCCAGTGGTCGAAG | GGAACTCCCGTCAAGACAACAGCGACG | TTCACGTACTACCTCAAGGACAG |
|
| OG0009501 | Unknown | hypothetical protein | Effector | 0.68 | CGCAAACTGGAATGGACCATACT | ATTCTACGCGCAACGGTCTGCAAGAGG | GTTGTAAACTCGATCCGCATTCG |
| OG0009526 | Unknown | No significant hits found | Non-effector | 0.94 | GTGAGGAATTACGATCTGTCCCC | TACCCGATGAGATGCGTTCGGCTGGTC | GATCCCACATACCAGAGTTCCAG | |
|
| OG0009272 | Unknown | hypothetical protein | Non-effector | 0.92 | GAGGACCTTCAAGATCTGACAGG | ACACCTAGGCCACTCCCTATCCAGCCA | CTGATGATCTTGCCCGGTTTTAC |
|
| OG0010415 | Contains signal peptide | hypothetical protein | Unlikely effector | 0.53 | TCAACTACTTCGACTCCTCCTCC | CTTCCGTGATCCCGTCAACGTCGGACT | GCGACACCATCCTTAGTTGTGTA |
| OG0010423 | Contains signal peptide | hypothetical protein | Non-effector | 0.94 | ATCTTCTACCCAATCACCTGCAG | GCCCAGCAAGGTGTTTTTGGCCAAGGA | ATAGCATCACTGATGAGCGTCTG | |
| OG0010431 | Contains signal peptide | hypothetical protein | Non-effector | 0.52 | TTTCGATCACAGGACCAAGGATC | ACTTGAGGCCAGGGTGTCACATAGGCG | GCACTTCTACCTTTCTACCCCTT | |
aPutative function determined with Interproscan(https://www.ebi.ac.uk/interpro/search/sequence-search)
bPutative function determined from NCBI’s blastp analyses on the nr database. Only results with alignment scores of > 80 were considered
cEffector probability determined by effectorP 2.0 (http://effectorp.csiro.au)
Fig. 1Mummerplots. Genes predicted from each genome were compared by alignment and the Average Nucleotide Identity (ANI) is shown in the bottom right corner of each comparison
Fig. 2Unique and shared orthogroups between the five Tilletia species from OrthoFinder’s output
Fig. 3ASTRAL greedy consensus cladogram based on analyses of individual bootstrap trees of the 4896 single copy orthologous genes shared between the ten Tilletia strains. Support values show the percentage of bootstrap replicates that contain that branch
Fig. 4ASTRAL greedy consensus cladogram based on analyses of individual bootstrap trees for each of 192 conserved orthologous proteins placing Tilletia in Kingdom Fungi. Support values show the percentage of bootstrap replicates that contain that branch