| Literature DB >> 23316438 |
Viola A Manning1, Iovanna Pandelova, Braham Dhillon, Larry J Wilhelm, Stephen B Goodwin, Aaron M Berlin, Melania Figueroa, Michael Freitag, James K Hane, Bernard Henrissat, Wade H Holman, Chinnappa D Kodira, Joel Martin, Richard P Oliver, Barbara Robbertse, Wendy Schackwitz, David C Schwartz, Joseph W Spatafora, B Gillian Turgeon, Chandri Yandava, Sarah Young, Shiguo Zhou, Qiandong Zeng, Igor V Grigoriev, Li-Jun Ma, Lynda M Ciuffetti.
Abstract
Pyrenophora tritici-repentis is a necrotrophic fungus causal to the disease tan spot of wheat, whose contribution to crop loss has increased significantly during the last few decades. Pathogenicity by this fungus is attributed to the production of host-selective toxins (HST), which are recognized by their host in a genotype-specific manner. To better understand the mechanisms that have led to the increase in disease incidence related to this pathogen, we sequenced the genomes of three P. tritici-repentis isolates. A pathogenic isolate that produces two known HSTs was used to assemble a reference nuclear genome of approximately 40 Mb composed of 11 chromosomes that encode 12,141 predicted genes. Comparison of the reference genome with those of a pathogenic isolate that produces a third HST, and a nonpathogenic isolate, showed the nonpathogen genome to be more diverged than those of the two pathogens. Examination of gene-coding regions has provided candidate pathogen-specific proteins and revealed gene families that may play a role in a necrotrophic lifestyle. Analysis of transposable elements suggests that their presence in the genome of pathogenic isolates contributes to the creation of novel genes, effector diversification, possible horizontal gene transfer events, identified copy number variation, and the first example of transduplication by DNA transposable elements in fungi. Overall, comparative analysis of these genomes provides evidence that pathogenicity in this species arose through an influx of transposable elements, which created a genetically flexible landscape that can easily respond to environmental changes.Entities:
Keywords: ToxA; ToxB; anastomosis; copy number variation; histone H3 transduplication; wheat (Triticum aestivum)
Mesh:
Substances:
Year: 2013 PMID: 23316438 PMCID: PMC3538342 DOI: 10.1534/g3.112.004044
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1 Phylogeny of Pyrenophora tritici-repentis and symptoms induced by the three sequenced isolates. RAxML maximum likelihood phylogenetic tree (best tree -nL1505566.555282) inferred from a 100-protein superalignment comprising 93,210 amino acid positions (A). Subphyla (-mycotina) and classes (-mycetes) of the phylum Ascomycota are shown and numbers near nodes are bootstrap partitions. (B) ToxA and ToxC symptoms induced by BFP-ToxAC (on Glenlea and 6B365, respectively; top 2 leaves), ToxB symptoms induced by DW7-ToxB (on 6B662; middle leaf), and the resistant reaction produced by the nonpathogenic SD20-NP (on Auburn; bottom leaf).
Pyrenophora tritici-repentis genome assembly statistics
| Assembly statistics | BFP-ToxAC |
|---|---|
| Coverage | 6.93 (6.19 q > 20) |
| Assembly size | 37.84 Mb |
| Total contig length | 37.21 Mb |
| Scaffolds | 47 |
| Scaffold N50 | 1.99 Mb |
| Contigs | 703 |
| Contig N50 | 115.53 Kb |
| Q40 | 96.51% |
| Linkage groups | 11 |
| GC content | 50.98% |
| Protein-coding genes | 12,141 |
| tRNA genes | 129 |
q, quality score.
Pyrenophora tritici-repentis Illumina sequencing statistics
| Sequencing Statistics | DW7-ToxB | SD20-NP |
|---|---|---|
| Average depth | 62X | 111X |
| Maximum depth | 2894X | 5623X |
| Number PE reads mapped | 34,248,040 | 67,243,074 |
| PE reads mapped | 93% | 85% |
| PE pairs mapped | 98% | 97% |
| SNP found | 86% | 84% |
| Homozygous SNP | 7429 | 73,190 |
| SNPs/bp | 5008 | 508 |
| Multiallellic SNP | 1250 | 6648 |
| Small indels found | 71% | 71% |
| Small indels | 1843 | 4282 |
| Multiallellic indels | 377 | 1660 |
SNP, single-nucleotide polymorphism; PE, paired-end.
Figure 2 Mapping of sequence reads of resequenced isolates relative to the reference genome of Pyrenophora tritici-repentis. Schematic represents the reference genome scaffolds (gray bars with supercontig numbers) mapped to each chromosome as defined by the reference optical map. The border box reflects chromosome size as indicated next to the chromosome number and white spaces between the scaffolds represent gaps in the sequence assemblies. Repeat density (green) in the reference isolate, and read (gray central panel) and SNP (top panel) densities of the Illumina-sequenced isolates (DW7-ToxB-red, SD20-NP-blue), were mapped per 10 kb of the high-quality genome assembly based on the reference isolate (BFP-ToxAC) of P. tritici-repentis.
Figure 3 Repeat identity in the genome sequence of Pyrenophora tritici-repentis. The graph represents a comparison of repeat similarities between Neurospora crassa, Stagonospora nodorum, Magnaporthe oryzae, and Pyrenophora tritici-repentis. An identity was assigned to each repetitive sequence based on its similarity to other repeats in the genome (naturally grouped into one repeat family). The number shown on the y-axis is a collection of all repeats identified in the genome sequences (minimal 400 bp, 50% identity) and binned into groups according to the sequence identity present in that repeat family.
Repetitive content of Pyrenophora tritici-repentis isolates
| BFP-ToxAC | % Shared | ||||
|---|---|---|---|---|---|
| Repeat | Total Length, nt | % of Repetitive | % of Genome | DW7-ToxB | SD20-NP |
| DNA transposon | 2,295,943 | 36.4 | 6.1 | 46.5 | 16.5 |
| MITEs | 74,049 | 1.2 | 0.2 | 0.8 | 1.1 |
| LTR retrotransposon | 2,553,686 | 40.5 | 6.7 | 28.1 | 20.6 |
| Non-LTR retrotransposon | 405,556 | 6.4 | 1.1 | 7.2 | 26.3 |
| Unknown TEs | 685,925 | 10.9 | 1.8 | 17.4 | 35.5 |
| Others (genes) | 159,636 | 2.5 | 0.4 | ND | ND |
| Simple repeats | 84,534 | 1.3 | 0.2 | ND | ND |
| Low complexity | 50,671 | 0.8 | 0.1 | ND | ND |
| Total | 6,310,000 | 16.7 | |||
Represent nt covered by custom repeat library. MITE, miniature inverted-repeat transposable element; LTR, long terminal repeat; TE, transposable element; ND, not determined.
Figure 4 Conidia of Pyrenophora tritici-repentis connected by anastomosis bridges. (A) Phase-contrast microscopy of conidial anastomosis in Ptr. (B) The same conidia shown in A stained with DAPI and visualized under an ultraviolet filter to image nuclei; Note the loss of nuclei from the upper cell. (C) Nomarski optics (D) and fluorescence microscopy of anastomosed conidia of Ptr expressing green fluorescent protein. Arrows indicate the anastomosis bridges. Scale bars = approximately 5 µm.
Figure 5 Transduplication of a histone H3-like protein family in the genome of Pyrenophora tritici-repentis. (A) Two representatives of a transduplicating family of DNA transposable elements in the BFP-ToxAC reference genome are shown with the characteristic motifs of the elements indicated (brown box = terminal inverted repeats (TIR); green box = hAT transposon coding regions; purple box = hAT dimerization motif; blue box = central open reading frame (ORF) coding region; yellow box = histone H3 coding region; black boundary box = full-length element). The large, likely autonomous element is approximately 5.6 kb in length. Black lines between elements illustrate a pathogen-specific recombination event that results in a smaller element of approximately 2.3 kb. The graphs show coverage of reads obtained from the pathogenic DW7-ToxB and nonpathogenic SD20-NP isolates. Arrows and lines below the element representation indicate alignments of ESTs detected in various libraries (arrows = poly(A) tails, light gray = introns). (B) Alignment of novel histone H3-like (H3L) proteins identified in Ptr with bona fide histone H3 variants from Ptr, S. nodorum, and N. crassa. Four variants of full-length H3L exist in Ptr, all are different by a single aa change (underlined; also see Table S9). Two changes in H3L alleles result in frameshift mutations, yielding 5′ and 3′ truncated versions of H3L. Most (21) copies are identical to PTRG_00559.1 in amino acid and DNA sequence. Completely conserved residues are shown in black, similar residues in green and variable residues in gray.
Figure 6 Absence of the 145-kb ToxA-containing region in the pathogenic DW7-ToxB and non-pathogenic SD20-NP isolates of Pyrenophora tritici-repentis. Schematic includes the protein-coding regions (purple) and repeats (green) present within the 170-kb ToxA-containing genomic region in the BFP-ToxAC reference genome. The graphs show coverage of reads obtained from the pathogenic DW7-ToxB and nonpathogenic SD20-NP isolates. Coverage depth per nucleotide (nt) is indicated on the left and fold coverage is indicated on the right. Blue shading represents synteny between genomic scaffolds of Cochliobolus heterostrophus isolate C4, SD20-NP, DW7-ToxB, and flanking regions of the 170-kb ToxA-containing region of the reference. Similarly colored bars within the scaffolds indicate colinear blocks.
Figure 7 Read mapping to and de novo assembly of ToxB- and toxb-containing loci in the genome of Pyrenophora tritici-repentis. Schematic of Illumina sequence reads (line graph) of isolate DW7-ToxB mapped to the ToxB1 locus (top: ToxB1 locus; accession number: AY425480.1) and of SD20-NP mapped to the toxb locus (bottom: toxb locus; accession number: AY083456.2). Coverage depth per nt is indicated on the left and fold coverage is indicated on the right. Straight lines above the graphs depict the contigs present in the de novo assemblies of the Illumina-sequenced isolates. The arrow on the contig above the toxb locus shows how that contig extends beyond the locus.
Putative Pyrenophora tritici-repentis nonribosomal peptide synthetase genes
| Locus | Best Hit | Repeated domains | Assoc. with TE | EST | Present (SNP) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Org. | Acc. | (%id/sim) | Rec. | DW7-ToxB | SD20-NP | |||||
| Conserved | ||||||||||
| PTRG_08276 | AAX09984.1 | ( | Yes | NPS2 (62/76) | No | No | Yes | Yes (1) | Yes (2) | |
| PTRG_01800 | AAP78735.1 | (77/87) | Yes | NPS4 (72/84) | No | No | No | Yes (1) | Yes (37) | |
| PTRG_01683 | ABI51982.1 | (77/86) | Yes | NPS6 (76/86) | No | No | No | Yes (1) | Yes (10) | |
| PTRG_00447 | AAX09992.1 | ( | Yes | NPS10 (91/96) | No | No | Yes | Yes (0) | Yes (3) | |
| PTRG_03139 | AAX09994.1 | ( | Yes | NPS12 (70/80)) | No | No | No | Yes (2) | Yes (10) | |
| PTRG_09101 | XP_001791762.1 | (72/80) | Yes | No | No | Yes | Yes (1) | Yes (39) | ||
| PTRG_09808 | XP_001797376.1 | (60/70) | Yes | No | No | No | Yes (0) | Yes (0) | ||
| PTRG_12015 | ACZ66258.1 | (39/56) | Yes | HTS1 (35/51) | No | No | Yes | Yes (2) | Yes (18) | |
| Hybrids | ||||||||||
| PTRG_00649 | XP_367899.2 | (37/50) | No | No | Yes | No | Yes (1) | Yes (6) | ||
| PTRG_04244 | XP_001905191.1 | (49/66) | No | No | No | Yes | Yes (7) | Yes (76) | ||
| NPS pathogen-specific diversification | ||||||||||
| PTRG_10128 | AAX09983.1 | ( | No | NPS1- w/o 5′&3′ (61/70) | No | Yes | No | Divergent | No | |
| PTRG_10433 | XP_001226467.1 | (28/43) | No | Yes | Yes | Yes | Divergent | No | ||
| PTRG_10437 | 10433 ( | Yes | Yes | No | No | No | ||||
| PTRG_11759 | AAX09988.1 | ( | Yes | NPS6 – w/o - 3′ (73/84) | No | Yes | No | No | No | |
| PTRG_11809 | 10433 ( | Yes | Yes | Yes | No | No | ||||
| PTRG_11818 | 11836 ( | No | Yes | Yes | No | No | ||||
| PTRG_11836 | 11818 ( | No | Yes | Yes | No | No | ||||
Ab, Alternaria brassicae; Acc., accession number; Cg, Chaetomium globosum; Ch, Cochliobolus heterostrophus; Cm, Cochliobolus miyabeanus; Fi, Fusarium incarnatum; Mg, Magnaporthe grisea; Org., organism; Pa, Podospora anserina; Rec., reciprocal best blast hit; Sn, Stagonospora nodorum; SNP, single-nucleotide polymorphism.
Best hit BlastP (Altschul ) NCBI.
SNP predicted in maq.
Comparison by Needle in EMBOSS (Needleman and Wunsch, 1970, Rice ); % identity/similarity.
At least two domains in the protein share >96% similarity.
Associated with transposable elements (TE).
Present in putative cluster.
See Cochliobolus NPS column.
Other members in cluster have EST support.
P. Martinez and L. Ciuffetti, unpublished data.
Apparent duplication in P. tritici-repentis genome.
Figure 8 NRPSs diversification in the genome of Pyrenophora tritici-repentis. Schematic shows the modular architecture of Cochliobolus heterostrophus NRPS1 (black solid bar and associated modules) and related P. tritici-repentis PtNRPSs (gray solid bars and associated modules). Brackets with straight arrows define modules in C. heterostrophus NRPS1 that have homologous and/or related regions present in PtNRPSs. The module domains (adenylation, thiolation, condensation, methyltransferase, and epimerization), and the solid bars that represent the entire protein are drawn to scale. Thin lines between gray bars extend to the region(s) of identity between proteins and the curved arrow depicts a recent duplication event.
Figure 9 Predicted secreted proteins in the genome sequence of Pyrenophora tritici-repentis. (A) Distribution of secreted proteins in the reference Ptr isolate (BFP-ToxAC) and the Illumina-sequenced pathogenic (DW7-ToxB) and nonpathogenic (SD20-NP) isolates. (B) PCR screen of pathogenic and nonpathogenic isolates for the presence of the gene and transcript for PTRG_11888, a predicted putative secreted protein. Pathogenic isolates: BFP-ToxAC; ASC1; 86-124; D308; DW2; SO3-P; Non-pathogenic isolates: SD20-NP; 90-2; 98-31-2 (see also Table S1).
The most abundant annotations in the predicted secretome of Pyrenophora tritici-repentis
| Gene Ontology (GO) and Pfam Terms | ||
|---|---|---|
| GO ID/Accession No. | GO Term/Accession Description | No. Loci with Term |
| Molecular function | ||
| GO:0004553 | Hydrolase activity, hydrolyzing O-glycosyl compounds | 112 |
| GO:0050660 | FAD binding | 43 |
| GO:0020037 | Heme binding | 37 |
| GO:0004497 | Monooxygenase activity | 34 |
| GO:0009055 | Electron carrier activity | 32 |
| GO:0003676 | Nucleic acid binding | 18 |
| GO:0016614 | Oxidoreductase activity, acting on CH-OH group of donors | 18 |
| GO:0004601 | Peroxidase activity | 16 |
| GO:0004091 | Carboxylesterase activity | 16 |
| GO:0004180 | Carboxypeptidase activity | 15 |
| GO:0008270 | Zinc ion binding | 13 |
| GO:0005515 | Protein binding | 13 |
| GO:0016810 | Hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds | 12 |
| GO:0030248 | Cellulose binding | 12 |
| GO:0004252 | Serine-type endopeptidase activity | 12 |
| GO:0016829 | Lyase activity | 12 |
| GO:0008237 | Metallopeptidase activity | 11 |
| GO:0016757 | Transferase activity, transferring glycosyl groups | 11 |
| Biological processes | ||
| GO:0055114 | Oxidation reduction | 65 |
| GO:0006508 | Proteolysis | 61 |
| GO:0006066 | Alcohol metabolic process | 24 |
| GO:0044267 | Cellular protein metabolic process | 18 |
| GO:0051704 | Multiorganism process | 16 |
| GO:0006629 | Lipid metabolic process | 16 |
| GO:0051591 | Response to cAMP | 14 |
| GO:0044262 | Cellular carbohydrate metabolic process | 14 |
| GO:0009308 | Amine metabolic process | 14 |
| GO:0034645 | Cellular macromolecule biosynthetic process | 13 |
| GO:0006139 | Nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | 13 |
| GO:0043581 | Mycelium development | 12 |
| GO:0045493 | Xylan catabolic process | 11 |
| GO:0016998 | Cell wall macromolecule catabolic process | 11 |
| Pfam domains | ||
| PF00067 | Cytochrome p450 | 30 |
| PF03443 | Glycosyl hydrolase family 61 | 26 |
| PF01565 | FAD binding domain | 25 |
| PF08031 | Berberine and berberine-like | 21 |
| PF00732 | GMC oxidoreductase (binds FAD as a cofactor) | 18 |
| PF05199 | GMC oxidoreductase | 16 |
| PF00187 | Carbohydrate Binding Module 18 | 16 |
| PF00135 | Carboxylesterase | 15 |
| PF00082 | Subtilase | 11 |
| PF00734 | Carbohydrate Binding Module 1 (cellulose-binding domain) | 11 |
| PF04389 | Peptidase Family M28 (metalloprotease) | 10 |
| PF00657 | GDSL family of esterases/lipases | 10 |
| PF01083 | Cutinase (cutin + H2O ↔ cutin monomers) | 10 |
Values represent the lowest node per branch (>10 sequences) of a directed acyclical graph of the hierarchical structure of GO terms.
e-value cutoff for Pfam analysis E < 0.1.