Literature DB >> 30403741

Annotation and analysis of the mitochondrial genome of Coniothyrium glycines, causal agent of red leaf blotch of soybean, reveals an abundance of homing endonucleases.

Christine L Stone1, Reid D Frederick1, Paul W Tooley1, Douglas G Luster1, Brittany Campos2, Richard A Winegar2, Ulrich Melcher3, Jacqueline Fletcher4, Trenna Blagden4.   

Abstract

Coniothyrium glycines, the causal agent of soybean red leaf blotch, is a USDA APHIS-listed Plant Pathogen Select Agent and potential threat to US agriculture. Sequencing of the C. glycines mt genome revealed a circular 98,533-bp molecule with a mean GC content of 29.01%. It contains twelve of the mitochondrial genes typically involved in oxidative phosphorylation (atp6, cob, cox1-3, nad1-6, and nad4L), one for a ribosomal protein (rps3), four for hypothetical proteins, one for each of the small and large subunit ribosomal RNAs (rns and rnl) and a set of 30 tRNAs. Genes were encoded on both DNA strands with cox1 and cox2 occurring as adjacent genes having no intergenic spacers. Likewise, nad2 and nad3 are adjacent with no intergenic spacers and nad5 is immediately followed by nad4L with an overlap of one base. Thirty-two introns, comprising 54.1% of the total mt genome, were identified within eight protein-coding genes and the rnl. Eighteen of the introns contained putative intronic ORFs with either LAGLIDADG or GIY-YIG homing endonuclease motifs, and an additional eleven introns showed evidence of truncated or degenerate endonuclease motifs. One intron possessed a degenerate N-acetyl-transferase domain. C. glycines shares some conservation of gene order with other members of the Pleosporales, most notably nad6-rnl-atp6 and associated conserved tRNA clusters. Phylogenetic analysis of the twelve shared protein coding genes agrees with commonly accepted fungal taxonomy. C. glycines represents the second largest mt genome from a member of the Pleosporales sequenced to date. This research provides the first genomic information on C. glycines, which may provide targets for rapid diagnostic assays and population studies.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30403741      PMCID: PMC6221350          DOI: 10.1371/journal.pone.0207062

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Coniothyrium glycines (R.B. Stewart) Verkely & Gruyter is a soilborne pathogen that infects soybeans and the perennial soybean, Neonotonia wightii, causing lesions on foliage, petioles, pods and stems and eventual defoliation and premature senescence [1]. C. glycines produces melanized sclerotia that can germinate to either form infectious mycelia or produce pycnidia that in turn produce infectious conidia. The pathogen is spread locally via rain/water splash and human or animal movement, which scatter sclerotia and conidia onto neighboring plants. Leaf drop of infected leaves delivers sclerotia and pycnidia to the soil where they serve as sources of secondary inoculum. Sclerotia may also remain in the soil and restart the cycle of infection in the next growing season. There is no evidence that the fungus is seed-borne, but spread might occur from infected plant debris mixed in with untreated seed or through movement of contaminated soil. The disease red leaf blotch (RLB) occurs predominantly in central and southern Africa [2] and the incidence of the disease has increased concomitantly with increased soybean production in regions where the pathogen is found. Yield losses of up to 50% have been reported in Zambia and Zimbabwe [3][4]. While it does not currently occur within the United States, the ability of sclerotia to survive high temperatures and dry conditions suggest it could survive in soybean growing regions of the southern United States [5]. As a result, the Secretary of Agriculture has determined that C. glycines poses a significant risk to U.S. agriculture, and the pathogen is listed by USDA-APHIS as a Plant Pathogen Select Agent under 7 CFR, part 331 [6][7]. Additionally, while C. glycines has been found to naturally infect only soybean and N. wightii, there is no evidence as to the pathogen’s potential ability to infect other leguminous species, such as cultivated peanut and native, wild legumes that occur in the USA. In the early stages of disease development, RLB may not be readily distinguished from other foliar soybean diseases such as Alternaria leaf spot, brown spot, or target spot. Current methods to identify C. glycines require time-consuming examination of morphological characteristics and temperature requirements. No molecular diagnostic assay currently exists to identify C. glycines. The examination of genomic sequences such as the mtDNA may provide targets for the development of diagnostic tools and also may provide insight into the mechanisms of disease resistance. Phylogenetic analysis of the mtDNA will also be useful to clarify the taxonomy of this fungus. RLB was first observed on soybean in Ethiopia in 1955 and, based on the morphology of the pycnidial state, the causal fungus was identified as Pyrenochaeta glycines [8]. In 1964, Dactuliophora glycines was described as the cause of a leaf spot disease[9], and was subsequently identified as the sclerotial state of P. glycines [10]. Hartman and Sinclair [1] established the genus Pyrenochaeta to accommodate these synanamorphs. The fungus was re-classified as Phoma glycinicola in 2002 based on morphological characteristics[11][12], and most recently was again re-classified as Coniothyrium glycines (R.B. Stewart) Verkely & Gruyter based on sequence analysis of regions of the ITS, SSU, LSU [13]. The mt genomes of only eight other members of the class Dothidiomycete, which includes several economically important plant pathogens such as the wheat pathogen, Stagonospora nodorum, and wheat leaf blotch, Zymoseptoria tritici (M. graminicola), can currently be found in GenBank. Six of these also share membership in the order Pleosporales with C. glycines. Comparison of the mt genome of C. glycines with the mt genome of these other eight fungi may help support or clarify the recent re-classification of C. glycines, as mitochondrial genomes are considered to be effective tools for evolutionary studies because they evolve independently of and at an accelerated rate from nuclear genomes [14][15][16]. This study provides the complete mitochondrial genome of a pathogenic fungus identified as a USDA-APHIS Plant Pathogen Select Agent due to its potential impact on soybean production. Previously, the only genomic data available were specific sequences used in phylogenetic analysis of Phoma and Septoria spp [13][17]. This sequence data may provide targets for the development of a rapid diagnostic assay and will help further clarify the evolving fungal taxonomy of the genus.

Materials and methods

Fungal isolate, library construction, and sequence assembly

C. glycines-infected leaves were collected from soybean at the Rattray Arnold Research Station, Harare, Zimbabwe in March 2005 and shipped to the USDA-ARS Foreign Disease-Weed Science Research Unit at Fort Detrick, MD under Animal and Plant Health Inspection Service permit. Isolate Pg-21 was recovered from the leaves and maintained on 20% V8-juice agar at 20°C in the dark. A 10% V8-juice broth was seeded with agar plugs containing mycelium of Pg-21 and grown for several weeks in the dark at 20°C without shaking. Tissue was collected through vacuum filtration onto Whatman No. 1 filter paper in a Buchner funnel. Total DNA was extracted using the DNeasy Plant Mini kit (Qiagen, Germantown, MD). Culture identification was confirmed through sequencing of ITS fragments. The mt genome was sequenced as part of a whole genome sequencing project with Illumina sequence libraries prepared using Nextera XT. Whole genome 2×300 paired-end sequencing was performed using Illumina MiSeq instrument. Reads were filtered and trimmed using Trimmomatic v.0.32 [18]. The iMetAMOS pipeline v. 1.5[19] was used to optimize de novo assembly and perform quality checks. Elements of the pipeline include FastQC v. 0.10.0; Spades v. 3.1.1; IDBA v. 1.1.1; KmerGenie v. 1.6741; and QUAST v. 2.2 [20][21][22][23][24]. Resulting assemblies were polished using Pilon v. 1.8 [25]. Samtools v. 1.1[26] and BLAST were used to remove low coverage and contaminating contigs. Initial shotgun assembly produced 1431 contigs greater than 1kb in size, with a median size of 11kb and median depth of coverage of 274X. Contig 76 was identified as an outlier with a size of 98,482 bp and average depth of coverage of 1542X. Discontinuous MegaBLAST searches revealed homology with fungal mt genome sequences. Finishing of the mt sequence was performed using CLC Genomics Workbench Genome Finishing Module (Qiagen, Germantown, MD), mapping raw Illumina reads back to contig 76, correcting assembly errors, and extending the contig ends.

Sequence annotation

The MFannot tool (http://megasun.bch.umontreal.ca/cgi-bin/mfannot/mfannotInterface.pl) was used to annotate the mt genome using genetic code 4 [27]. Annotation of open reading frames (ORFs) was reviewed and revised by BLAST homology searches against the NCBI protein database [28]. tRNAs were further evaluated against output from tRNAscan-SE[29], Dogma (Dual Organellar GenoMe Annotator)[30], and ARAGORN [31]. RNAweasel was used to classify identified introns as group I or group II introns [32]. Repeats were identified and analyzed with the Tandem Repeats Finder [33] and Palindrome and Einverted EMBOSS programs [34]. Codon usage for concatenated ORFs of twelve protein-coding genes was determined using the codon usage tool at http://www.bioinformatics.org/sms2/codon_usage.html with genetic code 4 [35]. The physical map of the Coniothyrium mtDNA was constructing using SnapGene Viewer (GSL Biotech; available at snapgene.com). The complete mt sequence of C. glycines isolate Pg-21 has been deposited in GenBank under the accession number MH337273.

Comparative genomics

The complete mt genomes of the eight fungi belonging to the Dothidiomycetes were retrieved from GenBank (Bipolaris cookei, MF784482; Didymella pinodes, NC_029396; Parastagonospora nodorum, NC_009746; Pithomyces chartarum, KY792993; Shiraia bambusicola, NC_026869; Stemphylium lycopersici, KX453765; Zasmidium cellare, NC_030334; and Zymoseptoria tritici, NC_010222.) Mitochondrial gene content and gene order of C. glycines was compared visually to these eight fungi. Nineteen additional complete mt genomes were retrieved from GenBank for a comparison of general features, including size, GC content, core protein coding genes, rRNAs, and tRNAs, and the presence of introns.

Phylogenetic analysis

Amino acid sequences of the twelve protein-coding genes shared in common among 25 fungal mt genomes were each aligned with MUSCLE from EMBL-EBI [36], and amino acids sharing low similarity were removed by Gblocks [37]. Sequences were concatenated using Seaview [38]. A maximum likelihood tree of aligned sequences was constructed with PhyML 3.0 using LG as the evolutionary model [39]. Branch support was assessed using the PhyML default of aLRT test (SH-Like).

Results

Gene content and genome organization

The mt genome of C. glycines is a circular molecule with a length of 98,533 bp (Fig 1). The sequence is AT-rich with an overall G + C content of 29.01%, and 28.9% in the coding regions of the protein-coding genes. The RNA genes had a higher GC content of 35.1% while the intergenic spacers had a lower GC content of 24.8%.
Fig 1

Circular mapping of the mitochondrial genome of Coniothyrium glycines.

Black blocks, grey blocks, hatched blocks, stipled blocks, and bars show, respectively, protein-coding, orfs, rRNA, introns, and tRNA genes. Arrows indicate the direction of transcription.

Circular mapping of the mitochondrial genome of Coniothyrium glycines.

Black blocks, grey blocks, hatched blocks, stipled blocks, and bars show, respectively, protein-coding, orfs, rRNA, introns, and tRNA genes. Arrows indicate the direction of transcription. Protein-coding genes of the mt genome included one gene encoding for ATP-synthase complex F0 subunit (atp6), three cytochrome oxidase subunits (cox1, cox2, cox3), seven nicotinamide adenine dinucleotide ubiquinone oxireductase subunits (nad1-6, nad4L), cytochrome b (cob), one ribosomal protein (rps3), and four hypothetical proteins (orf208, orf284, orf929, and orf1407) (Fig 1 and Table 1). The mt genome also encodes for small and large subunit ribosomal RNAs (rns and rnl) and 30 tRNAs (Fig 1 and Table 1). Genes were transcribed from both DNA strands. The cox1 and cox2 genes were adjacent to each other with no intergenic spacers. Similarly, nad2 and nad3 were adjacent with no intergenic spacers and nad5 is immediately followed by nad4L with an overlap of one base (Fig 1 and Table 1).
Table 1

Gene content of the Coniothyrium glycines mitochondrial genome.

  Codon 
Genetic elementLocation (nt)StartStopSize (nt)Size (aa)
rnljoin: 1–642; 2155–2432; 3861–5119; 8396–94563240
tRNA-Thr9894–9964
tRNA-Met9989–10059
tRNA-Met10065–10137
tRNA-Glu10402–10474
tRNA-Ala10506–10577
tRNA-Phe11092–11164
tRNA-Leu11625–11707
tRNA-Gln11782–11853
tRNA-His11859–11931
tRNA-Met12192–12263
atp6join: 13717–14049; 15413–15853ATGTAG774257
tRNA-Cys15948–16017
tRNA-Phe16300–16372
cox1*join: 16743–16954; 19031–19204; 21606–21712; 23153–23368; 24739–24749; 25790–25936; 27865–27867; 29552–29738; 31122–31189; 32251–32387; 33416–34214ATG*2061687
cox2*join: 34215–34445; 35613–35976; 37630–37688; 39663–39761TTA*TAA753250
rps40260–41591ATGTAA1332443
nad5complement join: 42514–43641; 46586–46792; 47806–47952; 49229–49372; 51255–51680ATGTAA2052683
nad4Lcomplement, join: 51680–51709; 53178–53417ATGTAA26989
tRNA-Phecomplement 53450–53522
orf284complement, 53689–54543ATGTAA855284
cobcomplement, join: 55085–55422; 56540–56872; 59027–59087; 62520–62747; 63283–63328; 64621–64775ATGTAG1161386
tRNA-Valcomplement, 65040–65112
nad1complement, join: 65506–65987; 67161–67376; 69551–69827; 73205–73348ATGTAG1119372
nad4complement, 73557–75110ATGTAA1554517
tRNA-Phecomplement, 75247–75319
nad3complement, 75575–76831ATGTAA1257418
nad2complement, 76832–78580ATGTAA1749582
cox3complement, join: 78714–78890; 79358–79780; 80968–81183ATGTAA816271
orf1407/dpo81506–85729ATATAA42241407
tRNA-Ile85795–85883
orf929/rpocomplement, 85986–88775ATGTAA2790929
tRNA-Arg90683–90753
rns91029–92648
tRNA-Leu93780–93862
tRNA-Tyr94127–94211
tRNA-Asn94286–94356
nad694681–95268
tRNA-Val95545–95617
tRNA-Lys95650–95721
tRNA-Gly96262–96334
tRNA-Asp96337–96408
tRNA-Ser96658–96737
tRNA-Trp96846–96917
tRNA-Ile97024–97095
tRNA-Arg97100–97171
tRNA-Ser97271–97355
orf20897357–97983ATGTAA627208
tRNA-Pro98372–98444    

*Putative polyprotein containing both cox1 & cox2.

*Putative polyprotein containing both cox1 & cox2. Within the intergenic spacers, four open reading frames (orf208, orf284, orf929, and orf1407) were found (Fig 1 and Table 1). Putative functions could be assigned to three of the ORFs: orf1407 encodes a putative DNA polymerase type B, orf929 encodes a putative DNA-dependent RNA polymerase, and orf208 encodes a putative GIY-YIG endonuclease protein. All three showed similarity to relevant sequences in other fungi and possessed conserved domain motifs. Only orf284 contained no conserved motifs and could not be assigned a putative function, but showed similarity to hypothetical proteins from whole genome shotgun sequencing of Bipolaris maydis and B. zeicola. An additional GIY-YIG endonuclease motif was identified in the intergenic spacer between the rnl and atp6. This region showed similarity to endonucleases from other fungi, however no clear ORF could be identified suggesting that this may represent a degenerate endonuclease. Only 14.4% of the mt sequence is comprised of intergenic spacers. Within the intergenic spacers, 10 perfect or near identical tandem repeats were identified ranging in size from 12–62 bp and with 2–5 copies (S1 Table). In addition, fifteen palindromes were identified ranging in size from 10–15 bp. A single inverted repeat of 30 bp was found.

Introns

Introns made up 54.1% of the mt genome with a total of 32 introns identified within 8 of the protein-coding genes and the rnl (Fig 1 and Table 2). Thirty of the introns were classified as group I introns. One intron was classified as a group II intron (intron3 of the rnl) and one intron could not be definitively classified (intron2 of cox2). Eighteen of the identified introns were determined to contain putative intronic ORFs with either GIY-YIG or LAGLIDADG homing endonuclease (HE) motifs. An additional eleven introns showed evidence of truncated or degenerate HE motifs and one possessed degenerate N-acetyl-transferase domains. Only two introns had no identifiable ORFs and BLAST analysis revealed no homology in the NCBI protein database. All putative HEs showed significant similarity to those found in the mt genomes of other fungi and most were identified in other members of the Pezizomycotina subphylum. However, each was unique within C. glycines, showing no similarity to other intronic ORFs within the mt genome.
Table 2

Similarities of complete and truncated intron-encoded ORFs from the Coniothyrium glycines mtDNA to proteins in the non-redundant protein NCBI database (BLASTX <1e-05).

GeneIntronConserved domainE-valueSimilarityAccession
rnlIntron 1GIY-YIG endonuclease truncated1.00E-68Bipolaris cookeiYP_009445537.1
Intron 2GIY-YIG truncated7.00E-117Sclerotinia borealisYP_009072317.1
Intron 3LAGLIDADG endonuclease8.00E-86Chrysoporthe austroafricanaYP_009262060.1
atp6Intron 1LAGLIDADG0.0Bipolaris cookeiYP_009445540.1
cox1Intron 1GIY-YIG0.0Sclerotinia borealisYP_009072328.1
Intron 2LAGLIDADG truncated &2.00E-127Bipolaris cookeiYP_009445534.1
GIY-YIG truncated4.00E-46Chrysoporthe deuterocubensisYP_009262077.1
Intron 3GIY-YIG0.0Bipolaris cookeiYP_009445533.1
Intron 4LAGLIDADG0.0Bipolaris cookeiYP_009445530.1
Intron 5LAGLIDADG7.00E-157Pyronema omphalodesYP_009240548.1
Intron 6LAGLIDADG truncated6.00E-74Wickerhamomyces pijperiYP_008475104.1
Intron 7LAGLIDADG truncated &1.00E-84Juglanconis oblongaATI20220.1
rps3/HE-like fusion protein7e-33Sporothrix sp.ACV41149.1
Intron 8GIY-YIG0.0Juglanconis oblongaATI20221.1
Intron 9LAGLIDADG0.0Bipolaris cookeiYP_009445524.1
Intron 10GIY-YIG2.00E-115Bipolaris cookeiYP_009445523.1
cox2Intron 1GIY-YIG2.00E-98Pestalotiopsis ficiAFP72251.1
Intron 2GIY-YIG2.00E-162Juglanconis juglandinaATI20502.1
Intron 3GIY-YIG0.0Fusarium pseudograminearumCDL73109.1
nad5Intron 1LAGLIDADG1.00E-180Chrysoporthe deuterocubensisYP_009262101.1
Intron 2LAGLIDADG4.00E-142Bipolaris cookeiYP_009445559.1
Intron 3LAGLIDADG0.0Bipolaris cookeiYP_009445560.1
Intron 4LAGLIDADG truncated2.00E-130Annulohypoxylon stygiumYP_008964963.1
nad4LIntron 1LAGLIDADG8.00E-173Sclerotinia sclerotiorumYP_009389052.1
cobIntron 1LAGLIDADG3.00E-26Fusarium culmorumYP_009136823.1
Intron 2----
Intron 3n-acetyl-transferase truncated2.00E-94Stemphylium lycopersiciKNG52863.1
Intron 4GIY-YIG truncated &2E-41Sclerotinia borealisYP_009072335.1
LAGLIDADG truncated2E-133Cryphonectria parasiticaAMX22249.1
Intron 5LAGLIDADG truncated2.00E-122Podospora curvicollaCAB72448.1
nad1Intron 1GIY-YIG truncated &5E-133Chrysoporthe austroafricanaYP_009262069.1
LAGLIDADG0.0Bipolaris cookeiYP_009445498.1
Intron 2LAGLIDADG0.0Juglanconis oblongaATI20217.1
Intron 3GIY-YIG truncated2.00E-33Verticillium sp.ABU24266.1
cox3Intron 1LAGLIDADG truncated4.00E-150Botrytis cinereaAGN49000.1
Intron 2----

A dash indicates no significant similarity of the intron sequence to any entries in the NCBI database.

A dash indicates no significant similarity of the intron sequence to any entries in the NCBI database. The cox1 gene was the most common site for intron insertion, possessing ten of the 32 identified introns. Each of the ten introns also possessed either complete or degenerative putative HEs. Of these ten, only five were found to have high sequence identity to annotated introns found in the same location in the cox1 gene of the other Pleosporales. However, no other member of the Pleosporales possessed all five introns in common. The GIY-YIG HE of intron1 of cox1 showed 87% and 88% nucleotide identity to the corresponding introns of D. pinodes and P. chartarum, respectively. However, there was not a corresponding HE in the mt genomes of the other four Pleosporales species. Likewise, cox1 intron4, containing a LAGLIDADG HE, showed 88% nucleotide identity to the corresponding intron in B. cookei, but was found in no other Pleosporales species. The remaining five introns showed varying degrees of identity with introns from the mt genomes of more distantly related fungi, such as intron8 which showed 85% nucleotide identity with an intron from the corresponding location in Sclerotinia sclerotiorum (S2 Table). The 2041-bp intron2 of cox1 has two regions with partial LAGLIDADG HE domains that showed 95–97% nucleotide identity with the 1208bp intron that occurs in the same position in the cox1 gene of D. pinodes. However, the central 1200 bp region of cox1 intron2 possessed a truncated GIY-YIG HE domain with no significant nucleotide similarity to any other fungus (S2 Table). This central region does show amino acid identity with a GIY-YIG HE located within an intron from the cob gene of the more distantly-related Chrysoporthe deutercubensis (Table 2). While most introns showed nucleotide identity with introns inserted into the same gene in other fungi, nad4L intron1 shared identity with free standing orfs in S. sclerotiorum and P. nodorum. One intron, nad1 intron2, showed no nucleotide identity with other species from the Ascomycota, but rather showed identity with introns from two members of the Basidiomycota. This intron showed identity with an intron from the nad1 gene of Moniliophthora roreri and an intron from the cox1 gene of Fomitopsis palustris.

Codon usage and tRNA genes

Codon usage, summarized in S3 Table, shows a bias towards AT-rich codons, which reflects the high AT content of the C. glycines mt genome. Most protein coding genes start with the canonical translation initiation codon ATG with the exception of cox2 and orf1407, which appear to utilize UUA and AUA start codons, respectively. The preferred stop codon in the mt genome was TAA, occurring in 12 genes. The alternative stop codon TAG occurs in 3 genes. A traditional stop codon could not be identified for cox1. This absence, combined with the location of cox1 adjacent to cox2 with no intergenic spacers, suggested the possibility of a fused cox1-cox2 polyprotein rather than two separate proteins. Thirty tRNAs were identified and twenty of them occurred in two large clusters around the rnl, while five occurred singly between mt genes (Fig 1). The tRNAs occurred on both DNA strands.

Comparative genomics and phylogenetic analysis

Comparison of the mt genome of C. glycines with those from eight other members of the Dothidiomycetes revealed that in all nine species genes are encoded on both mtDNA strands. Comparison also found some conservation of gene order, most notably within the Order Pleosporales (Fig 2). In all nine species, nad4L and nad5 were adjacent, and in all but P. nodorum there are no intergenic spacers but rather a one base pair overlap between the two genes. Within C. glycines and the six members of the Pleosporales, cox1 and cox2 were also adjacent with no intergenic spacers. Three members of the Pleosporales possess a conserved gene block of nad5, nad4L, nad3, and nad2. C. glycines shows the same gene order, however the block is disrupted by insertion of cob, nad1, and nad4 between nad4L and nad3. C. glycines and the other Pleosporales species also lack the atp8 and atp9 genes which are typically found in fungal mt genomes, while both Capnodiales species possess both genes.
Fig 2

Mitochondrial genome rearrangements among Dothidiomycetes.

Asterisk (*) indicates reverse direction of transcription. Each gene is assigned a separate color. Gene order was obtained from GenBank: Bipolaris cookei (MF784482), Didymella pinodes (NC_029396), Parastagonospora nodorum (NC_009746), Pithomyces chartarum (KY792993), Shiraia bambusicola (NC_026869), Stemphylium lycopersici (KX453765), Zasmidium cellare (NC_030334), and Zymoseptoria tritici (NC_010222).

Mitochondrial genome rearrangements among Dothidiomycetes.

Asterisk (*) indicates reverse direction of transcription. Each gene is assigned a separate color. Gene order was obtained from GenBank: Bipolaris cookei (MF784482), Didymella pinodes (NC_029396), Parastagonospora nodorum (NC_009746), Pithomyces chartarum (KY792993), Shiraia bambusicola (NC_026869), Stemphylium lycopersici (KX453765), Zasmidium cellare (NC_030334), and Zymoseptoria tritici (NC_010222). All nine species also exhibit large clusters of tRNA genes around the rnl, and within the Pleosporales tRNA order is maintained as well. The conservation of gene and tRNA order is expanded among the Pleosporales, with six of the seven possessing a nad6-rnl-atp6 gene block with associated conserved tRNA cluster patterns (Table 3). P. chartarum possesses a similar gene block and tRNA cluster pattern, but the atp6 is displaced relative to the other Pleosporales. This conservation of tRNA gene order is carried to a lesser extent to the Capnodiales.
Table 3

Comparison of conserved gene and tRNA cluster patterns flanking the rnl in Coniothyrium glycines and other Dothidiomycetes.

SpeciesOrderFamilytRNA and gene orderbAccession
Coniothyrium glycinesPleosporalesConiothyriaceaeLYN-nad6-VKGDSWIRSP-rnl-TMM-EAFLQHM—atp6MH337273
Bipolaris cookeiPleosporalesPleosporaceaeLYN-nad6-VKGDSWIRSP-rnl-TMM-EAFLQHML-atp6MF784482
Pithomyces chartarumPleosporalesPleosporaceae-YN-nad6-VKGDSWIRSP-rnl-TMMLEAFLQHMKY792993
Stemphylium lycopersiciPleosporalesPleosporaceaeLY—nad6-VKGDSWIRSP-rnl-TMMEAFLQHMNL-atp6KX453765
Didymella pinodesPleosporalesDidymellaceaeLYN-nad6-V—DSWIRSP-rnl-TM EAFLQHM—atp6NC_029396
Shiraia bambusicolaPleosporalesPleosporales incertae sedis—N-nad6-V-GDSWIRSP-rnl-TMMEAFLQHM—atp6NC_026869
Parastagonospora nodorumPleosporalesPhaeosphaeriaceaeLYN-nad6-VKGDSWIRSP-rnl-TMMEAFLQHM—atp6NC_009746
Zasmidium cellareCapnodialesMycosphaerellaceae————GDSWI-SA-rnl—-LEFLQHMVNC_030334
Zymoseptoria triticiCapnodialesMycosphaerellaceae————GDSWI-SP-rnl-MLEAFLYQMHRMNC_010222

aThe tRNA gene order of included organisms is taken from GenBank sequences.

bCapital letters correspond to tRNA genes for: L, Leucine; Y, Tyrosine; N, Asparagine; V, Valine; K, Lysine; G, Glycine; D, Aspartic acid; S, Serine; W, Tryptophan; I, Isoleucine; R, Arginine; P, Proline; T, Threonine; M, Methionine; E, Glutamic acid; A, Alanine; F, Phenylalanine; L, Leucine; Q, Glutamine; H, Histidine.

aThe tRNA gene order of included organisms is taken from GenBank sequences. bCapital letters correspond to tRNA genes for: L, Leucine; Y, Tyrosine; N, Asparagine; V, Valine; K, Lysine; G, Glycine; D, Aspartic acid; S, Serine; W, Tryptophan; I, Isoleucine; R, Arginine; P, Proline; T, Threonine; M, Methionine; E, Glutamic acid; A, Alanine; F, Phenylalanine; L, Leucine; Q, Glutamine; H, Histidine. Comparison of the mt genome of C. glycines and the other Dothidiomycetes with those of an additional 19 ascomycetous fungal species revealed several potentially distinguishing characteristics of this class. Of the 25 mt genomes compared, fifteen carry all genes on the same strand of DNA and an additional four mt genomes show the core coding genes encoded on the same strand with only tRNAs or hypothetical proteins encoded in the opposite direction (S4 Table). However, all nine members of the Dothidiomycetes contain genes distributed on both mtDNA strands. Also, while ribosomal protein S3 or S5 occurs within an intron of the rnl in 17 of the 25 species examined, among the Pleosporales rps3/rps5 occurs as a free standing ORF and the gene appears to be absent from the two Capnodiales species (Table 4). Additionally, while atp8 and atp9 are absent from the Pleosporales species, both are found in the other species with the exception of Pseudogymnoascus pannorum which lacks only atp9 (Table 4). The proximity of cox1 and cox2, also characteristic of the Pleosporales examined to date, is not apparent among the other ascomycetous species.
Table 4

A comparison of the general features of some completely sequenced fungal mitochondrial genomes.

SpeciesSize (bp)GC content (%)Core coding genesbribosomal proteincrRNAstRNAsintronsAccession
Arthroderma otae2394324.214rps52251NC_012832
Aspergillus niger3110327.014rps5d2253NC_007445
Beauveria bassiana2996127.214rps32253NC_010652
Bipolaris cookei13579030.112rps32e3040MF784482
Botryotinia fuckeliana8221229.914rps3d2e3020KC832409
Cladophialophora bantiana2682124.514rps52222NC_030600
Coniothyrium glycines9853329.012rps323032MH337273
Didymella pinodes5597329.512rps3d22214NC_029396
Epichloe typhina8463027.014rps3d2e2418NC_032063
Glarea lozoyensis4503829.814rps32337KF169905
Hypocrea jecorina4213027.214rps52259NC_003388
Lecanicillium saksenae2591926.514rps32261NC_028330
Metarhizium anisopliae2467328.414rps32241NC_008068
Parastagonospora nodorum4976129.412rps52275NC_009746
Peltigera dolichorrhiza5115626.814rps3d2266NC_031804
Penicillium polonicum2819225.614rps32271NC_030172
Phialocephala subalpina4374228.014rps32270NC_015789
Pithomyces chartarum6892628.612rps3d2e2613KY792993
Pseudogymnoascus pannorum2691828.113rps32271NC_027422
Pyronema omphalodes19118943.014rps322522NC_029745
Sclerotinia borealis20305132.114rps323161NC_025200
Shiraia bambusicola3903025.212rps32321NC_026869
Stemphylium lycopersici7591129.612rps3d22815KX453765
Talaromyces marneffei3543825.014rps522810NC_005256
Trichophyton rubrum2698523.514rps52251NC_012824
Verticillium dahliae2718427.314rps32251NC_008248
Zasmidium cellare2374327.814-2250NC_030334
Zymoseptoria tritici4396432.014-2270NC_010222

a All fungi in this table have mt genomes with circular topology.

b Refers to the 14 conserved protein coding genes typical of fungal mitochondrial genomes: 11 genes encoding subunits of respiratory chain complexes (cob cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6) and 3 ATP synthase subunits (atp6, atp8 and atp9).

c Ribosomal protein S3 or S5, when present, occurs as an intronic orf within the rnl of all above mt genomes with the exception of C. glycines, D. pinodes, P. nodorum, P. subalpina, P. amphalodes, P. chartarum, S. bambusicola, and S. lycopersici.

d The ribosomal proteins S3 or S5 were not annotated in the available sequences, but were putatively identified by blastx analysis against the non-redundant protein database.

e Ribosomal RNAs were not annotated in the available sequences, but were putatively identified by blastn analysis against the rnl and rns of other fungal mt genomes.

a All fungi in this table have mt genomes with circular topology. b Refers to the 14 conserved protein coding genes typical of fungal mitochondrial genomes: 11 genes encoding subunits of respiratory chain complexes (cob cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6) and 3 ATP synthase subunits (atp6, atp8 and atp9). c Ribosomal protein S3 or S5, when present, occurs as an intronic orf within the rnl of all above mt genomes with the exception of C. glycines, D. pinodes, P. nodorum, P. subalpina, P. amphalodes, P. chartarum, S. bambusicola, and S. lycopersici. d The ribosomal proteins S3 or S5 were not annotated in the available sequences, but were putatively identified by blastx analysis against the non-redundant protein database. e Ribosomal RNAs were not annotated in the available sequences, but were putatively identified by blastn analysis against the rnl and rns of other fungal mt genomes. Several similarities across the species were revealed as well. The G+C content is consistent among all species, ranging from 23–32%, with the exception of Pyronema omphalodes with 43%, and all show some tRNA clustering around the rnl. In all but four species, nad4L and nad5 are adjacent with either no intergenic spacer or a single base pair overlap (S4 Table). The size of the mt genome and the presence of introns varies across all species, ranging from 23743 bp in Z. cellare with no introns to 203051 bp in Sclerotinia borealis with 61 introns. In general, a larger number of introns is reflected in a larger genome size (Table 4). Among the Pleosporales, S. bambusicola has the smallest mt genome at 39030 bp, of which only 3.2% is comprised of the one intron identified.[40] P. nodorum (49761 bp) contains five introns, which make up 13% of the mtDNA [16], while D. pinodes (55973 bp) contains 14 introns, making up 26% of its mt genome size (NC_029396). Within C. glycines, the 32 identified introns comprised 54% of total mt genome size. A phylogenetic tree was built with twelve protein-coding genes in common from 25 fungal species (Fig 3). This tree agrees with commonly accepted fungal taxonomy and supports the placement of C. glycines among the Pleosporales and recent reclassification to its own family, the Coniothyriaceae.[13]
Fig 3

Phylogenetic tree constructed from unambiguously aligned portions of concatenated protein-coding sequences of twelve protein-coding genes shared in common among 25 fungal mt genomes.

Topology shown was inferred with PhyML 3.0 using LG as the evolutionary model. Sequences were obtained from GenBank: Arthroderma otae (NC_012832); Aspergillus niger (NC_007445); Beauveria bassiana (NC_010652); Botryotinia fuckeliana (KC832409); Cladophialophora bantiana (NC_030600); Didymella pinodes (NC_029396): Epichloe typhina (NC_032063); Glarea lozoyensis (KF169905); Hypocrea jecorina (NC_003388); Lecanicillium saksenae (NC_028330); Metarhizium anisopliae (NC_008068); Parastagonospora nodorum (NC_009746); Peltigera dolichorrhiza (NC_031804); Penicillium polonicum (NC_030172); Pseudogymnoascus pannorum (NC_027422); Pyronema omphalodes (NC_029745); Sclerotinia borealis (NC_025200); Shiraia bambusicola (NC_026869); Talaromyces marneffei (NC_005256); Trichophyton rubrum (NC_012824); Verticillium dahliae (NC_008248); Zasmidium cellare (NC_030334); Zymoseptoria tritici (NC_010222); Phialocephala subalpina (NC_015789).

Phylogenetic tree constructed from unambiguously aligned portions of concatenated protein-coding sequences of twelve protein-coding genes shared in common among 25 fungal mt genomes.

Topology shown was inferred with PhyML 3.0 using LG as the evolutionary model. Sequences were obtained from GenBank: Arthroderma otae (NC_012832); Aspergillus niger (NC_007445); Beauveria bassiana (NC_010652); Botryotinia fuckeliana (KC832409); Cladophialophora bantiana (NC_030600); Didymella pinodes (NC_029396): Epichloe typhina (NC_032063); Glarea lozoyensis (KF169905); Hypocrea jecorina (NC_003388); Lecanicillium saksenae (NC_028330); Metarhizium anisopliae (NC_008068); Parastagonospora nodorum (NC_009746); Peltigera dolichorrhiza (NC_031804); Penicillium polonicum (NC_030172); Pseudogymnoascus pannorum (NC_027422); Pyronema omphalodes (NC_029745); Sclerotinia borealis (NC_025200); Shiraia bambusicola (NC_026869); Talaromyces marneffei (NC_005256); Trichophyton rubrum (NC_012824); Verticillium dahliae (NC_008248); Zasmidium cellare (NC_030334); Zymoseptoria tritici (NC_010222); Phialocephala subalpina (NC_015789).

Discussion

This research provides the first genomic information on the USDA APHIS-listed Plant Pathogen Select Agent C. glycines; data which may provide targets for rapid diagnostic assays and population studies. Additionally, C. glycines represents the second largest mt genome from a member of the Pleosporales sequenced to date. Mitochondrial genome size among fungi varies greatly from the smallest, Rozella allomyces, at 12055 bp [41] to the largest, Rhizoctonia solani, at 235849 bp [42]. At 98,533 bp, C. glycines is of larger than average size and only 23 other currently available fungal mt genomes are larger. Among the fungi there is no correlation between mtDNA size and gene content. The gene content of fungal mt genomes is largely conserved. However, it is notable that C. glycines lacked two of the core set of genes typical of fungal mt genomes: atp8 and atp9. These two genes were also absent from the mt genomes of other Pleosporales species [16][40]. While gene content may be conserved, gene order is not equally conserved and relative gene order varies both between and within major fungal phyla [43][44][45]. Alignment of the C. glycines mt genome with other members of the Dothidiomycetes identified a lack of synteny in gene order and gene orientation. However, limited conserved gene blocks were observed. The uninterrupted gene pairs of nad2-nad3 and nad4L-nad5 occurred in all nine Dothidiomycetes species, while the pairing of cox1-cox2 occurred only within all seven Pleosporales species and not the two Capnodiales species. Additionally, nad1-nad4 remain coupled in only three species from the Pleosporales. A conserved gene block nad2-nad3 and nad4L-nad5 was identified among three of the Pleosporales, but within the C. glycines mt genome this block is interrupted by three other genes. However, six of the seven Pleosporales species showed an atp6-rnl-nad6 conserved gene block, which included two large clusters of tRNAs on either side of the rnl in a relatively conserved pattern. Additionally, protein-coding and tRNA genes of C. glycines and the eight other Dothidiomycetes are encoded on both mtDNA strands, while the majority of ascomycetes species examined here have genes encoded on a single DNA strand. The pattern of gene order in mt genomes may provide a road map to trace the evolutionary route of fungal taxonomy. As additional species from the Dothidiomycetes, and the Pleosporales specifically, are analyzed, the additional mt signals will indicate if conserved gene blocks identified to date are characteristic of the Order Pleosporales and further help elucidate fungal taxonomy. Comparative genomics and phylogenetic analysis presented here supports the placement of C. glycines within the Pleosporales and its recent reclassification to its own family, the Coniothyriaceae [13]. With gene content being largely conserved, the size variation evident among fungal mt genomes is instead attributable to variations in the structure and size of intergenic spacers and the number and size of introns [46][47][48]. The larger than average mt genome size of C. glycines was attributed to the relatively high number of introns identified, with 32 introns comprising over half of the total mt genome size. This abundance of introns, most of which possess complete or degenerate HEs, may also provide valuable tools for the evaluation of evolutionary history and intron mobility [49][50][51][52][53][54]. While the cox1 gene is considered the most common insertion site for group I introns in fungal mt genomes, the number of introns inserted varies widely from zero in some fungi to the fourteen identified in Podospora anserina [55]. The present study of C. glycines found five of ten cox1 introns, which all possess either complete or truncated HE domains, shared high sequence identity with corresponding introns from the six other Pleosporales species annotated, suggesting common ancestral origin. However, it is notable that none of these five putative HEs occurred in all seven Pleosporales species. The remaining five cox1 introns showed varying degrees of identity with introns from the mt genomes of more distantly related fungi. For example, cox1 intron8 contained a GIY-YIG HE that showed 85% nucleotide identity with an intron from the corresponding location in S. sclerotiorum of the Helotiales and intron5, with its LAGLIDADG HE, shared 71% identity with an intron from Lachancea mirantina, a member of the Saccharomycotina subphylum (S2 Table). The similarity to HEs from more distantly related fungi suggest possible acquisition through horizontal transfer rather than retention from a common ancestor. Additional evidence of horizontal transfer comes from nad1 intron2 and its LAGLIDADG HE which showed no nucleotide identity with introns from other species of the Ascomycota, but rather showed identity with introns from two distantly related members of the Basidiomycota. The examination of cox1 HEs also revealed evidence of multiple insertion events during the course of evolution. While cox1 intron2 possessed end regions with truncated LAGLIDADG domains and high nucleotide identity to a single orthologous intron from D. pinodes, the central region of this intron, with a truncated GIY-YIG domain, showed only amino acid similarity to an intron from the cob gene of the more distantly-related C. deutercubensis, suggesting the insertion of a new sequence into an already present HE. It is difficult to determine the precise roles that intron retention, intron acquisition through horizontal transfer, and intron loss have played in constructing the C. glycines mt genome as it has been annotated here. The question remains if some fungal lineages possess a mechanism by which they accumulate and retain HEs while other fungal lineages appear to have lost all introns, and what that mechanism might be. However, this analysis of HEs does suggest that a complex pattern of insertions and horizontal transfers of introns are responsible for the relatively large mt genome size of C. glycines.

Repeat sequences in the Coniothyrium glycines mitochondrial genome.

(DOCX) Click here for additional data file.

Sequence similarity betwen mt introns of Coniothyrium glycines and introns of other fungal mitochondrial genomes.

(XLSX) Click here for additional data file.

Codon usage in twelve protein-coding mitochondrial genes of Coniothyrium glycines.

(XLSX) Click here for additional data file.

Gene order of the fungal mt genomes used for comparative genomics and phylogenetic analysis.

(XLSX) Click here for additional data file.
  40 in total

1.  The sequence manipulation suite: JavaScript programs for analyzing and formatting protein and DNA sequences.

Authors:  P Stothard
Journal:  Biotechniques       Date:  2000-06       Impact factor: 1.993

2.  Automatic annotation of organellar genomes with DOGMA.

Authors:  Stacia K Wyman; Robert K Jansen; Jeffrey L Boore
Journal:  Bioinformatics       Date:  2004-06-04       Impact factor: 6.937

3.  The analysis of the complete mitochondrial genome of Lecanicillium muscarium (synonym Verticillium lecanii) suggests a minimum common gene organization in mtDNAs of Sordariomycetes: phylogenetic implications.

Authors:  Vassili N Kouvelis; Dimitri V Ghikas; Milton A Typas
Journal:  Fungal Genet Biol       Date:  2004-10       Impact factor: 3.495

4.  Analysis of the complete mitochondrial genome sequences of the soybean rust pathogens phakopsora pachyrhizi and p. meibomiae.

Authors:  Christine L Stone; Martha Lucia Posada Buitrago; Jeffrey L Boore; Reid D Frederick
Journal:  Mycologia       Date:  2010 Jul-Aug       Impact factor: 2.696

5.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

6.  Genes within genes: multiple LAGLIDADG homing endonucleases target the ribosomal protein S3 gene encoded within an rnl group I intron of Ophiostoma and related taxa.

Authors:  J Sethuraman; A Majer; N C Friedrich; D R Edgell; G Hausner
Journal:  Mol Biol Evol       Date:  2009-07-13       Impact factor: 16.240

7.  Identification of protein coding regions by database similarity search.

Authors:  W Gish; D J States
Journal:  Nat Genet       Date:  1993-03       Impact factor: 38.330

8.  The Agaricus bisporus cox1 gene: the longest mitochondrial gene and the largest reservoir of mitochondrial group i introns.

Authors:  Cyril Férandon; Serge Moukha; Philippe Callac; Jean-Pierre Benedetto; Michel Castroviejo; Gérard Barroso
Journal:  PLoS One       Date:  2010-11-18       Impact factor: 3.240

9.  Dothideomycete plant interactions illuminated by genome sequencing and EST analysis of the wheat pathogen Stagonospora nodorum.

Authors:  James K Hane; Rohan G T Lowe; Peter S Solomon; Kar-Chun Tan; Conrad L Schoch; Joseph W Spatafora; Pedro W Crous; Chinappa Kodira; Bruce W Birren; James E Galagan; Stefano F F Torriani; Bruce A McDonald; Richard P Oliver
Journal:  Plant Cell       Date:  2007-11-16       Impact factor: 11.277

10.  The 203 kbp mitochondrial genome of the phytopathogenic fungus Sclerotinia borealis reveals multiple invasions of introns and genomic duplications.

Authors:  Andrey V Mardanov; Alexey V Beletsky; Vitaly V Kadnikov; Alexander N Ignatov; Nikolai V Ravin
Journal:  PLoS One       Date:  2014-09-12       Impact factor: 3.240

View more
  5 in total

1.  Comparative mitochondrial genome analyses reveal conserved gene arrangement but massive expansion/contraction in two closely related Exserohilum pathogens.

Authors:  Qingzhou Ma; Yuehua Geng; Qiang Li; Chongyang Cheng; Rui Zang; Yashuang Guo; Haiyan Wu; Chao Xu; Meng Zhang
Journal:  Comput Struct Biotechnol J       Date:  2022-03-21       Impact factor: 7.271

2.  The Mitogenomes of Ophiostoma minus and Ophiostoma piliferum and Comparisons With Other Members of the Ophiostomatales.

Authors:  Abdullah Zubaer; Alvan Wai; Nikita Patel; Jordan Perillo; Georg Hausner
Journal:  Front Microbiol       Date:  2021-02-10       Impact factor: 5.640

3.  Characterization and phylogenetic analysis of the complete mitochondrial genome of the pathogenic fungus Ilyonectria destructans.

Authors:  Piotr Androsiuk; Adam Okorski; Łukasz Paukszto; Jan Paweł Jastrzębski; Sławomir Ciesielski; Agnieszka Pszczółkowska
Journal:  Sci Rep       Date:  2022-02-11       Impact factor: 4.379

Review 4.  Fungal Mitogenomes: Relevant Features to Planning Plant Disease Management.

Authors:  Rocio Medina; Mario Emilio Ernesto Franco; Laura Cecilia Bartel; Virginia Martinez Alcántara; Mario Carlos Nazareno Saparrat; Pedro Alberto Balatti
Journal:  Front Microbiol       Date:  2020-05-29       Impact factor: 5.640

5.  Comparative Mitogenomics of Fungal Species in Stachybotryaceae Provides Evolutionary Insights into Hypocreales.

Authors:  Li-Yuan Ren; Shu Zhang; Yong-Jie Zhang
Journal:  Int J Mol Sci       Date:  2021-12-12       Impact factor: 5.923

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.