| Literature DB >> 33956596 |
Emily F Warner1,2, Natália Bohálová3,4, Václav Brázda4, Zoë A E Waller5, Stefan Bidula6.
Abstract
Fungal infections cause >1 million deaths annually and the emergence of antifungal resistance has prompted the exploration for novel antifungal targets. Quadruplexes are four-stranded nucleic acid secondary structures, which can regulate processes such as transcription, translation, replication and recombination. They are also found in genes linked to virulence in microbes, and ligands that bind to quadruplexes can eliminate drug-resistant pathogens. Using a computational approach, we quantified putative quadruplex-forming sequences (PQS) in 1359 genomes across the fungal kingdom and explored their presence in genes related to virulence, drug resistance and biological processes associated with pathogenicity in Aspergillus fumigatus. Here we present the largest analysis of PQS in fungi and identify significant heterogeneity of these sequences throughout phyla, genera and species. PQS were genetically conserved in Aspergillus spp. and frequently pathogenic species appeared to contain fewer PQS than their lesser/non-pathogenic counterparts. GO-term analysis identified that PQS-containing genes were involved in processes linked with virulence such as zinc ion binding, the biosynthesis of secondary metabolites and regulation of transcription in A. fumigatus. Although the genome frequency of PQS was lower in A. fumigatus, PQS could be found enriched in genes involved in virulence, and genes upregulated during germination and hypoxia. Moreover, PQS were found in genes involved in drug resistance. Quadruplexes could have important roles within fungal biology and virulence, but their roles require further elucidation.Entities:
Keywords: Aspergillus fumigatus; Fungi; G-quadruplexes; drug resistance; i-motifs; in-silico; virulence
Mesh:
Substances:
Year: 2021 PMID: 33956596 PMCID: PMC8209732 DOI: 10.1099/mgen.0.000570
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.G-quadruplex (G4) and i-motif (iM) structures. Representative examples of G4s and iMs. (a) Side (b) and top-down view of the human telomere DNA quadruplex in K+ solution hybrid-1 form (PDB:2HY9). (c) The basic structure of the G-tetrad. (d) Side (e) and top-down view of an intramolecular iM DNA structure with C·C+ base pairing (PDB 1A83). (f) C·C+ base pairing found in iM structures. Images were generated using the Protein Imager software.
Fig. 2.Heterogeneity of PQS across fungal phylum. The total number and frequency of PQS within 1359 fungal genomes were analysed using G4Hunter with a threshold of 1.5 and a window size of 30. The average number of PQS (a, e), PQS/kbp (b, f), PQS/GC% (c, g) and GC content (d, h) in fungi from the Ascomycota (n=1107), Basidiomycota (n=186), Mucoromycota (n=31), Zoopagomycota (n=12), Chytrdiomycota (n=12), Microsporidia (n=9) and Cryptomycota. (n=2). (e–h) contain boxplots with Tukey whiskers. The outliers are indicated by dots and the line within the boxplot is representative of the median value.
Fig. 3.PQS within the sub-divisions of the phylum Ascomycota. The total number and frequency of PQS within fungal sub-divisions of the phylum Ascomycota were quantified using G4Hunter with a threshold of 1.5 and window size of 30. The total number of PQS (a), PQS/kbp (b), PQS/GC% (c) and GC content (d) in fungi from the Taphrinomycotina (n=14), Saccharomycotina (n=332) and Pezizomycotina (n=761). (a–d) contain boxplots with Tukey whiskers. The outliers are indicated by dots and the line within the boxplot is representative of the median value. Significance was determined by one-way ANOVA with multiple comparisons.
Fig. 4.Higher genome GC-content is positively correlated with the frequency of PQS. The frequency of PQS relative to the GC content of fungi was plotted for (a) all fungal genomes in the Ascomycota, (b) Taphrinomycotina, (c) Saccharomycotina, (d) and the Pezizomycotina. The Pearson correlation coefficient was used to determine the association between PQS and GC content. P<0.05 was considered statistically significant.
Fig. 5.PQS in Aspergillus spp. are genetically conserved. (a) Aspergillus spp. were categorized into sections based upon a phylogenetic tree generated by Steenwyk et al. [46]. The frequency of PQS was shown to be closely associated with the genetic relatedness of these fungi. (b) A PQS was found in an exonic region of cyp51A from A. fumigatus. A blast search of this sequence identified conservation of this exact sequence or variations of this sequence within the section Fumigati (highlighted in red). Alignments were performed using the Clustal Omega web tool (EMBL-EBI). Sequence is reverse complemented.
Fig. 6.Pathogenic Aspergillus species have lower PQS frequencies compared to their non-pathogenic/infrequently pathogenic counterparts. The frequency of PQS/kbp and PQS/GC% were quantified and compared between species within Aspergillus spp. (a) The PQS/kbp in all species of Aspergillus in the study. The frequency of PQS/kbp (b) and PQS/GC% (c) in pathogenic and non-pathogenic/infrequently pathogenic species. Dots represent individual species within a genus. Pink bars represent pathogenic species, whilst blue bars represent non-pathogenic/infrequently pathogenic organisms. The error bars represent the sd. Significance was determined by Student’s T test. Asterisks indicate P<0.05.
Fig. 7.PQS in A. fumigatus are enriched in the repeat regions and mRNA. The location of PQS found 100 bp before, within, and 100 bp after annotated genomic features in A. fumigatus with a G4Hunter score ≥1.2 were identified. (a) The total number of PQS in known genomic features and (b) frequency of PQS comparative to the genomic length of the annotated features. (c) PQS-containing genes are primarily associated with metabolism. Associated protein classes were determined using PANTHER Protein Class v.15.0.
Fig. 8.PQS-containing genes may be involved in numerous biological/molecular functions within pathogenic fungi. GO-term enrichment analysis of PQS-containing genes was performed using the FungiFun web tool V2.2.8. White bars indicate terms associated with molecular function, pink bars indicate terms associated with biological functions, and blue bars indicate terms associated with cellular components. The first number in brackets represents the number of genes and the second is the percentage of genes represented in the annotated category. Significance was determined via Fisher’s exact test. P<0.05 was considered significant.
Fig. 9.PQS can be found in genes that are highly upregulated during fungal germination, in response to environmental stresses, and in biofilms in A. fumigatus. Transcriptomes of A. fumigatus during germination, hypoxia, iron-limiting conditions, oxidative stress and in biofilms, were analysed and the 20 most upregulated genes were further investigated using QGRS Mapper and G4Hunter. (a) Genes that were highly upregulated (compared to dormant or untreated A. fumigatus conidia) contained PQS. Data points represent genes that contain PQS. (b) The average PQS frequency of genes upregulated in hyphae and during hypoxia were significantly higher when compared to the average PQS/kbp for the entire genome (1.55 PQS/kbp, red-dashed line), when analysed with the default G4Hunter settings. Data points represent the PQS frequency of an individual gene. Boxplot represents the median, maximum and minimum values. Whether the mean frequency of PQS within the upregulated genes of each condition was significantly higher compared to the mean of the entire genome was analysed via one-sample T test. * indicates a significant difference (P<0.05).
PQS in genes associated with virulence and drug resistance in A. fumigatus
|
Identifier |
Gene |
Description |
Sequence |
|---|---|---|---|
|
Afu2g02690 |
|
Fungal specific transcription factor, putative |
|
|
Afu4g06890 |
|
14-alpha sterol demethylase |
|
|
Afu7g03740 |
|
14-alpha sterol demethylase |
|
|
Afu1g14330 |
|
Azole transporter |
|
|
Afu6g04360 |
|
Putative ABC transporter |
|
|
Afu5g04170 |
|
Heat shock protein |
|
|
Afu6g12400 |
|
Putative 1,3-beta-glucan synthase catalytic subunit, major subunit of glucan synthase |
|
|
Afu2g03700 |
|
Hydroxymethylglutaryl-CoA (HMG-CoA) reductase |
|
|
Afu1g05800 |
|
Putative mitogen-activated protein kinase kinase (MAPKK) |
|
|
Afu1g06900 |
|
C2H2-type zinc finger transcription factor involved in calcium ion homeostasis |
|
|
Afu1g10080 |
|
Putative C2H2 zinc-responsive transcriptional activator |
|
|
Afu1g10380 |
|
Non-ribosomal peptide synthetase (NRPS) |
|
|
Afu1g10880 |
|
Putative P-type calcium ATPase |
|
|
Afu1g13140 |
|
G protein-coupled receptor alpha subunit |
|
|
Afu1g14660 |
|
Involved in regulation of secondary metabolism |
|
|
Afu1g17200 |
|
Fusarinine C non-ribosomal peptide synthetase (NRPS), putative |
|
|
Afu2g07680 |
|
|
|
|
Afu2g07770 |
|
Ras family GTPase protein |
|
|
Afu2g12200 |
|
cAMP-dependent protein kinase catalytic subunit |
|
|
Afu2g13260 |
|
Putative regulator of adherence, host cell interactions and virulence |
|
|
Afu2g17600 |
|
Conidial pigment polyketide synthase alb1 |
|
|
Afu3g12890 |
|
C6 transcription factor hasA |
|
|
Afu3g12690 |
|
Putative UDP-galactopyranose mutase, enzyme in the first step of galactofuranose biosynthesis |
|
|
Afu3g11970 |
|
C2H2 finger domain transcription factor |
|
|
Afu3g03650 |
|
Putative acetyltransferase with a predicted role in iron metabolism |
|
|
Afu3g03420 |
|
Nonribosomal peptide synthetase 4 |
|
|
Afu3g03400 |
|
Siderophore biosynthesis acetylase AceI, putative |
|
|
Afu3g02270 |
|
Mycelial catalase |
|
|
Afu5g03920 |
|
bZIP transcription factor HapX |
|
|
Afu5g08570 |
|
Class II protein kinase A (PKA) |
|
|
Afu5g08890 |
|
Putative homoaconitase |
|
|
Afu3g09690 |
|
Invasin calA |
|
|
Afu5g10760 |
|
Putative alpha-1,2-mannosyltransferase with a predicted role in protein glycosylation |
|
|
Afu5g11230 |
|
Ras family GTPase protein |
|
|
Afu4g09560 |
|
Zinc transporter that functions in neutral or alkaline environments |
|
|
Afu4g10460 |
|
Homocitrate synthase, essential enzyme of the alpha-aminoadipate pathway of lysine biosynthesis |
|
|
Afu2g12630 |
|
Allergen Asp f 13 |
|
|
Afu4g12470 |
|
Transcriptional activator of the cross-pathway control system of amino acid biosynthesis |
|
|
Afu6g09660 |
|
Nonribosomal peptide synthetase gliP |
|
|
Afu6g11390 |
|
GPI-anchored 1,3-beta-glucanosyltransferase |
|
|
Afu8g02750 |
|
Nucleolar rRNA processing protein |
|
|
Afu8g01670 |
|
Putative bifunctional catalase-peroxidase |
|
|
Afu6g04820 |
|
Para-aminobenzoic acid synthetase, an enzyme catalysing a late step in the biosynthesis of folate |
|
|
Afu6g03590 |
|
Methylcitrate synthase |
|
Fig. 10.Differences between PQS frequency in genes involved in the pathogenicity of A. fumigatus and their orthologues in the close non-pathogenic relative, A. fischeri (a) The differences between the distribution of PQS in A. fumigatus (black) and A. fischeri (pink) genomes. (b) The PQS frequency in genes involved in virulence, drug resistance and gliotoxin biosynthesis in A. fumigatus (black), compared with the PQS frequency in the respective A. fischeri orthologues (pink). PQS frequencies were quantified using the default G4Hunter settings. The mean genome frequencies for A. fumigatus and A. fischeri were 1.55 PQS/kbp and 1.53 PQS/kbp, respectively.