Literature DB >> 28810711

Whole RNA-Sequencing and Transcriptome Assembly of Candida albicans and Candida africana under Chlamydospore-Inducing Conditions.

Domenico Giosa1, Maria Rosa Felice2, Travis J Lawrence3,4, Megha Gulati3, Fabio Scordino1, Letterio Giuffrè5, Carla Lo Passo2, Enrico D'Alessandro5, Giuseppe Criseo2, David H Ardell3, Aaron D Hernday3, Clarissa J Nobile3, Orazio Romeo1,2.   

Abstract

Candida albicans is the most common cause of life-threatening fungal infections in humans, especially in immunocompromised individuals. Crucial to its success as an opportunistic pathogen is the considerable dynamism of its genome, which readily undergoes genetic changes generating new phenotypes and shaping the evolution of new strains. Candida africana is an intriguing C. albicans biovariant strain that exhibits remarkable genetic and phenotypic differences when compared with standard C. albicans isolates. Candida africana is well-known for its low degree of virulence compared with C. albicans and for its inability to produce chlamydospores that C. albicans, characteristically, produces under certain environmental conditions. Chlamydospores are large, spherical structures, whose biological function is still unknown. For this reason, we have sequenced, assembled, and annotated the whole transcriptomes obtained from an efficient C. albicans chlamydospore-producing clinical strain (GE1), compared with the natural chlamydospore-negative C. africana clinical strain (CBS 11016). The transcriptomes of both C. albicans (GE1) and C. africana (CBS 11016) clinical strains, grown under chlamydospore-inducing conditions, were sequenced and assembled into 7,442 (GE1 strain) and 8,370 (CBS 11016 strain) high quality transcripts, respectively. The release of the first assembly of the C. africana transcriptome will allow future comparative studies to better understand the biology and evolution of this important human fungal pathogen.
© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Candida africana; Candida albicans; Chlamydospores; RNA-seq; nTAR; transcriptome assembly

Mesh:

Year:  2017        PMID: 28810711      PMCID: PMC5553385          DOI: 10.1093/gbe/evx143

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Candida albicans is a diploid commensal fungus that asymptomatically colonizes the gastrointestinal and genitourinary tracts, vagina and skin of healthy humans, and other warm-blooded animals (Neville et al. 2015). However, it is also a major opportunistic fungal pathogen capable of causing a variety of superficial infections in healthy individuals as well as more severe infections, typically in immunocompromised hosts (Papon et al. 2013). Genetically, this yeast is extraordinarily versatile and its population structure is exceptionally heterogeneous, correlating with its prevalence in different clinical samples and the notable pathogenic potential of the isolates (McManus and Coleman 2014). Nevertheless, a particular group of strains appears to be better adapted at causing vaginal infections than others and represents an interesting example of divergent evolution in C. albicans (Odds 2010). These strains were originally proposed to be a new Candida species, Candida africana (Tietz et al. 2001), however comparative genetic studies indicated that the differences between C. albicans and C. africana are insufficient to support its status as a new Candida species (Odds et al. 2007; Romeo and Criseo 2011). Despite the high degree of genetic similarity shared between C. albicans and C. africana, there is considerable phenotypic diversity differentiating the two pathogens. Unlike C. albicans, C. africana is relatively limited in its abilities to assimilate distinct carbon sources, shows a low degree of virulence in disseminated infection models in mice, has a reduced growth rate, and exhibits a decreased ability to filament under standard filament-inducing conditions (Romeo et al. 2013; Pagniez et al. 2015; Felice et al. 2016). Another distinct feature of C. africana compared with C. albicans is its inability to form chlamydospores, large, spherical, thick-walled structures of unknown function that in the past were widely used as a taxonomic criterion to distinguish C. albicans from other Candida species. Chlamydospore development in C. albicans is not well understood; we, therefore, decided to explore and compare the transcriptomes of C. albicans and C. africana in response to specific culture conditions that are known to induce chlamydospore formation in C. albicans. In this study we report, for the first time, the whole-transcriptome profiles of C. albicans and C. africana biovariant strains grown under chlamydospore-inducing conditions.

Materials and Methods

Fungal Strains and Culture Conditions

In this study, we examined a typical C. albicans clinical strain (named GE1) and the reference clinical strain CBS 11016 also known as C. africana, a chlamydospore-negative biovariant of C. albicans (Romeo and Criseo 2009). Both Candida strains were isolated from vaginal samples of women suffering from vaginitis and were identified using conventional clinical laboratory and molecular methods (Romeo and Criseo 2008). The C. albicans GE1 strain shows a strong ability to produce chlamydospores when cultivated on plates containing corn meal agar (CMA) plus 1% tween 80 at 25 °C for 48–72 h, whereas the C. africana CBS 11016 strain is unable to form chlamydospores under the same culture conditions (Romeo and Criseo 2009). To stimulate chlamydospore production and induce the expression of genes associated with their formation, 15 ml of a standard overnight yeast suspension culture (containing ∼107cells/ml) was used to inoculate a 100 mm diameter Petri dish containing CMA plus 1% tween 80. The plates were incubated at 25 °C and microscopically monitored until chlamydospores were produced by the positive GE1 strain, which occurred at 48 h, and cells were collected and subjected to total RNA extraction.

RNA Extraction and Sequencing

A total of 1108 yeast cells were used for RNA extraction using the Ribopure Yeast Purification kit (Thermofisher, Italy) following the manufacturer’s instructions. RNA quantification was carried out spectrophotometrically at 260 nm and 280 nm and RNA integrity was evaluated with an Agilent 2100 Bioanalyzer instrument using the RNA 6000 Nano kit (Agilent Technologies, Italy). Before RNA-Seq, the expression levels of two genes highly expressed specifically during chlamydospore formation, CSP1 and CSP2, were examined by quantitative real time PCR as described previously (Palige et al. 2013). For cDNA synthesis, 2 µg of total RNA were digested with DNase I (Sigma-Aldrich, Italy) following manufacturer’s instructions and retrotranscribed by RevertAid First Strand cDNA Synthesis kit (Thermofisher, Italy) using oligo(dT) at 42 °C for 1 h followed by a reverse transcriptase denaturation step at 70 °C for 10 min. The mRNA levels of CSP1 and CSP2 were quantified by StepOne Plus Real Time PCR system (Applied Biosystem, Italy) using SYBR Premix Ex Taq II (Takara, Clontech, Italy) and primers listed in table 1. Actin (ACT1) was used as housekeeping gene and the ΔΔCt was calculated using the Livak and Schmittgen (2001) method. cDNA was also used to confirm the presence of two distinct “novel transcriptionally active regions” (nTARs) (table 2) detected during evaluation of the transcriptome assembly analysis. The sequences of these transcripts were retrieved from our assemblies, aligned with MEGA6 software (Tamura et al. 2013) and used to design specific oligonucleotide primers (table 1).
Table 1

Oligonucleotide Primers Used in This Study

Primer NameSequence (5′→3′)TargetAmplicon Size (bp)Reference
Ca3512_fw1ACACCACTGCAAGTATCCATATTGTGACSP1270Palige et al. (2013)
Ca3512_rev1ATCTTGTATAACCCTTTGTCGTCAAC
Ca4170_fw1GCTACTGGTGAAATTGTTGCTAATCCSP2280Palige et al. (2013)
Ca4170_rev1TCATCATCACAGTCATCGCTATC
ACT1-RT-FTCCAGAAGCTTTGTTCAGACCAGCACT1170Felice et al. (2016)
ACT1-RT-RTGCATACGTTCAGCAATACCTGGG
nTAR1_fwGTTGTTATTGTGATGGTGGTGGnTAR1197This study
nTAR1_revGGTAAGAGAGGGTTCTGCAGC
nTAR2_fwGCCAATCTTCGTATATGTGGCnTAR2278This study
nTAR2_revCTGGTGTTCTTCCCCTAGC
Table 2

Genomic Coordinates of the Two nTARs Found in This Study

nTAR NameCHRStartStopOverlap with Previous nTARConservation with Other Candida spp.GenBank
nTAR1R2,009,8282,010,096NOGEVV02006198
nTAR2R1,695,1491,695,435TFRW181aNOGEVV02005397

chromosome.

Sellam et al. (2010).

Oligonucleotide Primers Used in This Study Genomic Coordinates of the Two nTARs Found in This Study chromosome. Sellam et al. (2010). In vitro amplifications of the nTARs were performed separately using the DreamTaq PCR Master Mix (Thermofisher, Italy) plus 1 µl of cDNA and 0.5 µM of each primer pair (table 1). PCR reactions were carried out in a Bio-Rad T100 thermal cycler with preliminary denaturation at 95 °C for 4 min followed by 35 cycles of denaturation at 94 °C for 40 sec, annealing at 56 °C for 40 sec, extension at 72 °C for 45 sec and a final step of 10 min at 72 °C. Amplicons were subjected to 2% agarose gel electrophoresis to confirm the presence of amplified fragments, expected size and the absence of nonspecific products. Subsequently, PCR products were purified using the QIAquik PCR Purification kit (Qiagen, Italy) and sequenced at the Eurofins Genomics (Ebersberg, Germany; www.eurofinsgenomics.eu) using the same primers used for PCR (table 1). For whole RNA sequencing, ∼4 µg (100 ng/µl) of high quality total RNA (OD260/280 ≥ 2.0; RIN value ≥ 9.6) from each Candida strain, were sent to GATC Biotech (Konstanz, Germany) for poly-A filtering, fragmentation, random primed cDNA synthesis and library preparation for Illumina HiSeq single read sequencing (read length: 1 × 50 bp).

Transcriptome Assembly and Evaluation

Before assembling, raw reads were processed using the programs Scythe (version 0.994 BETA; https://github.com/vsbuffalo/scythe; last accessed July 28, 2017) and Sickle (version 1.33; https://github.com/najoshi/sickle; last accessed July 28, 2017) to remove adapters and sequences with low Phred-scores (cutoff: ≥20). For assembling whole transcriptomes, we used two different strategies: a de novo approach using BinPacker version 1.0 (Liu et al. 2016) and a reference based assembly using StringTie version 1.3.0 (Pertea et al. 2015) with two different alignment programs: HISAT2 version 2.0.4 (Kim et al. 2015) and Subjunc version 1.22.3 (Liao et al. 2013). Both HISAT2 and Subjunc short read aligners produced similar results. The reference genome used was that of C. albicans SC5314, version A22-s05-m05-r03, retrieved from the Candida Genome Database (CGD) (www.candidagenome.org). To evaluate the completeness of our assemblies, we used BUSCO version 1.22 (Simão et al. 2015), which quantitatively assesses the assembled transcriptomes by comparing them against a database (OrthoDB; www.orthodb.org) of universal single copy orthologs for fungi. A full description of the command-line programs, including an R script used for Subjunc alignment in the reference based assembly, is provided in the supplementary files S1 and S2, Supplementary Material online, respectively. Overall statistics of the two Candida transcriptomes assembled in this study were calculated using FAST version 1.06 (Lawrence et al. 2015) and SAMtools version 1.3.1 (Li et al. 2009).

Results

In this study, the C. albicans GE1 strain produced abundant chlamydospores after 48 h of incubation on CMA supplemented with 1% tween 80 whereas these structures were not observed in the C. africana CBS 11016 culture. To further demonstrate the lack of chlamydospore production in C. africana, we examined the induction of two specific genes, CSP1 and CSP2, which were reported to be highly up-regulated during chlamydospore development in C. albicans (Palige et al. 2013). As expected the expression levels of these two genes were significantly elevated in the GE1 strain compared with CBS 11016 strain (data not shown). Illumina whole transcriptome sequencing produced over 78 and 61 million raw reads for strains CBS 11016 and GE1, respectively (table 3). After quality filtering and trimming, ∼0.51% (CBS 11016 strain) and 0.76% (GE1 strain) of the total reads were removed, leaving a large data set of high-quality reads for both de novo and reference guided assembling (average Q-score: ∼38) (table 3). The number of complete, fragmented, and missing orthologs obtained by BUSCO (Simão et al. 2015) for all transcriptomes is shown in figure 1. Based on this result, it appears that the de novo approach yielded a more complete transcriptome by a large margin.
Table 3

Transcriptome Assembly Statistics for C. africana and C. albicans

CBS 11016GE1
Total raw reads78,360,45761,354,627
Number of reads used for assemblinga77,957,18160,885,731
Average read length50.7750.74
Total mappedb75,736,64358,898,936
Uniquely mappedc72,959,57155,128,869
Multiply mappedd2,777,0723,770,067
Unaligned2,623,8142,455,691
GC content (%)34.334.4
Total assembled contigs8,3707,442
Total assembly length (bp)10,507,12112,270,608
Number of contigs ≥ 500 bp5,5826,418
Number of contigs ≥1000 bp3,7844,723
Longest contig (bp)10,10210,652
Contig N502,0202,252
Number of contigs with ORFe7,0517,048

Adapters removed using Scythe and sequences trimmed by quality score using Sickle.

Total number of reads aligned on the C. albicans reference genome version A22-s05-m05-r03.

Total number of reads mapped to uniquely locations in the C. albicans genome.

Total number of reads mapped to multiple locations in the C. albicans genome.

Total number of contigs that contain an ORF ≥ 60 bp.

. 1.—

Number of complete, fragmented, and missing orthologs obtained by BUSCO analysis (Simão et al. 2015) using both de novo and reference-guided assemblies of C. africana and C. albicans.

Transcriptome Assembly Statistics for C. africana and C. albicans Adapters removed using Scythe and sequences trimmed by quality score using Sickle. Total number of reads aligned on the C. albicans reference genome version A22-s05-m05-r03. Total number of reads mapped to uniquely locations in the C. albicans genome. Total number of reads mapped to multiple locations in the C. albicans genome. Total number of contigs that contain an ORF ≥ 60 bp. Number of complete, fragmented, and missing orthologs obtained by BUSCO analysis (Simão et al. 2015) using both de novo and reference-guided assemblies of C. africana and C. albicans. To assess whether the transcripts were correctly assembled, an additional quality analysis was performed by mapping back the original reads to the assembled transcriptomes. The data showed that 95% of the reads mapped to both de novo assemblies produced by BinPacker while only 44.7% (C. africana) and 54% (C. albicans) of the reads mapped to the reference-based transcriptomes assembled by StringTie. The de novo assembled transcriptomes (Genbank accession number: GEVW02000000 for GE1 and GEVV02000000 for CBS 11016) resulted in 7,442 contigs (≥200 bp; largest contig 10,652 bp; total consensus length of 12,270,608 bp) for the C. albicans GE1 strain and 8,370 contigs (200 bp; largest contig 10,102 bp; total consensus length of 10,507,121 bp) for the C. africana CBS 11016 strain, constituting ∼43% and ∼37%, respectively, of the entire C. albicans genome. Overall statistics of the two Candida transcriptomes assembled in this study are summarized in table 3. A detailed list of the genes detected, including their annotations, is provided in the supplementary table S1, Supplementary Material online. From our RNA-seq data we generated over 75 (C. africana) and 58 (C. albicans) million mappable reads (table 3) and identified two novel transcriptionally active regions (nTARs) that mapped outside of the annotated regions of the C. albicans reference genome used in this study (fig. 2). The presence of the two nTARs in our cDNA samples was further verified by standard Sanger sequencing, which indicated that both C. albicans and C. africana produce these transcripts.
. 2.—

Signal tracks of RNA-seq data for the two nTARs found in this study.

Signal tracks of RNA-seq data for the two nTARs found in this study. In silico translation (http://web.expasy.org/translate; last accessed July 28, 2017) and BLAST analysis were unable to identify any homologous product currently available in the public databases and therefore our nTARs were classified as long noncoding RNAs (lncRNAs). Both lncRNAs, nTAR1 and nTAR2, map to two different unannotated regions of chromosome R (table 2 and fig. 2) between tD(GUC)7 (encoding a tRNA-Asp) and an uncharacterized ORF (CR_09460 C_A) and between SAP2 (encoding a major secreted aspartyl proteinase) and YHB5 (encoding a flavohemoglobin-related protein) (fig. 2).

Discussions and Conclusions

The biological function of chlamydospores is unknown and the reasons why only C. albicans, and its closely related species C. dubliniensis, have retained this phenotype throughout evolution remains a mystery (Palige et al. 2013; Navarathna et al. 2016). Different hypotheses have been put forth to explain the presence of these characteristic structures in these two Candida species, but none have been definitely proven. It was proposed that chlamydospores could allow the survival of the fungus in harsh environmental conditions or, given that C. albicans and C. dubliniensis are restricted to humans, could facilitate persistence or survival of these species within the mammalian host (Staib and Morschhäuser 2007). However, rarely have chlamydospores been observed in infected tissues (Heineman et al. 1961; Ho and O'Day 1981; Chabasse et al. 1988; Cole et al. 1991) although a recent study documenting their formation in the kidneys of experimentally infected mice suggested a role in pathogenesis (Navarathna et al. 2016). Evolutionarily, the assumption that chlamydospore production has persisted within C. albicans populations is not strictly true since a number of pathogenic strains, such as C. africana, have lost the ability to produce chlamydospores (Tietz et al. 2001; Romeo et al. 2013). These strains represent a group of phylogenetically distinct C. albicans strains (Odds 2010) and show many distinctive phenotypic traits, including profound changes in nutrient assimilation, reduced filamentation, and a low degree of virulence in insect and mammalian models (Borman et al. 2013; Romeo et al. 2013; Pagniez et al. 2015; Felice et al. 2016). This phenotypic diversity may be the result of a unique genetic background (Felice et al. 2016), however, the genomic sequence of C. africana has not yet been determined. For this reason, we decided to sequence the whole transcriptome of C. africana, compared with a standard C. albicans strain, in order to learn which genes are expressed under chlamydospore-inducing conditions. In this study both transcriptomes were assembled using two different methods and our data suggested that a de novo assembly approach was advantageous, despite the availability of a closely related reference genome (Huang et al. 2016). The better performance of the de novo approach may be due, in part, to the nature of our sequencing data (consisting of 50 bp single reads), since most reference based transcriptome assemblers are optimized to take advantage of the additional information contained in paired-end data. However, it is also possible that genetic differences among strains could have contributed to the superior performance of the de novo over reference-based transcriptome assemblies that we observed (Huang et al. 2016). In fact, genetically, the population structure of C. albicans is remarkably heterogeneous and C. africana represents the most evolutionary divergent lineage currently known (Odds et al. 2007; Odds 2010). The greater divergence of C. africana CBS 11016 from the reference C. albicans genome strain (SC5314) may explain why its reference-based assembly missed more orthologs (fig. 1) and mapped back fewer reads than the reference-based assembly using C. albicans GE1. Another interesting finding of this study relates to the discovery of novel and uncharacterized transcriptionally active regions whose functions are at present unknown, but are believed to be of regulatory importance. One nTAR (nTAR2; table 2), although not obviously evident by RNA-seq read coverage (fig. 2), overlaps exactly the same nTAR (TFRW181) initially described by Sellam et al. (2010) in a C. albicans genome annotation paper using high-resolution tiling arrays, and later reported by Nobile et al. (2012) using RNA-seq to study the transcriptional circuitry controlling C. albicans biofilm formation. This nTAR, as well as the new nTAR1 reported here, was not found in any other sequenced Candida species (Candida glabrata CBS 138, Candida tropicalis MYA-3404, Candida parapsilosis CDC 317, Candida orthopsilosis Co 90-125, Candida lusitaniae ATCC 42720, Candida guilliermondii ATCC 6260 and C. dubliniensis CD 36) accessible through the Candida genome database. Overall the release of the first assembly of the C. africana transcriptome represents an important milestone for Candida research because it sets the framework for future genetic studies to determine the transcriptional network underlying chlamydospore development in C. albicans, and also provides a platform for future comparative studies between these two closely related strains, whose differences in phenotypic and genetic traits are remarkably vast.

Supplementary Material

Supplementary data are available at Genome Biology and Evolution online. Click here for additional data file.
  31 in total

1.  Chlamydospores and dimorphism in Candida albicans endocarditis. Observations in a fatal superinfection during treatment of staphylococcus endocarditis.

Authors:  H S HEINEMAN; E J YUNIS; J SIEMIENSKI; A I BRAUDE
Journal:  Arch Intern Med       Date:  1961-10

2.  FAST: FAST Analysis of Sequences Toolbox.

Authors:  Travis J Lawrence; Kyle T Kauffman; Katherine C H Amrine; Dana L Carper; Raymond S Lee; Peter J Becich; Claudia J Canales; David H Ardell
Journal:  Front Genet       Date:  2015-05-19       Impact factor: 4.599

3.  Candida africana sp. nov., a new human pathogen or a variant of Candida albicans?

Authors:  H J Tietz; M Hopp; A Schmalreck; W Sterry; V Czaika
Journal:  Mycoses       Date:  2001-12       Impact factor: 4.377

Review 4.  Candida africana and its closest relatives.

Authors:  Orazio Romeo; Giuseppe Criseo
Journal:  Mycoses       Date:  2010-07-28       Impact factor: 4.377

5.  Chlamydospore-like cells of Candida albicans in the gastrointestinal tract of infected, immunocompromised mice.

Authors:  G T Cole; K R Seshan; M Phaneuf; K T Lynn
Journal:  Can J Microbiol       Date:  1991-08       Impact factor: 2.419

6.  First molecular method for discriminating between Candida africana, Candida albicans, and Candida dubliniensis by using hwp1 gene.

Authors:  Orazio Romeo; Giuseppe Criseo
Journal:  Diagn Microbiol Infect Dis       Date:  2008-07-21       Impact factor: 2.803

7.  Epidemiology, antifungal susceptibility, and pathogenicity of Candida africana isolates from the United Kingdom.

Authors:  Andrew M Borman; Adrien Szekely; Chistopher J Linton; Michael D Palmer; Phillipa Brown; Elizabeth M Johnson
Journal:  J Clin Microbiol       Date:  2013-01-09       Impact factor: 5.948

Review 8.  Candida albicans commensalism in the gastrointestinal tract.

Authors:  B Anne Neville; Christophe d'Enfert; Marie-Elisabeth Bougnoux
Journal:  FEMS Yeast Res       Date:  2015-09-06       Impact factor: 2.796

9.  BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data.

Authors:  Juntao Liu; Guojun Li; Zheng Chang; Ting Yu; Bingqiang Liu; Rick McMullen; Pengyin Chen; Xiuzhen Huang
Journal:  PLoS Comput Biol       Date:  2016-02-19       Impact factor: 4.475

10.  The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote.

Authors:  Yang Liao; Gordon K Smyth; Wei Shi
Journal:  Nucleic Acids Res       Date:  2013-04-04       Impact factor: 16.971

View more
  2 in total

1.  Whole RNA-sequencing and gene expression analysis of Trichoderma harzianum Tr-92 under chlamydospore-producing condition.

Authors:  Min Yuan; Yuanyuan Huang; Zhenhua Jia; Weina Ge; Lan Zhang; Qian Zhao; Shuishan Song; Yali Huang
Journal:  Genes Genomics       Date:  2019-04-09       Impact factor: 1.839

Review 2.  Transcriptomic and Genomic Approaches for Unravelling Candida albicans Biofilm Formation and Drug Resistance-An Update.

Authors:  Pei Pei Chong; Voon Kin Chin; Won Fen Wong; Priya Madhavan; Voon Chen Yong; Chung Yeng Looi
Journal:  Genes (Basel)       Date:  2018-11-07       Impact factor: 4.096

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.