| Literature DB >> 17487271 |
Melanie T Cushion1, A George Smulian, Bradley E Slaven, Tom Sesterhenn, Jonathan Arnold, Chuck Staben, Aleksey Porollo, Rafal Adamczak, Jarek Meller.
Abstract
Members of the genus Pneumocystis are fungal pathogens that cause pneumonia in a wide variety of mammals with debilitated immune systems. Little is known about their basic biological functions, including life cycle, since no species can be cultured continuously outside the mammalian lung. To better understand the pathological process, about 4500 ESTS derived from sequencing of the poly(A) tail ends of P. carinii mRNAs during fulminate infection were annotated and functionally characterized as unassembled reads, and then clustered and reduced to a unigene set with 1042 members. Because of the presence of sequences from other microbial genomes and the rat host, the analysis and compression to a unigene set was necessarily an iterative process. BLASTx analysis of the unassembled reads (UR) vs. the Uni-Prot and TREMBL databases revealed 56% had similarities to existing polypeptides at E values of<or=10(-6), with the remainder lacking any significant homology. The most abundant transcripts in the UR were associated with stress responses, energy production, transcription and translation. Most (70%) of the UR had similarities to proteins from filamentous fungi (e.g., Aspergillus, Neurospora) and existing P. carinii gene products. In contrast, similarities to proteins of the yeast-like fungi, Schizosaccharomyces pombe and Saccharomyces cerevisiae, predominated in the unigene set. Gene Ontology analysis using BLAST2GO revealed P. carinii dedicated most of its transcripts to cellular and physiological processes ( approximately 80%), molecular binding and catalytic activities (approximately 70%), and were primarily derived from cell and organellar compartments (approximately 80%). KEGG Pathway mapping showed the putative P. carinii genes represented most standard metabolic pathways and cellular processes, including the tricarboxylic acid cycle, glycolysis, amino acid biosynthesis, cell cycle and mitochondrial function. Several gene homologs associated with mating, meiosis, and sterol biosynthesis in fungi were identified. Genes encoding the major surface glycoprotein family (MSG), heat shock (HSP70), and proteases (PROT/KEX) were the most abundantly expressed of known P. carinii genes. The apparent presence of many metabolic pathways in P. carinii, sexual reproduction within the host, and lack of an invasive infection process in the immunologically intact host suggest members of the genus Pneumocystis may be adapted parasites and have a compatible relationship with their mammalian hosts. This study represents the first characterization of the expressed genes of a non-culturable fungal pathogen of mammals during the infective process.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17487271 PMCID: PMC1855432 DOI: 10.1371/journal.pone.0000423
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1EST Analysis and Unigene Process.
The raw sequence reads of the Expressed Sequence Tags were first purged of poor quality sequence (arrows to right of starting point). The resulting 3,896 reads were deposited to NHLBI GenBank and further analyzed for similarities to genes and gene products in the UNI-PROT_TREMBL databases using BLASTn and BLASTx and putative function with BLAST2GO and KASS. The raw reads were then processed using an iterative scheme to form the unigene set starting with a primary screen to purge sequences originating from the cloning vector, rat host and bacteria, followed by a trimming of the sequence ends to reduce poor quality sequence, using an in house program (qTrim). After trimming, the sequences were again purged of contaminants using BLASTn and BLASTx, then assembled using the CAP3 assembly program. After another round of contaminant removal, 1,632 cDNA clones representing putative unique genes were selected for full sequencing. These sequences were then subjected to the same qTrim program, then assembled by CAP3. The primer design program, Primer3 was used to design primers to close gaps in those clone sequence that did not represent full sequences. After another round of assembly, the cDNA sequences were compared to one another for sequence identity using BLASTn, to identify any redundant gene sequences. This resulted in a set of 1042 unique sequences (both contigs and singletons) of which 994 had significant similarities (E≤10−6) to genes within the UNI-PROT_TREMBL and 48 did not have significant similarities to existing genes (E≥10−6). The unigene set was then analyzed for putative functions by BLAST2GO and KAAS.
Transcript abundance.
| Unigene identifier | No. ESTs | Gene homolog | Organism | Metabolic role |
| Contig1035 | 281 | HSP70 |
| Stress related |
| Contig1031/147 | 148 | STE11 |
| Mating; nutritional stress |
| Contig1055 | 140 | SFP1 |
| Stress related; nutritional limitation |
| Contig1034/946 | 138 | COX II precursor |
| Aerobic respiration |
| Contig1039 | 89 | HSP90 |
| Stress-related |
| Contig498 | 84 | MSG |
| Variable surface antigen |
| Contig520 | 84 | CON7 |
| Sporogenesis |
| Contig1056 | 81 | PDR3 |
| Transcriptional activator; oxidative stress |
| 11d08 | 74 | Q9AVH2 |
| Senescence-associated; dehydration |
| Contig909 | 69 | SIS1 |
| Stress related |
| Contig1037/287 | 68 | MLO3 |
| mRNA export |
| Contig176 | 66 | Mitochondrial genome |
| Aerobic respiration |
| Contig1016 | 60 | PSI1 |
| Translation |
| Contig799 | 38 | BBP |
| Intron splicing |
| Contig8 | 24 | SIN1 |
| Oxidative stress related; mitosis |
| Contig298 | 22 | EF1A |
| Protein synthesis |
| Contig 947 | 21 | TFIID |
| Transcription |
Contig purged from final unigene set due to redundancy
Variant 2 major surface glycoprotein isotype
Low homology
Figure 2Binning of P. carinii EST gene homologs by general organism groups.
After trimming the raw EST reads for poor quality sequence (qTrim), the ESTs were analyzed for similarities to gene products of other organisms using BLASTx. Significance was set at E≤10−6.
Figure 3Binning of putative fungal gene homologs of P. carinii ESTs.
EST sequences with significant similarities to fungal genes with BLASTx scores of E≤10−6 were binned according to fungal species.
Figure 4Distribution of P. carinii ESTs with significant similarities to known Pneumocystis genes.
EST sequences with significant similarities to known P. carinii genes with BLASTx scores of E≤10−6
Figure 5BLAST2GO categories of ESTs.
ESTs were analyzed using the BLAST2GO software. Shown are level 2 categories for Biological Processes (A); Molecular Function (B); and Cellular Components (C).
Figure 6Binning of putative fungal gene homologs of P. carinii Unigenes.
The 918 unigene sequences with significant similarities fungal genes (BLASTx of E≤10−6 ) were binned according to fungal organism of origin.
KEGG biochemical pathway mappings for P. carinii unigenes.
| KEGG Category | Enzymes represented |
| 01110 Carbohydrate Metabolism | 48 |
| 01320 Signal Transduction | 42 |
| 01420 Cell Growth and Death | 36 |
| 01150 Amino Acid Metabolism | 33 |
| 01120 Energy Metabolism | 25 |
| 01220 Translation | 18 |
| 01190 Metabolism of Cofactors and Vitamins | 18 |
| 01430 Cell Communication | 19 |
| 01230 Folding, Sorting and Degradation | 17 |
| 01460 Immune System | 16 |
| 01130 Lipid Metabolism | 14 |
| 01140 Nucleotide Metabolism | 14 |
| 01210 Transcription | 14 |
| 01170 Glycan Biosynthesis and Metabolism | 10 |
| 01410 Cell Motility | 8 |
| 01195 Biosynthesis of Secondary Metabolites | 6 |
| 01450Behavior | 6 |
| 01510 Neurodegenerative Disorders | 6 |
| 01196 Xenobiotics Biodegradation and Metabolism | 5 |
| 01470 Nervous System | 5 |
| 01160 Metabolism of Other Amino Acids | 4 |
| 01440 Development | 4 |
| 01530 Metabolic Disorders | 2 |
| 01240 Replication and Repair | 1 |
| 01330 Signaling Molecules and Interaction | 1 |
Unigene Representation in the Glycolytic Pathway.
| Name of enzyme | Gene Present in Unigene/Genomic Database (U/G/NF) | Unigene Identifier (s) |
| Hexokinase | U/G | cDNAv1_0.fasta.screen.Contig604 |
| Phosphoglucose isomerase | G | |
| 6-phosphofructokinase | U/G | 14e02uni.f.ab1; 14e02uni.t.ab1 |
| Aldolase | NF | |
| Triose phosphate isomerase | G | |
| Glyceraldehyde 3-phosphate dehydrogenase | NF | |
| Phosphoglycerate kinase | U/G | cDNAv1_0.fasta.screen.Contig 611 |
| Phosphoglycerate mutase | U/G | cDNAv1_0.fasta.screen.Contig993 |
| Enolase | G | |
| Pyruvate kinase | G |
Homolog found in Unigene Set (U) or Genomic database (G) or Not Found (NF) in either database.
Unigenes represented in the Pentose Phosphate Pathway
| Name of enzyme | Gene Present in Unigene/Genomic Database (U/G/NF) | Unigene Identifier (s) |
|
| ||
| Glucose-6-phosphate dehydrogenase | G | |
| Gluconolactonase | G | |
| 6-phosphogluconate dehydrogenase | U/G | cDNAv1_0.fasta.screen.Contig940 |
| Ribose-phosphate pyrophosphokinase | U/G | cDNAv1_0.fasta.screen.Contig616 |
|
| ||
| Phosphoglucomutase | G | |
| Ribulose-5-phosphate 3-epimerase | U/G | cDNAv1_0.fasta.screen.Contig849 |
| 14e02uni.f.abi | ||
| 14e02uni.t.abi | ||
| Ribose-5-phosphate isomerase | G | |
| Transketolase | G | |
| Transaldolase | NF | |
| Transketolase | G | |
Homolog found in Unigene Set (U) or Genomic database (G) or Not Found (NF) in either database.
Unigenes represented in the Citrate Cycle (TCA).
| Name of enzyme | Gene Present in Unigene/Genomic Database (U/G) | Unigene Identifier (s) |
| Citrate synthase ( | U/G | cDNAv1_0.fasta.screen.Contig961 |
| ATP citrate synthase | U/G | cDNAv1_0.fasta.screen.Contig749 |
| cDNAv1_0.fasta.screen.Contig808 | ||
| Aconitate hydratase 1 (aconitase) | U/G | cDNAv1_0.fasta.screen.Contig547 |
| 15h06uni.e.abi | ||
| Isocitrate dehydrogenase | G | |
| Αlpha-ketoglutarate complex | U/G | 1h12uni.t.ab1 |
| Succinyl CoA synthetase | G | |
| Succinate dehydrogenase | U/G | cDNAv1_0.fasta.screen.Contig722 |
| Fumarase | G | |
| Malate dehydrogenase | U/G | cDNAv1_0.fasta.screen.Contig186 |
| cDNAv1_0.fasta.screen.Contig846 |
Homolog found in Unigene Set (U) or Genomic database (G) or Not Found (NF) in either database.
Gene Homologs associated with mating/meiosis.
| Fungal homolog |
| Function |
| MATMC, |
| Mating-type M-specific polypeptide Mc |
| FAR10, |
| involved in G1 cell cycle arrest in response to pheromone |
| STE3, |
| mating-type a-factor pheromone receptor |
| STE11, |
| Signal transducing MEK kinase involved in pheromone response and pseudohyphal/invasive growth pathways |
| CDC31, |
| required for spindle pole body duplication in mitosis and meiosis II |
| FUS3, |
| Mitogen-activated protein kinase involved in mating pheromone response |
| SHO1, |
| Transmembrane osmosensor, participates in activation of both the Cdc42p-and MAP kinase-dependent filamentous growth pathway; mating tip projection pathway |
| ORC3, |
| chromatin silencing at silent mating-type cassette |
| ORC5, |
| chromatin silencing at silent mating-type cassette |
| CDC10, |
| cellular morphogenesis during conjugation with cellular fusion; establishment of cell polarity (sensu Fungi); spore wall assembly (sensu Fungi) |
| SPT7, |
| Subunit of the SAGA transcriptional regulatory complex conjugation with cellular fusion |
| MEU26, |
| meiotic expression upregulated; no apparent S. |
| DEP1, |
| Transcriptional modulator involved in maintenance of telomeres, mating efficiency and sporulation |
| MAM2, |
| Pheromone P-factor receptor |