| Literature DB >> 24484775 |
Lakshmi Dhevi Nagarajha Selvan, Jyothi Embekkat Kaviyil, Raja Sekhar Nirujogi, Babylakshmi Muthusamy, Vinuth N Puttamallesh, Tejaswini Subbannayya, Nazia Syed, Aneesha Radhakrishnan, Dhanashree S Kelkar, Sartaj Ahmad, Sneha M Pinto, Praveen Kumar, Anil K Madugundu, Bipin Nair, Aditi Chatterjee, Akhilesh Pandey, Raju Ravikumar, Harsha Gowda1, Thottethodi Subrahmanya Keshava Prasad.
Abstract
BACKGROUND: Cryptococcus neoformans, a basidiomycetous fungus of universal occurrence, is a significant opportunistic human pathogen causing meningitis. Owing to an increase in the number of immunosuppressed individuals along with emergence of drug-resistant strains, C. neoformans is gaining importance as a pathogen. Although, whole genome sequencing of three varieties of C. neoformans has been completed recently, no global proteomic studies have yet been reported.Entities:
Year: 2014 PMID: 24484775 PMCID: PMC3915034 DOI: 10.1186/1559-0275-11-5
Source DB: PubMed Journal: Clin Proteomics ISSN: 1542-6416 Impact factor: 3.988
Figure 1Proteomics of var. . A) Venn diagram representing the number of proteins identified using different fractionation methods B) Scatter plot showing the normalized spectral abundance factor analysis of top 50 proteins identified in our study.
Figure 2Schematic workflow of proteogenomic analysis. Mass spectrometry derived data was searched against protein database and six-frame translated genome database of C. neoformans var. grubii. Peptides mapping to the protein database confirmed annotated proteins and annotated splice junctions. C represents the number of peptides identified mapping to exons excluding A and B. Peptides unique to six-frame translated genome database were categorized based on their mapping to intergenic regions and regions within the annotated genes. These peptides were used to refine the annotation of genome.
Summary of novel findings in proteogenomic analysis of var.
| Novel genes | 4 |
| Novel exons | 17 |
| Refinement of gene boundaries | 19 |
| Refinement of exon boundaries | 53 |
| Protein coding evidence in UTRs | 13 |
| Correction of translational frame | 8 |
| Novel protein start site | 1 |
List of novel proteins identified in var.
| 1 | CNAG_IOB_prot1 | STPLTVSEDQEDELLSIVPIK; AIVGSGLGILSVWNR | Chromosome 1: 1026293–1027893; Negative strand | GenBank: JZ152657.1 (CNAG_IOB_PROT1) | XP_777889.1 ( | Monoubiquitination related protein |
| 2 | CNAG_IOB_prot2 | DLEIGAPVSIGDTPSPEDK; SLSPEVQASLPADIR | Chromosome 2: 852949–855029; Positive strand | GenBank: JZ152658.1 (CNAG_IOB_PROT2) | XP_777044.1 ( | Integrase rve protein |
| 3 | CNAG_IOB_prot3 | LFDNDADGDDDDDQGAVNVNIR | Chromosome 5: 65512–66695; Positive strand | GenBank: JZ152659.1 (CNAG_IOB_PROT3) | XP_570553.1 ( | Hypothetical protein |
| 4 | CNAG_IOB_prot4 | WGILEGEFGK, LTFQFITSSEK, LVNEGDWEGGLSEVEDIVR | Chr 11: 13611–17959; Negative strand | GenBank: JZ152660.1 (CNAG_IOB_PROT4) | XP_776934.1 ( | Retrograde transport, endosome to Golgi-related protein |
Figure 3RT-PCR based validation of novel and revised gene models. RT-PCR validation was carried out for 47 novel and revised gene models. Transcript identifier for each gene model is indicated above each lane. These RT-PCR products were sequenced and submitted to GenBank.
Figure 4Identification of novel protein-coding genes based on peptides mapping to intergenic regions. A) GSSP mapped to intergenic region in Chromosome 1 between CNAG_00389 and CNAG_00390. Gene prediction programs predicted 358 amino acids long proteins from this region, which is homologous to monoubiquitination protein of C. neoformans var. neoformans.B) Three peptides mapped to intergenic region between CNAG_01461 and CNAG_01463. Gene prediction programs predicted protein-coding gene in this region. The predicted protein has homology to retrograde transport, endosome to Golgi related protein, XP_776934.1 (C. neoformans var. neoformans B-3501A), XP_569839.1 (C. neoformans var. neoformans JEC21), XP_003192571.1 (C. gattii WM276). C) The MS/MS spectra of intergenic peptides STPLTVSEDQEDELLSIVPIK and LTFQFITSSEK are shown.
List of novel exons identified using peptide evidence in var.
| 1 | CNAG_02122 | AcALLEGLESGLSR | Chr 6: 1136454 – 1136507; Negative | GenBank: JZ152661.1 (CNAG_02122_NE1) | XP_571070.1 ( |
| 2 | CNAG_02773 | TAITIKPALAAQAGK | Chr 3: 801827 – 801913; Negative | GenBank: JZ152662.1 (CNAG_02773_NE2) | XP_003192989.1 ( |
| 3 | CNAG_03648 | VIFVDADQIVR | Chr 2: 438306 – 438539; Positive | GenBank: JZ152663.1 (CNAG_03648_NE3) | XP_777193.1 ( |
| 4 | CNAG_03920 | YYDLGMESR | Chr 2: 1190442 – 1190507; Positive | GenBank: JZ152664.1 (CNAG_03920_NE4) | XP_569234.1 ( |
| 5 | CNAG_06526 | TYPLALDLWDSGSSVIIR; IASFPLNFISR | Chr 7: 12832–13094; Positive | GenBank: JZ152665.1 (CNAG_06526_NE5) | XP_567229.1 ( |
| 6 | CNAG_06533 | SNGYYSPITYFLAK; LLFDIIPLR | Chr 7: 36868 – 37134; Positive | GenBank: JZ152666.1 (CNAG_06533_NE6) | XP_571629.1 ( |
| 7 | CNAG_02396 | FNGPVDFER | Chr 6: 435584 – 435658; Positive | GenBank: JZ152667.1 (CNAG_02396_NE7) | XP_570777.1 ( |
| 8 | CNAG_00853 | TERYPLTLGR | Chr 1: 2257775 – 2257895; Positive | - | XP_567069.1( |
| 9 | CNAG_07863 | AELEMLVQR | Chr 10: 1004163 – 1004219; Negative | - | XP_567270.1 ( |
| 10 | CNAG_01261 | VTGQSPEDEDWLVGETLDGSHAGGFPK | Chr 5: 763635 – 763722; Negative | - | XP_570166.1 ( |
| 11 | CNAG_02407 | GGASIFEMVR; GGASIFEmVR | Chr 6: 401719 – 401783; Negative | - | XP_775427.1 ( |
| 12 | CNAG_02292 | YDIDLENKR | Chr 6: 684574 – 684630; Positive | - | XP_775540.1 ( |
| 13 | CNAG_02230 | FIDPAESGAVIPILHVNGYK | Chr 6: 853119 – 853178; Negative | - | XP_003194389.1 ( |
| 14 | CNAG_02147 | LLLRPIWKPR | Chr 6: 1080395 – 1080490; Positive | - | XP_775114.1 ( |
| 15 | CNAG_03648 | ASSVVTAAYKPLDGEGIFAPAQSTR | Chr 2: 436700–436837; Positive | - | XP_568822.1 ( |
| 16 | CNAG_04955 | GAVLVNVGR | Chr 4: 73400–73459; Negative | - | XP_568183.1 ( |
| 17 | CNAG_05059 | GLLDKFGEDR | Chr 4: 323430–323477; Positive | - | XP_568096.1 ( |
Figure 5Identification of protein start site using N-terminally acetylated peptide. A) N-terminally acetylated peptide Ac-SLAScIFcK identified upstream to CNAG_03069 corrected the annotated protein start site. B) N-terminally acetylated peptide Ac- AQVVPCLDHPSSYR found overlapping exon and intron junction confirmed the annotated protein start site and extended the first exon of CNAG_05196.