| Literature DB >> 19812769 |
Pierre Marcel Durand1, Theresa Louise Coetzer.
Abstract
Apoptosis is the phenotypic result of an active, regulated process of self-destruction. Following various cellular insults, apoptosis has been demonstrated in numerous unicellular eukaryotes, but very little is known about the genes and proteins that initiate and execute this process in this group of organisms. A bioinformatic approach presents an array of powerful methods to direct investigators in the identification of the apoptosis machinery in protozoans. In this review, we discuss some of the available computational methods and illustrate how they may be applied using the identification of a Plasmodium falciparum metacaspase gene as an example.Entities:
Keywords: apoptosis; bioinformatics; programmed cell death; protozoa
Year: 2008 PMID: 19812769 PMCID: PMC2735952 DOI: 10.4137/bbi.s430
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Key concepts.
| The most typical phenotype of PCD, characterized by specific morphological and biochemical features | |
| An evolutionary process by which genetic diversity increases over time | |
| The degree of similarity one expects to find between two genes or proteins by chance alone | |
| A probability model in which a system is modeled using the Markov process to identify hidden parameters using observable data | |
| A system whereby any given state is conditionally independent of any prior states | |
| A homologous gene or protein that has arisen as a result of speciation | |
| A homologous gene or protein that has arisen as a result of gene duplication | |
| The inherent capacity of a cell to implement an active and regulated mechanism of self-destruction | |
| An evolutionary process by which genetic diversity decreases over time |
Figure 1Four representative stages of Programmed Cell Death.
The cellular machinery involved in apoptosis.
| Second and third generic stages of PCD | Molecular components of PCD | Examples | Domains/Conserved regions |
|---|---|---|---|
| Receptors | Toll-like receptors, TNF-like receptors, FAS, IL-1 | DD, TIR, FNIII, Ig fold | |
| Transcription factors | Cell cycle control proteins, MYC, tumor suppressors | DBD, Ig fold | |
| Other intracellular and | Receptor associated signaling | MATH, CART, DD, Ig | |
| nuclear proteins | molecules, cyclin, CDKs, NF-κB, kinases, TRAF | fold | |
| Effectors | Caspases and other cysteine proteases, RIP kinase, BAX, BCL-2 | Casp, DD, CARD, DED, kinase domain | |
| Regulators | AIPs, Ap-ATPases, NTPases, serine/threonine kinases | conserved motifs in Ap-ATPases, Zn finger |
Notes: The molecular components with examples of the second (initiation) and third (execution) stages of PCD are listed. The domains and conserved regions that may be useful for the bioinformatic identification of homologs of the apoptosis machinery in unicellular eukaryotes are indicated.
Abbreviations: AIPs: apoptosis inhibitory proteins; Ap-ATPases: apoptotic ATPase; BCL-2: B cell leukemia/lymphoma 2 gene; CARD: caspase activation and recruitment domain; CART: cysteine-rich motif associated with RING and TRAF; Casp: caspase catalytic domain; CDK: cyclin dependant kinase; DBD: DNA-binding domain; DD: death domain; DED: death-effector domain; FNIII: fibronectin domain; Ig: immunoglobulin; IL-1: interleukin 1; MATH: meprin and TRAF homology domain; MYC: myelocytomatosis viral oncogene homolog; NK-κB: nuclear factor-κB; TIR: toll-interleukin-receptor domain; TNF: tumor necrosis factor; TRAF: TNF receptor associated factor.
Computational methods for identifying and investigating homologous genes and proteins.
| Method | Description | Examples |
|---|---|---|
| Homology search | Profile-based algorithm to search for homologous sequences | Gapped BLAST, PSI-BLAST |
| Multiple sequence alignment (MSA) | Alignment of multiple sequences to maximize the homology between them | CLUSTAL, Muscle, PileUp, T-Coffee, MAFFT |
| Hidden Markov model | Probability distribution across all residues in a MSA to generate a pattern of conservation or signature profile | Markov Chain Monte Carlo |
| Phylogenetics | Determination of the evolutionary relationships between taxa | Distance-matrix, maximum parsimony, maximum likelihood, Bayesian inference |
| Phylogenomics | Whole genome comparisons for identifying protein-protein interactions based on various criteria | Phylogenetic profiling, mirror tree, gene neighborhood, gene fusion |
| Sequence evolution | Identification of patterns of positive and stabilizing selection across phylogenetic lineages and codon sites | Phylogenetic analysis by maximum likelihood |
| Structure analysis | 3D structure prediction | SWISS-MODEL |
Figure 2A computational approach to the identification and characterization of apoptosis homologs in unicellular organisms.
Figure 3The bioinformatic steps taken to identify and investigate a P. falciparum metacaspase gene (PF13_0289). The mirror tree, gene fusion and gene neighborhood methods for determining protein-protein interactions, and sequence evolution analysis were not performed.
Abbreviations: CARD: caspase activation and recruitment domain; HMM: hidden Markov model; MSA: multiple sequence alignment.
Useful websites for bioinformatics researchers.
| European bioinformatics institute ( | A comprehensive resource of all bioinformatic data, includes a variety of bioinformatic tools such as programs for MSAs (CLUSTAL, MAFFT etc) and structural analysis. |
| HMM homepage ( | Downloads of available HMM software, includes a user’s guide and theoretical information. |
| HOGEMON database ( | Database of homologous sequences from fully sequenced genomes. |
| InterPro ( | Integrates information from numerous other protein databases. |
| National center for biotechnology information ( | National resource of molecular biology information, includes bioinformatic tools, PubMed. |
| Plasmodium database ( | Database of |
| Pedant phylogenetic web profiler ( | Web-based service to perform phylogenetic profiling of proteins against genomes. |
| PLATCOM ( | Integrated system for comparison of multiple genomes. |
| Protein analysis through evolutionary relationships ( | Classification of genes by function, phylogenetic trees, HMMs and multiple sequences available. |
| Phylogeny resource ( | Comprehensive list of available phylogeny software, software downloads available. |
| ProDom ( | Comprehensive set of protein domain families. |
| Pfam ( | Collection of multiple sequence alignments and HMMs covering protein families. |
| ProSite ( | Database of protein families, domains and functional sites. |
| Protein data bank ( | Archive of macromolecular structural data. |
| Reference sequence collection ( | Collection of non-redundant sequences that have been expertly reviewed. |
| STRING ( | Database of known and predicted protein-protein interactions. |
| Structural classification of proteins database ( | Classification of proteins based on structural similarities. |
| Universal protein resource ( | Catalog of proteins, integrates information from Swiss-Prot (curated expert knowledge base), TrEMBL (non-curated computer annotated supplement to Swiss-Prot) and PIR (protein sequence database), links to protein analysis tools such as SWISS-MODEL. |