| Literature DB >> 20964831 |
Aminael Sánchez-Rodríguez1, Cindy Martens, Kristof Engelen, Yves Van de Peer, Kathleen Marchal.
Abstract
BACKGROUND: Previous studies in Ascomycetes have shown that the function of gene families of which the size is considerably larger in extant pathogens than in non-pathogens could be related to pathogenicity traits. However, by only comparing gene inventories in extant species, no insights can be gained into the evolutionary process that gave rise to these larger family sizes in pathogens. Moreover, most studies which consider gene families in extant species only tend to explain observed differences in gene family sizes by gains rather than by losses, hereby largely underestimating the impact of gene loss during genome evolution.Entities:
Mesh:
Year: 2010 PMID: 20964831 PMCID: PMC3087541 DOI: 10.1186/1471-2148-10-318
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Evolutionary history in eight Ascomycete species according to DOLLOP and CAFE. The species tree was derived by NJ (Poisson correction, all nodes 100% BS-supported). Branch lengths are drawn to scale. Branches of the tree corresponding to pathogenic species are shaded in yellow while those corresponding to non-pathogenic species are shaded in green. At each time point (grey circles), the number of gains and losses is indicated as inferred by DOLLOP (black) and CAFE (red). Expansions and contractions of already existing gene families inferred by CAFE are not indicated as these cannot be inferred by DOLLOP.
Figure 2Clustering of gene families according to their phylogenetic profiles. Clusters are indicated with roman numbers for A: non-pathogens and B: pathogens. Numbers on the left hand side indicate the number of gene families contained within each cluster (i.e. families with the same normalized phylogenetic profiles). The white-to-black scale, based on the gene family normalized phylogenetic profiles, indicates the size of the families in a cluster per species relative to the family-average (columns). Gene ontology terms of the overrepresented gene families in each cluster are shown at the right. AN: Aspergillus nidulans; AO: Aspergillus oryzae; NC: Neurospora crassa; TR: Trichoderma reesei; MG: Magnaporthe grisea; FG: Fusarium graminearum; MF: Mycosphaerella fijiensis; MYG: Mycosphaerella graminicola.
Classification of gene families according to the evolutionary history that explains their origin
| Species | Description | Evolutionary events | Class | |||||
|---|---|---|---|---|---|---|---|---|
| Clustering | DOLLOP | CAFE | ||||||
| ancestors | extant | ancestors | extant | |||||
| Transmembrane transporter | X | X | X | X | X | gain; TP15; enlarged family size; significant expansion | C1; C3 | |
| Mannosidase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Peptidase activity | X | X | X | gain; enlarged family size | C1 | |||
| Transferase activity | X | X | gain; enlarged family size | C1 | ||||
| Chitin binding | X | X | X | TP15; enlarged family size | C3 | |||
| Lipid metabolism | X | X | X | X | gain; TP15 | C1; C3 | ||
| Methyltransferase activity | X | X | X | TP15; significant expansion | C2 | |||
| N-acetyl transferase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Oxidoreductase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Polyssacharide binding | X | X | gain | C1 | ||||
| Chitinase activity | X | X | gain | C1 | ||||
| Gluthatione peroxidase | X | X | gain | C1 | ||||
| Thioredoxin peroxidase | X | X | gain | C1 | ||||
| Astacin activity | X | X | TP15 | C3 | ||||
| G-coupled protein receptor | X | X | TP15 | C3 | ||||
| Tartrate dehydrogenase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Transmembrane transporters | X | X | loss | C3 | ||||
| Lipid metabolism | X | X | loss | C3 | ||||
| Polyketide biosynthetic pathway | X | X | gain | |||||
| Nuclease activity | X | X | X | gain; enlarged family size | ||||
| Chitin binding | X | X | loss | C3 | ||||
| Astacin activity | X | X | loss | C3 | ||||
| Fatty-acid synthase activity | X | X | X | gain; enlarged family size | ||||
| N-acetyl transferase activity | X | X | loss | C3 | ||||
| Mannosidase activity | X | X | loss | C3 | ||||
| G-coupled protein receptor | X | X | loss | C3 | ||||
| Oxidoreductase activity | X | X | loss | C3 | ||||
| Tartrate dehydrogenase activity | X | X | loss | C3 | ||||
| Transmembrane transporters | X | X | loss | C3 | ||||
| Lipid metabolism | X | X | loss | C3 | ||||
| Chitin binding | X | X | loss | C3 | ||||
| Astacin activity | X | X | loss | C3 | ||||
| N-acetyl transferase activity | X | X | loss | C3 | ||||
| Oxidoreductase activity | X | X | loss | C3 | ||||
| Mannosidase activity | X | X | loss | C3 | ||||
| G-coupled protein receptor | X | X | loss | C3 | ||||
| Tartrate dehydrogenase activity | X | X | loss | C3 | ||||
| Mannosidase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Peptidase activity | X | X | X | gain; enlarged family size | C1 | |||
| G-coupled protein receptor | X | X | X | X | TP15; enlarged family size | C3 | ||
| Chitin binding | X | X | X | TP15; enlarged family size | C3 | |||
| Lipid metabolism | X | X | X | TP15; enlarged family size | C3 | |||
| Transmembrane transporter | X | X | TP15 | C3 | ||||
| Subtilase activity | X | X | gain | C1 | ||||
| Cutinase activity | X | X | gain | C1 | ||||
| Polysaccharide binding | X | X | X | gain; enlarged family size | C1 | |||
| Chitinase activity | X | X | X | gain; enlarged family size | C1 | |||
| Gluthatione peroxidase | X | X | gain | C1 | ||||
| Thioredoxin peroxidase | X | X | gain | C1 | ||||
| Ribonuclease H | X | X | X | TP15; significant expansion | C2 | |||
| RNA-directed DNA polymerase | X | X | X | TP15; significant expansion | C2 | |||
| Astacin activity | X | X | TP15 | C3 | ||||
| Polyketide biosynthetic pathway | X | X | X | X | gain; TP15; significant expansion | C1, C2 | ||
| Antimicrobial peptide production | X | X | gain | |||||
| Transcription factor activity | X | X | X | gain | ||||
| Light sensing | X | X | gain | |||||
| Chitin binding | X | X | loss | C3 | ||||
| Astacin activity | X | X | loss | C3 | ||||
| Lipid metabolism | X | X | loss | C3 | ||||
| Transmembrane transporter | X | X | loss | C3 | ||||
| N-acetyl transferase activity | X | X | loss | C3 | ||||
| Mannosidase activity | X | X | loss | C3 | ||||
| G-coupled protein receptor | X | X | loss | C3 | ||||
| Oxidoreductase activity | X | X | loss | C3 | ||||
| Tartrate dehydrogenase activity | X | X | loss | C3 | ||||
| X- Pro dipeptidyl-peptidase activity | X | X | X | gain; enlarged family size | ||||
| Acid-ammonia ligase activity | X | X | X | gain; enlarged family size | ||||
| Nuclease activity | X | X | X | gain; enlarged family size | ||||
| Fatty-acid synthase activity | X | X | X | gain; enlarged family size | ||||
| Monooxygenase activity | X | X | gain | |||||
| Chitin binding | X | X | loss | C3 | ||||
| Astacin activity | X | X | loss | C3 | ||||
| Lipid metabolism | X | X | loss | C3 | ||||
| Transmembrane transporter | X | X | loss | C3 | ||||
| Positive regulation of cell growth | X | X | loss | |||||
| N-acetyl transferase activity | X | X | loss | C3 | ||||
| Mannosidase activity | X | X | loss | C3 | ||||
| G-coupled protein receptor | X | X | loss | C3 | ||||
| Oxidoreductase activity | X | X | loss | C3 | ||||
| Tartrate dehydrogenase activity | X | X | loss | C3 | ||||
| Transmembrane transporters | X | X | X | X | TP15, enlarged family size; significant expansion | C3 | ||
| Lipid metabolism | X | X | TP15 | C3 | ||||
| Mannosidase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Peptidase activity | X | X | X | gain; enlarged family size | C1 | |||
| Transferase activity | X | X | X | gain; enlarged family size | C1 | |||
| G-coupled protein receptor | X | X | X | TP15; enlarged family size | C3 | |||
| Chitin binding | X | X | X | TP15; enlarged family size | C3 | |||
| Astacin activity | X | X | X | X | gain; TP15 | C1; C3 | ||
| Proteolytic activity | X | X | gain | C1 | ||||
| Phospholipase A2 activity | X | X | X | gain; enlarged family size | C1 | |||
| Oxidoreductase activity | X | X | X | TP15; enlarged family size | C3 | |||
| N-acetyl transferase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Tartrate dehydrogenase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Nitrogen catabolism | X | X | loss | C4 | ||||
| Response to nitrogen starvation | X | X | loss | C4 | ||||
| Sensing nitrogen levels | X | X | loss | C4 | ||||
| Transmembrane transporter | X | X | X | TP15; enlarged family size | C3 | |||
| Lipid metabolism | X | X | TP15 | C3 | ||||
| Mannosidase activity | X | X | X | TP15; enlarged family size | C3 | |||
| Transferase activity | X | X | X | gain; enlarged family size | C1 | |||
| G-coupled protein receptor | X | X | X | X | TP15; enlarged family size | C3 | ||
| Metalloexopeptidase activity | X | X | X | gain; enlarged family size | C1 | |||
| Proteolytic activity | X | X | gain | C1 | ||||
| Astacin activity | X | X | X | X | gain; TP15; significant expansion | C1; C3 | ||
| Glutathione peroxidase | X | X | gain | C1 | ||||
| Thioredoxin peroxidase | X | X | gain | C1 | ||||
| Chitin binding | X | X | X | TP15; significant expansion | C3 | |||
| Nitrogen catabolism | X | X | loss | C4 | ||||
| Sensing nitrogen levels | X | X | loss | C4 | ||||
| Response to nitrogen starvation | X | X | loss | C4 | ||||
Only families for which the evolutionary history was consistently reconstructed by DOLLOP and CAFE are displayed. Description: GO description of the families. Clustering: indicates whether an above the family-average gene copy number was observed in at least one species according to the cluster analysis. Evolutionary events in non-pathogens were not categorized. Family size contractions due to gene loss events that do not imply the entire loss of the gene family were not included in the table but covered in the 'Dating gene duplication and loss events' section. Evolutionary events: 'gains' and 'losses' as inferred by DOLOP and CAFE; 'significant expansion as inferred by CAFE; 'enlarged gene family size' as inferred by clustering i.e. size of the family in the indicated species is larger than the family average. An 'X' indicates whether the event occurred in an ancestral node or in an extant species. Class: Gene families were classified according to the evolutionary event that explains their origin. The four classes (C1-C4) correspond to those shown in Figure 3 and described in the main text.
Remark 1: Non-pathogen-specific gains were never observed.
Remark 2: Although theoretically CAFE should be able to discover significant expansions or contractions of gene families in ancestral species, these were never observed. All cases of significant expansions and contractions were observed in the extant species.
Remark 3: Note that the class distinction was made based on the gene content of the families and not based on their functional annotation. This explains why families belonging to the same GO category can belong to different classes (e.g. some astacin families that were present ancestrally got lost in a non-pathogen-specific way: those belong to C3. Other astacin related families were selectively gained in some pathogens. Those belong to C1.
Figure 3Evolutionary events explaining gene copy number variations between pathogens and non-pathogens. Numbers at each node represent the presence ('1') or absence ('0') of members of a hypothetical gene family in each lineage descending from an ancestral species. A) Pathogen-specific gene family gains (Class 1). This event explains the origin of gene families that are not present in an ancestral species common to both pathogenic and non-pathogenic lineages, but appear in one extant pathogen or in an ancestral node that gives rise to exclusively pathogenic lineages. B) Pathogen-specific gene family expansions (Class 2): gene families that are already present in the last common ancestor of the extant species become expanded in a lineage specific way (denoted by a star (*)). C) Non-pathogen-specific losses (Class 3). This event results in gene families that were either present in the last common ancestor of all species or in an ancestor common to both pathogenic and non-pathogenic lineages to become lost selectively in non-pathogenic lineages. Loss of a gene family refers to selective loss of all its members in the considered species or ancestral node. In our study, non-pathogen-specific gene family loss was only observed in extant species. Note that in the pathogenic lineages where members of the considered families were not lost, sometimes a further expansion of the families was observed as denoted by a star (*). D) Pathogen-specific gene family losses (Class 4): gene families either already present in the last common ancestor of all species or in an ancestral species common to both pathogenic and non-pathogenic lineages got lost selectively in pathogenic lineages. In our study, this pathogen-specific gene family loss was also only observed in extant species.
Overview of single gene insertional mutants in Ascomycete species
| Species | description | Gene ID | Phenotype |
|---|---|---|---|
| RNA-dependent DNA replication | reduced virulence; conidiation defective | ||
| reduced virulence; conidiation defective | |||
| reduced virulence; conidiation defective | |||
| Ribonuclease H | reduced virulence; appressorium formation defective | ||
| appressorium formation defective | |||
| Polyketide biosynthetic process | effector (avirulence determinant) | ||
| Transmembrane transporters | Reduced virulence | ||
| Unaffected pathogenicity | |||
| Polyketide biosynthetic process | Unaffected pathogenicity | ||
| Unaffected pathogenicity | |||
| Unaffected pathogenicity | |||
| Unaffected pathogenicity | |||
| Unaffected pathogenicity | |||
| Unaffected pathogenicity | |||
| Unaffected pathogenicity | |||
| Involved in transmembrane transport | Reduced virulence |
Experimental data was collected from two sources: (1) Pathogen-Host Interaction database (PHI-base) [40] and (2) the work of Jeon et al. [41]. Phenotype: based on mutant deviations in conidiation and appressorium formation compared to untransformed control. Effector: gene product or signaling molecule resulting from its enzymatic activity that triggers defensive mechanisms in the plant host. Hence, effectors are usually known as avirulence determinants.
Figure 4Origin of the observed variation in gene copy number between Ascomycete species as explained by tree reconciliation. Panels A and B describe two hypothetical scenarios of how the size of a gene family increased in the extant species: A: enlarged gene family size due to a specific expansion in an extant species (species 1). B: enlarged gene family size in extant species 1 due to an expansion (duplication) in an ancestral node ('D') before the speciation event between species 1 and 2. Extant species 2 lost one of the copies ('L' in a red circle leading to a gray faded branch) C: Tree representing the phylogenetic relations between the studied species. Branches of the tree corresponding to pathogenic species are shaded in yellow. Pie charts indicate, for each of the extant species at the leave of a branch, how many of its gene families with increased size originated through scenario A and B. For instance, for a pathogen, scenario A means that there was a species-specific expansion in the pathogenic branch, while scenario B indicates the gene family in the studied species enlarged because of an ancestral duplication followed by a loss in the non-pathogen, closest to the studied pathogen (see species tree). The color level used in the pie charts (following the blue-to-red scale in below) is an indication for the fraction of enlarged gene families in the studied species, explained by either scenario A or B versus the total number of families included in the phylogenetic analysis. This shows that scenario B occurs much more frequently in absolute numbers in pathogens than in non pathogens.