| Literature DB >> 17868481 |
Mikko Arvas1, Teemu Kivioja, Alex Mitchell, Markku Saloheimo, David Ussery, Merja Penttila, Stephen Oliver.
Abstract
BACKGROUND: Several dozen fungi encompassing traditional model organisms, industrial production organisms and human and plant pathogens have been sequenced recently and their particular genomic features analysed in detail. In addition comparative genomics has been used to analyse specific sub groups of fungi. Notably, analysis of the phylum Saccharomycotina has revealed major events of evolution such as the recent genome duplication and subsequent gene loss. However, little has been done to gain a comprehensive comparative view to the fungal kingdom. We have carried out a computational genome wide comparison of protein coding gene content of Saccharomycotina and Pezizomycotina, which include industrially important yeasts and filamentous fungi, respectively.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17868481 PMCID: PMC2045113 DOI: 10.1186/1471-2164-8-325
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genome data
| Phylum | Subphylum | Name | Abbreviation | Count of ORFs | Source | WWW | Funcat | InterPro | Protfun |
| Ascomycota | Archeascomycota | Spom | 4984 | Sanger Institute, UK | YES | ||||
| Ascomycota | Pezizomycotina | Afum | 9926 | Tigr, USA | YES | ||||
| Ascomycota | Pezizomycotina | Anid | 9541 | Broad Institute, USA | YES | YES | |||
| Ascomycota | Pezizomycotina | Anig | 11200 | Joint Genome Institute, USA | |||||
| Ascomycota | Pezizomycotina | Aory | 12062 | NITE, Japan | YES | ||||
| Ascomycota | Pezizomycotina | Bcin | 16446 | Broad Institute, USA | |||||
| Ascomycota | Pezizomycotina | Cglo | 11124 | Broad Institute, USA | YES | ||||
| Ascomycota | Pezizomycotina | Cimm | 10457 | Broad Institute, USA | |||||
| Ascomycota | Pezizomycotina | Fgra | 11640 | Broad Institute, USA | YES | YES | YES | ||
| Ascomycota | Pezizomycotina | Mgri | 11109 | Broad Institute, USA | YES | YES | |||
| Ascomycota | Pezizomycotina | Nhae | 16237 | Joint Genome Institute, USA | |||||
| Ascomycota | Pezizomycotina | Ncra | 10082 | Broad Institute, USA | YES | YES | |||
| Ascomycota | Pezizomycotina | Sscl | 14522 | Broad Institute, USA | |||||
| Ascomycota | Pezizomycotina | Snod | 16597 | Broad Institute, USA | |||||
| Ascomycota | Pezizomycotina | Tree | 9997 | Joint Genome Institute, USA | YES | YES | |||
| Ascomycota | Saccharomycotina | Agos | 4726 | Ashbya Genome Database, Switzerland | YES | ||||
| Ascomycota | Saccharomycotina | Calb | 6165 | Stanford University, UK | YES | YES | |||
| Ascomycota | Saccharomycotina | Cgla | 5181 | Genolevures Consortium, France | YES | ||||
| Ascomycota | Saccharomycotina | Cgui | 5920 | Broad Institute, USA | |||||
| Ascomycota | Saccharomycotina | Clus | 5941 | Broad Institute, USA | |||||
| Ascomycota | Saccharomycotina | Dhan | 6318 | Genolevures Consortium, France | YES | ||||
| Ascomycota | Saccharomycotina | Klac | 5327 | Genolevures Consortium, France | YES | ||||
| Ascomycota | Saccharomycotina | Ppas | 5999 | Integrated genomics, USA | |||||
| Ascomycota | Saccharomycotina | Psti | 5839 | Joint Genome Institute, USA | |||||
| Ascomycota | Saccharomycotina | Scas | 4677 | Saccharomyces Genome Database, USA | |||||
| Ascomycota | Saccharomycotina | Scer | 5888 | Saccharomyces Genome Database, USA | YES | YES | YES | ||
| Ascomycota | Saccharomycotina | Sklu | 2968 | Saccharomyces Genome Database, USA | |||||
| Ascomycota | Saccharomycotina | Ylip | 6521 | Genolevures Consortium, France | YES | YES | |||
| Basidiomycota | Hymenomycetes | Ccin | 13544 | Broad Institute, USA | |||||
| Basidiomycota | Hymenomycetes | Cneo | 7302 | Broad Institute, USA | |||||
| Basidiomycota | Hymenomycetes | Pchr | 10048 | Joint Genome Institute, USA | YES | YES | |||
| Basidiomycota | Ustilaginomycetes | Umay | 6522 | Broad Institute, USA | YES | ||||
| Zygomycota | Zygomycetes | Rory | 17467 | Broad Institute, USA |
Phylum, subphylum, abbreviation used in figures, count of ORFs in the genome, data source and internet address and information, whether Funcat annotation existed and whether InterPro or Protfun analysis was carried out, shown for each species.
Figure 1Selection of the protein clustering parameter "inflation value" (. Average sensitivity and specificity percentages (left y-axis) and total count and count of orphan clusters i.e. clusters with only a single member ORF (right y-axis) for clusterings made with different inflation values (x-axis).
Figure 2Unrecognised by InterPro or clustering. Percentage of ORFs in orphan clusters (y-axis) versus the percentage of ORFs unrecognised by InterPro (x-axis) for species which were analysed with InterProScan. Species are coloured by phyla. Names of the species, plotted as abbreviations beside the data points, are explained in Table 1.
Figure 3Genomic ORF redundancy. Percentage of ORFs in clusters containing more than one ORF from the species in question i.e. genomic ORF redundancy (y-axis) versus the size of the genome in ORFs (x-axis) for each species. See Figure 2 for further details.
Figure 4Protein clustering overview. See Figure 5 for legend.
Figure 5Legend for figure 4. A heatmap of clusters with at least ten ORFs. In the main heatmap colour intensity of a cell shows the number of ORFs shown by clusters (rows) and by species (columns). Both rows and columns are ordered by hierarchical clustering to group similar rows or columns together. The dendrogram from hierarchical clustering is shown for columns and the phylum of species is indicated by a column colour bar between the heatmap and the dendrogram. Under the heatmap each species is specified by an abbreviation explained in Table 1. On the left side of the main heatmap a black and white side heatmap shows the percentage of ORFs in a cluster that have an InterPro entry of all cluster's ORFs analysed with InterProScan ("wIPR"), cluster's stability and cluster's Saccharomycotina to Pezizomycotina ratio in a clustering where inflation value (r) was 1.1 ("S/P r 1.1"). Stability reflects the ratio of cluster size between a clustering where r = 3.1 to that where r = 1.1. As Figure 1 shows, when r is set to its minimum value (1.1), TRIBE-MCL clustering produces a minimum amount of clusters and orphan clusters. In consequence the clusters are on average larger when r = 1.1. The ratio between cluster size r = 3.1 and r = 1.1 is shown as a percentage. "S/P r 1.1" reflects the ratio of count of Saccharomycotina ORFs to Pezizomycotina ORFs in a cluster when r = 1.1. By comparing "S/P r 1.1" to the species distribution of a cluster shown on the main heatmap one can see if a cluster retains the Saccharomycotina to Pezizomycotina ratio when r = 1.1. On the right side of the main heatmap, a side heatmap shows various functional classifications for the clusters. Whether or not the cluster has a Funcat classification ("Funcat") or has an ORF found in S. cerevisiae metabolic model iMH805 is shown ("iMH805"). Whether the proteins in the cluster are predicted by Protfun to have a signal sequence directing them into either mitochondrion or secretion pathway ("TargetP"), have transmembrane domains ("TMHMM") or are predicted to be enzymes is shown ("Enz."). Clusters belonging to regions A: "Pezizomycotina abundant", B: "Pezizomycotina specific", C: "Saccharomycotina absent" and D: "Saccharomycotina unique" are specified by a vertical bar between main and right heatmap.
Enriched cluster categories in region A: "Pezizomycotina abundant"
| Software or database | Database identifier | Category | Count of clusters in region | % of clusters in region | % of all clusters with category | P-value | Author assignment | Figure 13 |
| ProtFun | - | Enzyme/nonenzyme: Enzyme | 180 | 91.0 | 5 | 1.60E-11 | - | - |
| ProtFun | - | Enzyme class: Uncategorised | 147 | 74.0 | 6 | 1.60E-17 | - | - |
| ProtFun | - | Cellular role: Uncategorised | 143 | 72.0 | 4 | 2.10E-02 | - | - |
| Funcat | 01 | Metabolism | 81 | 41.0 | 6 | 8.00E-06 | Metabolism | - |
| ProtFun | - | TargetP: Uncategorised | 71 | 36.0 | 5 | 1.40E-04 | - | - |
| Funcat | 01.05 | C-compound and carbohydrate metabolism | 41 | 21.0 | 10 | 1.30E-07 | Plant biomass degradation | - |
| ProtFun | - | TargetP: Secretion | 39 | 20.0 | 9 | 1.30E-07 | - | - |
| ProtFun | - | Cellular role: Cell envelope | 30 | 15.0 | 11 | 1.20E-08 | - | - |
| Funcat | 01.05.01 | C-compound and carbohydrate utilization | 29 | 15.0 | 10 | 1.30E-05 | Plant biomass degradation | - |
| Funcat | 32 | Cell rescue, defence and virulence | 24 | 12.0 | 7 | 6.50E-03 | Virulence/Defence | - |
| Funcat | 01.05.01.01 | C-compound, carbohydrate catabolism | 22 | 11.0 | 22 | 4.40E-11 | Plant biomass degradation | - |
| ProtFun | - | TMHMM: Uncategorised | 20 | 10.0 | 7 | 3.70E-03 | - | - |
| Funcat | 01.20 | Secondary metabolism | 19 | 10.0 | 22 | 1.40E-09 | 2ary metabolism | - |
| Funcat | 32.05 | Disease, virulence and defence | 14 | 7.0 | 23 | 1.00E-07 | Virulence/Defence | - |
| Funcat | 01.05.01.01.02 | Polysaccharide degradation | 13 | 7.0 | 27 | 3.50E-08 | Plant biomass degradation | - |
| Funcat | 32.05.05 | Virulence, disease factors | 9 | 5.0 | 45 | 3.20E-08 | Virulence/Defence | - |
| Interpro | IPR001128 | Cytochrome P450 | 8 | 4.0 | 42 | 1.30E-07 | 2ary metabolism | YES |
| Interpro | IPR002347 | Glucose/ribitol dehydrogenase | 8 | 4.0 | 21 | 4.60E-05 | 2ary metabolism | YES |
| Funcat | 32.07 | Detoxification | 7 | 4.0 | 10 | 2.20E-02 | 2ary metabolism | - |
| Interpro | IPR002198 | Short-chain dehydrogenase/reductase SDR | 7 | 3.5 | 21 | 1.60E-04 | 2ary metabolism | YES |
| Interpro | IPR006094 | FAD linked oxidase, N-terminal | 5 | 2.5 | 56 | 6.30E-06 | 2ary metabolism | YES |
| Interpro | IPR003042 | Aromatic-ring hydroxylase | 5 | 2.5 | 36 | 8.60E-05 | 2ary metabolism | YES |
| Funcat | 01.25 | Extracellular metabolism | 4 | 2.0 | 36 | 7.30E-04 | Plant biomass degradation | - |
| Funcat | 02.16 | Fermentation | 4 | 2.0 | 20 | 8.00E-03 | Metabolism | - |
| Funcat | 01.05.01.01.01 | Sugar, glucoside, polyol and carboxylate catabolism | 4 | 2.0 | 19 | 9.50E-03 | Plant biomass degradation | - |
| Funcat | 30.05 | Transmembrane signal transduction | 4 | 2.0 | 12 | 4.50E-02 | Signalling | - |
| Interpro | IPR011050 | Pectin lyase fold/virulence factor | 4 | 2.0 | 27 | 1.60E-03 | Plant biomass degradation | NO |
| Interpro | IPR000873 | AMP-dependent synthetase and ligase | 4 | 2.0 | 25 | 2.10E-03 | 2ary metabolism | YES |
| Interpro | IPR008985 | Concanavalin A-like lectin/glucanase | 4 | 2.0 | 24 | 2.60E-03 | Plant biomass degradation | NO |
| Funcat | 32.10.07 | Degradation of foreign (exogenous) polysaccharides | 3 | 2.0 | 50 | 1.20E-03 | Plant biomass degradation | - |
| Funcat | 01.01.09.05 | Metabolism of tyrosine | 3 | 2.0 | 43 | 2.10E-03 | 2ary metabolism | - |
| Funcat | 01.05.01.01.09 | Aerobic aromate catabolism | 3 | 2.0 | 43 | 2.10E-03 | Metabolism | - |
| Funcat | 32.10 | Degradation of foreign (exogenous) compounds | 3 | 2.0 | 38 | 3.30E-03 | Plant biomass degradation | - |
| Funcat | 02.25 | Oxidation of fatty acids | 3 | 2.0 | 27 | 8.80E-03 | 2ary metabolism | - |
| Interpro | IPR010730 | Heterokaryon incompatibility | 3 | 1.5 | 75 | 1.80E-04 | Mating | YES |
| Interpro | IPR003661 | Histidine kinase A, N-terminal | 3 | 1.5 | 43 | 1.40E-03 | Signalling | NO |
| Interpro | IPR006163 | Phosphopantetheine-binding | 3 | 1.5 | 43 | 1.40E-03 | 2ary metabolism | YES |
| Interpro | IPR005467 | Histidine kinase | 3 | 1.5 | 38 | 2.20E-03 | Signalling | NO |
| Interpro | IPR001789 | Response regulator receiver | 3 | 1.5 | 33 | 3.20E-03 | Signalling | NO |
| Funcat | 30.05.01.10 | Two-component signal transduction system | 2 | 1.0 | 67 | 4.90E-03 | Signalling | - |
| Funcat | 32.05.05.01 | Toxins | 2 | 1.0 | 50 | 9.50E-03 | 2ary metabolism | - |
| Funcat | 01.01.09.05.02 | Degradation of tyrosine | 2 | 1.0 | 40 | 1.50E-02 | 2ary metabolism | - |
| Funcat | 01.20.37 | Biosynthesis of peptide derived compounds | 2 | 1.0 | 40 | 1.50E-02 | 2ary metabolism | - |
| Funcat | 34.11.11 | Rhythm (e.g. circadian, ultradian) | 2 | 1.0 | 40 | 1.50E-02 | Other | - |
| Funcat | 01.01.09.04 | Metabolism of phenylalanine | 2 | 1.0 | 33 | 2.30E-02 | 2ary metabolism | - |
| Funcat | 01.25.01 | Extracellular polysaccharide degradation | 2 | 1.0 | 33 | 2.30E-02 | Plant biomass degradation | - |
| Funcat | 16.05 | Polysaccharide binding | 2 | 1.0 | 33 | 2.30E-02 | Plant biomass degradation | - |
| Funcat | 16.21.05 | FAD/FMN binding | 2 | 1.0 | 33 | 2.30E-02 | 2ary metabolism | - |
| Funcat | 01.01.11.03.02 | Degradation of valine | 2 | 1.0 | 29 | 3.10E-02 | 2ary metabolism | - |
| Funcat | 30.05.01 | Receptor enzyme mediated signalling | 2 | 1.0 | 25 | 4.00E-02 | Signalling | - |
| Funcat | 01.20.35.01 | Biosynthesis of phenylpropanoids | 2 | 1.0 | 22 | 5.00E-02 | 2ary metabolism | - |
| Interpro | IPR000675 | Cutinase | 2 | 1.0 | 100 | 1.30E-03 | Plant biomass degradation | NO |
| Interpro | IPR001077 | O-methyltransferase, family 2 | 2 | 1.0 | 100 | 1.30E-03 | Regulation | NO |
| Interpro | IPR002227 | Tyrosinase | 2 | 1.0 | 100 | 1.30E-03 | 2ary metabolism | NO |
| Interpro | IPR008922 | Di-copper centre-containing | 2 | 1.0 | 100 | 1.30E-03 | Dubious | NO |
| Interpro | IPR012951 | Berberine/berberine-like | 2 | 1.0 | 100 | 1.30E-03 | 2ary metabolism | NO |
| Interpro | IPR000743 | Glycoside hydrolase, family 28 | 2 | 1.0 | 67 | 3.70E-03 | Plant biomass degradation | NO |
| Funcat | 01.05.01.01.11 | Anaerobic aromate catabolism | 1 | 1.0 | 100 | 4.10E-02 | Metabolism | - |
| Funcat | 01.20.01.09 | Biosynthesis of aminoglycoside antibiotics | 1 | 1.0 | 100 | 4.10E-02 | 2ary metabolism | - |
| Funcat | 01.20.23 | Biosynthesis of secondary products derived from L-methionine | 1 | 1.0 | 100 | 4.10E-02 | 2ary metabolism | - |
| Funcat | 01.20.36 | Non-ribosomal peptide synthesis | 1 | 1.0 | 100 | 4.10E-02 | 2ary metabolism | - |
| Funcat | 01.20.37.05 | Biosynthesis of beta-lactams | 1 | 1.0 | 100 | 4.10E-02 | 2ary metabolism | - |
| Funcat | 02.16.03.03 | Heterofermentative pathway and fermentation of other saccharides | 1 | 1.0 | 100 | 4.10E-02 | Metabolism | - |
| Funcat | 32.05.05.03 | Bacteriocins | 1 | 1.0 | 100 | 4.10E-02 | 2ary metabolism | - |
| Funcat | 32.07.09 | Detoxification by degradation | 1 | 1.0 | 100 | 4.10E-02 | 2ary metabolism | - |
| Funcat | 34.11.01.01 | Light environment response | 1 | 1.0 | 100 | 4.10E-02 | Signalling | - |
| Funcat | 36.20.18 | Plant hormonal regulation | 1 | 1.0 | 100 | 4.10E-02 | Signalling | - |
Software or database that the categories were derived from, category database identifier, category name, count of clusters with the category, percentage of all clusters in this region, percentage of all clusters with the category and significance of enrichment are shown for categories which are enriched in the region (Figure 4) with a p < 0.01. Column "Author assignment" shows an assignment to general themes, based on the respective database, that summarise the InterPro and Funcat categories. While the other assignments are directly based on the databases, "Secondary metabolism" covers entries which are known to participate also in secondary metabolism ([61, 62], Funcat and InterPro). For InterPro entries whether the entry is found from Figure 9 is specified in column "Figure 9". InterPro entries assigned to "Dubious" are entries that InterPro itself considers unreliable.
Enriched cluster categories in region B: "Pezizomycotina specific"
| Software or database | Database identifier | Category | Count of clusters in region | % of clusters in region | % of all clusters with category | P-value | Author assignment | Figure 9 |
| ProtFun | - | TMHMM: No transmembrane | 913 | 81.0 | 21 | 5.60E-03 | - | - |
| ProtFun | - | Enzyme/nonenzyme: Nonenzyme | 175 | 16.0 | 25 | 1.30E-03 | - | - |
| ProtFun | - | TargetP: Secretion | 128 | 11.0 | 28 | 1.90E-05 | - | - |
| ProtFun | - | Cellular role: Cell envelope | 93 | 8.0 | 35 | 9.50E-09 | - | - |
| ProtFun | - | Cellular role: Central intermediary metabolism | 84 | 7.0 | 26 | 1.20E-02 | - | - |
| Interpro | IPR001138 | Fungal transcriptional regulatory protein, N-terminal | 41 | 3.6 | 51 | 7.10E-10 | Transcription regulation | YES |
| ProtFun | - | Cellular role: Regulatory functions | 37 | 3.0 | 39 | 1.80E-05 | - | - |
| Interpro | IPR007219 | Fungal specific transcription factor | 25 | 2.2 | 37 | 1.30E-03 | Transcription regulation | YES |
| ProtFun | - | Cellular role: Purines and pyrimidines | 27 | 2.0 | 28 | 4.30E-02 | - | - |
| Interpro | IPR001810 | Cyclin-like F-box | 13 | 1.2 | 62 | 4.00E-05 | Macromolecule interaction | YES |
| Interpro | IPR000637 | HMG-I and HMG-Y, DNA-binding | 7 | 0.6 | 88 | 9.70E-05 | Chromatin | NO |
| Interpro | IPR001087 | Lipolytic enzyme, G-D-S-L | 6 | 0.5 | 75 | 1.40E-03 | Metabolism | NO |
| Interpro | IPR006710 | Glycoside hydrolase, family 43 | 6 | 0.5 | 75 | 1.40E-03 | Plant biomass degradation | NO |
| Interpro | IPR005302 | MOCO sulphurase C-terminal | 3 | 0.3 | 100 | 8.50E-03 | Metabolism | NO |
| Interpro | IPR006209 | EGF-like | 3 | 0.3 | 100 | 8.50E-03 | Extracellular | NO |
| Funcat | 36.20.35 | Response to environmental stimuli | 3 | 0.0 | 75 | 4.20E-02 | Signalling | - |
See Table 2 for legend
Enriched cluster categories in region C: " Saccharomycotina absent"
| Software or database | Database identifier | Category | Count of clusters in region | % of clusters in region | % of all clusters with category | P-value | Author assignment | Figure 9 |
| ProtFun | - | TMHMM: No transmembrane | 825 | 82.0 | 19 | 8.60E-04 | - | - |
| ProtFun | - | Enzyme/nonenzyme: Enzyme | 747 | 74.0 | 19 | 4.50E-02 | - | - |
| ProtFun | - | Cellular role: Uncategorised | 722 | 72.0 | 20 | 2.20E-06 | - | - |
| ProtFun | - | TargetP: Mitochondrion | 81 | 8.0 | 22 | 4.80E-02 | - | - |
| ProtFun | - | Cellular role: Replication and transcription | 9 | 1.0 | 38 | 2.10E-02 | - | - |
| Funcat | 01.01.11.04.02 | Degradation of leucine | 5 | 1.0 | 63 | 1.30E-02 | 2ary metabolism | - |
| Funcat | 01.01.11.04 | metabolism of leucine | 5 | 1.0 | 50 | 3.90E-02 | 2ary metabolism | - |
| Interpro | IPR001623 | Heat shock protein DnaJ, N-terminal | 9 | 0.9 | 43 | 7.90E-03 | Macromolecule interaction | NO |
| Interpro | IPR000195 | RabGAP/TBC | 6 | 0.6 | 55 | 7.30E-03 | Secretion | NO |
| Interpro | IPR000683 | Oxidoreductase, N-terminal | 6 | 0.6 | 55 | 7.30E-03 | Metabolism | NO |
| Interpro | IPR006091 | Acyl-CoA dehydrogenase, central region | 4 | 0.4 | 100 | 1.10E-03 | Metabolism | NO |
| Interpro | IPR006092 | Acyl-CoA dehydrogenase, N-terminal | 4 | 0.4 | 100 | 1.10E-03 | Metabolism | NO |
| Funcat | 01.01.11.02.02 | Degradation of isoleucine | 4 | 0.0 | 67 | 2.00E-02 | 2ary metabolism | - |
| Funcat | 01.01.11.03.02 | Degradation of valine | 4 | 0.0 | 57 | 3.90E-02 | 2ary metabolism | - |
| Funcat | 01.01.03.01.02 | Degradation of glutamine | 2 | 0.0 | 100 | 4.40E-02 | 2ary metabolism | - |
See Table 2 for legend
Enriched cluster categories in region C: "Saccharomycotina unique"
| Software or database | Database identifier | Category | Count of clusters in region | % of clusters in region | % of all clusters with category | P-value | Author assignment | Figure 9 |
| ProtFun | - | TMHMM: No transmembrane | 380 | 82.0 | 9 | 2.90E-02 | - | - |
| ProtFun | - | TargetP: No splice site | 306 | 66.0 | 10 | 1.30E-04 | - | - |
| Funcat | 11 | Transcription | 105 | 23.0 | 11 | 2.60E-02 | Transcription | - |
| ProtFun | - | Enzyme/nonenzyme: Nonenzyme | 95 | 20.0 | 13 | 7.80E-07 | - | - |
| ProtFun | - | Cellular role: Translation | 87 | 19.0 | 23 | 6.20E-20 | - | - |
| Funcat | 11.02 | RNA synthesis | 70 | 15.0 | 12 | 4.70E-02 | Transcription | - |
| ProtFun | - | Enzyme class: Ligase (EC 6.-.-.-) | 68 | 15.0 | 15 | 2.40E-06 | - | - |
| ProtFun | - | TargetP: Mitochondrion | 54 | 12.0 | 14 | 4.20E-05 | - | - |
| ProtFun | - | Cellular role: Central intermediary metabolism | 43 | 9.0 | 13 | 2.00E-03 | - | - |
| ProtFun | - | Cellular role: Amino acid biosynthesis | 37 | 8.0 | 35 | 6.80E-15 | - | - |
| Funcat | 42.16 | Biogenesis of cellular compartment: mitochondrion | 34 | 7.0 | 21 | 5.30E-06 | Mitochondrion | - |
| Funcat | 12.01.01 | Ribosomal proteins | 27 | 6.0 | 14 | 3.80E-02 | Protein translation | - |
| ProtFun | - | Cellular role: Energy metabolism | 20 | 4.0 | 17 | 1.80E-03 | - | - |
| Funcat | 70.16 | Subcellular localisation: mitochondrion | 18 | 4.0 | 16 | 2.40E-02 | Mitochondrion | - |
| Funcat | 11.04.01 | rRNA processing | 17 | 4.0 | 15 | 3.90E-02 | Transcription | - |
| ProtFun | - | Cellular role: Regulatory functions | 15 | 3.0 | 16 | 1.10E-02 | - | - |
| Interpro | IPR011046 | WD40-like | 14 | 3.0 | 18 | 5.70E-03 | Dubious | YES |
| ProtFun | - | Enzyme class: Isomerase (EC 5. -. -.-) | 14 | 3.0 | 45 | 5.30E-08 | - | - |
| Funcat | 11.02.01 | rRNA synthesis | 11 | 2.0 | 19 | 2.40E-02 | Transcription | - |
| ProtFun | - | Cellular role: Biosynthesis of cofactors | 7 | 2.0 | 37 | 6.00E-04 | - | - |
| ProtFun | - | Cellular role: Replication and transcription | 7 | 2.0 | 29 | 2.80E-03 | - | - |
| Funcat | 18.02.09 | Regulator of transcription factor | 5 | 1.0 | 31 | 1.50E-02 | Transcription regulation | - |
| Interpro | IPR006596 | Nucleotide binding protein, PINc | 3 | 0.6 | 60 | 5.20E-03 | Nucleotide binding | NO |
| Interpro | IPR000992 | Stress-induced protein SRP1/TIP1 | 2 | 0.4 | 100 | 7.10E-03 | Stress response | NO |
| Interpro | IPR007757 | MT-A70 | 2 | 0.4 | 100 | 7.10E-03 | Nucleotide binding | NO |
See Table 2 for legend
Figure 6The KEGG metabolic pathway "Valine, leucine and isoleucine degradation". Enzymes found in S. cerevisiae according to SGD and KEGG are filled with green and in M. grisea according to KEGG circled in orange. Enzymes found in clusters corresponding to Funcat categories 01.01.11.02 -01.01.11.04 enriched in the region C: "Saccharomycotina absent" in Table 3 are filled with pink.
Figure 7PCA of counts of ORFs with an InterPro entry. Positions of species analysed with InterProScan on the two PCs that explain the largest amount of variation in the counts of ORFs with an InterPro entry. Species abbreviations are explained in Table 1 and data points are coloured by phyla.
Figure 8PCA loadings of InterPro entries and InterPro entry structures. PCA loadings of InterPro entries (a) or InterPro entry structures (b) of the two PCs that explain the largest amount of variation in the counts of ORFs with an InterPro entry. The PCA for InterPro entries (a) is shown in picture 11, for InterPro entry structures (b) data not shown. 100 InterPro entries or InterPro entry structures having the most extreme PCA loadings on the two PCs shown are coloured with orange (TOP 100), while the rest are 4373 InterPro entries and 16319 InterPro structures are in blue. InterPro entries identifiers are shown for 20 most extreme PCA loadings.
Figure 9TOP 100 InterPro entries. See Figure 10 for legend.
Figure 10Legend for figure 9. A heatmap of counts of ORFs with an InterPro entry for the TOP 100 entries from Figure 8a. In the main heatmap colour intensity of a cell shows the number of ORFs with an InterPro entry shown by entries (rows) and by species (columns). Both rows and columns are ordered by hierarchical clustering to group similar rows or columns together. Columns were clustered with counts of ORFs while rows were clustered with the entry PCA loadings (Left side heatmap and Figure 8a). The dendrogram from hierarchical clustering is shown for columns and the phylum of species is indicated by a column colour bar between the heatmap and the dendrogram. Under the heatmap each species is specified by an abbreviation explained in Table 1. Left side heatmap shows the loading of the entry as in Figure 8a. Interpro entry identifier ("IPR id.", "IPR0" removed from beginning), name ("IPR name") and "Author assignment" are shown for each entry. The "Author assignment" is an assignment to general themes that summarise the individual categories based on the InterPro database. While the other assignments are directly based on the InterPro, "Secondary metabolism" covers entries which are known to participate also in secondary metabolism ([61, 62] and InterPro). InterPro entries assigned to "Dubious" are entries that InterPro itself considers unreliable.
Figure 11Schema and screenshots of the browsable fungal comparative genomics database. The schema on the upper right corner shows links between different browser views to the database. Additionally two example screenshots are shown. See text for further details.