| Literature DB >> 21501500 |
Christian P Kubicek1, Alfredo Herrera-Estrella, Verena Seidl-Seiboth, Diego A Martinez, Irina S Druzhinina, Michael Thon, Susanne Zeilinger, Sergio Casas-Flores, Benjamin A Horwitz, Prasun K Mukherjee, Mala Mukherjee, László Kredics, Luis D Alcaraz, Andrea Aerts, Zsuzsanna Antal, Lea Atanasova, Mayte G Cervantes-Badillo, Jean Challacombe, Olga Chertkov, Kevin McCluskey, Fanny Coulpier, Nandan Deshpande, Hans von Döhren, Daniel J Ebbole, Edgardo U Esquivel-Naranjo, Erzsébet Fekete, Michel Flipphi, Fabian Glaser, Elida Y Gómez-Rodríguez, Sabine Gruber, Cliff Han, Bernard Henrissat, Rosa Hermosa, Miguel Hernández-Oñate, Levente Karaffa, Idit Kosti, Stéphane Le Crom, Erika Lindquist, Susan Lucas, Mette Lübeck, Peter S Lübeck, Antoine Margeot, Benjamin Metz, Monica Misra, Helena Nevalainen, Markus Omann, Nicolle Packer, Giancarlo Perrone, Edith E Uresti-Rivera, Asaf Salamov, Monika Schmoll, Bernhard Seiboth, Harris Shapiro, Serenella Sukno, Juan Antonio Tamayo-Ramos, Doris Tisch, Aric Wiest, Heather H Wilkinson, Michael Zhang, Pedro M Coutinho, Charles M Kenerley, Enrique Monte, Scott E Baker, Igor V Grigoriev.
Abstract
BACKGROUND: Mycoparasitism, a lifestyle where one fungus is parasitic on another fungus, has special relevance when the prey is a plant pathogen, providing a strategy for biological control of pests for plant protection. Probably, the most studied biocontrol agents are species of the genus Hypocrea/Trichoderma.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21501500 PMCID: PMC3218866 DOI: 10.1186/gb-2011-12-4-r40
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Genome assembly and annotation statistics
|
|
|
| |
|---|---|---|---|
| Genome size, Mbp | 36.1 | 38.8 | 34.1 |
| Coverage | 8.26× | 8.05× | 9.00× |
| Assembly gaps, Mbp | 0.1 (0.16%) | 0.2(0.4%) | 0.05 (0.1%) |
| Number of scaffolds | 50 | 135 | 89 |
| Number of predicted genes | 11865 | 12518 | 9143 |
| Gene length, bp | 1747.06 | 1710.05 | 1793,25 |
| Protein length, amino acids | 471.54 | 478.69 | 492,27 |
| Exons per gene | 2,93 | 2,98 | 3,06 |
| Exon length, bp | 528.17 | 506.13 | 507,81 |
| Intron length, bp | 104.20 | 104.95 | 119,64 |
| Supported by homology, NR | 10,219 (92%) | 10,915 (94%) | 8409 (92%) |
| Supported by homology, Swissprot | 8,367(75%) | 8,773 (75%) | 6763 (74%) |
| Has PFAM domain | 5,883 (53%) | 6,267 (54%) | 5096 (56%) |
NR, non-redundant database; PFAM, protein families.
Figure 1Distribution of orthologues of . The Venn diagram shows the distribution found for the three species of Trichoderma.
Occurrence of orthologues, paralogues and singletons in the genomes of the three Trichoderma spp
| Genome | Synteny | Total genes | Non-orthologs | ||
|---|---|---|---|---|---|
| Syntenic | 9,350 | 7,326 | 2,024 | 2.2e-16 | |
| Non-syntenic | 2,515 | 1,265 | 1,250 | ||
| Syntenic | 9,828 | 7,326 | 2,502 | 2.2e-16 | |
| Non-syntenic | 2,690 | 1,532 | 1,158 | ||
| Syntenic | 8,776 | 7,326 | 1,450 | 2.2e-16 | |
| Non-syntenic | 367 | 153 | 214 |
aOrthologs that are in all three genomes. bNull hypothesis that the proportion of non-orthologs that are syntenic is less than the proportion of non-orthologs that are non-syntenic. P-value: null hypothesis that the proportion of paralogs that are syntenic is less than the proportion of paralogs that are non-syntenic.
The major classes of transposable elements found in the Trichoderma genomes
|
|
|
| ||||
|---|---|---|---|---|---|---|
| Class | Copy number | Total length (bp) | Copy number | Total length (bp) | Copy number | Total length (bp) |
| DNA | 372 | 39,899 | 446 | 50,448 | 370 | 52,358 |
| LTR | 533 | 64,534 | 559 | 76,482 | 541 | 67,484 |
| Helitrons | 40 | 9,235 | 45 | 9,962 | 34 | 8,547 |
| LINE | 561 | 65,202 | 530 | 54,928 | 349 | 59,414 |
| Totala | 178,870 (0.49%) | 191,820 (0.57%) | 187,803 (0.48%) | |||
aTotal in base pairs and percentage of genome of transposable elements found in the genomes. LINE, long interspersed nuclear element; LTR, long terminal repeat.
Repeat-induced point mutation ratios for four of the most abundant transposable element families in the three Trichoderma species
| Sequence | TA/AT ratio | CT+AT/AC+GT ratio | |
|---|---|---|---|
| 0.70 | 1.35 | ||
| LTR Copia | 0.42 | 1.50 | |
| LTR Gypsy | 0.97 | 1.21 | |
| LINE R1 | 1.86 | 1.67 | |
| LINE Tad1 | 0.82 | 1.32 | |
| 0.71 | 1.28 | ||
| LTR Copia | 1.04 | 1.31 | |
| LTR Gypsy | 1.01 | 1.28 | |
| LINE R1 | 0.99 | 2.40 | |
| LINE Tad1 | 0.33 | 1.30 | |
| 0.71 | 1.33 | ||
| LTR Copia | 0.77 | 1.48 | |
| LTR Gypsy | 0.95 | 1.16 | |
| LINE R1 | 0.75 | 2.14 | |
| LINE Tad1 | 1.33 | 0.99 | * |
aThe asterisk indicates the family Tad1 from T. virens in which the RIP ratios fall within values that are typically associated with RIP. LINE, long interspersed nuclear element; LTR, long terminal repeat; RIP, repeat-induced point mutation; TE, transposable element.
Presence of genes in Trichoderma known to be required in N. crassa for repeat-induced point mutation
| Functiona | |||||
|---|---|---|---|---|---|
|
|
|
| |||
| RID | XP_959047.1 | Putative DMT, essential for RIP and for MIP | |||
| Dim-5 | XP_957479.2 | Histone 3-K9 HMT essential for RIP; RdRP | 152017 | 55211 | 515216 |
| QDE-1 | XP_959047.1 | RdRP, essential for quelling | 361 | 64774 | 67742 |
| QDE-2 | XP_960365.2 | Argonaute-like protein, essential for quelling | 79413 | 20883 | 49832 |
| QDE-3 | XP_964030.2 | RecQ helicase, essential for quelling | 91316 | 30057 | 102458 |
| DCL1 | XP_961898.1 | Dicer-like protein, involved in quelling | 20162 | 20212 | 69494 |
| DCL2 | XP_963538.2 | Dicer-like protein, involved in quelling | 318 | 47151 | 79823 |
| QIP | CAP68960.1 | Putative exonuclease protein, involved in quelling | 14588 | 41043 | 57424 |
| SAD-1 | XP_964248.2 | RdRP essential for MSUD | 465 | 28428 | 103470 |
| SAD-2 | XP_961084.1 | Essential for MSUD | No | No | No |
aN. crassa gene information and abbreviations taken from [36]. DMT, cytosine DNA methyltransferase; HMT, histone methyltransferase; MIP, methylation induced premeotically; MSUD, meiotic silencing of unpaired DNA; RdRP, RNA-dependent RNA-polymerase.
Major paralogous gene expansions in Trichoderma
| PFAM domain |
|
|
| |
|---|---|---|---|---|
| 19 | 4 | |||
| 20 | 5,1 | |||
| Peptidase S8 subtilisin cluster 1-4 | 10 | 9,6 | ||
| 13 | 3,4 | |||
| 20 | 32 | 34 | 4,7 | |
| 12 | 5 | |||
| 10 | 5,8 | |||
| 9 | 6 | |||
| 10 | 17 | 19 | 8 | |
| 11 | 24 | 18 | 10,8 | |
| 6 | 2 | |||
| 2 | 4,7 | |||
| Unknown major facilitator subfamily (PF07690) domain | 9 | 15 | 15 | 5,5 |
| F-box domain protein | 7 | 10 | 11 | 1,7 |
| Ankyrin domain protein with protein kinase domain | 6 | 8 | 11 | 2,7 |
| Amidase | 4 | 11 | 11 | 2,8 |
| Epoxide hydrolase (PF06441) plus AB hydrolase_1 (PF00561) | 5 | 14 | 11 | 3,2 |
| FAD_binding_4, plus HET and berberine bridge enzymes (08031) domain | 5 | 13 | 11 | 6,1 |
| FMN oxidoreductases | 2 | 8 | 10 | 2,5 |
| Unknown protein with DUF84 (NTPase) and NmrA domain | 5 | 19 | 10 | 3,7 |
| Protein with GST_N and GST_C domains | 6 | 12 | 10 | 4,6 |
| 6 | 8 | 9 | 1,1 | |
| 6 | 7 | 9 | 1,2 | |
| 2 | 0,2 | |||
| 5 | 9 | 8 | 1,3 | |
| 5 | 11 | 8 | 2,2 | |
| 1 | 1,4 | |||
| GFO_IDH_MocA (01408 and 02894) oxidoreductase | 3 | 9 | 7 | 1,5 |
| 4 | 6 | 6 | 0,7 | |
| 3 | 4 | 5 | 1 | |
| 3 | 4 | 5 | 1 | |
| 3 | 5 | 5 | 1,1 | |
| 3 | 5 | 5 | 1,3 | |
| Nitrilase | 3 | 6 | 5 | 2,2 |
| 4 | 4 | 4 | 0,8 | |
| AAA-family ATPase (PF00004) | 4 | 3 | 4 | 1 |
| Pyridoxal phosphate dependent decarboxylase (00282) | 2 | 3 | 4 | 1,2 |
| Unknown protein | 3 | 4 | 4 | 1,3 |
aResults are from MCL analysis of the three Trichoderma species (Tr, Ta, Tv) and mean values from ten other ascomycetes whose genomes are present in the JGI database [63]. Eurotiomycetes: Aspergillus carbonarius, Aspergillus niger. Sordariomycetes: Thielavia terrerstris, Chaetomium globosum, Cryphonectria parasitica, Neurospora discreta, Neurospora tetrasperma. Dothidiomycetes: Mycospherella graminicola, Mycospherella fijiensis, Cochliobolus heterostrophus. The number of genes present in the "other fungi" is averaged. Data were selected from a total of 28,919 clusters, average cluster number 5.8 (standard deviation 15.73). PFAM categories printed in bold specify those that are significantly (P < 0.05) expanded in all three Trichoderma species; numbers in bold and italics specify genes that are significantly more abundant in Ta and Tv versus Tr (P < 0.05). GH, glycosyl hydrolase family; GST, glutahionine-S transferase; SSCP, small secreted cystein-rich protein.
Glycosyl hydrolase families involved in chitin/chitosan and β-1,3 glucan hydrolysis that are expanded in mycoparasitic Trichoderma species
| Glycosyl hydrolase family | ||||||||
|---|---|---|---|---|---|---|---|---|
| Taxonomy | GH18 | GH75 | GH17 | GH55 | GH64 | GH81 | 217 | |
| S | 5 | 2 | 18 | |||||
| S | 4 | 1 | 18 | |||||
| S | 4 | 2 | 15 | |||||
| | S | 2 | 6 | 5 | 2 | 1 | 14 | |
| | S | 19 | 1 | 6 | 3 | 2 | 1 | 12 |
| | S | 12 | 1 | 4 | 6 | 2 | 1 | 13 |
| | S | 20 | 1 | 4 | 7 | 1 | 1 | 13 |
| | S | 14 | 1 | 7 | 3 | 1 | 2 | 13 |
| | E | 19 | 2 | 5 | 6 | 0 | 1 | 12 |
| | E | 14 | 2 | 5 | 3 | 0 | 1 | 9 |
| | E | 9 | 1 | 5 | 3 | 2 | 1 | 11 |
| | P | 5 | 1 | 4 | 2 | 0 | 9 | |
| | SM | 2 | 0 | 4 | 0 | 0 | 2 | 6 |
| | SS | 1 | 0 | 1 | 0 | 0 | 1 | 2 |
| | A | 11 | 0 | 2 | 2 | 0 | 0 | 4 |
| | A | 10 | 0 | 4 | 2 | 0 | 0 | 6 |
| | A | 20 | 0 | 4 | 6 | 0 | 0 | 10 |
aMain substrates for the respective enzyme families. bNumber of all enzymes that can act on ß-glucan as a substrate. Taxonomy abbreviations: S, Sordariomycetes; E, Eurotiomycetes; P, Pezizomycetes; S, Saccharomycetes; SS, Schizosaccharomycetes; A, Agaricomycetes. The bold numbers indicate glycosyl hydrolase (GH) families that have a statistically significant expansion in Trichoderma (P < 0.05) or Ta and Tv (GH18). This support was obtained only when N. haematococca and T. melanosporum were not included in the analysis of GH18 and GH81, respectively.
The number of polyketide synthases and non-ribosomal peptide synthetases of Trichoderma compared to other fungi
| Fungal species | PKS | NRPS | PKS-NRPS | Total |
|---|---|---|---|---|
| 26 | 14 | 4 | 44 | |
| 26 | 13 | 1 | 40 | |
| 23 | 11 | 2 | 36 | |
| 20 | 6 | 8 | 34 | |
| 14 | 19 | 1 | 34 | |
| 12 | 16 | 3 | 31 | |
| 17 | 10 | 2 | 29 | |
| 13 | 13 | 1 | 27 | |
| 12 | 12 | 1 | 25 | |
| 7 | 3 | 0 | 10 |
Number of PFAM domains that are enriched among paralogous genes in non-syntenic areas
|
|
|
| |
|---|---|---|---|
| Zn2Cys6 transcription factors | 9 | 69 | |
| WD40 domains | 1 | 11 | 14 |
| Sugar transporters | 18 | 13 | |
| Proteases | 2 | 28 | 23 |
| Cytochrome P450 | 7 | 15 | |
| NmrA-domains | 2 | 19 | 21 |
| Major facilitator superfamily | 7 | 52 | 60 |
| HET domains | 3 | 26 | 27 |
| Glycoside hydrolases | 3 | 33 | 26 |
| FAD-binding proteins | 2 | 28 | 24 |
| Ankyrins | 4 | 44 | 37 |
| Alcohol dehydrogenases | 51 | 71 | |
| α/ß-fold hydrolases | 2 | 26 | 15 |
| ABC transporters | 4 | 14 | 3 |
| Number of genes | 50 | 485 | 418 |
| Total gene number in NS areas | 92 | 686 | 1012 |
Boxed numbers are those that are significantly (p < 0.05) different from the two other species when related to the genome size. PFAM, protein family; NS, non-syntenic; HET, heteroincompatibility.
Figure 2Mycoparasitism is an ancient life style of . (a) Position of Ta, Tv and Tr within the genus Hypocrea/Trichoderma. The positions of Tr, Tv and Ta are 4, 29 and 97, respectively - shown in bold), and a few hallmark species are given by their names. For the identities of the other species, see the gene accession numbers (Materials and methods). (b) Bayesian phylogram based on the analysis of amino acid sequences of 100 orthologous syntenic proteins (MCMC, 1 million generations, 10,449 characters) in Tr, Tv, Ta, Gibberella zeae and Chaetomium globosum. Circles above nodes indicate 100% posterior probabilities and significant bootstrap coefficients. The numbers in the boxes between (a) and (b) indicate the genome sizes and gene counts and percentage net gain regarding Ta. Photoplates show the mycoparasitic reaction after the contact between Trichoderma species and Rhizoctonia solani. Trichoderma species are always on the left side; dashed lines indicate the position of Trichoderma overgrowth of R. solani.