| Literature DB >> 29145801 |
Raúl Castanera1, Gúmer Pérez1, Leticia López-Varas1, Joëlle Amselem2, Kurt LaButti3, Vasanth Singan3, Anna Lipzen3, Sajeet Haridas3, Kerrie Barry3, Igor V Grigoriev3, Antonio G Pisabarro1, Lucía Ramírez4.
Abstract
BACKGROUND: Coniophora olivacea is a basidiomycete fungus belonging to the order Boletales that produces brown-rot decay on dead wood of conifers. The Boletales order comprises a diverse group of species including saprotrophs and ectomycorrhizal fungi that show important differences in genome size.Entities:
Keywords: Annotation; Basidiomycete; Boletales; Brown-rot; Genome; Retrotransposon; Transposable elements
Mesh:
Substances:
Year: 2017 PMID: 29145801 PMCID: PMC5689174 DOI: 10.1186/s12864-017-4243-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of C. olivacea genome assembly and annotation
| Feature |
|
|
|---|---|---|
| Genome assembly size (Mb) | 39.07 | 42.97 |
| Sequencing coverage depth | 137.7× | 49.5× |
| Number of scaffolds | 863 | 210 |
| Scaffold N50 a | 80 | 7 |
| Scaffold L50 (Mb) b | 0.14 | 2.40 |
| N° scaffold gaps | 127 | 412 |
| Genome assembly gaps (%) | 0.24 | 2.57 |
| Assembly completeness (%) | 99.78 | Unknown |
| Repeat content (%) c | 2.91 | 4.68 |
| GC content (%) | 52.82 | 52.4 |
| Number of genes | 14,928 | 13,761 |
| Gene density (genes/Mb) | 382.07 | 320.26 |
| Predicted secreted proteins | 470 (3.1%) | 504 (3.7%) |
a N50 indicates the number of scaffolds that account for 50% of the total assembled sequence
b L50 indicates that 50% of the total sequence is assembled in scaffolds larger than this size
c Includes TE, simple repeats and low complexity regions
Fig. 1Maximum-likelihood phylogeny of 17 agaricomycetes inferred from 1677 genes. Branch labels indicate the results of 100 bootstraps
Fig. 2a Synteny dot plot showing a fraction of the whole-genome alignment between C. puteana and C. olivacea. Every grid line in the y-axes represents the end of one scaffold and the beginning of the next. Forward matches are displayed in red, while reverse matches are displayed in blue. b Histogram of similarity of the 39,506 aligned regions. c Venn diagram summarizing the amount of genes shared by the two genomes based on reciprocal best hit (RBH) and tBLASTN is shown in panel C
GO terms significantly enriched in the predicted secretome of C. olivacea
| Molecular Function | Description | GO/Secretome | GO/Genome |
|
|---|---|---|---|---|
| GO:0030600 | Feruloyl esterase activity | 6/470 | 9/14,928 | 0.000171 |
| GO:0042500 | Aspartic endopeptidase activity, intra membrane cleaving | 11/470 | 20/14,928 | 0.000192 |
| GO:0008843 | Endochitinase activity | 8/470 | 14/14,928 | 0.000194 |
| GO:0004568 | Chitinase activity | 8/470 | 14/14,928 | 0.000194 |
| GO:0004650 | Polygalacturonase activity | 11/470 | 15/14,928 | 0.000354 |
| GO:0004806 | Triglyceridelipase activity | 11/470 | 29/14,928 | 0.000376 |
| GO:0016160 | Amylase activity | 25/470 | 40/14,928 | 0.000737 |
| GO:0008933 | Lytic transglycosylase activity | 25/470 | 40/14,928 | 0.000737 |
| GO:0015927 | Trehalase activity | 25/470 | 40/14,928 | 0.000737 |
| GO:0015925 | Galactosidase activity | 25/470 | 40/14,928 | 0.000737 |
| GO:0015924 | Mannosyl-oligosaccharide mannosidase activity | 25/470 | 40/14,928 | 0.000737 |
| GO:0015929 | Hexosaminidase activity | 25/470 | 40/14,928 | 0.000737 |
| GO:0015928 | Fucosidase activity | 25/470 | 40/14,928 | 0.000737 |
| GO:0008810 | Cellulase activity | 9/470 | 11/14,928 | 0.00089 |
| GO:0015926 | Glucosidase activity | 25/470 | 41/14,928 | 0.000948 |
| GO:0015923 | Mannosidase activity | 25/470 | 41/14,928 | 0.000948 |
| GO:0004620 | Phospholipase activity | 9/470 | 32/14,928 | 0.000968 |
| GO:0004553 | Hydrolase activity hydrolyzing O-glycosyl compounds | 44/470 | 99/14,928 | 0.00105 |
| GO:0004194 | Obsolete pepsin A activity | 17/470 | 42/14,928 | 0.00121 |
| GO:0005199 | Structural constituent of cell wall | 16/470 | 33/14,928 | 0.00129 |
| GO:0030246 | Carbohydrate binding | 9/470 | 25/14,928 | 0.00143 |
| GO:0004190 | Aspartic-type endopeptidaseactivity | 20/470 | 44/14,928 | 0.00193 |
| GO:0004099 | Chitin deacetylase activity | 5/470 | 9/14,928 | 0.00803 |
| GO:0004185 | Serine-type carboxypeptidase activity | 5/470 | 12/14,928 | 0.0467 |
| Biological Process | ||||
| GO:0000272 | Polysaccharide catabolic process | 5/470 | 6/14,928 | 0.000414 |
| GO:0006508 | Proteolysis | 43/470 | 189/14,928 | 0.00128 |
| GO:0005975 | Carbohydrate metabolic process | 65/470 | 161/14,928 | 0.00176 |
| GO:0006629 | Lipid metabolic process | 10/470 | 50/14,928 | 0.00674 |
| Cellular Component | ||||
| GO:0005576 | Extracellular region | 7/470 | 15/14,928 | 0.000354 |
| GO:0005618 | Cell wall | 18/470 | 35/14,928 | 0.00224 |
a Bonferroni corrected, Fisher p-value
Size and functional annotation of C. olivacea predicted gene families targeted to the secretory pathway
| Gene family | SignalP | SECRETOOL | Functional annotation |
|---|---|---|---|
| Cluster_1 | 59 | 3 | Cytochrome P450 |
| Cluster_2 | 33 | 0 | Unknown |
| Cluster_3 | 32 | 17 | Hydrophobin |
| Cluster_4 | 19 | 11 | Aspartic peptidase |
| Cluster_5 | 18 | 12 | Carboxylesterase |
| Cluster_6 | 17 | 0 | Unknown |
| Cluster_7 | 15 | 0 | Unknown |
| Cluster_8 | 14 | 12 | Peptidase G1 |
| Cluster_9 | 14 | 9 | RlpA-likelipoprotein |
| Cluster_10 | 13 | 0 | Pheromone mating factor, STE3 |
| Cluster_11 | 13 | 0 | Unknown |
| Cluster_12 | 12 | 3 | Peptidase S8/S53 |
| Cluster_13 | 11 | 9 | Unknown |
| Cluster_14 | 10 | 0 | CAZy:GH18 |
| Cluster_15 | 10 | 9 | Cytochrome P450 |
| Cluster_16 | 10 | 6 | Unknown/lytic polysaccharide monooxygenase (LPMO/ CAZy:AA9) |
| Cluster_17 | 9 | 5 | Asparticpeptidase |
| Cluster_18 | 9 | 5 | CAZy:CE4 Carbohydrate Esterase Family 4 |
| Cluster_19 | 9 | 0 | CAZy:GH16 |
| Cluster_20 | 9 | 2 | Peptidase S10 |
| Cluster_21 | 9 | 5 | Sugar transporter |
| Cluster_22 | 9 | 4 | Unknown/putative lipoprotein |
| Cluster_23 | 8 | 6 | Fungal lipase |
| Cluster_24 | 8 | 0 | Isoprenylcysteinecarboxylmethyltransferase |
| Cluster_25 | 8 | 0 | Monooxygenase, FAD-binding |
| Cluster_26 | 7 | 7 | Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family |
| Cluster_27 | 7 | 0 | Unknown |
| Cluster_28 | 7 | 1 | Unknown |
| Cluster_29 | 6 | 5 | CAZy:GH128 |
| Cluster_30 | 6 | 0 | CAZy:GH28 |
| Cluster_31 | 6 | 3 | CAZy:GH3 |
| Cluster_32 | 6 | 2 | Peptidase M28 |
| Cluster_33 | 6 | 6 | Thaumatin |
| Cluster_34 | 6 | 2 | Unknown |
| Cluster_35 | 6 | 6 | Unknown |
| Cluster_36 | 5 | 1 | Aspartic peptidase |
| Cluster_37 | 5 | 2 | CAZy:AA1_1 |
| Cluster_38 | 5 | 4 | CAZy:AA5_1 |
| Cluster_39 | 5 | 5 | CAZy:AA9 |
| Cluster_40 | 5 | 1 | CAZy:CBM5 |
| Cluster_41 | 5 | 4 | CAZy:GH12 |
| Cluster_42 | 5 | 5 | CAZy:GH30_3 |
| Cluster_43 | 5 | 0 | CAZy:GH47 |
| Cluster_44 | 5 | 2 | CAZy:GH71 |
| Cluster_45 | 5 | 0 | Monooxygenase |
| Cluster_46 | 5 | 2 | Unknown |
| Cluster_47 | 5 | 0 | Unknown |
| Cluster_48 | 5 | 5 | Unknown/xylanase inhibitor tl-xi |
| Cluster_49 | 5 | 4 | Unknown |
| Cluster_50 | 5 | 4 | Unknown |
| Cluster_51 | 5 | 4 | Unknown |
| Cluster_52 | 5 | 0 | Unknown |
| Cluster_53 | 5 | 4 | Unknown |
| Cluster_54 | 5 | 0 | Unknown |
| Cluster_55 | 5 | 0 | Unknown |
| Cluster_56 | 4 | 0 | CAZy:GH18, CAZy:CBM5 |
| Cluster_57 | 4 | 0 | CAZy:GH31 |
| Cluster_58 | 4 | 3 | CAZy:GH55 |
| Cluster_59 | 4 | 4 | Flavin monooxygenase-like |
| Cluster_60 | 4 | 3 | GOLD |
| Cluster_61 | 4 | 2 | Histidine phosphatase superfamily, clade-2 |
| Cluster_62 | 4 | 3 | Lysophospholipase |
| Cluster_63 | 4 | 1 | Peptidase S28 |
| Cluster_64 | 4 | 0 | Proteolipid membrane potential modulator |
| Cluster_65 | 4 | 3 | RlpA-like, ceratoplatanin |
| Cluster_66 | 4 | 1 | Thioredoxin-like fold |
| Cluster_67 | 4 | 3 | Unknown |
| Cluster_68 | 4 | 3 | Unknown |
| Cluster_69 | 4 | 3 | Unknown |
| Cluster_70 | 4 | 0 | Unknown |
Protein IDs of each cluster are shown in Additional file 5: Table S3
Fig. 3TE content and genome size in five Boletales species. TE content is shown as a histogram, and genome size as a green line in panel A. Panel B shows a histogram representing the number of TE families found in each species
Summary of TE content in four Boletales genome assemblies
|
|
|
|
|
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (43) families | (108) families | (87) families | (230) families | (432) families | |||||||||||
| Copies | Full copies | Coverage (%) | Copies | Full copies | Coverage (%) | Copies | Full copies | Coverage (%) | Copies | Full copies | Coverage (%) | Copies | Full copies | Coverage (%) | |
| Class I | |||||||||||||||
| LINE | 30 | 4 | 0.03 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 317 | 41 | 0.80 |
| LINE (unknown) | 29 | 5 | 0.02 | 11 | 3 | 0.01 | 23 | 3 | 0.02 | 0 | 0 | 0.00 | 14 | 1 | 0.01 |
| SINE | 0 | 0 | 0.00 | 6 | 2 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 9 | 1 | 0.00 |
| LTR/Copia | 36 | 7 | 0.09 | 441 | 27 | 0.83 | 267 | 10 | 0.72 | 3,773 | 86 | 6.04 | 1,617 | 101 | 2.43 |
| LTR/Gypsy | 394 | 13 | 0.93 | 299 | 28 | 0.54 | 767 | 42 | 2.13 | 6,949 | 268 | 16.27 | 8,434 | 575 | 19.28 |
| LTR/LARD | 0 | 0 | 0.00 | 60 | 8 | 0.08 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 361 | 2 | 0.53 |
| LTR/TRIM | 15 | 4 | 0.02 | 136 | 4 | 0.08 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 576 | 93 | 0.20 |
| DIRS | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 361 | 36 | 0.58 |
| Penelope | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 69 | 11 | 0.15 | 0 | 0 | 0.00 |
| Class II | |||||||||||||||
| Helitron | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 21 | 4 | 0.04 | 260 | 25 | 0.43 | 1,386 | 38 | 2.01 |
| TIR/DDE | 361 | 28 | 0.52 | 362 | 38 | 0.68 | 1,089 | 55 | 2.16 | 2,148 | 166 | 3.04 | 3,366 | 255 | 4.25 |
| TIR (unknown) | 143 | 34 | 0.18 | 720 | 115 | 1.10 | 323 | 49 | 0.36 | 736 | 67 | 1.55 | 1,115 | 40 | 1.85 |
| MITE | 410 | 85 | 0.30 | 702 | 264 | 0.56 | 121 | 43 | 0.12 | 539 | 98 | 0.62 | 1,102 | 227 | 0.59 |
| Maverick (putative) | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 56 | 3 | 0.21 |
| Unknown | 67 | 4 | 0.07 | 99 | 9 | 0.06 | 865 | 133 | 0.99 | 1,138 | 167 | 1.34 | 8,611 | 708 | 8.44 |
| TOTAL | 1,485 | 184 | 2.15 | 2,836 | 498 | 3.94 | 3,476 | 339 | 6.54 | 15,612 | 888 | 29.45 | 27,325 | 2,121 | 41.17 |
Fig. 4Abundance and structure of a Chromoviridae LTR-retrotransposon family of C. olivacea. The upper panel shows the mapping of the annotated genome copies of this family onto their consensus sequence. The lower panel shows a scheme of the structural and functional domains of this family: long terminal repeats (LTRs) are represented as blue rectangles; the internal domains shown are (from left to right): aspartate protease, reverse transcriptase, RNase, integrase, chromatin organization modifier
Fig. 5Maximum likelihood phylogeny of the Gypsy reverse-transcriptases found in the C. olivacea, C. puteana, S. lacrymans, H. pinastri and P. tinctorius (blue) genomes. SH (Shimodaira-Hasegawa) local support values are shown in branches. The reverse-transcriptase from Oryza sativa ATLANTIS-I family consensus (Repbase) was used as outgroup
Fig. 6Estimated insertion age of the LTR-retrotransposons found in C. olivacea, C. puteana, S. lacrymans, H. pinastri and P. tinctorius. MYA = million years ago