| Literature DB >> 32066798 |
Habibu Aliyu1, Olga Gorte2, Pieter de Maayer3, Anke Neumann2, Katrin Ochsenreither4.
Abstract
Trichosporonaceae incorporates six genera of physiologically and ecologically diverse fungi including both human pathogenic taxa as well as yeasts of biotechnological interest, especially those oleagenic taxa that accumulate large amounts of single cell oils (SCOs). Here, we have undertaken comparative genomic analysis of thirty-three members of the family with a view to gain insight into the molecular determinants underlying their lifestyles and niche specializations. Phylogenomic analysis revealed potential misidentification of three strains which could impact subsequent analyses. Evaluation of the predicted proteins coding sequences showed that the free-living members of the family harbour greater numbers of carbohydrate active enzymes (CAZYmes), metallo- and serine peptidases compared to their host-associated counterparts. Phylogenies of selected lipid biosynthetic enzymes encoded in the genomes of the studied strains revealed disparate evolutionary histories for some proteins inconsistent with the core genome phylogeny. However, the documented oleagenic members distinctly cluster based on the constitution of the upstream regulatory regions of genes encoding acetyl-CoA carboxylase (ACC), ATP-citrate synthase (ACS) and isocitrate dehydrogenase [NADP] (ICDH), which are among the major proteins in the lipid biosynthetic pathway of these yeasts, suggesting a possible pattern in the regulation of these genes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32066798 PMCID: PMC7026411 DOI: 10.1038/s41598-020-59672-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genome features of thirty-three Trichosporonaceae and two outgroup species included in the analysis.
| Strains | Accession | BioProject | Isolation source/locality | Sequencing technology | Assembly Size (bp) | # Scaffolds | N50 (bp) | % G + C | # Genes | # tRNA | # Proteins | Unique Proteins | Prots atleast 1 orthologue | Comp* | Dup¥ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BCJI00000000 | PRJDB3695 | Cabbage purchased in a market/Tokyo, Japan | HiSeq. 2500 | 23,647,732 | 16 | 2,410,059 | 56.47 | 8,334 | 422 | 7,912 | 1,204 | 6,708 | 91.3 | 0.2 | |
| BCFW00000000 | PRJDB3573 | Rotten wooden sideboard (pneumonitis patient), Japan | HiSeq. 2500 | 24,510,922 | 28 | 3,306,260 | 58.52 | 8,536 | 379 | 8,157 | 582 | 7,575 | 91.1 | 0.6 | |
| BCJN00000000 | PRJDB3703 | Moist humus around roots/Colombia | HiSeq. 2500 | 24,609,388 | 29 | 3,284,570 | 61.14 | 8,608 | 385 | 8,223 | 1,628 | 6,595 | 96.3 | 1.2 | |
| BCJO00000000 | PRJDB3704 | Sour milk/Germany | HiSeq. 2500 | 24,114,851 | 17 | 4,554,691 | 59.23 | 9,556 | 918 | 8,638 | 1,677 | 6,961 | 90.2 | 0.5 | |
| BCKV00000000 | PRJDB3730 | Soil | HiSeq. 2500 | 30,616,633 | 26 | 2,819,049 | 59.63 | 11,708 | 1,530 | 10,178 | 1,886 | 8,292 | 92.8 | 0.5 | |
| BCFV00000000 | PRJDB3572 | Water purification tank/Montevideo, Uruguay | HiSeq. 2500 | 24,872,216 | 61 | 1,972,470 | 58.17 | 8,550 | 375 | 8,175 | 599 | 7,576 | 91.9 | 0.4 | |
| RSCE00000000 | PRJNA506377 | Grassland /Germany | PacBio | 25,479,456 | 32 | 1,376,709 | 59.15 | 9,729 | 576 | 9,153 | 555 | 8,598 | 96.7 | 0.9 | |
| BCJG00000000 | PRJDB3693 | Exudate of Taxus baccata/Hamburg, Germany | HiSeq. 2500 | 25,989,348 | 37 | 3,276,928 | 59.05 | 9,816 | 546 | 9,270 | 662 | 8,608 | 97.2 | 0.9 | |
| BCKJ00000000 | PRJDB3717 | Buffalo dung/Minturno, Campania, Italy | HiSeq. 2500 | 31,617,680 | 35 | 2,551,142 | 59.59 | 10,127 | 1,019 | 9,108 | 1,778 | 7,330 | 92.2 | 0.7 | |
| LTAL00000000 | PRJNA313001 | Hangzhou oil refinery/China | Illumina MiSeq | 30,717,177 | 21 | 5,629,136 | 57.13 | 9,586 | 878 | 8,708 | 1,386 | 7,322 | 94.1 | 0.4 | |
| PQXP00000000 | PRJNA428315 | Ryzosphere/Las Yungas rainforest, Argentina | Illumina MiSeq | 30,042,643 | 1067 | 53,521 | 60.69 | 11,492 | 1,292 | 10,200 | 1,746 | 8,454 | 90.5 | 4.3 | |
| BEDW00000000 | PRJDB5900 | Urine of chronic renal failure patient/Hokkaido, Japan | Illumina MiSeq | 19,894,493 | 28 | 1,599,305 | 60.59 | 7,697 | 143 | 7,554 | 1,357 | 6,197 | 88.7 | 0.5 | |
| BCJH00000000 | PRJDB3694 | Sputum of tubercular patient/The Netherlands | HiSeq. 2500 | 18,637,344 | 75 | 571,590 | 57.89 | 7,137 | 151 | 6,986 | 293 | 6,693 | 88.4 | 0.3 | |
| LDEP00000000 | PRJNA281029 | Sputum of tubercular patient/Netherlands | Illumina GAIIx | 16,443,618 | 354 | 62,631 | 59.41 | 6,628 | 151 | 6,477 | 246 | 6,231 | 80.8 | 0.6 | |
| LRUG00000000 | PRJNA310294 | Soil/Nanchang, Jiangxi, China | Illumina HiSeq | 38,696,417 | 592 | 116,762 | 60.30 | 14,851 | 448 | 14,403 | 1,004 | 13,399 | 94.1 | ||
| BCKU00000000 | PRJDB3729 | Probably human clinical specimen | HiSeq. 2500 | 23,155,501 | 98 | 744,446 | 61.97 | 9,635 | 399 | 9,236 | 1,756 | 7,480 | 88.3 | 0.4 | |
| BEDZ00000000 | PRJDB5903 | Cyanide contaminated soil /Sasolburg, South Africa | Illumina MiSeq | 19,941,766 | 90 | 582,111 | 58.01 | 7,086 | 150 | 6,936 | 993 | 5,943 | 87.5 | 0.2 | |
| BEDX00000000 | PRJDB5901 | Skin/Germany | Illumina MiSeq | 17,225,847 | 12 | 2,186,752 | 60.98 | 8,162 | 199 | 7,963 | 1,697 | 6,266 | 85.2 | 0.5 | |
| BCKR00000000 | PRJDB3725 | Human infected skin/Tübingen, Germany | HiSeq. 2500 | 23,337,637 | 37 | 2,690,106 | 60.05 | 8,945 | 297 | 8,648 | 511 | 8,137 | 92.1 | 0.3 | |
| BCJT00000000 | PRJDB3710 | Meningitis case/Belgium | HiSeq. 2500 | 40,783,511 | 84 | 1,668,497 | 60.14 | 15,434 | 507 | 14,927 | 1,016 | 13,911 | 94.4 | ||
| MATS00000000 | PRJNA327102 | Dairy plant | PacBio | 19,908,169 | 16 | 2,509,747 | 60.59 | 8,302 | 266 | 8,036 | 169 | 7,867 | 89.6 | 0.4 | |
| JZUH00000000 | PRJNA239490 | - | Illumina | 19,835,558 | 180 | 216,041 | 60.74 | 8,267 | 267 | 8,000 | 197 | 7,803 | 89.1 | 0.5 | |
| BCLT00000000 | PRJDB3696 | Human, Japan | HiSeq. 2500 | 24,687,929 | 36 | 2,256,092 | 59.45 | 9,439 | 502 | 8,937 | 210 | 8,727 | 94.4 | 1.1 | |
| ALBS00000000 | PRJNA164647 | Trichosporia cutis proriasiformis progressiva, Japan | 454; Illumina | 24,540,311 | 78 | 1,660,894 | 59.01 | 9,364 | 497 | 8,867 | 287 | 8,580 | 93.2 | 0.5 | |
| AMBO00000000 | PRJNA172216 | Maize cobs§ Illinois Peoria USA | 454; Illumina | 25,299,608 | 194 | 3,223,897 | 58.91 | 9,816 | 518 | 9,298 | 810 | 8,488 | 93.7 | 0.7 | |
| JXYL00000000 | PRJDB3697 | Head lesion caused by beesting/Turrialba, Costa Rica | Illumina HiSeq | 42,353,277 | 190 | 1,468,092 | 59.60 | 15,969 | 908 | 15,061 | 2,104 | 12,957 | 95.3 | ||
| JXYK00000000 | PRJDB3698 | Human feces | Illumina HiSeq | 24,653,913 | 32 | 3,676,950 | 60.21 | 9,494 | 471 | 9,023 | 1,233 | 7,790 | 93.6 | 0.7 | |
| JXYM00000000 | PRJDB3701 | Human | Illumina HiSeq | 20,339,538 | 18 | 2,739,924 | 62.69 | 7,700 | 318 | 7,382 | 654 | 6,728 | 89.3 | 0.2 | |
| JXYN00000000 | PRJDB3702 | White piedra infected scalp | Illumina HiSeq | 40,322,879 | 116 | 2,824,449 | 60.22 | 15,163 | 584 | 14,579 | 2,470 | 12,109 | 96.4 | ||
| BCJX00000000 | PRJDB3714 | Meadow soil/Renswoude, the Netherlands | HiSeq. 2500 | 33,698,914 | 35 | 2,162,681 | 59.17 | 11,097 | 1,032 | 10,065 | 3,277 | 6,788 | 91.9 | 1.0 | |
| BEDY00000000 | PRJDB5902 | Strawberry /Market in Akihabara, Tokyo, Japan | Illumina MiSeq | 20,263,689 | 21 | 1,928,959 | 58.21 | 8,282 | 297 | 7,985 | 1,620 | 6,365 | 94 | 0.7 | |
| BCJF00000000 | PRJDB3692 | Soil | HiSeq. 2500 | 22,653,840 | 10 | 3,082,120 | 62.74 | 9,255 | 279 | 8,976 | 495 | 8,481 | 93.7 | 0.4 | |
| BFAH00000000 | PRJDB6593 | Soil | Illumina HiSeq. 2500 | 22,628,423 | 46 | 1,340,400 | 62.76 | 9,270 | 296 | 8,974 | 483 | 8,491 | 93.5 | 0.4 | |
| BCKT00000000 | PRJDB3728 | Leaf of Lagerstroemia calyculata/Thailand | HiSeq. 2500 | 25,142,362 | 31 | 1,454,648 | 54.94 | 9,382 | 54 | 9,328 | 3,020 | 6,308 | 95.1 | 0.2 | |
| BCKS00000000 | PRJDB3726 | Stream water/Japan | HiSeq. 2500 | 22,370,450 | 44 | 1,186,041 | 44.66 | 6,989 | 59 | 6,930 | 1,035 | 5,895 | 90 | 0.2 |
*Completeness and ¥duplication determined using BUSCO based on basidiomycota_odb9. §Human pathogenic (superficial and systemic).
Figure 1Phylogenomic analysis of members of the family Trichosporonaceae. The maximum likelihood (ML) tree was inferred from the concatenated protein alignment (223,082 amino acids) of 405 proteins present in single copies among the haploid genomes and only in duplicate copies in the hybrid genomes. The phylogeny was generated using IQ-TREE version 1.6.7 based on the LG + F + R10 model. The ML was generated with confidence values based on 1,000 bootstrap replicates. The documented oil accumulating members of the family are indicated in blue fonts. The labels ‘_1’ and ‘_2’ indicate the two sets of single copy orthologs (SCOs) in the hybrid genomes, where the letter shows higher amino acid similarity to the closest haploid genome.
Figure 2Comparison of number of proteins associated with (a) CAZymes and (b) MEROPS among thirty-three strains of Trichosporonaceae. CAZymes; AA: auxillary activities, CBM: carbohydrate-binding modules, CE: carbohydrate esterases, GH: glycoside hydrolases and GT: glycosyltransferases. MEROPS; A: aspartic peptidases, C: cysteine peptidases, M: metallo-peptidases, N: asparagine peptide lyases, S: serine peptidases, T: threonine peptidases, and I: protease inhibitors.
Figure 3Illustration of the initiation of the biochemical oil production in yeasts showing the steps within the pathway catalysed by the studied enzymes under nitrogen limitation. ACC, acetyl-CoA carboxylase, AMPD, AMP deaminase, ACS, ATP-citrate synthase, FASI & II, fatty acid synthase subunits alpha and beta and ICDH, isocitrate dehydrogenase [NADP]. × and ↑ indicates the inhibition of ICDH and increased activity of AMPD under nitrogen limitation. Modified from[7]
Figure 4Evolutionary analyses of the ICDH protein and the upstream region of its gene among thirty-three strains of Trichosporonaceae. (a) ML tree of ICDH (380 amino acids long trimmed alignment) generated using IQ-TREE version 1.6.7 with confidence values based on 1,000 bootstrap replicates. (b) Distribution of predicted transcription factor binding sites 600 nucleotide bases upstream of the transcription initiation site of ICDH gene clustered using hierarchical clustering on principal components (HCPC) in R. The documented oil accumulating members of the family are indicated in blue fonts in the phylogeny and with blue arrows in the HCPC.
Figure 5Evolutionary analyses of the ACS protein and the upstream region of its gene among thirty-three strains of Trichosporonaceae. (a) ML tree of ACS (1,097 amino acids long trimmed alignment) generated using IQ-TREE version 1.6.7 with confidence values based on 1,000 bootstrap replicates. (b) Distribution of predicted transcription factor binding sites 600 nucleotide bases upstream of the transcription initiation site of ACS gene clustered using hierarchical clustering on principal components (HCPC) in R. The documented oil accumulating members of the family are indicated in blue fonts in the phylogeny and with blue arrows in the HCPC.
Figure 6Evolutionary analyses of the ACC protein and the upstream region of its gene among thirty-three strains of Trichosporonaceae. (a) ML tree of ACC (2094 amino acids long trimmed alignment) generated using IQ-TREE version 1.6.7 with confidence values based on 1,000 bootstrap replicates. (b) Distribution of predicted transcription factor binding sites 600 nucleotide bases upstream of the transcription initiation site of ACC gene clustered using hierarchical clustering on principal components (HCPC) in R. The documented oil accumulating members of the family are indicated in blue fonts in the phylogeny and with blue arrows in the HCPC.