| Literature DB >> 27829365 |
Nicolás Bellora1, Martín Moliné1, Márcia David-Palma2, Marco A Coelho2, Chris Todd Hittinger3, José P Sampaio2, Paula Gonçalves2, Diego Libkind4.
Abstract
BACKGROUND: The class Tremellomycete (Agaricomycotina) encompasses more than 380 fungi. Although there are a few edible Tremella spp., the only species with current biotechnological use is the astaxanthin-producing yeast Phaffia rhodozyma (Cystofilobasidiales). Besides astaxanthin, a carotenoid pigment with potent antioxidant activity and great value for aquaculture and pharmaceutical industries, P. rhodozyma possesses multiple exceptional traits of fundamental and applied interest. The aim of this study was to obtain, and analyze two new genome sequences of representative strains from the northern (CBS 7918T, the type strain) and southern hemispheres (CRUB 1149) and compre them to a previously published genome sequence (strain CBS 6938). Photoprotection and antioxidant related genes, as well as genes involved in sexual reproduction were analyzed.Entities:
Keywords: Aquaculture; Basidiomycete; Mating type; Mycosporines; Photoprotection; Phylogenomics; Type strain; Xanthophyllomyces dendrorhous; Yeast
Mesh:
Substances:
Year: 2016 PMID: 27829365 PMCID: PMC5103461 DOI: 10.1186/s12864-016-3244-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Detailed information on the three strains of P. rhodozyma included in this study
| Strain | Alternative codes | Origin and isolation reference | Lineagea | Genome reference |
|---|---|---|---|---|
| CBS 6938 | UCD 77-61 | Sap of | C2 | Sharma et al., 2015 [ |
| CBS 7918T | VKM Y-2786, JCM 9681, KCTC 17160, NCYC 2774 | Exudate of | C2 | This study |
| CRUB 1149 | CBS 10596 | Water from Lake Ilon, surrounded by | A | This study |
aBased on David-Palma et al. [6] phylogenetic grouping using MLST
Comparative analysis of genomes, assemblies, and genes statistics for P. rhodozyma and other fungi
| Density | Introns | Intron | Exon | % correct | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| # | Species | Strain | Size (Mb) | Scaf. | N50 (Kb) | %N | %GC | %repeat | Genes | Gene/Mb | ILG | %ILG | /Gene | Length | Length | CDS |
| 1 |
| CBS 7966 | 9.0 | 67 | 654.6 | 0.00 | 52.06 | 0.98 | 4195 | 468.3 | 2030 | 48.39 | 2.1 | 81 | 508 | 99.7 |
| 2 |
| strain 521 | 19.7 | 274 | 127.5 | 0.00 | 54.03 | 2.9 | 6454 | 327.9 | 4640 | 71.89 | 1.7 | 229 | 583 | 97.9 |
| 3 |
| SY62 | 18.4 | 74 | 445.6 | 0.04 | 56.51 | 1.21 | 6313 | 342.3 | 4703 | 74.5 | 1.7 | 230 | 573 | 99.6 |
| 4 |
| CBS 633.66 | 9.8 | 56 | 337.4 | 0.17 | 40.01 | 0.79 | 5062 | 515.7 | 374 | 7.39 | 3.2 | 56 | 345 | 99.8 |
| 5 |
| DSM-1558 | 28.6 | 45 | 1622.7 | 2.28 | 46.73 | 6.52 | 8975 | 313.4 | 1101 | 12.27 | 5.2 | 129 | 287 | 99 |
| 6 |
| JEC21 | 19.1 | 14 | 1439.0 | 0.01 | 48.54 | 5.33 | 6933 | 363.9 | 343 | 4.95 | 5.4 | 69 | 257 | 100 |
| 7 |
| WM276 | 18.4 | 14 | 1333.1 | 0.07 | 47.88 | 4.53 | 6474 | 352.3 | 228 | 3.52 | 5.4 | 70 | 260 | 99.7 |
| 8 |
| CBS 569 | 25.3 | n.a. | n.a. | 0.24 | n.a. | n.a. | 7702 | n.a. | 456 | 5.92 | 5.9 | n.a. | n.a. | n.a. |
| 9 |
| CBS 10117 | 23.9 | n.a. | n.a. | 0.02 | n.a. | n.a. | 8426 | n.a. | 554 | 6.57 | 5.6 | n.a. | n.a. | n.a. |
| 10 |
| CBS 10737 | 20.8 | n.a. | n.a. | 0.06 | n.a. | n.a. | 7667 | n.a. | 403 | 5.26 | 5.7 | n.a. | n.a. | n.a. |
| 11 |
| CBS 10118 | 24.4 | n.a. | n.a. | 0.06 | n.a. | n.a. | 8834 | n.a. | 671 | 7.6 | 5.8 | n.a. | n.a. | n.a. |
| 12 |
| CBS 10435 | 22.7 | n.a. | n.a. | 0.22 | n.a. | n.a. | 8242 | n.a. | 507 | 6.15 | 5.8 | n.a. | n.a. | n.a. |
| 13 |
| NRRL Y-50378 | 22.8 | 712 | 71.4 | 0.02 | 58.47 | 0.82 | 8588 | 376.8 | 701 | 8.16 | 5.5 | 63 | 239 | 94.6 |
| 14 |
| RY1 | 19.1 | 1152 | 32.4 | 0.00 | 56.13 | 1 | 7288 | 380.7 | 435 | 5.97 | 5.5 | 63 | 240 | 96.7 |
| 15 |
| CBS 8904 | 25.3 | 194 | 3223.9 | 1.00 | 59.5 | 1.52 | 8520 | 336.8 | 1513 | 17.76 | 3.2 | 118 | 353 | 97.7 |
| 16 |
| ANT03-052 | 19.7 | 50 | 1080.6 | 0.22 | 52.93 | 0.74 | 6421 | 326.1 | 248 | 3.86 | 6 | 79 | 243 | 99.9 |
| 17 |
| CBS 6938 | 19.5 | 257 | 2088.2 | 1.65 | 47.31 | 2.52 | 6260 | 321 | 183 | 2.92 | 7.4 | 112 | 199 | 97.5 |
| 18 |
| CBS 7918 T | 18.7 | 343 | 104.1 | 0.00 | 47.21 | 2.37 | 5980 | 319 | 97 | 1.62 | 7.5 | 110 | 199 | 98.3 |
| 19 |
| CRUB 1149 | 18.9 | 305 | 132.9 | 0.01 | 47.11 | 2.47 | 6016 | 318 | 108 | 1.8 | 7.5 | 110 | 199 | 98.4 |
| 20 |
| SK-4 | 30.5 | 167 | 1718.1 | 0.00 | 53.7 | 7.46 | 9335 | 306.3 | 229 | 2.45 | 8.2 | 118 | 177 | 99.4 |
| 21 |
| H4-8 | 38.7 | 25 | 2560.2 | 0.15 | 57.53 | 4.49 | 12999 | 336.1 | 953 | 7.33 | 5.5 | 85 | 243 | 99.9 |
| 22 |
| okayama7#130 | 36.2 | 67 | 3468.1 | 0.00 | 51.67 | 5.06 | 12265 | 339.3 | 1017 | 8.29 | 5.5 | 80 | 251 | 99.9 |
| 23 |
| FP-91666 SS1 | 46.5 | 159 | 1799.0 | 1.86 | 51.31 | 1.66 | 13115 | 282 | 929 | 7.08 | 6.2 | 99 | 226 | 99.6 |
Scaf. scaffolds, ILG intron less genes, n.a. not available due to the unpublished condition of the assemblies
Fig. 1Genome sequence divergence using sliding window analysis of 3 strains of P. rhodozyma. Strains CBS 7918T (red) and CRUB 1149 (blue) are plotted against the 17 largest scaffolds of CBS 6938 (x axis), and nucleotide sequence divergence (left y axis) is relative to the genome of this latter strain. Colored numbers represent the mean divergence for each of the two strains for each scaffold. Grey areas indicate regions of the CBS 6938 genome enriched in Ns (percentage of Ns in right y axis)
Fig. 2Level of genomic sequence divergence of 3 P. rhodozyma strains and other cases of closely related yeasts using alignment-free method (Kr). Cneo, Cryptococcus neoformans
Fig. 3Cladogram of the Tremellomycetes based on 248 protein sequences belonging to CEGs and the presence of photoprotective and antioxidant related genes. * lack of mycosporines was confirmed experimentally in this study. Numbers in brackets correspond to proteins predicted to have similar functions. All branches had maximum a posteriori support values
Fig. 4Phylogenetic tree based on protein sequences of 3 different clades of catalases from P. rhodozyma CBS 7918T and other fungi. C.: Cryptococcus; Co.: Coprionopsis; K.: Kwoniella; M.: Mrakia; S.: Stereum; Sc.: Schizophyllum; T.: Thichosporon; Tr.: Tremella; W.: Wallemia
Fig. 5Gene content and organization around the putative mating type genes of P. rhodozyma. a Genomic organization of the DNA regions with putative mating pheromone receptors (Ste3-1 and Ste3-2), pheromone precursors (Mfa1 and Mfa2) and b homeodomain transcription factors (HD1 and HD2). Genes are indicated by arrows showing the direction of transcription. Arrows in grey indicate putative mating related genes while those in white depict additional genes apparently unrelated to mating. Hyp1-2 are hypothetical predicted proteins of uncertain function. The end of a scaffold is indicated by a solid vertical bar. The contiguity of the scaffolds harboring the two sets of pheromone and receptor genes was confirmed by PCR with primers MP100 and MP103. c Sequence alignment of the putative pheromone precursors from P. rhodozyma (CBS 7918T) and C. neoformans var. neoformans (Cnn Mfα1 - XP_570122.1; Cnn Mfa1- AAG42766.1). The predicted mature pheromones are shaded in grey and the CAAX-motifs for C-terminal processing are underlined