| Literature DB >> 22844193 |
Shu-Hsi Lin1, Miwa Yoshimoto, Ping-Chiang Lyu, Chuan-Yi Tang, Masanori Arita.
Abstract
Aspergillus species are industrially and agriculturally important as fermentors and as producers of various secondary metabolites. Among them, fungal polyketides such as lovastatin and melanin are considered a gold mine for bioactive compounds. We used a phylogenomic approach to investigate the distribution of iterative polyketide synthases (PKS) in eight sequenced Aspergilli and classified over 250 fungal genes. Their genealogy by the conserved ketosynthase (KS) domain revealed three large groups of nonreducing PKS, one group inside bacterial PKS, and more than 9 small groups of reducing PKS. Polyphyly of nonribosomal peptide synthase (NRPS)-PKS genes raised questions regarding the recruitment of the elegant conjugation machinery. High rates of gene duplication and divergence were frequent. All data are accessible through our web database at http://metabolomics.jp/wiki/Category:PK.Entities:
Keywords: database; fungi; phylogeny ; polyketide synthase; secondary metabolite
Year: 2012 PMID: 22844193 PMCID: PMC3399418 DOI: 10.4137/EBO.S9796
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Labels for iterative PKS genes.
| Label | Explanation | Reasoning method |
|---|---|---|
| 6-MSAS | 6-methylsalicylic acid synthase | KS-AT-DH-KR-ACP in this order |
| NR-PKS | Non-reducing | No DH, KR, and ER domains |
| PR-PKS | Partially-reducing | No ER domain and not 6-MSAS type |
| R-PKS | Reducing | With DH, KR, or ER domain but not 6-MSAS type |
| NRPS-PKS | Hybrid synthase with nonribosomal peptide synthase (NRPS) | NCBI annotation and literature |
| bMSAS | Bacterial 6-MSAS | Experimental evidence from the literature |
| bPR-PKS | Bacterial PR-PKS | Experimental evidence from the literature |
| FAS | Fatty acid synthase | Outgroup |
The number of PKS genes in sequenced Aspergillus species.
| Species | CADRE | Aspergillus book (Ref. | This study | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PKS | NRPS-PKS | Total | PKS | PKS-like NRPS | Total | 6-MSAS | NR-PKS | PR-PKS | R-PKS | PKS total | NRPS-PKS | Total | |
| 16 | 3 | 19 | 16 | 1 | 17F | 2 | 3 | 3 | 9 | 17 | 4 | 21 | |
| 25 | 2 | 27 | 35 | – | 35Y | 0 | 14 | 9 | 10 | 33 | 3 | 36 | |
| 11 | 2 | 13 | 13 | 1 | 14F | 1 | 5 | 1 | 5 | 12 | 2 | 14 | |
| 13 | 0 | 13 | 13 | 2 | 15F | 0 | 6 | 3 | 4 | 13 | 2 | 15 | |
| – | – | – | – | – | – | 0 | 8 | 0 | 0 | 8 | 0 | 8 | |
| 26 | 1 | 27 | 27 | – | 27N | 0 | 14 | 3 | 13 | 30 | 1 | 31 | |
| 13 | 0 | 13 | 34 | 7 | 41P | 1 | 5 | 7 | 25 | 38 | 6 | 44 | |
| 29 | 0 | 29 | 32 | – | 32R | 1 | 11 | 7 | 10 | 29 | 2 | 31 | |
| – | – | – | – | – | – | 2 | 10 | 4 | 13 | 29 | 1 | 30 | |
| – | – | – | – | – | – | 1 | 3 | 1 | 1 | 6 | 0 | 6 | |
| 15 | 2 | 17 | 17 | 1 | 18F | 2 | 6 | 0 | 7 | 15 | 2 | 17 | |
Notes: Numbers include putative genes and are intended as upper bounds. The numbers for the CADRE database were obtained from release 3. For terreus, the database (as of April 2012) did not provide accurate information on PKS genes. Superscripts indicate reference sources: F, Ref. 34; N, Ref. 30; P, Ref. 32; R, Ref. 33; Y, Ref. 31.
Figure 1The phylogenetic tree of the KS domain for representative sequences (ie, the concise tree) computed with the maximum parsimony method.
Notes: There are three large categories: R-PKS, bacterial PKS, and NR-PKS. R-PKS is separated into many small groups. Bacterial PKS contains the Nested Fungal Clade (NFC; see text). NR-PKS contains three large groups. The tree topology is slightly different from the full tree containing 471 sequences due to the selection of sequences. Also see the Methods section for the selection method of sequences.
Figure 2Overview of the KS domain phylogeny.
Figure 3Statistics of the number of domains and their length predicted by the MAPSI tool.
Notes: The horizontal axis is the amino acid length and the vertical axis is the number of sequences. The KS domain is the longest and also the best conserved.
Well conserved orthologous genes in Aspergillus species.
| Type | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 6-MSAS I | A1CLJ5 | B8NPY7 | Q5BCE6 | An11g09720 | AO090102000166 | Q0C9L7 | |||
| R-PKS I(1) | Q4WAY3 | Q4WAY3 | Q5AX96 | AO090011000015 | Q0CNL6 | Q0CNL6 | |||
| R-PKS V (lovastatin and t-toxin) | C5FYS0 | B8NFE8 | Q5BEJ4 | An09g01930 | AO090701000826 | Q8J0F5 | |||
| R-PKS IV | P0CJ32 | P0CJ32 | P85915 | P85915 | AO090009000052 | Q0CB46 | P85915 | ||
| bacMSAS 6-MSAS (NFC) | P22367 | B8NYX0 | An10g00140 | AO090206000074 | P22367 | ||||
| NR-PKS I(2) | P0C8J3 | An04g09530 | AO090023000444 | ||||||
| NR-PKS I(7) | An09g05730 | ||||||||
| NR-PKS I(3) | Q2U886 | Q4WA61 | Q4WA61 | Q2U886 | D7PHZ2 | Q2U886 | Q4WA61 |
Notes: This putative orthology was determined by the KS sequences and not by their gene structures. Large groups covering more than 5 species with more than 75 bootstrapping values are listed with duplicate genes. The 6-digit alphanumeric codes are UniRef50 identifiers (as of April 2012). The longer codes for niger and oryzae are standard genomic IDs provided by their sequencing communities, which are also accessible at our website. Bold identifiers are referred to in the main text.