| Literature DB >> 16737526 |
Margaret I Butler1, Jeremy Gray, Timothy J D Goodwin, Russell T M Poulter.
Abstract
BACKGROUND: We recently described a mini-intein in the PRP8 gene of a strain of the basidiomycete Cryptococcus neoformans, an important fungal pathogen of humans. This was the second described intein in the nuclear genome of any eukaryote; the first nuclear encoded intein was found in the VMA gene of several saccharomycete yeasts. The evolution of eukaryote inteins is not well understood. In this report we describe additional PRP8 inteins (bringing the total of these to over 20). We compare and contrast the phylogenetic distribution and evolutionary history of the PRP8 intein and the saccharomycete VMA intein, in order to derive a broader understanding of eukaryote intein evolution. It has been suggested that eukaryote inteins undergo horizontal transfer and the present analysis explores this proposal.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16737526 PMCID: PMC1508164 DOI: 10.1186/1471-2148-6-42
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Insertions in fungal PRP8 genes encode full-length inteins. The splicing and homing endonuclease domain motifs of ascomycete VMA inteins aligned with PRP8 inteins present in public databases. A. Alignment of the N-terminal splicing domains (blocks A and B; see InBase [62]. B. Alignment of the C-terminal splicing domains (F and G). C. An alignment of the four homing endonuclease domains (blocks C, D, E, H; see InBase) of the full-length PRP8 inteins and the active ascomycete VMA inteins. A region of variable length is indicated between blocks C and D together with the number of residues removed from the alignment. Accession numbers in the NCBI/protein database for the VMA inteins are SceVMA PXBYVA; ScarVMA, CAC86344; ZbaVMA, CAC86348.1; CtrVMA, A46080. Sequences of PRP8 inteins can be found at InBase and/or from accession data in Table 1. The taxonomic relationships of the species are also summarised in Table 1.
Data describing PRP8 inteins from ascomycetes and basidiomycetes
| Intein | Species | Strain | Intein size (aa) | C-termin. Intron | GenBank accession |
| Order: Eurotiales | |||||
| Ani_PRP8 | FGSC_A4 | 605 | 177bp | ||
| Ani_PRP8 | R20 | 605 | 177bp | ||
| Agi_PRP8 | NRRL6136 | 167 | 53bp | ||
| Section Fumigati | |||||
| Nfi_PRP8 | FRR0181 | 517 | 55bp | ||
| Afu_PRP8 | FRR0163 | 819 | 51bp | ||
| Afu_PRP8 | Af293 | 819 | 51bp | ||
| Afu_PRP8 | NRRL5109 | 819 | 51bp | ||
| Abr_PRP8 | FRR2439 | 165 | no | ||
| Avi_PRP8 | FRR0576 | 169 | no | ||
| Nsp_PRP8 | FRR4595 | 169 | no | ||
| Ngl_PRP8 | FRR2163 | 153 | no | ||
| Nfi/Nps_PRP8 | FRR0186 | 169 | no | ||
| Nau_PRP8 | NRRL4378 | 164 | no | ||
| Nqu_PRP8 | NRRL4175 | 169 | no | ||
| Nfe_PRP8 | NRRL5534 | 155 | no | ||
| Order: Onygales | |||||
| Hca_PRP8 | G217_B | 534 | no | see text | |
| Hca_PRP8 | G186_AR | 534 | no | see text | |
| Hca_PRP8 | WU24 | 535 | no | ||
| Ure_PRP8 | 1704 | 180 | no | ||
| Pbr_PRP8 | Pb18 | 573 | no | ||
| Order: Helotiales | |||||
| Bci_PRP8 | B05.10 | 838 | 62bp | ||
| Phylum: Basidiomycota | |||||
| Cne_PRP8 | IUM93-3231 | 172 | 52bp | ||
| Cne_PRP8 | PHLS8104 | 171 | 53bp | ||
| Cba_PRP8 | WM02.98 | 170 | 65bp | ||
| Cla_PRP8 | CBS139 | 522 | no | ||
Figure 2Full-length and mini-inteins encoded within fungal PRP8 genes, drawn to scale. Numbers within the boxes represent intein length (amino acid residues); boxes within BciPRP8 and AfuPRP8 contain the extra insertion (206 and 203 residues, respectively) near the end of the N-terminal splicing domain. The taxonomic relationships of the species are summarised in Table 1.
Figure 3An alignment of ascomycete and basidiomycete PRP8 inteins. Similar residue types have the same colour background. Accession numbers for the PRP8 intein sequences are in Table 1. The box diagram at the base of the alignment indicates the general shape of the largest inteins in the alignment. The arrangement of the splicing domains (blocks A, B, F and G) and endonuclease domains (blocks C, D, E and H) are indicated.
Data describing nucleotide substitution patterns within the conserved regions of the homing endonuclease domains of pairs of full-length PRP8 inteins. Sd is the number of observed synonymous substitutions; Nd is the number of observed non-synonymous substitutions; S is the number of potential synonymous substitutions; N is the number of potential non synonymous substitutions; pS is the proportion of observed synonymous substitutions; pN is the proportion of observed non-synonymous substitutions; dS is the Jukes-Cantor correction for multiple hits of pS; dN is the Jukes-Cantor correction for multiple hits of pN. Inteins are identified using strain names as shown in Table 1.
| Hca186AR | Hca217B | 2.00 | 1.00 | 47.17 | 165.83 | 0.04 | 0.01 | 0.04 | 0.01 | 7.21 |
| Hca186AR | BciPRP8 | 31.62 | 40.38 | 47.17 | 165.83 | 0.67 | 0.24 | 1.68 | 0.29 | 5.72 |
| Hca186AR | PbrPRP8 | 36.00 | 32.00 | 48.67 | 164.33 | 0.74 | 0.19 | 3.22 | 0.23 | 14.27 |
| Hca186AR | AniPRP8 | 41.88 | 38.12 | 46.50 | 166.50 | 0.90 | 0.23 | NA | 0.27 | NA |
| Hca186AR | Afu0163 | 38.12 | 35.88 | 48.67 | 164.33 | 0.78 | 0.22 | NA | 0.26 | NA |
| Hca186AR | Afu5109 | 38.12 | 36.88 | 48.33 | 164.67 | 0.79 | 0.22 | NA | 0.27 | NA |
| Hca186AR | Nfi0181 | 36.38 | 33.62 | 48.33 | 164.67 | 0.75 | 0.20 | NA | 0.24 | NA |
| Hca186AR | ClaPRP8 | 39.75 | 62.25 | 48.67 | 161.33 | 0.82 | 0.39 | NA | 0.54 | NA |
| Hca217B | BciPRP8 | 31.12 | 39.88 | 47.67 | 165.33 | 0.65 | 0.24 | 1.53 | 0.29 | 5.27 |
| Hca217B | PbrPRP8 | 35.50 | 31.50 | 49.17 | 163.83 | 0.72 | 0.19 | 2.47 | 0.22 | 11.10 |
| Hca217B | AniPRP8 | 42.75 | 37.25 | 47.00 | 166.00 | 0.91 | 0.22 | NA | 0.27 | NA |
| Hca217B | Afu0163 | 37.62 | 35.38 | 49.17 | 163.83 | 0.77 | 0.22 | NA | 0.25 | NA |
| Hca217B | Afu5109 | 37.62 | 36.38 | 48.83 | 164.17 | 0.77 | 0.22 | NA | 0.26 | NA |
| Hca217B | Nfi0181 | 35.88 | 33.12 | 48.83 | 164.17 | 0.73 | 0.20 | 2.92 | 0.24 | 12.41 |
| Hca217B | ClaPRP8 | 39.00 | 60.00 | 49.17 | 160.83 | 0.79 | 0.37 | NA | 0.52 | NA |
| BciPRP8 | PbrPRP8 | 43.50 | 27.50 | 49.17 | 163.83 | 0.88 | 0.17 | NA | 0.19 | NA |
| BciPRP8 | AniPRP8 | 39.00 | 24.00 | 47.00 | 166.00 | 0.83 | 0.14 | NA | 0.16 | NA |
| BciPRP8 | Afu0163 | 39.50 | 16.50 | 49.17 | 163.83 | 0.80 | 0.10 | NA | 0.11 | NA |
| BciPRP8 | Afu5109 | 39.50 | 17.50 | 48.83 | 164.17 | 0.81 | 0.11 | NA | 0.11 | NA |
| BciPRP8 | Nfi0181 | 35.50 | 16.50 | 48.83 | 164.17 | 0.73 | 0.10 | 2.61 | 0.11 | 24.21 |
| BciPRP8 | ClaPRP8 | 36.75 | 46.25 | 49.50 | 160.50 | 0.74 | 0.29 | 3.45 | 0.36 | 9.48 |
| PbrPRP8 | AniPRP8 | 38.88 | 31.12 | 48.50 | 164.50 | 0.80 | 0.19 | NA | 0.22 | NA |
| PbrPRP8 | Afu0163 | 38.25 | 23.75 | 50.67 | 162.33 | 0.75 | 0.15 | NA | 0.16 | NA |
| PbrPRP8 | Afu5109 | 38.25 | 24.75 | 50.33 | 162.67 | 0.76 | 0.15 | NA | 0.17 | NA |
| PbrPRP8 | Nfi0181 | 40.25 | 22.75 | 50.33 | 162.67 | 0.80 | 0.14 | NA | 0.15 | NA |
| PbrPRP8 | ClaPRP8 | 38.88 | 48.12 | 51.00 | 159.00 | 0.76 | 0.30 | NA | 0.39 | NA |
| AniPRP8 | Afu0163 | 36.62 | 22.38 | 48.50 | 164.50 | 0.76 | 0.14 | NA | 0.15 | NA |
| AniPRP8 | Afu5109 | 35.62 | 22.38 | 48.17 | 164.83 | 0.74 | 0.14 | 3.21 | 0.15 | 21.44 |
| AniPRP8 | Nfi0181 | 38.62 | 20.38 | 48.17 | 164.83 | 0.80 | 0.12 | NA | 0.14 | NA |
| AniPRP8 | ClaPRP8 | 44.50 | 43.50 | 48.83 | 161.17 | 0.91 | 0.27 | NA | 0.33 | NA |
| Afu0163 | Afu5109 | 1.00 | 0.00 | 50.33 | 162.67 | 0.02 | 0.00 | 0.02 | 0.00 | NA |
| Afu0163 | Nfi0181 | 11.00 | 2.00 | 50.33 | 162.67 | 0.22 | 0.01 | 0.26 | 0.01 | 20.84 |
| Afu0163 | ClaPRP8 | 40.50 | 47.50 | 51.00 | 159.00 | 0.79 | 0.30 | NA | 0.38 | NA |
| Afu5109 | Nfi0181 | 12.00 | 2.00 | 50.00 | 163.00 | 0.24 | 0.01 | 0.29 | 0.01 | 23.38 |
| Afu5109 | ClaPRP8 | 41.50 | 47.50 | 50.67 | 159.33 | 0.82 | 0.30 | NA | 0.38 | NA |
| Nfi0181 | ClaPRP8 | 39.62 | 47.38 | 50.67 | 159.33 | 0.78 | 0.30 | NA | 0.38 | NA |
Data describing nucleotide substitution patterns within the conserved regions of the homing endonuclease domains of pairs of VMA inteins. Sd is the number of observed synonymous substitutions; Nd is the number of observed non-synonymous substitutions; S is the number of potential synonymous substitutions; N is the number of potential non-synonymous substitutions; pS is the proportion of observed synonymous substitutions; pN is the proportion of observed non-synonymous substitutions; dS is the Jukes-Cantor correction for multiple hits of pS; dN is the Jukes-Cantor correction for multiple hits of pN. EN describes the experimentally determined activity of the homing endonuclease [9], A= active, I= inactive. The activities of the endonucleases from C. glabrata and D. hansenii have not been determined, but are likely to be inactive. The species in which VMA inteins occur are labelled: Scerev, Saccharomyces cerevisiae; SspDH1A, a wild isolate of S. cerevisiae; Scarioc, S. cariocanus; Tglobosa, Torulaspora globosa; Zbailli, Zygosaccharomyces bailli; Tpret, T. pretoriensis; Zbisporus, Zygosaccharomyces bisporus; Zrouxii, Z. rouxii; Kpolysp, Kluyveromyces polysporus; Sunisp, S. (Kazachstania) unisporus; Sexiguus, S. (Kazachstania) exiguus; Ctropicalis, Candida tropicalis; Sdair, S. (Naumovia) dairenensis; Scastelli, S. (Naumovia) castellii; Cglabrata, C. glabrata; Klactis, K. lactis; Dhansenii, Debaryomyces hansenii. Inteins with active homing endonucleases are marked by asterisks*.
| Sd | Nd | S | N | pS | pN | dS | dN | dS/dN | EN | ||
| *Scerev | *SspDH1A | 9.00 | 0.00 | 52.00 | 158.00 | 0.17 | 0.00 | 0.20 | 0.00 | NA | AA |
| *Scerev | *Scarioc | 13.00 | 4.00 | 52.33 | 157.67 | 0.25 | 0.03 | 0.30 | 0.03 | 11.69 | AA |
| *Scerev | Tglobosa | 31.88 | 36.12 | 52.50 | 157.50 | 0.61 | 0.23 | 1.24 | 0.27 | 4.54 | AI |
| *Scerev | *Zbailli | 31.50 | 34.50 | 51.50 | 158.50 | 0.61 | 0.22 | 1.27 | 0.26 | 4.93 | AA |
| *Scerev | Tpret | 34.62 | 41.38 | 52.83 | 157.17 | 0.66 | 0.26 | 1.55 | 0.32 | 4.79 | AI |
| *Scerev | Zbisporus | 27.00 | 38.00 | 49.33 | 151.67 | 0.55 | 0.25 | 0.98 | 0.30 | 3.22 | AI |
| *Scerev | Zrouxii | 22.38 | 41.62 | 48.50 | 152.50 | 0.46 | 0.27 | 0.72 | 0.34 | 2.11 | AI |
| *Scerev | Kpolysp | 30.88 | 57.12 | 48.00 | 162.00 | 0.64 | 0.35 | 1.46 | 0.48 | 3.07 | AI |
| *Scerev | Sdair | 34.75 | 61.25 | 51.50 | 158.50 | 0.67 | 0.39 | 1.72 | 0.54 | 3.18 | AI |
| *Scerev | Sunisp | 27.62 | 70.38 | 52.17 | 157.83 | 0.53 | 0.45 | 0.92 | 0.68 | 1.36 | AI |
| *Scerev | Sexiguus | 24.00 | 71.00 | 50.67 | 159.33 | 0.47 | 0.45 | 0.75 | 0.68 | 1.11 | AI |
| *Scerev | Ctropicalis | 28.75 | 70.25 | 52.17 | 157.83 | 0.55 | 0.45 | 1.00 | 0.68 | 1.47 | AI |
| *Scerev | Scastelli | 35.88 | 56.12 | 51.00 | 159.00 | 0.70 | 0.35 | 2.08 | 0.48 | 4.37 | AI |
| *Scerev | Cglabrata | 25.38 | 67.62 | 45.83 | 143.17 | 0.55 | 0.47 | 1.01 | 0.75 | 1.35 | AI |
| *Scerev | Klactis | 33.12 | 70.88 | 49.50 | 151.50 | 0.67 | 0.47 | 1.67 | 0.73 | 2.28 | AI |
| *Scerev | Dhansenii | 32.62 | 50.38 | 52.83 | 154.17 | 0.62 | 0.33 | 1.30 | 0.43 | 3.03 | AI |
| *SspDH1A | *Scarioc | 15.00 | 4.00 | 52.33 | 157.67 | 0.29 | 0.03 | 0.36 | 0.03 | 13.99 | AA |
| *SspDH1A | Tglobosa | 28.12 | 35.88 | 52.50 | 157.50 | 0.54 | 0.23 | 0.94 | 0.27 | 3.46 | AI |
| *SspDH1A | *Zbailli | 29.25 | 34.75 | 51.50 | 158.50 | 0.57 | 0.22 | 1.06 | 0.26 | 4.09 | AA |
| *SspDH1A | Tpret | 35.38 | 41.62 | 52.83 | 157.17 | 0.67 | 0.26 | 1.67 | 0.33 | 5.13 | AI |
| *SspDH1A | Zbisporus | 23.75 | 38.25 | 49.33 | 151.67 | 0.48 | 0.25 | 0.77 | 0.31 | 2.51 | AI |
| *SspDH1A | Zrouxii | 23.62 | 41.38 | 48.50 | 152.50 | 0.49 | 0.27 | 0.79 | 0.34 | 2.33 | AI |
| *SspDH1A | Kpolysp | 30.38 | 56.62 | 48.00 | 162.00 | 0.63 | 0.35 | 1.39 | 0.47 | 2.96 | AI |
| *SspDH1A | Sdair | 33.50 | 61.50 | 51.50 | 158.50 | 0.65 | 0.39 | 1.51 | 0.55 | 2.77 | AI |
| *SspDH1A | Sunisp | 26.88 | 71.12 | 52.17 | 157.83 | 0.52 | 0.45 | 0.87 | 0.69 | 1.26 | AI |
| *SspDH1A | Sexiguus | 23.75 | 71.25 | 50.67 | 159.33 | 0.47 | 0.45 | 0.74 | 0.68 | 1.08 | AI |
| *SspDH1A | Ctropicalis | 27.00 | 70.00 | 52.17 | 157.83 | 0.52 | 0.44 | 0.88 | 0.67 | 1.31 | AI |
| *SspDH1A | Scastelli | 34.62 | 56.38 | 51.00 | 159.00 | 0.68 | 0.35 | 1.77 | 0.48 | 3.68 | AI |
| *SspDH1A | Cglabrata | 27.12 | 67.88 | 45.83 | 143.17 | 0.59 | 0.47 | 1.17 | 0.75 | 1.56 | AI |
| *SspDH1A | Klactis | 33.88 | 71.12 | 49.50 | 151.50 | 0.68 | 0.47 | 1.83 | 0.74 | 2.48 | AI |
| *SspDH1A | Dhansenii | 32.62 | 50.38 | 52.83 | 154.17 | 0.62 | 0.33 | 1.30 | 0.43 | 3.03 | AI |
| *Scarioc | Tglobosa | 31.50 | 35.50 | 52.83 | 157.17 | 0.60 | 0.23 | 1.19 | 0.27 | 4.42 | AI |
| *Scarioc | *Zbailli | 32.88 | 33.12 | 51.83 | 158.17 | 0.63 | 0.21 | 1.40 | 0.25 | 5.71 | AA |
| *Scarioc | Tpret | 38.12 | 41.88 | 53.17 | 156.83 | 0.72 | 0.27 | 2.34 | 0.33 | 7.10 | AI |
| *Scarioc | Zbisporus | 27.50 | 37.50 | 49.33 | 151.67 | 0.56 | 0.25 | 1.02 | 0.30 | 3.40 | AI |
| *Scarioc | Zrouxii | 28.12 | 38.88 | 48.50 | 152.50 | 0.58 | 0.25 | 1.11 | 0.31 | 3.57 | AI |
| *Scarioc | Kpolysp | 27.88 | 59.12 | 48.33 | 161.67 | 0.58 | 0.37 | 1.10 | 0.50 | 2.19 | AI |
| *Scarioc | Sdair | 34.62 | 63.38 | 51.83 | 158.17 | 0.67 | 0.40 | 1.66 | 0.57 | 2.90 | AI |
| *Scarioc | Sunisp | 31.12 | 69.88 | 52.50 | 157.50 | 0.59 | 0.44 | 1.17 | 0.67 | 1.75 | AI |
| *Scarioc | Sexiguus | 25.75 | 72.25 | 51.00 | 159.00 | 0.50 | 0.45 | 0.84 | 0.70 | 1.20 | AI |
| *Scarioc | Ctropicalis | 27.38 | 68.62 | 52.50 | 157.50 | 0.52 | 0.44 | 0.89 | 0.65 | 1.37 | AI |
| *Scarioc | Scastelli | 38.88 | 57.12 | 51.33 | 158.67 | 0.76 | 0.36 | NA | 0.49 | NA | AI |
| *Scarioc | Cglabrata | 23.50 | 69.50 | 45.83 | 143.17 | 0.51 | 0.49 | 0.86 | 0.78 | 1.10 | AI |
| *Scarioc | Klactis | 35.12 | 72.88 | 49.50 | 151.50 | 0.71 | 0.48 | 2.19 | 0.77 | 2.85 | AI |
| *Scarioc | Dhansenii | 29.75 | 49.25 | 52.83 | 154.17 | 0.56 | 0.32 | 1.04 | 0.42 | 2.50 | AI |
| Tglobosa | *Zbailli | 24.50 | 13.50 | 52.00 | 158.00 | 0.47 | 0.09 | 0.74 | 0.09 | 8.18 | IA |
| Tglobosa | Tpret | 28.50 | 19.50 | 53.33 | 156.67 | 0.53 | 0.12 | 0.93 | 0.14 | 6.87 | II |
| Tglobosa | Zbisporus | 26.12 | 35.88 | 49.33 | 151.67 | 0.53 | 0.24 | 0.92 | 0.28 | 3.23 | II |
| Tglobosa | Zrouxii | 22.50 | 31.50 | 48.50 | 152.50 | 0.46 | 0.21 | 0.72 | 0.24 | 2.99 | II |
| Tglobosa | Kpolysp | 24.50 | 65.50 | 48.50 | 161.50 | 0.51 | 0.41 | 0.84 | 0.58 | 1.44 | II |
| Tglobosa | Sdair | 32.00 | 64.00 | 52.00 | 158.00 | 0.62 | 0.41 | 1.29 | 0.58 | 2.21 | II |
| Tglobosa | Sunisp | 29.50 | 65.50 | 52.67 | 157.33 | 0.56 | 0.42 | 1.03 | 0.61 | 1.70 | II |
| Tglobosa | Sexiguus | 28.12 | 73.88 | 51.17 | 158.83 | 0.55 | 0.47 | 0.99 | 0.73 | 1.36 | II |
| Tglobosa | Ctropicalis | 32.25 | 72.75 | 52.67 | 157.33 | 0.61 | 0.46 | 1.27 | 0.72 | 1.77 | II |
| Tglobosa | Scastelli | 35.50 | 52.50 | 51.50 | 158.50 | 0.69 | 0.33 | 1.89 | 0.44 | 4.31 | II |
| Tglobosa | Cglabrata | 26.00 | 82.00 | 45.50 | 143.50 | 0.57 | 0.57 | 1.08 | 1.08 | 1.00 | II |
| Tglobosa | Klactis | 33.75 | 73.25 | 49.50 | 151.50 | 0.68 | 0.48 | 1.80 | 0.78 | 2.32 | II |
| Tglobosa | Dhansenii | 31.25 | 55.75 | 53.00 | 154.00 | 0.59 | 0.36 | 1.16 | 0.49 | 2.34 | II |
| *Zbailli | Tpret | 32.75 | 21.25 | 52.33 | 157.67 | 0.63 | 0.13 | 1.35 | 0.15 | 9.08 | AI |
| *Zbailli | Zbisporus | 30.62 | 33.38 | 48.50 | 152.50 | 0.63 | 0.22 | 1.38 | 0.26 | 5.35 | AI |
| *Zbailli | Zrouxii | 31.75 | 31.25 | 47.67 | 153.33 | 0.67 | 0.20 | 1.64 | 0.24 | 6.91 | AI |
| *Zbailli | Kpolysp | 30.75 | 65.25 | 47.50 | 162.50 | 0.65 | 0.40 | 1.49 | 0.57 | 2.59 | AI |
| *Zbailli | Sdair | 31.62 | 63.38 | 51.00 | 159.00 | 0.62 | 0.40 | 1.31 | 0.57 | 2.31 | AI |
| *Zbailli | Sunisp | 30.38 | 68.62 | 51.67 | 158.33 | 0.59 | 0.43 | 1.15 | 0.65 | 1.78 | AI |
| *Zbailli | Sexiguus | 29.00 | 73.00 | 50.17 | 159.83 | 0.58 | 0.46 | 1.10 | 0.70 | 1.57 | AI |
| *Zbailli | Ctropicalis | 35.75 | 74.25 | 51.67 | 158.33 | 0.69 | 0.47 | 1.92 | 0.74 | 2.61 | AI |
| *Zbailli | Scastelli | 31.25 | 53.75 | 50.50 | 159.50 | 0.62 | 0.34 | 1.31 | 0.45 | 2.92 | AI |
| *Zbailli | Cglabrata | 21.62 | 76.38 | 44.67 | 144.33 | 0.48 | 0.53 | 0.78 | 0.92 | 0.85 | AI |
| *Zbailli | Klactis | 28.88 | 71.12 | 48.67 | 152.33 | 0.59 | 0.47 | 1.17 | 0.73 | 1.61 | AI |
| *Zbailli | Dhansenii | 34.75 | 59.25 | 52.00 | 155.00 | 0.67 | 0.38 | 1.66 | 0.53 | 3.11 | AI |
| Tpret | Zbisporus | 28.38 | 36.62 | 50.00 | 151.00 | 0.57 | 0.24 | 1.06 | 0.29 | 3.62 | II |
| Tpret | Zrouxii | 26.00 | 37.00 | 49.17 | 151.83 | 0.53 | 0.24 | 0.92 | 0.29 | 3.11 | II |
| Tpret | Kpolysp | 27.38 | 59.62 | 48.83 | 161.17 | 0.56 | 0.37 | 1.03 | 0.51 | 2.02 | II |
| Tpret | Sdair | 27.62 | 62.38 | 52.33 | 157.67 | 0.53 | 0.40 | 0.91 | 0.56 | 1.62 | II |
| Tpret | Sunisp | 27.62 | 71.38 | 53.00 | 157.00 | 0.52 | 0.45 | 0.89 | 0.70 | 1.27 | II |
| Tpret | Sexiguus | 32.00 | 71.00 | 51.50 | 158.50 | 0.62 | 0.45 | 1.32 | 0.68 | 1.94 | II |
| Tpret | Ctropicalis | 30.62 | 65.38 | 53.00 | 157.00 | 0.58 | 0.42 | 1.10 | 0.61 | 1.82 | II |
| Tpret | Scastelli | 29.00 | 55.00 | 51.83 | 158.17 | 0.56 | 0.35 | 1.03 | 0.47 | 2.20 | II |
| Tpret | Cglabrata | 29.88 | 80.12 | 46.50 | 142.50 | 0.64 | 0.56 | 1.46 | 1.04 | 1.40 | II |
| Tpret | Klactis | 38.38 | 77.62 | 50.17 | 150.83 | 0.76 | 0.51 | NA | 0.87 | NA | II |
| Tpret | Dhansenii | 34.25 | 57.75 | 53.33 | 153.67 | 0.64 | 0.38 | 1.45 | 0.52 | 2.79 | II |
| Zbisporus | Zrouxii | 18.12 | 25.88 | 47.83 | 153.17 | 0.38 | 0.17 | 0.53 | 0.19 | 2.76 | II |
| Zbisporus | Kpolysp | 27.38 | 55.62 | 45.67 | 155.33 | 0.60 | 0.36 | 1.20 | 0.49 | 2.47 | II |
| Zbisporus | Sdair | 29.25 | 59.75 | 48.67 | 152.33 | 0.60 | 0.39 | 1.21 | 0.56 | 2.18 | II |
| Zbisporus | Sunisp | 27.38 | 61.62 | 49.00 | 152.00 | 0.56 | 0.41 | 1.02 | 0.58 | 1.76 | II |
| Zbisporus | Sexiguus | 19.25 | 62.75 | 48.50 | 152.50 | 0.40 | 0.41 | 0.57 | 0.60 | 0.95 | II |
| Zbisporus | Ctropicalis | 26.75 | 66.25 | 49.33 | 151.67 | 0.54 | 0.44 | 0.96 | 0.65 | 1.47 | II |
| Zbisporus | Scastelli | 30.50 | 55.50 | 48.33 | 152.67 | 0.63 | 0.36 | 1.38 | 0.50 | 2.78 | II |
| Zbisporus | Cglabrata | 24.88 | 71.12 | 44.50 | 144.50 | 0.56 | 0.49 | 1.03 | 0.80 | 1.28 | II |
| Zbisporus | Klactis | 33.38 | 76.62 | 48.83 | 152.17 | 0.68 | 0.50 | 1.82 | 0.83 | 2.18 | II |
| Zbisporus | Dhansenii | 29.88 | 58.12 | 50.33 | 150.67 | 0.59 | 0.39 | 1.18 | 0.54 | 2.17 | II |
| Zrouxii | Kpolysp | 24.00 | 58.00 | 44.83 | 156.17 | 0.54 | 0.37 | 0.94 | 0.51 | 1.83 | II |
| Zrouxii | Sdair | 28.50 | 63.50 | 47.83 | 153.17 | 0.60 | 0.41 | 1.19 | 0.60 | 1.97 | II |
| Zrouxii | Sunisp | 27.88 | 68.12 | 48.17 | 152.83 | 0.58 | 0.45 | 1.11 | 0.68 | 1.64 | II |
| Zrouxii | Sexiguus | 24.50 | 64.50 | 47.67 | 153.33 | 0.51 | 0.42 | 0.87 | 0.62 | 1.40 | II |
| Zrouxii | Ctropicalis | 25.88 | 66.12 | 48.50 | 152.50 | 0.53 | 0.43 | 0.93 | 0.65 | 1.44 | II |
| Zrouxii | Scastelli | 24.50 | 56.50 | 47.50 | 153.50 | 0.52 | 0.37 | 0.87 | 0.51 | 1.72 | II |
| Zrouxii | Cglabrata | 29.38 | 79.62 | 44.33 | 144.67 | 0.66 | 0.55 | 1.61 | 0.99 | 1.62 | II |
| Zrouxii | Klactis | 31.38 | 76.62 | 48.00 | 153.00 | 0.65 | 0.50 | 1.54 | 0.83 | 1.86 | II |
| Zrouxii | Dhansenii | 29.75 | 55.25 | 49.50 | 151.50 | 0.60 | 0.36 | 1.21 | 0.50 | 2.43 | II |
| Kpolysp | Sdair | 19.25 | 69.75 | 48.33 | 164.67 | 0.40 | 0.42 | 0.57 | 0.62 | 0.91 | II |
| Kpolysp | Sunisp | 19.88 | 77.12 | 49.17 | 163.83 | 0.40 | 0.47 | 0.58 | 0.74 | 0.78 | II |
| Kpolysp | Sexiguus | 21.12 | 74.88 | 47.67 | 165.33 | 0.44 | 0.45 | 0.67 | 0.69 | 0.97 | II |
| Kpolysp | Ctropicalis | 20.25 | 78.75 | 49.00 | 164.00 | 0.41 | 0.48 | 0.60 | 0.77 | 0.78 | II |
| Kpolysp | Scastelli | 28.50 | 71.50 | 47.67 | 165.33 | 0.60 | 0.43 | 1.20 | 0.64 | 1.86 | II |
| Kpolysp | Cglabrata | 20.12 | 70.88 | 42.50 | 146.50 | 0.47 | 0.48 | 0.75 | 0.78 | 0.96 | II |
| Kpolysp | Klactis | 28.38 | 80.62 | 45.83 | 155.17 | 0.62 | 0.52 | 1.31 | 0.89 | 1.48 | II |
| Kpolysp | Dhansenii | 30.12 | 70.88 | 48.67 | 158.33 | 0.62 | 0.45 | 1.31 | 0.68 | 1.92 | II |
| Sdair | Sunisp | 19.25 | 48.75 | 53.50 | 165.50 | 0.36 | 0.29 | 0.49 | 0.37 | 1.31 | II |
| Sdair | Sexiguus | 21.62 | 59.38 | 52.00 | 167.00 | 0.42 | 0.36 | 0.61 | 0.48 | 1.26 | II |
| Sdair | Ctropicalis | 26.12 | 48.88 | 52.67 | 163.33 | 0.50 | 0.30 | 0.81 | 0.38 | 2.13 | II |
| Sdair | Scastelli | 30.38 | 25.62 | 51.00 | 162.00 | 0.60 | 0.16 | 1.19 | 0.18 | 6.67 | II |
| Sdair | Cglabrata | 23.75 | 76.25 | 44.67 | 144.33 | 0.53 | 0.53 | 0.93 | 0.91 | 1.01 | II |
| Sdair | Klactis | 29.25 | 75.75 | 48.83 | 152.17 | 0.60 | 0.50 | 1.20 | 0.82 | 1.47 | II |
| Sdair | Dhansenii | 30.38 | 55.62 | 52.00 | 155.00 | 0.58 | 0.36 | 1.13 | 0.49 | 2.32 | II |
| Sunisp | Sexiguus | 17.12 | 61.88 | 53.17 | 165.83 | 0.32 | 0.37 | 0.42 | 0.52 | 0.82 | II |
| Sunisp | Ctropicalis | 21.50 | 59.50 | 53.50 | 162.50 | 0.40 | 0.37 | 0.58 | 0.50 | 1.15 | II |
| Sunisp | Scastelli | 29.88 | 57.12 | 51.83 | 161.17 | 0.58 | 0.35 | 1.10 | 0.48 | 2.29 | II |
| Sunisp | Cglabrata | 23.00 | 65.00 | 45.00 | 144.00 | 0.51 | 0.45 | 0.86 | 0.69 | 1.24 | II |
| Sunisp | Klactis | 27.88 | 67.12 | 49.17 | 151.83 | 0.57 | 0.44 | 1.06 | 0.67 | 1.58 | II |
| Sunisp | Dhansenii | 28.50 | 69.50 | 52.67 | 154.33 | 0.54 | 0.45 | 0.96 | 0.69 | 1.39 | II |
| Sexiguus | Ctropicalis | 21.25 | 65.75 | 52.33 | 163.67 | 0.41 | 0.40 | 0.58 | 0.58 | 1.02 | II |
| Sexiguus | Scastelli | 35.00 | 55.00 | 50.33 | 162.67 | 0.70 | 0.34 | 1.96 | 0.45 | 4.37 | II |
| Sexiguus | Cglabrata | 17.88 | 73.12 | 44.83 | 144.17 | 0.40 | 0.51 | 0.57 | 0.85 | 0.67 | II |
| Sexiguus | Klactis | 29.12 | 81.88 | 48.67 | 152.33 | 0.60 | 0.54 | 1.20 | 0.95 | 1.27 | II |
| Sexiguus | Dhansenii | 25.50 | 65.50 | 51.50 | 155.50 | 0.50 | 0.42 | 0.81 | 0.62 | 1.31 | II |
| Ctropicalis | Scastelli | 36.25 | 55.75 | 51.67 | 161.33 | 0.70 | 0.35 | 2.06 | 0.46 | 4.44 | II |
| Ctropicalis | Cglabrata | 29.38 | 74.62 | 45.50 | 143.50 | 0.65 | 0.52 | 1.48 | 0.89 | 1.67 | II |
| Ctropicalis | Klactis | 32.25 | 78.75 | 49.50 | 151.50 | 0.65 | 0.52 | 1.52 | 0.89 | 1.72 | II |
| Ctropicalis | Dhansenii | 32.00 | 66.00 | 52.67 | 154.33 | 0.61 | 0.43 | 1.25 | 0.63 | 1.97 | II |
| Scastelli | Cglabrata | 27.88 | 83.12 | 44.83 | 144.17 | 0.62 | 0.58 | 1.32 | 1.10 | 1.21 | II |
| Scastelli | Klactis | 31.50 | 78.50 | 48.50 | 152.50 | 0.65 | 0.51 | 1.51 | 0.87 | 1.73 | II |
| Scastelli | Dhansenii | 33.38 | 53.62 | 51.83 | 155.17 | 0.64 | 0.35 | 1.47 | 0.46 | 3.17 | II |
| Cglabrata | Klactis | 26.38 | 61.62 | 44.83 | 144.17 | 0.59 | 0.43 | 1.15 | 0.63 | 1.82 | II |
| Cglabrata | Dhansenii | 22.75 | 81.25 | 46.50 | 142.50 | 0.49 | 0.57 | 0.79 | 1.07 | 0.74 | II |
| Klactis | Dhansenii | 34.12 | 71.88 | 50.50 | 150.50 | 0.68 | 0.48 | 1.73 | 0.76 | 2.28 | II |
Fungal species without an intein in PRP8. Detected by a survey of PRP8 intein insertion sites of species in public databases using TBLASTN with a partial, intein-less, sequence of the Aspergillus fumigatus PRP8 as the query. The presence/absence of an intron almost immediately downstream of the empty intein insertion site is also noted. Included are species shown not to have an intein by Liu & Yang [25] and Butler & Poulter [20]. Note that the data represent only one isolate from each species.
| Species | Order | Accession | intron |
| ASCOMYCOTA | |||
| Schizosaccharomycetales | NC_003424.1 | no | |
| Pneumocystidales | contig_130 | no | |
| Ascomycotina | |||
| Saccharomycetales | AAFM01000018 | no | |
| " | AACE01000391.1 | no | |
| " | AAFN01000124.1 | no | |
| " | AACQ01000072.1 | no | |
| " | NC_006032.1 | no | |
| " | AABY01000045.1 | no | |
| " | NC_001140.4 | no | |
| " | AACA01000109.1 | no | |
| " | AABZ01000010.1 | no | |
| " | AADM01000296.1 | no | |
| " | NC_006037.1 | no | |
| " | NC_005783.2 | no | |
| " | NC_006047.1 | no | |
| " | NC_006069.1 | no | |
| " | AAFT01000064 | no | |
| Pezizomycotina | |||
| Eurotiales | AAJN01000238.1 | no | |
| " | AAKD01000005.1 | yes | |
| " | AAIH01001496.1 | no | |
| " | Liu & Yang 2004 | ? | |
| " | Liu & Yang 2004 | ? | |
| " | Liu & Yang 2004 | ? | |
| " | Liu & Yang 2004 | ? | |
| " | This work | yes | |
| Pleosporales | AAGI01000010 | yes | |
| Sordariales | AABX01000169 | no | |
| Sordariales | AAFU01000976 | yes | |
| Sordariales | contig_159 | yes | |
| Sordariales | AACU01000805 | no | |
| Hypocreales | AAIL01001072 | no | |
| Hypocreales | AACM01000127 | no | |
| Hypocreales | contig 2.11 | no | |
| Onygenales | CF811886 | no | |
| Onygenales | AAEC01000088 | no | |
| Helotiales | supercontig_1.4 | yes | |
| BASIDIOMYCOTA | |||
| Aphyllophorales | AADS01000435.1 | no | |
| Agaricales | AACS01000175 | no | |
| Ustilaginales | AACP01000056.1 | no | |
| Tremellales | Butler & Poulter 2005 | yes | |
| " | Butler & Poulter 2005 | yes | |
| " | Butler & Poulter 2005 | yes | |
| " | Butler & Poulter 2005 | no | |
| " | Butler & Poulter 2005 | no | |
| ZYGOMYCOTA | |||
| supercontig_1.5 | no | ||
Figure 4Phylogenetic tree based on an alignment of PRP8 proteins. The tree was constructed by the neighbour-joining method using PAUP* [61] and is a consensus derived from 1000 bootstrap replicates. The numbers indicate the percentage bootstrap support (only nodes with >50% support are shown). Accession numbers for the PRP8 sequences are: Caenorhabditis elegans, NM_066384; Mus musculus, AB047391; Homo sapiens, BAA22563; Xenopus laevis, AAH45266; Anopheles gambiae, EAA04255; Drosophila melanogaster, NP610735; Oryza sativa, AB023482; Arabidopsis thaliana, AC009322; Schizosaccharomyces pombe, T38841; Neurospora crassa, EAA33717; Aspergillus nidulans, BK001316; Saccharomyces cerevisiae, S34670; S. paradoxus, AABY01000045 (ORF: 35077-42312); S. mikatae, AABZ01000010 (ORF: 37550-44800); Neurospora crassa (EAA33717); Dictyostelium discoideum, AAL92617. PRP8 sequences of Candida glabrata (gene name CAGL0I02266g), Debaryomyces hansenii (gene name DEHA0E02717g) and Kluyveromyces lactis (gene name KLLA0A05280g) were obtained from the Génolevures website. Predicted PRP8 protein sequences for the remaining species were generated using data from their respective genome sequencing web sites and are available from the authors. Accession numbers refer to data which include the PRP8 encoding regions; Cryptococcus neoformans var. grubii (AACO01000004), Cryptococcus neoformans var.neoformans (AAEY01000001), Cryptococcus gattii R265 (AAFP01000516), Coprinopsis cinerea (AACS01000175), Gibberella zeae (AACM01000127), Ashbya gossypii (AAS50980), Aspergillus fumigatus (AAHF01000008), Aspergillus clavatus (AAKD01000005), Aspergillus terreus (AAJN01000238), Neosartorya fischeri (AAKE01000005.1), Uncinocarpus reesii (AAIW01000130.1), Coccidioides immitis (AAEC02000024), Trichoderma reesei (AAIL01001072), Magnaporthe grisea (AACU02000289), Chaetomium globosum (AAFU01000976), Phaeosphaeria nodorum (AAGI01000010), Ustilago maydis (AACP01000056), Phanerochaete chrysosporium (AADS01000435), Botrytis cinerea (AAID01001534, AAID01001533), Sclerotinia sclerotiorum (AAGT01000110), Rhizopus oryzae (AACW02000011), and Histoplasma capsulatum (AAJI01001309).
Figure 5Phylogenetic tree of inteins based on an alignment of the four protein splicing domain motifs. Phylogenetic tree based on an alignment of the four protein splicing motifs from the PRP8 inteins and the VMA inteins, together with splicing domains from inteins found in eukaryote viruses and plastids. The tree was constructed by the neighbour-joining method using PAUP* [61] and is a consensus derived from 1000 bootstrap replicates. The numbers indicate the percentage bootstrap support (only nodes with >50% support are shown). Accession numbers of the intein sequences are as in Figures 1 and 3 or can be obtained from InBase [62]. S. dairenensis and S. castellii are now included in the newly described genus Naumovia; S. exiguus and S. unisporus are now included in the genus Kazachstania [70].
Figure 6Phylogenetic trees based on alignments of the homing endonuclease domains. Phylogenetic trees based on alignments of the whole homing endonuclease from:A. PRP8 full-length inteins. B. VMA inteins. Asterisks denote the VMA intein homing endonucleases known to be active [9]. The trees were constructed by the neighbour-joining method using PAUP* [61] and each represents a consensus derived from 100 bootstrap replicates. The numbers indicate the percentage bootstrap support (only nodes with >50% support are shown). Accession numbers of the intein sequences are as in Figures 1 and 3 or can be obtained from InBase [62]. S. dairenensis and S. castellii are now included in the newly described genus Naumovia; S. exiguus and S. unisporus are now included in the genus Kazachstania [70].
Primer sequences used to amplify PRP8 intein encoding sequences from euascomycetes.
| Primer name | Relevant species | Primer sequence |
| Aspfum-F1 | 5' acagatgtcatccaagc 3' | |
| Aspfum-F2 | (including | 5' tgggaaagagcatgccttgc 3' |
| Aspfum-F4 | 5' gaaccaggaaatggagacg 3' | |
| Aspfum-Rv1 | and | 5' ttcaacggtatcgtagcg 3' |
| Aspfum-Rv4 | 5' acagagtgaaccgacg 3' | |
| Liu_1 | Fumigati spp. and | 5' atgaagagcaa [tc]cc [agct]tt [tc]tggtggac 3' |
| Liu_2 | 5' gcattcgtgag [tc]tt [ct]tt [ag]aa [tc]ttcat 3' | |
| Ani-Fn | 5' agcttgtcttgccaac 3' | |
| Ani-Fs | 5' acaaagacagaccaaccaataaacggattg 3' | |
| Ani-Ra | 5' actgttatgcagtacaacatag 3' | |
| Ani-Rsn | 5' tctcccaggagtctcgacgctaga 3' | |
| Pb_Fwd1 | 5' catatctctggaactgtggcag 3' | |
| Pb_Fwd2 | 5' gcagttgggtattgacacagtgc 3' | |
| Pb_Rev1 | 5' gcactgtgtcaatacccaactgc 3' | |
| Pb_Rev2 | 5' gcgttcgtcaacttcttgaacttcat 3' | |
| Pb_Rev3 | 5' gctttgtgctgcctcgttaacg 3' | |
| Para_b-F3 | 5' gatagaagtcgcacg 3' | |
| Para_b-Rv4 | 5' gggtcggtttatgc 3' |