| Literature DB >> 23770768 |
Debashish Bhattacharya1, Dana C Price, Cheong Xin Chan, Huan Qiu, Nicholas Rose, Steven Ball, Andreas P M Weber, Maria Cecilia Arias, Bernard Henrissat, Pedro M Coutinho, Anagha Krishnan, Simone Zäuner, Shannon Morath, Frédérique Hilliou, Andrea Egizi, Marie-Mathilde Perrineau, Hwan Su Yoon.
Abstract
The limited knowledge we have about red algal genomes comes from the highly specialized extremophiles, Cyanidiophyceae. Here, we describe the first genome sequence from a mesophilic, unicellular red alga, Porphyridium purpureum. The 8,355 predicted genes in P. purpureum, hundreds of which are likely to be implicated in a history of horizontal gene transfer, reside in a genome of 19.7 Mbp with 235 spliceosomal introns. Analysis of light-harvesting complex proteins reveals a nuclear-encoded phycobiliprotein in the alga. We uncover a complex set of carbohydrate-active enzymes, identify the genes required for the methylerythritol phosphate pathway of isoprenoid biosynthesis, and find evidence of sexual reproduction. Analysis of the compact, function-rich genome of P. purpureum suggests that ancestral lineages of red algae acted as mediators of horizontal gene transfer between prokaryotes and photosynthetic eukaryotes, thereby significantly enriching genomes across the tree of photosynthetic life.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23770768 PMCID: PMC3709513 DOI: 10.1038/ncomms2931
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Analysis of the P. purpureum genome.
(a) Transmission electron microscopy image of a P. purpureum cell showing the central pyrenoid (Py), cell membrane (CM), starch granules (S) and plastid thylakoids (Th). (b) Percentage of single protein RAxML trees (raw numbers shown in the bars) that support the monophyly of P. purpureum (bootstrap ≥90%) solely with other Plantae members (exclusive), or in combination with non-Plantae taxa that interrupt this clade (non-exclusive). These latter groups of trees are primarily explained by red/green algal EGT into the nuclear genome of chromalveolates. For each of these algal lineages, the set of trees with different numbers of taxa (x) ≥4, ≥10, ≥20, ≥30 and ≥40 in a tree are shown. Each tree has ≥3 phyla. The Plantae-only groups are reds-greens-glaucophytes (RGGl) and reds-greens (RG).
Summary of over-represented gene ontology terms.
| 0000325 | Plant-type vacuole | C | 5.70 × 10−2 | 6.80 × 10−5 |
| 0009536 | Plastid | C | 5.90 × 10−2 | 9.90 × 10−5 |
| 0044434 | Chloroplast part | C | 5.90 × 10−2 | 1.10 × 10−4 |
| 0003006 | Developmental process involved in reproduction | P | 5.90 × 10−4 | 1.40 × 10−7 |
| 0022414 | Reproductive process | P | 1.80 × 10−2 | 8.30 × 10−6 |
| 0000003 | Reproduction | P | 2.60 × 10−2 | 2.00 × 10−5 |
| 0048608 | Reproductive structure development | P | 2.60 × 10−2 | 2.40 × 10−5 |
| 0051704 | Multi-organism process | P | 5.90 × 10−2 | 1.10 × 10−4 |
| 0009793 | Embryo development ending in seed dormancy | P | 5.90 × 10−2 | 1.20 × 10−4 |
| 0010154 | Fruit development | P | 7.00 × 10−2 | 1.80 × 10−4 |
| 0048316 | Seed development | P | 7.00 × 10−2 | 1.80 × 10−4 |
C, cellular component; FDR, false discovery rate; GO, gene ontology; P, biological process.
Over- represented GO terms in Plantae-associated genes in comparison to all genes in P. purpureum, with the associated type shown as well as the C and P, FDR and P-value from Fisher’s exact test.
Figure 2Phylogenetic analysis of proteins on contig 2035b in P. purpureum.
These are all RAxML trees (WAG + Γ + I + F model) with the results of 100 bootstrap replicates shown on the branches. (a) Tree inferred from a squalene monooxygenase-like protein involved in sterol biosynthesis that shows the expected monophyly of red algae and of plants within the eukaryote tree of life. (b) Tree inferred from a tyrosine kinase/lipopolysaccharide-modifying enzyme that shows a complex phylogenetic relationship between red algae and chromalveolates. (c) Tree inferred from a glycosyltransferase of bacterial origin that is consistent with the monophyly of red algae and glaucophytes and a shared history of the gene in these taxa with chromalveolates, potentially via secondary EGT. (d) Tree inferred from an unknown protein in the aminotransferase superfamily that is present only in red algae and originated through HGT presumably from a proteobacterial source. The unit of branch length in each tree is the number of substitutions per site. The GenBank GI and Joint Genome Institute (JGI) accession codes (where available) are shown after each taxon name.
Comparison of molecular weights based on sequence prediction.
| 2142.1 | 22.0 | 22.68 |
| 2493.3/Lhcr2 | 21.0 | 21.8 |
| 3421.1 | 19.5 | 22.81 |
| 435.19 | 19.5 | 23.56 |
| 4406.7 | 23.5 | 22.79 |
| 491.7/Lhcr1 | 23.0 | 23.08 |
| 776.1 | 22.5 | 22.42 |
This was achieved using http://www.bioinformatics.org/sms/prot_mw.html
†The authors suggest that Lhcr1 (contig 491.7) is either the 23.0 or the 22.5 band. Here we assume, based on the predicted weight, that it was the 23.0 kDa band.
‡Based on the process of elimination and predicted weights, it is likely that this is the 22.5 kDa band.
CAZymes present in the P. purpureum genome.
| 31 | 83 | 0 | 2 | 40 | |
| 21 | 61 | 0 | 2 | 16 | |
| 30 | 69 | 0 | 3 | 23 | |
| 41 | 85 | 0 | 3 | 30 | |
| 37 | 77 | 0 | 2 | 29 | |
| 74 | 203 | 0 | 2 | 51 | |
| 51 | 172 | 0 | 5 | 25 | |
| 84 | 128 | 3 | 2 | 24 |
CBM, carbohydrate-binding modules; CE, carbohydrate esterases; GH, glycoside hydrolases; GT, glycosyltransferases; PL, polysaccharide lyases.
Figure 3Analysis of a transporter in P. purpureum.
Schematic image showing the putative sodium–potassium ATPase and sodium:glucose cotransporter identified in the P. purpureum genome data.
Figure 4Analysis of CYPs and sphingolipid metabolism in P. purpureum.
(a) Maximum likelihood (RAxML; LG + Γ + I + F model) tree of CYP sequences from P. purpureum and other eukaryotes. Support for internal branches was assessed using 100 bootstrap replicates. The major known CYP clans are indicated. (b) Putative sphingolipid synthesis pathway in P. purpureum deduced from analyses of vascular plants (for example, Arabidopsis). Modifications without candidate genes in the P. purpureum draft assembly are indicated in red. Cer, ceramide; GCS, glucosylceramide synthase; GIPC, glucosyl inositol phosphoryl ceramide; GlcCer, glucosylceramide; IPC, inositol phosphoryl ceramide; LCB, long chain base; and VLCFA, very long chain fatty acid.
Identification of the meiotic toolkit genes in P. purpureum.
| SPO11-3 | Contig_2255.5 | 1 × 10−150 | |
| SPO11-2 | Contig_3410.8 | 3 × 10−23 | |
| DMC1 | Contig_4438.25 | 2 × 10−95 | |
| HOP1 | Contig_667.2 | 5 × 10−23 | |
| HOP2 | Contig_4415.5 | 8 × 10−26 | |
| MSH4 | Contig_4404.10 | 2 × 10−31 | |
| MSH5 | Contig_2159.1 | 2 × 10−41 | |
| MND1 | Contig_2121.23 | 5 × 10−43 | |
| MER3 | Contig_4401.1 | 6 × 10−59 |