| Literature DB >> 15938755 |
Jing-Ke Weng1, Milos Tanurdzic, Clint Chapple.
Abstract
BACKGROUND: The lycophyte Selaginella moellendorffii is a member of one of the oldest lineages of vascular plants on Earth. Fossil records show that the lycophyte clade arose 400 million years ago, 150-200 million years earlier than angiosperms, a group of plants that includes the well-studied flowering plant Arabidopsis thaliana. S. moellendorffii has a genome size of approximately 100 Mbp, as small or smaller than that of A. thaliana. S. moellendorffii has the potential to provide significant comparative information to better understand the evolution of vascular plants.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15938755 PMCID: PMC1184070 DOI: 10.1186/1471-2164-6-85
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1A simplified version of the plant phylogenetic tree simplified and condensed from Pryer et al. [11]. The tree shows that lycophytes (highlighted) diverged from other vascular plant lineages soon after plants colonized the terrestrial environment. Representative species were chosen from sub-clades within the clades listed, and illustrate major developments in plant evolution including the colonization of land (land plants, L), the development of vasculature (vascular plants, V) and true leaves (euphyllophytes, E), and the evolution of flowers (flowering plants, F), and seeds (seed plants, S).
Figure 2The morphology of (a) A greenhouse grown S. moellendorffii. (b) A close up of an aerial branch of S. moellendorffii indicating the bulbils (white arrows) that can be used for clonal propagation and sporangia (black arrows) containing microspores and megaspores for sexual propagation.
Figure 3Distribution of ESTs were clustered into putative unigene sets using StackPack v. 2.2, and the number of cluster members of each size category was plotted relative to their abundance within the EST collection.
The most abundantly represented ESTs in the S. moellendorffii cDNA library.
| Cluster | Number of ESTs | Top BLASTX hit in non-redundant protein database | |||
| Accession Number | Best Identity Description | E-value | |||
| 1 | SmoC-1_cn126 | 105 | - | Novel | - |
| 2 | SmoC-1_cn125 | 46 | - | Novel | - |
| 3 | SmoC-1_cn018 | 31 | SP:P16031 | Ribulose bisphosphate carboxylase small subunit) [ | 8E-51 |
| 4 | SmoC-1_cn121 | 25 | SP:P04669 | Ferredoxin, chloroplast precursor [ | 2E-26 |
| 5 | SmoC-1_cn106 | 17 | PIR:S16294 | chlorophyll a/b-binding protein [ | 9E-99 |
| 6 | SmoC-1_cn107 | 17 | GB:AAM46780 | latex plastidic aldolase-like protein [ | 1E-164 |
| 7 | SmoC-1_cn171 | 17 | PIR:S31863 | chlorophyll a/b-binding protein [ | 1E-106 |
| 8 | SmoC-1_cn011 | 14 | GB:AAC78107 | photosystem-1 H subunit GOS5 [ | 8E-30 |
| 9 | SmoC-1_cn233 | 13 | SP:Q9SXW9 | Plastocyanin, chloroplast precursor [ | 2E-37 |
| 10 | SmoC-1_cn025 | 11 | SP:P51118 | glutamine synthetase cytosolic isoenzyme 1 [ | 1E-152 |
| 11 | SmoC-1_cn089 | 11 | GB:AAG17036 | S-adenosylmethionine synthetase [ | 7E-17 |
| 12 | SmoC-1_cn195 | 11 | SP:P11432 | Early light-induced protein, chloroplast precursor (ELIP) [ | 1E-32 |
| 13 | SmoC-1_cn023 | 9 | SP:P82977 | Subtilisin-chymotrypsin inhibitor [ | 4E-11 |
| 14 | SmoC-1_cn145 | 9 | - | Novel | - |
| 15 | SmoC-1_cn179 | 9 | SP:P30361 | Cytochrome B6-F complex iron-sulfur subunit 1, chloroplast precursor [ | 3E-74 |
| 16 | SmoC-1_cn189 | 9 | - | Novel | - |
| 17 | SmoC-1_cn006 | 8 | GB:AAG59875 | PSII subunit PsbW [ | 5E-13 |
| 18 | SmoC-1_cn078 | 8 | SP:O48560 | Catalase 3 [ | 0 |
| 19 | SmoC-1_cn211 | 8 | SP:P23993 | Photosystem I reaction center subunit XI, chloroplast precursor [ | 2E-55 |
| 20 | SmoC-1_cn226 | 8 | PDB:1EKJA | Carbonic Anhydrase [Pisum Sativum] | 2E-63 |
| 21 | SmoC-1_cn019 | 7 | REF:NP_175963 | photosystem I reaction center subunit V, chloroplast, [ | 2E-34 |
| 22 | SmoC-1_cn108 | 7 | PIR:T23512 | hypothetical protein K08H10.2a [ | 1E-12 |
| 23 | SmoC-1_cn215 | 7 | GB:AAB88617 | ubiquitin conjugating enzyme [ | 3E-82 |
| 24 | SmoC-1_cn218 | 7 | SP:P27494 | Chlorophyll a-b binding protein 36, chloroplast precursor [ | 1E-127 |
| 25 | SmoC-1_cn013 | 6 | PIR:T06471 | core protein [ | 1E-20 |
| 26 | SmoC-1_cn016 | 6 | SP:Q9SLQ8 | Oxygen-evolving enhancer protein 2, chloroplast precursor [ | 1E-79 |
| 27 | SmoC-1_cn033 | 6 | GB:AAM97011 | expressed protein [ | 6E-40 |
| 28 | SmoC-1_cn136 | 6 | GB:AAO49652 | photosystem I-N subunit [ | 2E-37 |
| 29 | SmoC-1_cn139 | 6 | DBJ:BAC66946 | chloroplastic iron superoxide dismutase [ | 3E-69 |
| 30 | SmoC-1_cn180 | 6 | EMB:CAB71293 | chloroplast ferredoxin-NADP+ oxidoreductase precursor [ | 1E-139 |
| 31 | SmoC-1_cn208 | 6 | SP:P54773 | Photosystem II 22 kDa protein, chloroplast precursor [ | 5E-61 |
| 32 | SmoC-1_cn250 | 6 | - | Novel | - |
Non-redundant protein database includes all non-redundant GenBank CDS translations (GB)+ RefSeq Proteins (REF) +PDB + SwissProt (SP) + PIR + PRF. The identities of ESTs were putatively described by the top BLASTX hit (with lowest E-value) of the assembled EST contigs.
The GO categorization of S. moellendorffii ESTs by biological process, molecular function, and cellular component.
| Gene Ontology term | Representation | Representation percentage | |
| Biological process | Metabolism | 312 | 74% |
| Biosynthesis | 64 | 15% | |
| Protein metabolism | 57 | 14% | |
| Catabolism | 22 | 5% | |
| Nucleic acid metabolism | 20 | 5% | |
| Cell growth and/or maintenance | 53 | 13% | |
| Transport | 44 | 10% | |
| Response to stimulus and stress | 19 | 5% | |
| Photosynthesis | 16 | 4% | |
| Cell communication | 15 | 4% | |
| Signal transduction | 12 | 3% | |
| Homeostasis | 3 | 1% | |
| Development | 1 | <1% | |
| Cell death | 1 | <1% | |
| Molecular function | Catalytic activity | 132 | 36% |
| Hydrolase activity | 36 | 10% | |
| Transferase activity | 25 | 7% | |
| Oxidoreductase activity | 22 | 6% | |
| Kinase activity | 12 | 3% | |
| Binding | 107 | 29% | |
| Nucleotide binding | 65 | 18% | |
| Metal ion binding | 20 | 5% | |
| Transporter activity | 64 | 18% | |
| Electron transporter activity | 16 | 4% | |
| Carrier activity | 12 | 3% | |
| Structural molecule activity | 40 | 11% | |
| Translation regulator activity | 10 | 3% | |
| Signal transducer activity | 4 | 1% | |
| Chaperone activity | 3 | 1% | |
| Enzyme regulator activity | 2 | 1% | |
| Motor activity | 1 | <1% | |
| Transcription regulator activity | 1 | <1% | |
| Cellular component | Intracellular | 135 | 75% |
| Membrane | 45 | 25% |
Note that one gene product may be assigned to more than one GO terms, and one children term can fit into multiple parental categories. The representation means the number of non-redundant ESTs that can be mapped to a certain GO term. The representation percentage is based on the total number of GO mappings in each of the three major ontologies (biological process: 420, molecular function: 364, cellular component: 180).
Figure 4Representations of Gene Ontology (GO) mapping results for (a) Biological process (b) Molecular function (c) Cellular component.
Comparison of GO assignments between A. thaliana ESTs and S. moellendorffii ESTs.
| Gene Ontology term | Categories | Representation percentage | |
| Biological process | Metabolism | 74% | 39% |
| Cell growth and/or maintenance | 13% | 13% | |
| Response to stimulus and stress | 5% | 16% | |
| Photosynthesis | 4% | <1% | |
| Cell communication | 4% | 6% | |
| Homeostasis | 1% | 1% | |
| Development | <1% | 6% | |
| Cell death | <1% | 1% | |
| Molecular function | Catalytic activity | 36% | 41% |
| Binding | 29% | 32% | |
| Transporter activity | 18% | 8% | |
| Structural molecule activity | 11% | 2% | |
| Translation regulator activity | 3% | 1% | |
| Signal transducer activity | 1% | 1% | |
| Chaperone activity | 1% | 2% | |
| Enzyme regulator activity | 1% | 1% | |
| Motor activity | <1% | 1% | |
| Transcription regulator activity | <1% | 7% | |
| Cellular component | Intracellular | 75% | 70% |
| Membrane | 25% | 19% | |
The GO assignments for A. thaliana ESTs were obtained from TIGR [38]. The percentage of GO assignments for A. thaliana was calculated based on the total numbers of GO mappings in each of the three major ontologies with the number of unknown terms deducted from them (biological process: 20185, molecular function: 23680, cellular component: 6309). The functional categories present in A. thaliana but not in S. moellendorffii were not listed in the table.
Figure 5A Venn diagram showing the distribution of The 1301 translated S. moellendorffii non-redundant ESTs were used as queries in homology searches against A. thaliana and P. patens EST databases, respectively. The two inner circles contain the numbers and percentages of S. moellendorffii ESTs that share tBLASTX similarity with A. thaliana or P. patens ESTs. The region between inner circles and outer circle represents S. moellendorffii ESTs without tBLASTX matches.
Top 20 S. moellendorffii EST tBLASTX hits for A. thaliana ESTs that are not present within the P. patens EST database.
| Non-redundant EST | tBLASTX E-value | Best BLASTX Descriptor in | Accession Number | |
| 1 | SmoC-1_01_H05 | 1E-107 | expressed protein | REF:NP_194688 |
| 2 | SmoC-1_02_C05 | 1E-99 | oligopeptide transporter OPT family protein | REF:NP_192815 |
| 3 | SmoC-1_01_L23 | 2E-99 | putative Mg-protoporphyrin IX chelatase | REF:NP_196867 |
| 4 | SmoC-1_05_G17 | 2E-99 | putative caffeoyl-CoA 3-O-methyltransferase | REF:NP_195131 |
| 5 | SmoC-1_05_K13 | 5E-90 | chloroplast membrane protein (ALBINO3) | REF:NP_180446 |
| 6 | SmoC-1_01_E02 | 1E-89 | cullin family protein | REF:NP_567243 |
| 7 | SmoC-1_05_G03 | 7E-87 | putative UDP-galactose/UDP-glucose transporter | REF:NP_563949 |
| 8 | SmoC-1_05_I19 | 6E-86 | expressed protein | REF:NP_566060 |
| 9 | SmoC-1_02_N15 | 9E-86 | nicotinate phosphoribosyltransferase family protein | REF:NP_179923 |
| 10 | SmoC-1_03_I01 | 7E-80 | glycoside hydrolase family 77 protein | REF:NP_181616 |
| 11 | SmoC-1_cn293 | 4E-77 | amine oxidase family protein | REF:NP_181830 |
| 12 | SmoC-1_03_C24 | 9E-73 | uridylyltransferase-related protein | REF:NP_564010 |
| 13 | SmoC-1_02_P14 | 5E-70 | expressed protein | REF:NP_199542 |
| 14 | SmoC-1_06_P21 | 2E-69 | RNase L inhibitor protein-related | REF:NP_196569 |
| 15 | SmoC-1_05_G10 | 3E-69 | expressed protein | REF:NP_191746 |
| 16 | SmoC-1_03_C14 | 1E-66 | putative isoflavone reductase | REF:NP_565107 |
| 17 | SmoC-1_03_N06 | 6E-65 | transducin / WD-40 repeat family protein | REF:NP_190148 |
| 18 | SmoC-1_06_M11 | 1E-63 | dehydration stress-induced protein | GB:AAM62648 |
| 19 | SmoC-1_06_B20 | 1E-60 | putative membrane protein | REF:NP_849987 |
| 20 | SmoC-1_05_O21 | 2E-60 | paired amphipathic helix repeat-containing protein | REF:NP_186781 |
The tBLASTX E-value of an EST varies with its BLASTX E-value in a small range (e.g. SmoC-1_01_H05 has a tBLASTX E-value of 1E-107 against its homologous A. thaliana EST and a BLASTX E-value of 2E-94 against the translated full length A. thaliana cDNA.). The homology ranking was based on the tBLASTX E-value. The identities of ESTs were putatively described by the A. thaliana protein with the lowest E-value in the BLASTX analysis.