| Literature DB >> 20157482 |
L Wissler1, E Dattolo, A D Moore, T B H Reusch, J L Olsen, M Migliaccio, E Bornberg-Bauer, G Procaccini.
Abstract
As ecosystem engineers, seagrasses are angiosperms of paramount ecological importance in shallow shoreline habitats around the globe. Furthermore, the ancestors of independent seagrass lineages have secondarily returned into the sea in separate, independent evolutionary events. Thus, understanding the molecular adaptation of this clade not only makes significant contributions to the field of ecology, but also to principles of parallel evolution as well. With the use of Dr. Zompo, the first interactive seagrass sequence database presented here, new insights into the molecular adaptation of marine environments can be inferred. The database is based on a total of 14 597 ESTs obtained from two seagrass species, Zostera marina and Posidonia oceanica, which have been processed, assembled and comprehensively annotated. Dr. Zompo provides experimentalists with a broad foundation to build experiments and consider challenges associated with the investigation of this class of non-domesticated monocotyledon systems. Our database, based on the Ruby on Rails framework, is rich in features including the retrieval of experimentally determined heat-responsive transcripts, mining for molecular markers (SSRs and SNPs), and weighted key word searches that allow access to annotation gathered on several levels including Pfam domains, GeneOntology and KEGG pathways. Well established plant genome sites such as The Arabidopsis Information Resource (TAIR) and the Rice Genome Annotation Project are interfaced by Dr. Zompo. With this project, we have initialized a valuable resource for plant biologists in general and the seagrass community in particular. The database is expected to grow together with more data to come in the near future, particularly with the recent initiation of the Zostera genome sequencing project.The Dr. Zompo database is available at http://drzompo.uni-muenster.de/Entities:
Year: 2009 PMID: 20157482 PMCID: PMC2790305 DOI: 10.1093/database/bap009
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Dr. Zompo: screenshot of numerous ways to access the seagrass EST sequences and their annotation. Browse/Search results view, interactive flash charts of EST statistics, Pfam domain annotation represented as a tag cloud, KEGG pathway annotation with highlighted seagrass genes.
Figure 2.Dr. Zompo: extract from the detailed view of an exemplary contig. Large body of functional annotation (SwissProt, GeneOntology, Pfam, KEGG) plus visual representation of assembled reads and sequence properties (ORF, domains, polymorphisms).
Summarizing statistics on the EST sequence preprocessing steps and sequence analysis for both Zostera marina and Posidonia oceanica EST collections
| Number of raw EST sequences | 9412 | 5185 |
| Number of clean sequences after pregap4 | 7876 | 3089 |
| Number of clean sequences after crossmatch | 7416 | 3079 |
| Number of sequences in contigs after EST assembly | 5024 | 2238 |
| Number of singletons after EST assembly | 2392 | 841 |
| Number of contigs after EST assembly | 995 | 378 |
| Number of unigenes (contigs plus singletons) | 3387 | 1219 |
| Number of unigenes with a putative ORF | 3387 | 1219 |
| … based on homology to NCBI nr (BLASTX, | 2467 | 963 |
| … without homology but longest polypeptide >20aa | 920 | 256 |
| Number of unigenes containing SSRs | 585 | 346 |
| Total number of SSRs | 730 | 498 |
| Number of unigenes containing potential SNPs | 115 | 18 |
| Total number of potential SNPs | 393 | 436 |
Putative ORFs were predicted based on homology to known proteins or by identifying the longest stretch of DNA that could be translated into more than 20 amino acids; potential SNPs were determined using QualitySNP (Tang,J., Vosman,B., Voorrips,R.E., van der Linden,C.G., and Leunissen,J.A.M. (2006) QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species. BMC Bioinformatics, 7, 438), and SSRs were detected using SSRIT (Temnykh,S., DeClerck,G., Lukashova,A., Lipovich,L., Cartinhour,S. and McCouch,S. (2001) Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genome Res, 11, 1441–1452).
List of Zostera–Posidonia ortholog pairs that were found to have significantly different expression levels (i.e. number of EST reads)
| Expected | Observed | χ2 | SwissProt annotation | ||
|---|---|---|---|---|---|
| Zoma_ZMD12058 | Pooc_Contig125 | 24.8 : 16.2 | 1 : 40 | 1129.59 | Oxygen-evolving enhancer protein 2-2,chloroplast precursor |
| Zoma_Contig902 | Pooc_Contig3 | 24.8 : 16.2 | 10 : 31 | 436.04 | Ribulose bisphosphate carboxylase small chainSSU5B, chloroplast precursor |
| Zoma_Contig740 | Pooc_Contig287 | 21.1 : 13.9 | 33 : 2 | 281.26 | Photosystem II 10 kDa polypeptide, chloroplastprecursor |
| Zoma_ZME03108 | Pooc_Contig281 | 10.9 : 7.1 | 1 : 17 | 194.94 | Probable glutathione S-transferase GSTU6 |
| Zoma_Contig116 | Pooc_Contig279 | 12.7 : 8.3 | 3 : 18 | 187.59 | Light-regulated protein precursor |
| Zoma_Contig825 | Pooc_PC010E04 | 11.5 : 7.5 | 18 : 1 | 85.11 | 40S ribosomal protein S11 |
| Zoma_Contig630 | Pooc_PC017D04 | 9.1 : 5.9 | 14 : 1 | 48.8 | Actin-depolymerizing factor 4 |
| Zoma_Contig609 | Pooc_Contig167 | 12.1 : 7.9 | 17 : 3 | 48.4 | 60S ribosomal protein L10 |
| Zoma_Contig14 | Pooc_Contig14 | 23.6 : 15.4 | 28 : 11 | 39.47 | Glyceraldehyde-3-phosphate dehydrogenase, cytosolic |
| Zoma_Contig680 | Pooc_PC019A01 | 7.9 : 5.2 | 12 : 1 | 34.4 | 40S ribosomal protein S18 |
| Zoma_Contig421 | Pooc_Contig242 | 12.1 : 7.9 | 16 : 4 | 30.72 | – |
| Zoma_Contig263 | Pooc_PC028C07 | 7.3 : 4.8 | 11 : 1 | 28.15 | Peptidyl-prolyl cis-trans isomerase |
| Zoma_Contig896 | Pooc_Contig170 | 11.5 : 7.5 | 15 : 4 | 24.83 | Translationally-controlled tumor protein homolog |
| Zoma_ZMC06047 | Pooc_Contig217 | 4.2 : 2.8 | 1 : 6 | 20.84 | – |
| Zoma_Contig8 | Pooc_Contig92 | 10.9 : 7.1 | 14 : 4 | 19.56 | Chlorophyll a-b binding protein 40, chloroplastprecursor |
| Zoma_Contig643 | Pooc_Contig193 | 9.1 : 5.9 | 6 : 9 | 18.73 | Nascent polypeptide-associated complex subunitalpha-like protein 1 |
| Zoma_Contig977 | Pooc_Contig19 | 6.0 : 4.0 | 3 : 7 | 18.49 | Photosystem I reaction center subunit XI, chloroplast precursor |
| Zoma_Contig679 | Pooc_Contig8 | 6.6 : 4.4 | 4 : 7 | 13.99 | ADP-ribosylation factor 1 |
| Zoma_ZME01071 | Pooc_Contig256 | 3.6 : 2.4 | 1 : 5 | 13.77 | 40S ribosomal protein S19 |
| Zoma_Contig828 | Pooc_PC046B10 | 5.4 : 3.6 | 8 : 1 | 13.15 | ATP synthase subunit d, mitochondrial |
| Zoma_Contig636 | Pooc_PC027E02 | 5.4 : 3.6 | 8 : 1 | 13.15 | – |
| Zoma_Contig195 | Pooc_PC034B02 | 5.4 : 3.6 | 8 : 1 | 13.15 | Adenosine kinase 2 |
| Zoma_Contig880 | Pooc_Contig271 | 8.5 : 5.5 | 11 : 3 | 12.94 | Inosine-5'-monophosphate dehydrogenase |
| Zoma_Contig748 | Pooc_Contig278 | 5.4 : 3.6 | 3 : 6 | 11.87 | Peroxiredoxin Q, chloroplast precursor |
| Zoma_Contig239 | Pooc_PC023A07 | 4.8 : 3.2 | 7 : 1 | 9.4 | – |
| Zoma_Contig69 | Pooc_Contig180 | 7.9 : 5.2 | 10 : 3 | 9.22 | Thioredoxin H-type 1 |
| Zoma_ZMF05139 | Pooc_Contig367 | 3.0 : 2.0 | 1 : 4 | 8.16 | 60S ribosomal protein L18-2 |
| Zoma_ZME04139 | Pooc_Contig4 | 3.0 : 2.0 | 1 : 4 | 8.16 | – |
| Zoma_ZMC13030 | Pooc_Contig83 | 3.0 : 2.0 | 1 : 4 | 8.16 | Pollen-specific protein C13 precursor |
| Zoma_ZMA09041 | Pooc_Contig264 | 3.0 : 2.0 | 1 : 4 | 8.16 | Nucleoside diphosphate kinase 1 |
| Zoma_Contig913 | Pooc_Contig362 | 4.8 : 3.2 | 3 : 5 | 6.71 | Zeaxanthin epoxidase, chloroplast precursor |
| Zoma_Contig736 | Pooc_Contig211 | 4.8 : 3.2 | 3 : 5 | 6.71 | Acyl carrier protein 1, chloroplast precursor |
| Zoma_Contig488 | Pooc_Contig61 | 4.8 : 3.2 | 3 : 5 | 6.71 | 60S ribosomal protein L7-4 |
Expression levels were compared using a χ2-test with P ≤ 0.01, where the expected number of reads was determined by averaging the overall number of reads in both species considering their different library sizes. All ortholog pairs which contained Zostera unigenes that have been found to be responsive to the heat stress treatment were removed.