Literature DB >> 17881371

SmedGD: the Schmidtea mediterranea genome database.

Sofia M C Robb1, Eric Ross, Alejandro Sánchez Alvarado.   

Abstract

The planarian Schmidtea mediterranea is rapidly emerging as a model organism for the study of regeneration, tissue homeostasis and stem cell biology. The recent sequencing, assembly and annotation of its genome are expected to further buoy the biomedical importance of this organism. In order to make the extensive data associated with the genome sequence accessible to the biomedical and planarian communities, we have created the Schmidtea mediterranea Genome Database (SmedGD). SmedGD integrates in a single web-accessible portal all available data associated with the planarian genome, including predicted and annotated genes, ESTs, protein homologies, gene expression patterns and RNAi phenotypes. Moreover, SmedGD was designed using tools provided by the Generic Model Organism Database (GMOD) project, thus making its data structure compatible with other model organism databases. Because of the unique phylogenetic position of planarians, SmedGD (http://smedgd.neuro.utah.edu) will prove useful not only to the planarian research community, but also to those engaged in developmental and evolutionary biology, comparative genomics, stem cell research and regeneration.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17881371      PMCID: PMC2238899          DOI: 10.1093/nar/gkm684

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Schmidtea mediterranea is a freshwater planarian of the phylum Platyhelminthes that is rapidly becoming a model system for the investigation of regeneration, tissue homeostasis and stem cell biology (1). Interest is being spurred on by the remarkable biology of these animals and the expanding repertoire of tools available to interrogate their biology (2). Like other non-parasitic flatworms, S. mediterranea has the ability to regenerate complete animals from small, excised body fragments. If a planarian is decapitated, both head and trunk fragments regenerate the missing body parts, i.e. the body and the head, respectively. The process takes seven days and results in the full, functional integration of the newly regenerated tissues and organs to the pre-existing structures. This remarkable developmental plasticity is made possible by a population of somatic stem cells known as neoblasts found throughout the body of planarians. Because of their abundance and characteristic undifferentiated state, neoblasts are both easy to identify and amenable to experimental manipulation. In addition, planarians belong to the Lophotrochozoa, a large group of understudied animals that is sister to the Ecdysozoa (e.g. Drosophila and Caenorhabditis elegans) and the Deuterostomes (e.g. non-mammalian and mammalian vertebrates). Presently, the Lophotrochozoa are poorly represented among currently sequenced genomes, and much of their molecular and developmental biology remain unexplored. Hence, the study of S. mediterranea is likely to both complement ongoing studies in available model systems and to expand our knowledge in a large number of long-standing and fundamental problems relevant to human health and biology (e.g. tissue homeostasis and regeneration) not readily studied in well-established model systems such as Drosophila and C. elegans. Recently, many methodologies have been introduced to analyze planarian biology in depth. Prominent among these are the availability of ∼78 000 ESTs, the study of gene function via robust and reproducible RNAi methodologies (3,4) and a sequenced, assembled and annotated genome. Because these growing resources were devoid of an integrative tool capable of coordinating the inflow of genomic and functional genomic data, we set out to create an easy to use, yet comprehensive database to house and mine this information. The result of this effort is the Schmidtea mediterranea Genome Database (SmedGD), which was constructed using tools from the Generic Model Organism Database (GMOD; http://www.gmod.org) project, and populated with GMOD-compliant annotation data from MAKER (5), as well as information collected from a wide variety of sources such as Gene Ontology (GO), PFAM, SwissProt, SMART and others. Deploying GMOD tools to construct SmedGD facilitates and ensures: (1) interoperability of SmedGD with other genome databases (e.g. WormBase and FlyBase) to allow comparative genomic studies; (2) short- and long-term curation of gene models using Apollo (6); and (3) future expansions to include microarray gene expression data, in situ expression patterns, and PubMed references, for example. In its present form, members of the biomedical and planarian research communities can use SmedGD to find genes of interest and their homologs in other species, download sequences, link to other databases, and find RNAi phenotypes. SmedGD should prove a key resource in furthering the development and integration of S. mediterranea as an important model system for current studies of metazoan biology and human disease.

SmedGD architecture

In order to achieve interoperability between SmedGD and other organism databases such as FlyBase and WormBase, we constructed SmedGD using tools and components from GMOD, a NIH-funded effort aimed at providing generic software to build new genome/organism databases. SmedGD consists of two major components: a GFF database and a GBrowse generic framework (7) capable of autogenerating a generic genome browser from the database. Data in GFF3 format and compliant with Sequence Ontology (8), is uploaded to a database using a perl script distributed with Bioperl (bp_seqfeature_load.pl) (9). The script converts GFF3 formatted annotations to Bio::SeqFeatureI objects, and generates a database schema that is GMOD-compliant. We used this mySQL-based schema instead of the Chado schema provided by GMOD because of the length of time it took to both load the S. mediterranea data into Chado (days instead of hours) and to query the resulting PostgresSQL database (minutes instead of seconds). Attempts were made to increase the efficiency of loading and querying, but not enough improvement was noted. Although Chado is a very robust relational database that has been successfully implemented in the development of BeetleBase (10) and ParameciumDB (11), we ascribe the underperformance of this schema in our hands to the sheer size of the S. mediterranea data (∼900 Mb), i.e. 12.5× the size of ParameciumDB and 4.5× the size of BeetleBase. We made sure that the mySQL database generated from the GFF3 files conformed to GMOD standards. Therefore, we parsed the database using GBrowse, which implemented a Bio::DB::SeqFeature::Store database adaptor to access and display rows of data or tracks that are mapped to specific locations in the genome (Figure 1). Customizations to the standard GMOD distribution were also made to accommodate additional database searching and sequence retrieval. CGI scripts that interface with SmedGD's mySQL database using DBI, a perl module, enable specialized queries of GO terms and RNAi phenotypes, and uniquely formatted protein homology search results. Changes made to the GBrowse configuration file allow for the linking to a CGI script that uses ‘fastacmd’, free software distributed by the NCBI (http://www.ncbi.nlm.nih.gov), to retrieve sequences from specially formatted fasta files and displays them when the ‘Name’ of an mRNA is selected in the mRNA ‘Details Page’ (see below). The web page displaying the retrieved sequence includes a link to the NCBI BLAST web server, which when selected will auto fill the NCBI BLAST forms with the retrieved planarian sequence.
Figure 1.

Screen capture of SmedGD displaying genomic contig v31.019651. This contig has only one predicted gene, which has 5 exons and a 3′UTR. The tracks displayed include the gene model, its corresponding predicted transcript, and the relevant biological evidence associated with this model (see text for detailed explanations of each track). From this data, users can see the details of the gene model and its evidence (all of the predicted exons are supported by EST and protein evidence), and that the gene is likely coding for a histone deacetylase. Double-stranded RNA has been used to silence this gene and the resulting phenotypes are listed. cDNA Microarray data is not yet available, but a sample of how this information will be viewed is presented. An arrow pointing down indicates down-regulation of the gene in the experimental group.

Screen capture of SmedGD displaying genomic contig v31.019651. This contig has only one predicted gene, which has 5 exons and a 3′UTR. The tracks displayed include the gene model, its corresponding predicted transcript, and the relevant biological evidence associated with this model (see text for detailed explanations of each track). From this data, users can see the details of the gene model and its evidence (all of the predicted exons are supported by EST and protein evidence), and that the gene is likely coding for a histone deacetylase. Double-stranded RNA has been used to silence this gene and the resulting phenotypes are listed. cDNA Microarray data is not yet available, but a sample of how this information will be viewed is presented. An arrow pointing down indicates down-regulation of the gene in the experimental group. Physically, SmedGD is housed on an Apple Xserve G5 computer with 3 GB of RAM. The operating system used is the Mac OS X Server operating system—version 10.4 and the web server software used is Apache—version 1.3.

SmedGD contents

SmedGD is a storehouse of valuable data that is easily accessible and conveniently mapped to the planarian genome sequence. The layers of data mapped to the genome sequence are MAKER annotations, ESTs, 454 cDNA sequences, protein homology, protein domains, Gene Ontology terms, RNAi phenotypes, mRNA expression patterns, human curated genes and microRNAs (Table 1).
Table 1.

Types and number of entries that comprise SmedGD

Data typeCount
Sequence type
    Genomic contigs43 294
    Predicted genes:31 955
    MAKER30 437
    SNAP1518
    Predicted mRNAs32 448
    Human curated genes1000
    ESTs78 101
    EST contigs15 043
    454 contigs9071
    MicroRNAs71
Protein homology
    Swissprot25 733
    C. elegans Proteome5319
    H. Sapiens Proteome10 746
    M. musculus Proteome14 713
    D. melanogaster4301
    BLASTX with reference proteomes88 293
Protein Domains
    PFAM14 442
    SMART5711
GO terms
    Molecular Function25 075
    Cellular Location12 294
    Biological Process23 842
Experimental data
    RNAi phenotypes303
    mRNA expression patterns123
Types and number of entries that comprise SmedGD

Assembly and annotation data

Sequencing and assembly data was provided by the Washington University Genome Sequencing Center in St. Louis, MO. The current assembly version 3.1 (v31), consists of ∼900 Mb of sequence distributed over 43 294 supercontigs. SmedGD also houses the annotation of v31, which was performed using a recently developed automated annotation pipeline named MAKER (5). MAKER implements an algorithm that uses protein homology and collectively assembled ESTs from various S. mediterranea EST projects (12,13) using CAP3 (14) to predict genes, many with 5′ and 3′ UTRs. The predicted genes (31 955) and the accompanying transcripts and splice variants are mapped to the genome sequence. The protein homology and the EST alignments that were used for gene prediction are also mapped to the genome to give the investigator the ability to judge the accuracy of the gene prediction.

cDNA data

Additional evidence is provided to encourage critical analysis of the predictions. The ∼78 000 ESTs that were assembled and used by MAKER were individually aligned to the genome sequence using BLAT (15). We also aligned ∼9000 mRNAs sequenced using 454 technology (16). These alignments enable the user to identify the genomic location of previously published ESTs and grade the positions of exons and splice sites.

Protein homology data

To assist in determining gene function, homologs of the MAKER-predicted protein sequences are included in SmedGD. This set of data was obtained by comparing the MAKER proteins to the PFAM (17) and SMART (18) portions of the CDD database (19), and to Swiss-Prot (20) using RPS-BLAST (21) and BLAST (22). To further validate the predicted genes and identify missing exons, the proteomes of C. elegans, Drosophila melanogaster, Homo sapiens, and Mus musculus were individually subjected to TBLASTN comparisons (22) against the planarian genome translated into six frames.

Gene ontology

Gene Ontology terms (23–25) associated with each of the homologous database entries were extracted and entered into SmedGD. The addition of GO terms makes searching the database for genes with a specific association to a biological process, molecular function or cellular location possible (see below).

RNAi and whole-mount in situ hybridization data

In 2005, the results of an extensive RNAi screen in S. mediterranea were published (3). This work has been incorporated into SmedGD. RNAi phenotypes can be searched and the results link to the genome browser. The ‘Details Page’ displays images and descriptions of phenotypes. In addition to RNAi experimental data, mRNA expression patterns are also included in the database. If an in situ hybridization of an mRNA exists, images are correlated with the position of the corresponding gene in the genome and can be accessed for viewing directly from the genome browser (see below).

Additional data

Additional features mapped to the genome include human-curated gene annotations and mature microRNA sequences (26). Because of the modular nature of GMOD, it will not be difficult to add additional data as it is collected. Currently, we are testing implementations to incorporate microarray expression data into SmedGD.

Using SmedGD

Overview

SmedGD can be navigated from the top tool bar, where links to the ‘Genome Browser’, sequence search interfaces ‘Blast’, ‘Blat’ and the text search interface ‘Search’ can be found (Figure 1). In order to facilitate large-scale, high-throughput mining of the data in SmedGD, we established a standard naming nomenclature for all genes and associated data (Table 2). Each name consists of up to four fields separated by a period. The first field identifies the type of data being viewed (assembly version, annotation run, EST or 454 cDNA contigs). The second field indicates the number of the genomic or EST/454 contig in which the data can be found. The third and fourth fields indicate the identification number of the feature (e.g. gene, transcript) associated with the genomic contig. A description of the nomenclature with examples is presented in Table 2.
Table 2.

Standardized nomenclature strategy to denote individual entries in SmedGD

TypeExample nameDescription of field
Genomic contigsv31.019651Assembly version 3.1Contig No*
Genesmk4.019651.00Maker Run 4Contig No.*Gene No.**
Transcriptsmk4.019651.00.01Maker Run 4Contig No.*Gene No.**Transcript No.**
EST contigsec1.00159.004EST contig Version 1EST contig No.***No. of reads ****
454 contigsfc1.08641.005454 contig Version 1454 contig No.***No. of reads ****

*padded to 6 digits.

**padded to 2 digits.

***padded to 5 digits.

****padded to 3 digits.

Standardized nomenclature strategy to denote individual entries in SmedGD *padded to 6 digits. **padded to 2 digits. ***padded to 5 digits. ****padded to 3 digits.

Genome browser page

From within the genome browser the ‘Search Landmark or Region’ function can be used to query the database to find features mapped to specific regions of the genome. Examples of search terms are protein homology (piwi), contig names (v31.019651), and gene names (mk4.019651.00). The ‘Overview Panel’ provides a simplified view of the entire genomic contig (Figure 1). The red box indicates the area of the contig which is being viewed in the ‘Details’ panel. The red box can be recentered on a different area of the contig by clicking in the ‘Overview’ panel. The size of the box and therefore the number of base pairs viewed in the details panel can be altered by changing the parameters in the ‘Scroll/Zoom’ drop down menu or by adding the desired region to be viewed in the ‘Search Landmark or Region’ box of the contig. (e.g. v31.019651:1375..2445). The ‘Details’ panel contains the information that aligns to the genome, ranging from predicted genes to RNAi phenotypes. The information displayed in the ‘Details’ panel is controlled by the selection of tracks. Tracks are biologically and bioinformatically obtained data that have been aligned to the genomic contigs. The types of data are subdivided into the groups ‘Genes’, ‘Phenotype/Expression’, ‘Sequence Similarity’, ‘Species TBLASTN’ and ‘Non-Coding RNA’.

Genes

Within this group, genes/mRNAs predicted by MAKER that are supported by High Quality evidence (e.g. ESTs and protein homology), and ‘Human-Curated Genes/mRNAs’ (human-edited genes/mRNAs) are displayed. Presently, and using Apollo (6), only one investigator has manually curated gene models (Dr Alejandro Sánchez Alvarado). To identify the provenance of the curation, the edited gene models are identified by the three letter designation, e.g. ASA, followed by a gene number and transcript number (ASA.00084.01). It is expected that others will join efforts in the curation of genes. Such edits will be identified by a unique three letter designation corresponding to the name of the person or laboratory responsible for the editing.

Phenotype/expression

Display of RNAi Phenotypes and in situ data are controlled by this group of tracks. The RNAi Phenotypes appear as a bar that spans the length of the EST used to disrupt gene function by RNA interference. The description of the resulting phenotype is located under the bar (RNAi:AY967490). If an annotation has an mRNA in situ hybidization associated with it, a hyperlinked thumbnail is displayed that can be selected to obtain more detailed information (AY967481:in situ).

Sequence similarity

EST and 454 cDNA contigs, individual ESTs and protein sequences have been aligned to the genome and their display is controlled in this section of the ‘Tracks’. In the ‘454 contigs aligned with BLAT’, 454 contigs prepared from 454 sequencing reads were aligned to the genome using the Blat algorithm and standard psl output. BLAT is designed to quickly find sequences of 90% and greater similarity with a score of 30. In the ‘BLASTX Hits track’ nucleotide to protein comparisons, via WU-BLASTX six-frame translation (http://blast.wustl.edu) was used to find similarity hits of genomic sequence searched against a reference protein dataset comprising of the proteomes of E. coli, Saccharomyces cerevisiae, D. melanogaster, C. elegans, Platyhelminthes, Ciona intestinalis, mouse and human. Statistical cutoffs of 40% identity and expectation value of 1e−5 were used. The ‘EST BLASTN’ track is made up of contigs of S. mediterranea ESTs that were aligned to the S. mediterranea genome using cutoffs of 85% identity and expectation value of 1e−10. Similarly, ‘ESTs aligned with BLAT’ is made up of all the individual S. mediterranea ESTs aligned to the S. mediterranea genome using WU-BLAST formatted output from BLAT with expectation value cutoffs of 1e−95 and with at least 45% alignment of the EST. There are three tracks that align whole proteins or protein domains to the MAKER predictions. The first of these, ‘BLASTP to Swissprot’ are alignments of predicted proteins from MAKER gene models against Swissprot. Next is the ‘MAKER predictions RPS-BLAST to PFAM’. These are alignments of predicted proteins from MAKER gene models against PFAM using NCBI RPS-Blast to identify known functional motifs. The last of the three tracks is the ‘MAKER predictions RPS-BLAST to SMART’. These are alignments of predicted proteins from MAKER gene models against SMART using NCBI RPS-Blast. An expectation value cutoff of 1e−3 and 40% alignment requirement with the protein hit was used in filtering the results of these three protein homology searches. The final option in the ‘Sequence Similarity’ grouping is ‘Repeat Regions’, which when selected displays areas of the genome containing interspersed repeats and low complexity DNA sequences as identified by RepeatMasker (http://www.repeatmasker.org/) using a S. mediterranea-specific repeat library (5).

Species TBLASTN

This track displays alignments of the proteomes of individual species to the planarian genome using TBLASTN. The proteomes of C. elegans, D. melanogaster, H. sapiens and M. musculus were aligned and results with expectation values equal to or less than 1e−5 and 30% alignment were selected for display in the browser. These pre-computed sets of data allow users to determine quickly if homologs to the gene of interest exist in the genomes of established genetic model systems.

Non-coding RNA

This track maps published mature miRNA sequences (26) to the genome (sme-miR-2b, for example). Efforts are underway to populate this track with miRNA gene predictions as well. In all of the above cases, the images of the features in the ‘Details’ area of the genome browser that align to the genome can be selected to retrieve a ‘Details Page’. The information in the ‘Details Page’ of different feature types displays feature-specific information and links to tools and associated websites. For example, in the mRNA ‘Details Page’, the mRNA name is hyperlinked and selecting it will retrieve protein and nucleotide sequence, while in the miRNA ‘Details Page’, the name will link to the selected record in miRBase (http://microrna.sanger.ac.uk).

BLAST, BLAT and Search pages

The Search tools provide methods for users to directly query the databases of SmedGD. Sequences can be used to find homologs in S. mediterranea by using BLAST and BLAT. Both nucleotide and protein sequences can be used with BLAST, while BLAT will search the whole genome with nucleotide sequences only. The BLAT result page contains a button that will link the results to an auto-generated track in the browser, such that the queried sequence will be aligned visually to the genome and will be in correct alignment with the other data tracks. In the Search page a user can query the text data stored in SmedGD. Protein homology is queried in the Swissprot, SMART, PFAM and proteome hits to the genome (Figure 2A and B). Gene Ontology terms can be searched by cellular location, molecular function or biological process (Figure 3A). Finally RNAi phenotypes can also be queried with an assortment of checkboxes and drop down menus in an additive ‘AND’ fashion (Figure 3B).
Figure 2.

(A) The ‘Protein Homology’ Search interface. In this example, the search term ‘piwi-like’ is being submitted. Each of the hits from SwissProt, SMART, PFAM and the species-specific databases are searched for the user query term. (B) Results of the search are sorted by genomic contig and location. When more than one result is found on one contig, the matches are grouped and the background will be similarly colored. When there is more than one protein match for one genomic location, it is often due to this sequence matching more than one database. When there is only one result per contig the background is colored white. The contig and location are hyperlinked to the genome browser for further inspection.

Figure 3.

(A) ‘Gene Ontology’ Search interface. Users can search for terms, such as ‘stem cell’ in one of three Gene Ontology (GO) categories (e.g. Biological Process). Any protein homology hit to the genome that had GO terms associated with it will be searched and the corresponding genomic contig and location will be returned in a fashion similar to the ‘Protein Homology’ results page. (B) The ‘RNAi Phenotype’ search interface is used by selecting phenotypes from the five categories listed. More information about these categories and phenotypes can be found in (3). The selections are additive, therefore each of the records returned will contain all of the chosen phenotypes.

(A) The ‘Protein Homology’ Search interface. In this example, the search term ‘piwi-like’ is being submitted. Each of the hits from SwissProt, SMART, PFAM and the species-specific databases are searched for the user query term. (B) Results of the search are sorted by genomic contig and location. When more than one result is found on one contig, the matches are grouped and the background will be similarly colored. When there is more than one protein match for one genomic location, it is often due to this sequence matching more than one database. When there is only one result per contig the background is colored white. The contig and location are hyperlinked to the genome browser for further inspection. (A) ‘Gene Ontology’ Search interface. Users can search for terms, such as ‘stem cell’ in one of three Gene Ontology (GO) categories (e.g. Biological Process). Any protein homology hit to the genome that had GO terms associated with it will be searched and the corresponding genomic contig and location will be returned in a fashion similar to the ‘Protein Homology’ results page. (B) The ‘RNAi Phenotype’ search interface is used by selecting phenotypes from the five categories listed. More information about these categories and phenotypes can be found in (3). The selections are additive, therefore each of the records returned will contain all of the chosen phenotypes.

Curation and future expansions of SmedGD

Since SmedGD is based on GMOD tools, updates, expansion and gene model curation are significantly streamlined. This is a key feature of a GMOD-compliant organism/genome database as genome annotation and the generation of evidence to curate such models is a community-wide ongoing effort. As such, curation of individual genes and data associated with their function will need to be updated regularly. Apollo (http://fruitfly.org/annot/apollo/) will be used to edit gene annotations, since this software has been adopted by GMOD as its annotation workbench, and its outputs are GMOD compliant. Currently under development is the incorporation of microarray data to SmedGD. Expression data associated with over 10 000 unique ESTs will be mapped to each corresponding alignment on the genome, allowing users of SmedGD to correlate genome location, expression pattern and functional data in a single window. In addition, we are exploring the implementation of Textpresso (www.textpresso.org) (27) for literature searching of both current and historical publications associated with planarians, regeneration, tissue homeostasis and stem cells. Finally, as the information accrued increases in complexity, we will also implement a BioMart-based data management system in order to facilitate data exports and simplify database management tasks.
  27 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  The Gene Ontology Annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro.

Authors:  Evelyn Camon; Michele Magrane; Daniel Barrell; David Binns; Wolfgang Fleischmann; Paul Kersey; Nicola Mulder; Tom Oinn; John Maslen; Anthony Cox; Rolf Apweiler
Journal:  Genome Res       Date:  2003-03-12       Impact factor: 9.043

3.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

Authors:  Brigitte Boeckmann; Amos Bairoch; Rolf Apweiler; Marie-Claude Blatter; Anne Estreicher; Elisabeth Gasteiger; Maria J Martin; Karine Michoud; Claire O'Donovan; Isabelle Phan; Sandrine Pilbout; Michel Schneider
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

4.  The Bioperl toolkit: Perl modules for the life sciences.

Authors:  Jason E Stajich; David Block; Kris Boulez; Steven E Brenner; Stephen A Chervitz; Chris Dagdigian; Georg Fuellen; James G R Gilbert; Ian Korf; Hilmar Lapp; Heikki Lehväslaiho; Chad Matsalla; Chris J Mungall; Brian I Osborne; Matthew R Pocock; Peter Schattner; Martin Senger; Lincoln D Stein; Elia Stupka; Mark D Wilkinson; Ewan Birney
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

5.  Ingestion of bacterially expressed double-stranded RNA inhibits gene expression in planarians.

Authors:  Phillip A Newmark; Peter W Reddien; Francesc Cebrià; Alejandro Sánchez Alvarado
Journal:  Proc Natl Acad Sci U S A       Date:  2003-08-13       Impact factor: 11.205

6.  MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes.

Authors:  Brandi L Cantarel; Ian Korf; Sofia M C Robb; Genis Parra; Eric Ross; Barry Moore; Carson Holt; Alejandro Sánchez Alvarado; Mark Yandell
Journal:  Genome Res       Date:  2007-11-19       Impact factor: 9.043

7.  The Schmidtea mediterranea database as a molecular resource for studying platyhelminthes, stem cells and regeneration.

Authors:  Alejandro Sánchez Alvarado; Phillip A Newmark; Sofia M Robb; Réjeanne Juste
Journal:  Development       Date:  2002-12       Impact factor: 6.868

8.  The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology.

Authors:  Evelyn Camon; Michele Magrane; Daniel Barrell; Vivian Lee; Emily Dimmer; John Maslen; David Binns; Nicola Harte; Rodrigo Lopez; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

9.  The Pfam protein families database.

Authors:  Alex Bateman; Lachlan Coin; Richard Durbin; Robert D Finn; Volker Hollich; Sam Griffiths-Jones; Ajay Khanna; Mhairi Marshall; Simon Moxon; Erik L L Sonnhammer; David J Studholme; Corin Yeats; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 10.  Apollo: a sequence annotation editor.

Authors:  S E Lewis; S M J Searle; N Harris; M Gibson; V Lyer; J Richter; C Wiel; L Bayraktaroglu; E Birney; M A Crosby; J S Kaminker; B B Matthews; S E Prochnik; C D Smithy; J L Tupy; G M Rubin; S Misra; C J Mungall; M E Clamp
Journal:  Genome Biol       Date:  2002-12-23       Impact factor: 13.583

View more
  146 in total

1.  A chemical genetics approach reveals H,K-ATPase-mediated membrane voltage is required for planarian head regeneration.

Authors:  Wendy S Beane; Junji Morokuma; Dany S Adams; Michael Levin
Journal:  Chem Biol       Date:  2011-01-28

2.  Reference genes for qPCR assays in toxic metal and salinity stress in two flatworm model organisms.

Authors:  Michelle Plusquin; Olivier DeGheselle; Ann Cuypers; Ellen Geerdens; Andromeda Van Roten; Tom Artois; Karen Smeets
Journal:  Ecotoxicology       Date:  2011-11-12       Impact factor: 2.823

3.  cDNA cloning of heat shock protein 90 gene and protein expression pattern in response to heavy metal exposure and thermal stress in planarian Dugesia japonica.

Authors:  Ke-Xue Ma; Guang-Wen Chen; De-Zeng Liu
Journal:  Mol Biol Rep       Date:  2012-03-06       Impact factor: 2.316

4.  Inhibition of planar cell polarity extends neural growth during regeneration, homeostasis, and development.

Authors:  Wendy S Beane; Ai-Sun Tseng; Junji Morokuma; Joan M Lemire; Michael Levin
Journal:  Stem Cells Dev       Date:  2012-03-23       Impact factor: 3.272

Review 5.  The cell biology of schistosomes: a window on the evolution of the early metazoa.

Authors:  R Alan Wilson
Journal:  Protoplasma       Date:  2012-07       Impact factor: 3.356

6.  The evolution of thrombospondins and their ligand-binding activities.

Authors:  Amber A Bentley; Josephine C Adams
Journal:  Mol Biol Evol       Date:  2010-04-28       Impact factor: 16.240

7.  PRMT5 and the role of symmetrical dimethylarginine in chromatoid bodies of planarian stem cells.

Authors:  Labib Rouhana; Ana P Vieira; Rachel H Roberts-Galbraith; Phillip A Newmark
Journal:  Development       Date:  2012-02-08       Impact factor: 6.868

8.  Quantitative measures for the management and comparison of annotated genomes.

Authors:  Karen Eilbeck; Barry Moore; Carson Holt; Mark Yandell
Journal:  BMC Bioinformatics       Date:  2009-02-23       Impact factor: 3.169

9.  Preparation of the planarian Schmidtea mediterranea for high-resolution histology and transmission electron microscopy.

Authors:  John L Brubacher; Ana P Vieira; Phillip A Newmark
Journal:  Nat Protoc       Date:  2014-02-20       Impact factor: 13.491

10.  Zebrafish Ciliopathy Screen Plus Human Mutational Analysis Identifies C21orf59 and CCDC65 Defects as Causing Primary Ciliary Dyskinesia.

Authors:  Christina Austin-Tse; Jan Halbritter; Maimoona A Zariwala; Renée M Gilberti; Heon Yung Gee; Nathan Hellman; Narendra Pathak; Yan Liu; Jennifer R Panizzi; Ramila S Patel-King; Douglas Tritschler; Raqual Bower; Eileen O'Toole; Jonathan D Porath; Toby W Hurd; Moumita Chaki; Katrina A Diaz; Stefan Kohl; Svjetlana Lovric; Daw-Yang Hwang; Daniela A Braun; Markus Schueler; Rannar Airik; Edgar A Otto; Margaret W Leigh; Peadar G Noone; Johnny L Carson; Stephanie D Davis; Jessica E Pittman; Thomas W Ferkol; Jeffry J Atkinson; Kenneth N Olivier; Scott D Sagel; Sharon D Dell; Margaret Rosenfeld; Carlos E Milla; Niki T Loges; Heymut Omran; Mary E Porter; Stephen M King; Michael R Knowles; Iain A Drummond; Friedhelm Hildebrandt
Journal:  Am J Hum Genet       Date:  2013-10-03       Impact factor: 11.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.