Literature DB >> 15608275

openSputnik--a database to ESTablish comparative plant genomics using unsaturated sequence collections.

Stephen Rudd1.   

Abstract

The public expressed sequence tag collections are continually being enriched with high-quality sequences that represent an ever-expanding range of taxonomically diverse plant species. While these sequence collections provide biased insight into the populations of expressed genes available within individual species and their associated tissues, the information is conceivably of wider relevance in a comparative context. When we consider the available expressed sequence tag (EST) collections of summer 2004, most of the major plant taxonomic clades are at least superficially represented. Investigation of the five million available plant ESTs provides a wealth of information that has applications in modelling the routes of plant genome evolution and the identification of lineage-specific genes and gene families. Over four million ESTs from over 50 distinct plant species have been collated within an EST analysis pipeline called openSputnik. The ESTs were resolved down into approximately one million unigene sequences. These have been annotated using orthology-based annotation transfer from reference plant genomes and using a variety of contemporary bioinformatics methods to assign peptide, structural and functional attributes. The openSputnik database is available at http://sputnik.btk.fi.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15608275      PMCID: PMC539994          DOI: 10.1093/nar/gki040

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Complete genome sequencing has become the standard modus operandi for bacterial genomics, and tens of eukaryotic genomes have also been completely sequenced (see http://www.genomesonline.org). Plant genomics is, however, frequently hindered by the typically large and repetitive nature of the genome. Certain plant species have genome sizes that dwarf the human genome; the 1C genome size for broad bean (Vicia faba) is at least 26 000 Mb (Plant DNA C-values database), or over eight times the size of the human genome. The selection of candidate plant genomes for complete sequencing is, therefore, based on the scientific and anthropocentric value of the plant and the feasibility of a meaningful sequencing and assembly strategy. While several diverse plant species [Arabidopsis thaliana (1), Oryza sativa (2,3) and Populus trichocarpa] have been or will shortly be completely sequenced, the majority of plant genomes remain largely inaccessible. Arabidopsis and rice are certainly model plant systems but, are neither truly representative of any other given species nor are they general indicators for gene content across the whole plant kingdom. The first forays into comparative plant genomics using Arabidopsis and rice as reference genomes have demonstrated that there is a remarkable degree of underlying sequence diversity between these species (2,3). This firmly advocates the need to at least sample the protein-coding component of more taxonomically ‘exotic’ plant genomes. cDNA preparation and expressed sequence tag (EST) sequencing remain a dominant methodology for accessing the protein coding (and expressed) portion of the genome. Many laboratories are independently sequencing very large numbers of sequences from a broad and bio-diverse spectrum of plant species (Figure 1). EST sequences retain their exalted status for several reasons [for a review see (4)].
Figure 1

A depiction of the phylogenetic relationships among the major plant lineages as published previously (23). The evolutionary tree has been overlaid with the names of plant species having large EST collections (>5000 sequences) that are available in the current release of openSputnik. The symbol ‘**’ denotes the plant groups where either small EST collections (>1000 ESTs) are available or as-yet unreleased sequences are known to exist. This figure reveals the taxonomic distribution of large plant EST collections, but also highlights the strong bias towards the agriculturally important species.

They are technically simple to produce and cheap to sequence. ESTs provide a robust approximation of the expressed gene content of the parental genome under given sampling conditions and can be used for primitive expression profiling between tissues (5). The extensive redundancy typical of EST collections also allows for the selection of putative molecular markers (6,7). cDNAs may be used as a substrate for arraying, to create cDNA microarrays; this allows for true gene expression profiling (8). With an excess of 5.4 million sequences from over 320 species, the current public plant EST sequence databases (EMBL release 80) (9) are a valuable and contextually rich but under-utilized resource. If we consider just the large EST collections with over 5000 ESTs, 5.1 million ESTs from 74 species are represented. These species, while highly biased towards the key plant taxonomic clades of the rosids, asterids and monocots, still contain representative species, from other key taxonomic groups. The species represented contain representatives of single cellularity—the red and brown algae and lower plants—gymnosperms, basal angiosperms and the angiosperms. With such a wealth of signals for investigation of the underlying genomic changes in gene-content, protein structures and domain composition, the EST collections surely deserve detailed analysis and investigation. The openSputnik database has been designed as an interim platform for the exhaustive annotation and analysis of EST sequences in a comparative context. In addition to clustering sequences, a peptide sequence is identified, thus, providing a more sensitive target for the identification of functional and structural features. Sequences are placed in context with the currently available complete plant genomes and are associated with other clustered EST collections. The openSputnik database, thus, creates a platform upon which the intricate patterns of generalist house-keeping genes and lineage-specific gene families may be teased apart. The completed EST project annotations are available as a searchable web resource. While the provision of an integrated resource containing a diverse mixture of clustered and contextually placed unigene sequences is not unique [e.g. TIGR Gene Indices database (10), NCBI Unigenes (11) or PlantGDB at Iowa State University (12)], the openSputnik database is currently distinct in its focus towards functionally describing unigene sequences on the basis of both orthologous gene annotations and the application of bioinformatics methods for ab initio annotations.

IMPLEMENTATION AND STARTING MATERIAL

The openSputnik database has been programmed using the Java programming language and utilizes the PostgreSQL relational database management system to archive and retrieve sequences and their annotations. Therefore, openSputnik is largely platform-independent and has been implemented using a server–client model to allow for calculation in a distributed and heterogeneous computational environment. The methods implemented within openSputnik are described as functional objects and the analytical pathway is described as a directed acyclic graph (Figure 2). The current version of openSputnik utilizes the complete public plant EST collection that was available from the European Molecular Biology Laboratory (EMBL) at the start of Spring 2004 (EMBL release 78). A rule was imposed so that EST collections of at least 4500 sequences would be included. Over four million EST sequences representing 55 distinct plant species were identified using this rule. These sequences were loaded onto the openSputnik database schema.
Figure 2

A simplification of the directed acyclic graph that describes the analytical pipeline used to build the openSputnik database. As starting material, species-specific EMBL flat files are imported and all annotations are retained. This creates a sequence source ‘EST collection’. This source is used to derive two other annotative sources, the ‘UNIGENE collection’ and the ‘PEPTIDE collection’ (sources shown in red). When the sources have been built, they are annotated using a variety of methods highlighted in green. The analyses anchored to the schema are used to create derived annotations including Funcat and GO terms (shown in orange). All analyses are made available to the database user via the openZputnik interface.

SEQUENCE CLUSTERING

Prior to sequence clustering, ESTs were aggressively trimmed of any likely residual vector or polylinker sequences using the Crossmatch application (P. Green, unpublished data) and the National Center for Bioinformatics Information (NCBI) UniVec database. Sequences <55 nt in length were excluded at this stage. To prevent the aggregation of sequences on the basis of low complexity sequence islands, all low complexity sequences were masked using the RepeatBeater algorithm (Biomax informatics, Martinsried, Germany). The masked sequences were clustered into pools of related sequences using a suffix tree based approach (HPT2 algorithm; Biomax informatics). To encourage the aggregation of sequences, HPT2 was run using a similarity threshold of 0.7 and a number of network iterations equalling the number of masked ESTs. The resulting clusters were assembled into unigene sequences using the CAP3 algorithm with standard settings. Within the larger EST collections, some HPT2 identified clusters contain many members. To simplify the analysis, larger clusters were truncated to an arbitrary threshold of a maximum of 2500 ESTs. Some individual ESTs representing the most highly expressed genes were absent from their cognate unigenes.

PEPTIDE PREDICTION

It is probable that each derived unigene sequence represents an expressed and properly spliced mRNA. Extensive amounts of either 5′-untranslated region (5′-UTR) or 3′-UTR may exist within the unigene sequences. The identification of a meaningful peptide sequence lends value to the dataset by allowing us to exclude sequences of low protein-coding potential, and additionally allows the use of peptide-annotation algorithms. ESTScan (13) models have been trained for each of the underlying species. Training data were produced by identifying probable open reading frame (ORF) sequences from a BLASTX (14) analysis against the Swiss-Prot (15) database arbitrarily filtered at 1E−10. ESTScan was used with the derived model to predict the most likely peptide for each unigene sequence. The numbers of ESTs, unigenes and peptides are shown for each of the 55 openSputnik plant species along with estimates of actual coding potential and redundancy across the individual libraries (Table 1).
Table 1.

Table summarizing the sequence content of the openSputnik database

Organism nameNo. of ESTsEST sequence (bp)No. of singletonsNo. of assembiesUnigene sequence (bp)RedundancyPeptide sequence (aa)Protein coding potential
Allium cepa19 58213 016 289725240208 544 7471.52 531 51988.9
Arabidopsis thaliana190 74184 128 06517 67520 10922 482 6883.76 135 20281.9
Beta vulgaris20 15110 184 665924437067 368 7911.42 015 99082.1
Brassica napus37 15921 438 036804154478 389 2172.62 403 18485.9
Capsicum annuum22 43310 226 020732630565 496 9511.91 477 08080.6
Chlamydomonas reinhardtii154 60082 230 38218 21110 98923 178 7553.52 388 59630.9
Citrus sinensis23 33712 738 998531134165 474 7952.31 473 29480.7
Cryptomeria japonica71283 624 193320212032 457 7841.5579 83470.8
Cycas rumphii59522 873 07922306971 597 2821.8349 00165.5
Eschscholzia californica54682 529 15031467411 908 9621.3564 14788.7
Glycine max344 524158 703 38428 96324 89233 585 0324.78 648 79277.3
Gossypium arboreum38 91526 139 86710 007607613 043 9192.02 958 83568.1
Gossypium hirsutum13 5718 414 112593419145 367 0831.61 334 90174.6
Hedyotis centranthoides54162 476 00935956412 022 0871.2450 94366.9
Hedyotis terminalis48752 228 28433135301 830 0941.2402 30665.9
Helianthus annuus59 84125 553 02811 90060508 654 9473.02 086 80672.3
Helianthus argophyllus12 7874 929 193464610292 309 0892.1516 76367.1
Helianthus paradoxus10 3404 149 627384410121 997 1152.1458 46568.9
Hordeum vulgare372 431198 114 71725 40523 03337 345 5655.39 139 51573.4
Ipomoea nil25 89915 289 506457248296 252 2582.41 682 96580.8
Lactuca sativa68 18835 969 88912 427799813 090 2182.73 527 51480.8
Lotus corniculatus36 31113 987 475764642485 529 9082.51 635 21488.7
Lycopersicon esculentum150 22875 468 37113 17814 87019 372 9693.95 380 40383.3
Lycopersicon pennellii83463 842 35824089011 770 9212.2503 01485.2
Medicago truncatula187 763101 662 46319 44817 18927 597 7083.76 630 34272.1
Mesembryanthemum crystallinum25 80315 782 659483131375 941 2452.71 541 78677.9
Nicotiana tabacum10 3235 104 49987106304 738 1481.1952 83960.3
Oryza minuta52682 367 83227565911 658 5721.4452 96381.9
Oryza sativa260 901136 090 82130 97120 93434 467 8153.98 593 18574.8
Phaeodactylum tricornutum12 1217 911 359304315263 439 5902.3894 96078.1
Phaseolus coccineus20 1208 487 980441924313 269 0962.6886 98681.4
Physcomitrella patens102 21954 477 83310 11413 30915 177 6963.63 521 52569.6
Pinus pinaster15 7197 679 661497424524 209 2911.81 036 69973.9
Pinus taeda110 62251 626 00314 63211 61015 972 2153.23 945 83274.1
Poncirus trifoliata63904 107 970164412092 220 6091.8568 75876.8
Populus alba10 4465 769 749385614803 192 0531.8862 94981.1
Populus balsamifera30 29614 140 412703136645 503 9102.61 522 33083.0
Populus tremula70 09130 629 34614 699795411 475 1262.73 192 05483.5
Populus tremuloides13 0506 174 206263422182 413 5732.6706 58587.8
Porphyra yezoensis20 9799 801 783277420452 853 6513.4681 73171.7
Prunus persica11 4526 496 591320615883 135 2882.1883 16584.5
Saccharum officinarum246 301156 538 94229 89525 08945 845 4063.411 003 16272.0
Saccharum spp.88074 377 943478411553 165 6111.4788 52074.7
Secale cereale91944 313 461379313462 687 8301.6662 34273.9
Solanum tuberosum94 52551 346 134665115 98316 752 8953.14 715 29984.4
Sorghum bicolor161 76683 411 68416 95517 70423 132 7743.66 004 63077.9
Sorghum propinquum21 3879 750 610537135074 673 2862.11 209 82277.7
Stevia rebaudiana55483 242 04524987132 048 9651.6578 30384.7
Theobroma cacao65622 607 87119887531 103 7762.4276 18875.1
Triticum aestivum511 732257 643 80149 17133 66651 549 0495.012 964 65275.5
Triticum monococcum99734 956 308394116813 212 8691.5810 91075.7
Vitis hybrid65333 604 678103210521 385 9392.6349 25075.6
Vitis vinifera135 71274 769 503961612 89316 019 1024.74 176 66578.2
Zea mays384 391173 945 69824 26625 72529 187 8086.07 017 86872.1
Zinnia elegans97834 896 796653614564 140 8241.2890 00464.5

A total of 55 plant species are included in the current release, and represent a broad taxonomic distribution of species. Shown are the number of ESTs and the total nucleotide length for all EST sequences. The number of resulting singleton unigenes and multi-member assemblies is shown, along with the summed length of all available unigene sequence. The difference between total nucleotide length in EST and unigene sequences is summarized as apparent redundancy. Since peptide sequences have been prepared for each of the unigenes the length of all derived peptide is also shown and a measure of apparent coding potential across the whole unigene set is also shown.

DATABASE CONTENTS

The unigene sequences and peptides from each of the included species have been annotated using a selection of bioinformatics tools that are relevant to comparative genomics and biological understanding. Sequences are annotated for structural and functional characters using InterPro domains (16), TMHMM for the identification of transmembrane domains (17), TargetP for the prediction of organellar targeting (18) and SignalP for subcellular localization (19). The blast algorithm is used to reflect similarities of individual sequences with known proteins in the Swiss-Prot database, predicted proteins in the UniProt database (20) and to organism specific sets of proteins not restricted to A.thaliana, O.sativa or aggregated plant proteins. The complete sequence collections are summarized using the MIPS catalogue of functionally annotated proteins (Funcat) (21) and Gene Ontology terms (22). A collection of methods has been implemented to provide the typical figures and charts that are often seen in EST collection publications. Graphical representation of sequence lengths, number of ESTs within unigenes and clone-library representation are all included. Also included are reports summarizing the functional distribution of unigenes using both GOSlims and the MIPS Funcat.

DATABASE ACCESS

A query interface to the openSputnik database is provided by a web application product written for the Zope web application server. The openZputnik portal at http://sputnik.btk.fi provides access to all core EST collections through a single unified interface. Selecting EST projects will display a list of all available projects. When an openSputnik collection is selected, an interface that provides routes to the underlying data will be displayed. Different methods are included for EST sequences, unigene sequences and peptide sequences. Additionally, a page is included to access sequences on the basis of pre-computed reports and a BLAST server is included so that sequences may be identified on the basis of similarity to a known sequence. Sequences may be identified on the basis of a variety of criteria not restricted to GC content, length, name or predicted function. When a sequence is selected, a single page summary report is displayed for the sequence. This summarizes key information that includes wherever appropriate, the best BLAST matches, functional information and physical attributes. Navigation tabs are provided so that a user may access all primary information derived or associated with a single sequence.

DATA AVAILABILITY AND FUTURE DIRECTIONS

All data within the openSputnik database is freely available to the scientific community. Please contact the author to request the inclusion of additional methods. The analytical pipeline may be applied to novel and proprietary sequence collections as either a collaboration with, or as a service of, the Bioinformatics Core facility provided at the Turku Centre for Biotechnology. The openSputnik SQL schema and complete database dumps are available upon request. The source code to the openSputnik engine and core reporting architecture is being open-sourced and released to Source Forge (www.sourceforge.com). The openSputnik group will prepare one or two releases of the clustered plant unigenes per year. Additional plant species will be included into the pipeline as they exceed our arbitrary size threshold. Additional groups of organisms will be integrated in the future with a comparative mammalian unigene database planned for spring 2005. Additional emphasis is being placed on the creation of generic reports that can distil the essence of large and heterogeneous sequence collections. Further synchronization of the completed resources with the Gene Ontology and dynamic integration and comparison of groups of species is in progress. The challenge is to stay abreast with the ever-growing collections of sequences and the novel bioinformatics methodologies that offer us the ability to better understand the nuances within our sequence collections.
  23 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences.

Authors:  C Iseli; C V Jongeneel; P Bucher
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1999

3.  cDNA screening by array hybridization.

Authors:  R Drmanac; S Drmanac
Journal:  Methods Enzymol       Date:  1999       Impact factor: 1.600

4.  Deciding among green plants for whole genome studies.

Authors:  Kathleen M Pryer; Harald Schneider; Elizabeth A Zimmer; Jo Ann Banks
Journal:  Trends Plant Sci       Date:  2002-12       Impact factor: 18.313

5.  MIPS: a database for genomes and protein sequences.

Authors:  H W Mewes; D Frishman; U Güldener; G Mannhaupt; K Mayer; M Mokrejs; B Morgenstern; M Münsterkötter; S Rudd; B Weil
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Authors: 
Journal:  Nature       Date:  2000-12-14       Impact factor: 49.962

7.  The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species.

Authors:  J Quackenbush; J Cho; D Lee; F Liang; I Holt; S Karamycheva; B Parvizi; G Pertea; R Sultana; J White
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

8.  A draft sequence of the rice genome (Oryza sativa L. ssp. indica).

Authors:  Jun Yu; Songnian Hu; Jun Wang; Gane Ka-Shu Wong; Songgang Li; Bin Liu; Yajun Deng; Li Dai; Yan Zhou; Xiuqing Zhang; Mengliang Cao; Jing Liu; Jiandong Sun; Jiabin Tang; Yanjiong Chen; Xiaobing Huang; Wei Lin; Chen Ye; Wei Tong; Lijuan Cong; Jianing Geng; Yujun Han; Lin Li; Wei Li; Guangqiang Hu; Xiangang Huang; Wenjie Li; Jian Li; Zhanwei Liu; Long Li; Jianping Liu; Qiuhui Qi; Jinsong Liu; Li Li; Tao Li; Xuegang Wang; Hong Lu; Tingting Wu; Miao Zhu; Peixiang Ni; Hua Han; Wei Dong; Xiaoyu Ren; Xiaoli Feng; Peng Cui; Xianran Li; Hao Wang; Xin Xu; Wenxue Zhai; Zhao Xu; Jinsong Zhang; Sijie He; Jianguo Zhang; Jichen Xu; Kunlin Zhang; Xianwu Zheng; Jianhai Dong; Wanyong Zeng; Lin Tao; Jia Ye; Jun Tan; Xide Ren; Xuewei Chen; Jun He; Daofeng Liu; Wei Tian; Chaoguang Tian; Hongai Xia; Qiyu Bao; Gang Li; Hui Gao; Ting Cao; Juan Wang; Wenming Zhao; Ping Li; Wei Chen; Xudong Wang; Yong Zhang; Jianfei Hu; Jing Wang; Song Liu; Jian Yang; Guangyu Zhang; Yuqing Xiong; Zhijie Li; Long Mao; Chengshu Zhou; Zhen Zhu; Runsheng Chen; Bailin Hao; Weimou Zheng; Shouyi Chen; Wei Guo; Guojie Li; Siqi Liu; Ming Tao; Jian Wang; Lihuang Zhu; Longping Yuan; Huanming Yang
Journal:  Science       Date:  2002-04-05       Impact factor: 47.728

9.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica).

Authors:  Stephen A Goff; Darrell Ricke; Tien-Hung Lan; Gernot Presting; Ronglin Wang; Molly Dunn; Jane Glazebrook; Allen Sessions; Paul Oeller; Hemant Varma; David Hadley; Don Hutchison; Chris Martin; Fumiaki Katagiri; B Markus Lange; Todd Moughamer; Yu Xia; Paul Budworth; Jingping Zhong; Trini Miguel; Uta Paszkowski; Shiping Zhang; Michelle Colbert; Wei-lin Sun; Lili Chen; Bret Cooper; Sylvia Park; Todd Charles Wood; Long Mao; Peter Quail; Rod Wing; Ralph Dean; Yeisoo Yu; Andrey Zharkikh; Richard Shen; Sudhir Sahasrabudhe; Alun Thomas; Rob Cannings; Alexander Gutin; Dmitry Pruss; Julia Reid; Sean Tavtigian; Jeff Mitchell; Glenn Eldredge; Terri Scholl; Rose Mary Miller; Satish Bhatnagar; Nils Adey; Todd Rubano; Nadeem Tusneem; Rosann Robinson; Jane Feldhaus; Teresita Macalma; Arnold Oliphant; Steven Briggs
Journal:  Science       Date:  2002-04-05       Impact factor: 47.728

10.  Database resources of the National Center for Biotechnology.

Authors:  David L Wheeler; Deanna M Church; Scott Federhen; Alex E Lash; Thomas L Madden; Joan U Pontius; Gregory D Schuler; Lynn M Schriml; Edwin Sequeira; Tatiana A Tatusova; Lukas Wagner
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

View more
  17 in total

1.  Comparative mapping of DNA sequences in rye (Secale cereale L.) in relation to the rice genome.

Authors:  B Hackauf; S Rudd; J R van der Voort; T Miedaner; P Wehling
Journal:  Theor Appl Genet       Date:  2008-10-25       Impact factor: 5.699

2.  Gene expression and metabolite profiling of Populus euphratica growing in the Negev desert.

Authors:  Mikael Brosché; Basia Vinocur; Edward R Alatalo; Airi Lamminmäki; Thomas Teichmann; Eric A Ottow; Dimitar Djilianov; Dany Afif; Marie-Béatrice Bogeat-Triboulot; Arie Altman; Andrea Polle; Erwin Dreyer; Stephen Rudd; Lars Paulin; Petri Auvinen; Jaakko Kangasjärvi
Journal:  Genome Biol       Date:  2005-12-02       Impact factor: 13.583

3.  Analysis of DNA polymorphisms in sugar beet (Beta vulgaris L.) and development of an SNP-based map of expressed genes.

Authors:  Katharina Schneider; Dagmar Kulosa; Thomas Rosleff Soerensen; Silke Möhring; Martin Heine; Gregor Durstewitz; Andreas Polley; Eberhard Weber; Jens Lein; Uwe Hohmann; Emma Tahiro; Bernd Weisshaar; Britta Schulz; Georg Koch; Christian Jung; Martin Ganal
Journal:  Theor Appl Genet       Date:  2007-07-11       Impact factor: 5.699

4.  Analysis of EST sequences suggests recent origin of allotetraploid colonial and creeping bentgrasses.

Authors:  David Rotter; Arvind K Bharti; Huaijun Michael Li; Chongyuan Luo; Stacy A Bonos; Suleiman Bughrara; Geunhwa Jung; Joachim Messing; William A Meyer; Stephen Rudd; Scott E Warnke; Faith C Belanger
Journal:  Mol Genet Genomics       Date:  2007-05-12       Impact factor: 2.980

5.  Phytome: a platform for plant comparative genomics.

Authors:  Stefanie Hartmann; Dihui Lu; Jason Phillips; Todd J Vision
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

6.  Optimization and comparison of different methods for RNA isolation for cDNA library construction from the reindeer lichen Cladonia rangiferina.

Authors:  Sini Junttila; Kean-Jin Lim; Stephen Rudd
Journal:  BMC Res Notes       Date:  2009-10-05

7.  Gene-based microsatellites for cassava (Manihot esculenta Crantz): prevalence, polymorphisms, and cross-taxa utility.

Authors:  Adebola Aj Raji; James V Anderson; Olufisayo A Kolade; Chike D Ugwu; Alfred Go Dixon; Ivan L Ingelbrecht
Journal:  BMC Plant Biol       Date:  2009-09-11       Impact factor: 4.215

8.  The plant transcriptome-from integrating observations to models.

Authors:  Björn Usadel; Alisdair R Fernie
Journal:  Front Plant Sci       Date:  2013-03-11       Impact factor: 5.753

9.  Characterization of an 18,166 EST dataset for cassava (Manihot esculenta Crantz) enriched for drought-responsive genes.

Authors:  Y Lokko; J V Anderson; S Rudd; A Raji; D Horvath; M A Mikel; R Kim; L Liu; A Hernandez; A G O Dixon; I L Ingelbrecht
Journal:  Plant Cell Rep       Date:  2007-05-31       Impact factor: 4.964

10.  EST2uni: an open, parallel tool for automated EST analysis and database creation, with a data mining web interface and microarray expression data integration.

Authors:  Javier Forment; Francisco Gilabert; Antonio Robles; Vicente Conejero; Fernando Nuez; Jose M Blanca
Journal:  BMC Bioinformatics       Date:  2008-01-07       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.