Literature DB >> 17445272

CGKB: an annotation knowledge base for cowpea (Vigna unguiculata L.) methylation filtered genomic genespace sequences.

Xianfeng Chen1, Thomas W Laudeman, Paul J Rushton, Thomas A Spraggins, Michael P Timko.   

Abstract

BACKGROUND: Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important food and forage legumes in the semi-arid tropics because of its ability to tolerate drought and grow on poor soils. It is cultivated mostly by poor farmers in developing countries, with 80% of production taking place in the dry savannah of tropical West and Central Africa. Cowpea is largely an underexploited crop with relatively little genomic information available for use in applied plant breeding. The goal of the Cowpea Genomics Initiative (CGI), funded by the Kirkhouse Trust, a UK-based charitable organization, is to leverage modern molecular genetic tools for gene discovery and cowpea improvement. One aspect of the initiative is the sequencing of the gene-rich region of the cowpea genome (termed the genespace) recovered using methylation filtration technology and providing annotation and analysis of the sequence data. DESCRIPTION: CGKB, Cowpea Genespace/Genomics Knowledge Base, is an annotation knowledge base developed under the CGI. The database is based on information derived from 298,848 cowpea genespace sequences (GSS) isolated by methylation filtering of genomic DNA. The CGKB consists of three knowledge bases: GSS annotation and comparative genomics knowledge base, GSS enzyme and metabolic pathway knowledge base, and GSS simple sequence repeats (SSRs) knowledge base for molecular marker discovery. A homology-based approach was applied for annotations of the GSS, mainly using BLASTX against four public FASTA formatted protein databases (NCBI GenBank Proteins, UniProtKB-Swiss-Prot, UniprotKB-PIR (Protein Information Resource), and UniProtKB-TrEMBL). Comparative genome analysis was done by BLASTX searches of the cowpea GSS against four plant proteomes from Arabidopsis thaliana, Oryza sativa, Medicago truncatula, and Populus trichocarpa. The possible exons and introns on each cowpea GSS were predicted using the HMM-based Genscan gene predication program and the potential domains on annotated GSS were analyzed using the HMMER package against the Pfam database. The annotated GSS were also assigned with Gene Ontology annotation terms and integrated with 228 curated plant metabolic pathways from the Arabidopsis Information Resource (TAIR) knowledge base. The UniProtKB-Swiss-Prot ENZYME database was used to assign putative enzymatic function to each GSS. Each GSS was also analyzed with the Tandem Repeat Finder (TRF) program in order to identify potential SSRs for molecular marker discovery. The raw sequence data, processed annotation, and SSR results were stored in relational tables designed in key-value pair fashion using a PostgreSQL relational database management system. The biological knowledge derived from the sequence data and processed results are represented as views or materialized views in the relational database management system. All materialized views are indexed for quick data access and retrieval. Data processing and analysis pipelines were implemented using the Perl programming language. The web interface was implemented in JavaScript and Perl CGI running on an Apache web server. The CPU intensive data processing and analysis pipelines were run on a computer cluster of more than 30 dual-processor Apple XServes. A job management system called Vela was created as a robust way to submit large numbers of jobs to the Portable Batch System (PBS).
CONCLUSION: CGKB is an integrated and annotated resource for cowpea GSS with features of homology-based and HMM-based annotations, enzyme and pathway annotations, GO term annotation, toolkits, and a large number of other facilities to perform complex queries. The cowpea GSS, chloroplast sequences, mitochondrial sequences, retroelements, and SSR sequences are available as FASTA formatted files and downloadable at CGKB. This database and web interface are publicly accessible at http://cowpeagenomics.med.virginia.edu/CGKB/.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17445272      PMCID: PMC1868039          DOI: 10.1186/1471-2105-8-129

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  16 in total

1.  Differential methylation of genes and retrotransposons facilitates shotgun sequencing of the maize genome.

Authors:  P D Rabinowicz; K Schutz; N Dedhia; C Yordan; L D Parnell; L Stein; W R McCombie; R A Martienssen
Journal:  Nat Genet       Date:  1999-11       Impact factor: 38.330

2.  Enrichment of gene-coding sequences in maize by genome filtration.

Authors:  C A Whitelaw; W B Barbazuk; G Pertea; A P Chan; F Cheung; Y Lee; L Zheng; S van Heeringen; S Karamycheva; J L Bennetzen; P SanMiguel; N Lakey; J Bedell; Y Yuan; M A Budiman; A Resnick; S Van Aken; T Utterback; S Riedmuller; M Williams; T Feldblyum; K Schubert; R Beachy; C M Fraser; J Quackenbush
Journal:  Science       Date:  2003-12-19       Impact factor: 47.728

3.  The genome of black cottonwood, Populus trichocarpa (Torr. & Gray).

Authors:  G A Tuskan; S Difazio; S Jansson; J Bohlmann; I Grigoriev; U Hellsten; N Putnam; S Ralph; S Rombauts; A Salamov; J Schein; L Sterck; A Aerts; R R Bhalerao; R P Bhalerao; D Blaudez; W Boerjan; A Brun; A Brunner; V Busov; M Campbell; J Carlson; M Chalot; J Chapman; G-L Chen; D Cooper; P M Coutinho; J Couturier; S Covert; Q Cronk; R Cunningham; J Davis; S Degroeve; A Déjardin; C Depamphilis; J Detter; B Dirks; I Dubchak; S Duplessis; J Ehlting; B Ellis; K Gendler; D Goodstein; M Gribskov; J Grimwood; A Groover; L Gunter; B Hamberger; B Heinze; Y Helariutta; B Henrissat; D Holligan; R Holt; W Huang; N Islam-Faridi; S Jones; M Jones-Rhoades; R Jorgensen; C Joshi; J Kangasjärvi; J Karlsson; C Kelleher; R Kirkpatrick; M Kirst; A Kohler; U Kalluri; F Larimer; J Leebens-Mack; J-C Leplé; P Locascio; Y Lou; S Lucas; F Martin; B Montanini; C Napoli; D R Nelson; C Nelson; K Nieminen; O Nilsson; V Pereda; G Peter; R Philippe; G Pilate; A Poliakov; J Razumovskaya; P Richardson; C Rinaldi; K Ritland; P Rouzé; D Ryaboy; J Schmutz; J Schrader; B Segerman; H Shin; A Siddiqui; F Sterky; A Terry; C-J Tsai; E Uberbacher; P Unneberg; J Vahala; K Wall; S Wessler; G Yang; T Yin; C Douglas; M Marra; G Sandberg; Y Van de Peer; D Rokhsar
Journal:  Science       Date:  2006-09-15       Impact factor: 47.728

4.  Transposons, DNA methylation and gene control.

Authors:  R Martienssen
Journal:  Trends Genet       Date:  1998-07       Impact factor: 11.639

5.  Nested retrotransposons in the intergenic regions of the maize genome.

Authors:  P SanMiguel; A Tikhonov; Y K Jin; N Motchoulskaia; D Zakharov; A Melake-Berhan; P S Springer; K J Edwards; M Lee; Z Avramova; J L Bennetzen
Journal:  Science       Date:  1996-11-01       Impact factor: 47.728

6.  Retrotransposons in the flanking regions of normal plant genes: a role for copia-like elements in the evolution of gene structure and expression.

Authors:  S E White; L F Habera; S R Wessler
Journal:  Proc Natl Acad Sci U S A       Date:  1994-12-06       Impact factor: 11.205

7.  The distribution of 5-methylcytosine in the nuclear genome of plants.

Authors:  L M Montero; J Filipski; P Gil; J Capel; J M Martínez-Zapater; J Salinas
Journal:  Nucleic Acids Res       Date:  1992-06-25       Impact factor: 16.971

8.  Maize genome sequencing by methylation filtration.

Authors:  Lance E Palmer; Pablo D Rabinowicz; Andrew L O'Shaughnessy; Vivekanand S Balija; Lidia U Nascimento; Sujit Dike; Melissa de la Bastide; Robert A Martienssen; W Richard McCombie
Journal:  Science       Date:  2003-12-19       Impact factor: 47.728

9.  Active maize genes are unmodified and flanked by diverse classes of modified, highly repetitive DNA.

Authors:  J L Bennetzen; K Schrick; P S Springer; W E Brown; P SanMiguel
Journal:  Genome       Date:  1994-08       Impact factor: 2.166

10.  The Pfam protein families database.

Authors:  Alex Bateman; Lachlan Coin; Richard Durbin; Robert D Finn; Volker Hollich; Sam Griffiths-Jones; Ajay Khanna; Mhairi Marshall; Simon Moxon; Erik L L Sonnhammer; David J Studholme; Corin Yeats; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more
  15 in total

Review 1.  Breeding of Vegetable Cowpea for Nutrition and Climate Resilience in Sub-Saharan Africa: Progress, Opportunities, and Challenges.

Authors:  Tesfaye Walle Mekonnen; Abe Shegro Gerrano; Ntombokulunga Wedy Mbuma; Maryke Tine Labuschagne
Journal:  Plants (Basel)       Date:  2022-06-15

2.  The genetics of domestication of rice bean, Vigna umbellata.

Authors:  Takehisa Isemura; Akito Kaga; Norihiko Tomooka; Takehiko Shimizu; Duncan Alexander Vaughan
Journal:  Ann Bot       Date:  2010-09-29       Impact factor: 4.357

3.  Tobacco transcription factors: novel insights into transcriptional regulation in the Solanaceae.

Authors:  Paul J Rushton; Marta T Bokowiec; Shengcheng Han; Hongbo Zhang; Jennifer F Brannock; Xianfeng Chen; Thomas W Laudeman; Michael P Timko
Journal:  Plant Physiol       Date:  2008-03-12       Impact factor: 8.340

Review 4.  Salinity stress response and 'omics' approaches for improving salinity stress tolerance in major grain legumes.

Authors:  Uday Chand Jha; Abhishek Bohra; Rintu Jha; Swarup Kumar Parida
Journal:  Plant Cell Rep       Date:  2019-01-12       Impact factor: 4.570

5.  Construction of a genetic linkage map and genetic analysis of domestication related traits in mungbean (Vigna radiata).

Authors:  Takehisa Isemura; Akito Kaga; Satoshi Tabata; Prakit Somta; Peerasak Srinives; Takehiko Shimizu; Uken Jo; Duncan A Vaughan; Norihiko Tomooka
Journal:  PLoS One       Date:  2012-08-02       Impact factor: 3.240

Review 6.  Genomics-assisted breeding in four major pulse crops of developing countries: present status and prospects.

Authors:  Abhishek Bohra; Manish K Pandey; Uday C Jha; Balwant Singh; Indra P Singh; Dibendu Datta; Sushil K Chaturvedi; N Nadarajan; Rajeev K Varshney
Journal:  Theor Appl Genet       Date:  2014-04-08       Impact factor: 5.699

7.  Global changes in gene expression during compatible and incompatible interactions of cowpea (Vigna unguiculata L.) with the root parasitic angiosperm Striga gesnerioides.

Authors:  Kan Huang; Karolina E Mellor; Shom N Paul; Mark J Lawson; Aaron J Mackey; Michael P Timko
Journal:  BMC Genomics       Date:  2012-08-17       Impact factor: 3.969

8.  TOBFAC: the database of tobacco transcription factors.

Authors:  Paul J Rushton; Marta T Bokowiec; Thomas W Laudeman; Jennifer F Brannock; Xianfeng Chen; Michael P Timko
Journal:  BMC Bioinformatics       Date:  2008-01-25       Impact factor: 3.169

9.  Sequencing and analysis of the gene-rich space of cowpea.

Authors:  Michael P Timko; Paul J Rushton; Thomas W Laudeman; Marta T Bokowiec; Edmond Chipumuro; Foo Cheung; Christopher D Town; Xianfeng Chen
Journal:  BMC Genomics       Date:  2008-02-27       Impact factor: 3.969

10.  Transcriptional analysis of highly syntenic regions between Medicago truncatula and Glycine max using tiling microarrays.

Authors:  Lei Li; Hang He; Juan Zhang; Xiangfeng Wang; Sulan Bai; Viktor Stolc; Waraporn Tongprasit; Nevin D Young; Oliver Yu; Xing-Wang Deng
Journal:  Genome Biol       Date:  2008-03-19       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.