Literature DB >> 19498102

The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes.

Kim D Pruitt1, Jennifer Harrow, Rachel A Harte, Craig Wallin, Mark Diekhans, Donna R Maglott, Steve Searle, Catherine M Farrell, Jane E Loveland, Barbara J Ruef, Elizabeth Hart, Marie-Marthe Suner, Melissa J Landrum, Bronwen Aken, Sarah Ayling, Robert Baertsch, Julio Fernandez-Banet, Joshua L Cherry, Val Curwen, Michael Dicuccio, Manolis Kellis, Jennifer Lee, Michael F Lin, Michael Schuster, Andrew Shkeda, Clara Amid, Garth Brown, Oksana Dukhanina, Adam Frankish, Jennifer Hart, Bonnie L Maidak, Jonathan Mudge, Michael R Murphy, Terence Murphy, Jeena Rajan, Bhanu Rajput, Lillian D Riddick, Catherine Snow, Charles Steward, David Webb, Janet A Weber, Laurens Wilming, Wenyu Wu, Ewan Birney, David Haussler, Tim Hubbard, James Ostell, Richard Durbin, David Lipman.   

Abstract

Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.

Entities:  

Mesh:

Year:  2009        PMID: 19498102      PMCID: PMC2704439          DOI: 10.1101/gr.080531.108

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  25 in total

1.  VARSPLIC: alternatively-spliced protein sequences derived from SWISS-PROT and TrEMBL.

Authors:  P Kersey; H Hermjakob; R Apweiler
Journal:  Bioinformatics       Date:  2000-11       Impact factor: 6.937

2.  Segmental duplications: organization and impact within the current human genome project assembly.

Authors:  J A Bailey; A M Yavor; H F Massa; B J Trask; E E Eichler
Journal:  Genome Res       Date:  2001-06       Impact factor: 9.043

3.  The Ensembl analysis pipeline.

Authors:  Simon C Potter; Laura Clarke; Val Curwen; Stephen Keenan; Emmanuel Mongin; Stephen M J Searle; Arne Stabenau; Roy Storey; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

Review 4.  An overview of Ensembl.

Authors:  Ewan Birney; T Daniel Andrews; Paul Bevan; Mario Caccamo; Yuan Chen; Laura Clarke; Guy Coates; James Cuff; Val Curwen; Tim Cutts; Thomas Down; Eduardo Eyras; Xose M Fernandez-Suarez; Paul Gane; Brian Gibbins; James Gilbert; Martin Hammond; Hans-Rudolf Hotz; Vivek Iyer; Kerstin Jekosch; Andreas Kahari; Arek Kasprzyk; Damian Keefe; Stephen Keenan; Heikki Lehvaslaiho; Graham McVicker; Craig Melsopp; Patrick Meidl; Emmanuel Mongin; Roger Pettett; Simon Potter; Glenn Proctor; Mark Rae; Steve Searle; Guy Slater; Damian Smedley; James Smith; Will Spooner; Arne Stabenau; James Stalker; Roy Storey; Abel Ureta-Vidal; K Cara Woodwark; Graham Cameron; Richard Durbin; Anthony Cox; Tim Hubbard; Michele Clamp
Journal:  Genome Res       Date:  2004-04-12       Impact factor: 9.043

5.  Identification of a novel SNF2/SWI2 protein family member, SRCAP, which interacts with CREB-binding protein.

Authors:  H Johnston; J Kneer; I Chackalaparampil; P Yaciuk; J Chrivia
Journal:  J Biol Chem       Date:  1999-06-04       Impact factor: 5.157

6.  Sequencing and comparison of yeast species to identify genes and regulatory elements.

Authors:  Manolis Kellis; Nick Patterson; Matthew Endrizzi; Bruce Birren; Eric S Lander
Journal:  Nature       Date:  2003-05-15       Impact factor: 49.962

7.  Human-mouse alignments with BLASTZ.

Authors:  Scott Schwartz; W James Kent; Arian Smit; Zheng Zhang; Robert Baertsch; Ross C Hardison; David Haussler; Webb Miller
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

8.  The Vertebrate Genome Annotation (Vega) database.

Authors:  J L Ashurst; C-K Chen; J G R Gilbert; K Jekosch; S Keenan; P Meidl; S M Searle; J Stalker; R Storey; S Trevanion; L Wilming; T Hubbard
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  The Mouse Genome Database (MGD): from genes to mice--a community resource for mouse biology.

Authors:  Janan T Eppig; Carol J Bult; James A Kadin; Joel E Richardson; Judith A Blake; A Anagnostopoulos; R M Baldarelli; M Baya; J S Beal; S M Bello; W J Boddy; D W Bradt; D L Burkart; N E Butler; J Campbell; M A Cassell; L E Corbani; S L Cousins; D J Dahmen; H Dene; A D Diehl; H J Drabkin; K S Frazer; P Frost; L H Glass; C W Goldsmith; P L Grant; M Lennon-Pierce; J Lewis; I Lu; L J Maltais; M McAndrews-Hill; L McClellan; D B Miers; L A Miller; L Ni; J E Ormsby; D Qi; T B K Reddy; D J Reed; B Richards-Smith; D R Shaw; R Sinclair; C L Smith; P Szauter; M B Walker; D O Walton; L L Washburn; I T Witham; Y Zhu
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  Database resources of the National Center for Biotechnology Information.

Authors:  Eric W Sayers; Tanya Barrett; Dennis A Benson; Stephen H Bryant; Kathi Canese; Vyacheslav Chetvernin; Deanna M Church; Michael DiCuccio; Ron Edgar; Scott Federhen; Michael Feolo; Lewis Y Geer; Wolfgang Helmberg; Yuri Kapustin; David Landsman; David J Lipman; Thomas L Madden; Donna R Maglott; Vadim Miller; Ilene Mizrachi; James Ostell; Kim D Pruitt; Gregory D Schuler; Edwin Sequeira; Stephen T Sherry; Martin Shumway; Karl Sirotkin; Alexandre Souvorov; Grigory Starchenko; Tatiana A Tatusova; Lukas Wagner; Eugene Yaschenko; Jian Ye
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

View more
  306 in total

1.  Consistent annotation of gene expression arrays.

Authors:  Benoît Ballester; Nathan Johnson; Glenn Proctor; Paul Flicek
Journal:  BMC Genomics       Date:  2010-05-11       Impact factor: 3.969

2.  IFT56 regulates vertebrate developmental patterning by maintaining IFTB complex integrity and ciliary microtubule architecture.

Authors:  Daisy Xin; Kasey J Christopher; Lewie Zeng; Yong Kong; Scott D Weatherbee
Journal:  Development       Date:  2017-03-06       Impact factor: 6.868

Review 3.  Massively parallel sequencing and rare disease.

Authors:  Sarah B Ng; Deborah A Nickerson; Michael J Bamshad; Jay Shendure
Journal:  Hum Mol Genet       Date:  2010-09-15       Impact factor: 6.150

4.  Whole-exome sequencing identifies compound heterozygous mutations in WDR62 in siblings with recurrent polymicrogyria.

Authors:  David R Murdock; Gary D Clark; Matthew N Bainbridge; Irene Newsham; Yuan-Qing Wu; Donna M Muzny; Sau Wai Cheung; Richard A Gibbs; Melissa B Ramocki
Journal:  Am J Med Genet A       Date:  2011-08-10       Impact factor: 2.802

Review 5.  Vision from next generation sequencing: multi-dimensional genome-wide analysis for producing gene regulatory networks underlying retinal development, aging and disease.

Authors:  Hyun-Jin Yang; Rinki Ratnapriya; Tiziana Cogliati; Jung-Woong Kim; Anand Swaroop
Journal:  Prog Retin Eye Res       Date:  2015-02-07       Impact factor: 21.198

6.  An analysis of exome sequencing for diagnostic testing of the genes associated with muscle disease and spastic paraplegia.

Authors:  Cristina Dias; Murat Sincan; Praveen F Cherukuri; Rosemarie Rupps; Yan Huang; Hannah Briemberg; Kathryn Selby; James C Mullikin; Thomas C Markello; David R Adams; William A Gahl; Cornelius F Boerkoel
Journal:  Hum Mutat       Date:  2012-02-28       Impact factor: 4.878

7.  Long runs of homozygosity are enriched for deleterious variation.

Authors:  Zachary A Szpiech; Jishu Xu; Trevor J Pemberton; Weiping Peng; Sebastian Zöllner; Noah A Rosenberg; Jun Z Li
Journal:  Am J Hum Genet       Date:  2013-06-06       Impact factor: 11.025

8.  Exome sequencing finds a novel PCSK1 mutation in a child with generalized malabsorptive diarrhea and diabetes insipidus.

Authors:  Michael Yourshaw; R Sergio Solorzano-Vargas; Lindsay A Pickett; Iris Lindberg; Jiafang Wang; Galen Cortina; Anna Pawlikowska-Haddal; Howard Baron; Robert S Venick; Stanley F Nelson; Martín G Martín
Journal:  J Pediatr Gastroenterol Nutr       Date:  2013-12       Impact factor: 2.839

9.  Identification of TP53 as an acute lymphocytic leukemia susceptibility gene through exome sequencing.

Authors:  Bradford C Powell; Lichun Jiang; Donna M Muzny; Lisa R Treviño; Zoann E Dreyer; Louise C Strong; David A Wheeler; Richard A Gibbs; Sharon E Plon
Journal:  Pediatr Blood Cancer       Date:  2012-12-19       Impact factor: 3.167

10.  The UCSC Genome Browser.

Authors:  Donna Karolchik; Angie S Hinrichs; W James Kent
Journal:  Curr Protoc Bioinformatics       Date:  2009-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.