Literature DB >> 12952880

Assessment of genome-wide protein function classification for Drosophila melanogaster.

Huaiyu Mi1, Jody Vandergriff, Michael Campbell, Apurva Narechania, William Majoros, Suzanna Lewis, Paul D Thomas, Michael Ashburner.   

Abstract

The functional classification of genes on a genome-wide scale is now in its infancy, and we make a first attempt to assess existing methods and identify sources of error. To this end, we compared two independent efforts for associating proteins with functions, one implemented by FlyBase and the other by PANTHER at Celera Genomics. Both methods make inferences based on sequence similarity and the available experimental evidence. However, they differ considerably in methodology and process. Overall, assuming that the systematic error across the two methods is relatively small, we find the protein-to-function association error rate of both the FlyBase and PANTHER methods to be <2%. The primary source of error for both methods appears to be simple human error. Although homology-based inference can certainly cause errors in annotation, our analysis indicates that the frequency of such errors is relatively small compared with the number of correct inferences. Moreover, these homology errors can be minimized by careful tree-based inference, such as that implemented in PANTHER. Often, functional associations are made by one method and not the other, indicating that one of the greatest challenges lies in improving the completeness of available ontology associations.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12952880      PMCID: PMC403707          DOI: 10.1101/gr.771603

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  19 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

3.  A comparison of whole-genome shotgun-derived mouse chromosome 16 and the human genome.

Authors:  Richard J Mural; Mark D Adams; Eugene W Myers; Hamilton O Smith; George L Gabor Miklos; Ron Wides; Aaron Halpern; Peter W Li; Granger G Sutton; Joe Nadeau; Steven L Salzberg; Robert A Holt; Chinnappa D Kodira; Fu Lu; Lin Chen; Zuoming Deng; Carlos C Evangelista; Weiniu Gan; Thomas J Heiman; Jiayin Li; Zhenya Li; Gennady V Merkulov; Natalia V Milshina; Ashwinikumar K Naik; Rong Qi; Bixiong Chris Shue; Aihui Wang; Jian Wang; Xin Wang; Xianghe Yan; Jane Ye; Shibu Yooseph; Qi Zhao; Liansheng Zheng; Shiaoping C Zhu; Kendra Biddick; Randall Bolanos; Arthur L Delcher; Ian M Dew; Daniel Fasulo; Michael J Flanigan; Daniel H Huson; Saul A Kravitz; Jason R Miller; Clark M Mobarry; Knut Reinert; Karin A Remington; Qing Zhang; Xiangqun H Zheng; Deborah R Nusskern; Zhongwu Lai; Yiding Lei; Wenyan Zhong; Alison Yao; Ping Guan; Rui-Ru Ji; Zhiping Gu; Zhen-Yuan Wang; Fei Zhong; Chunlin Xiao; Chia-Chien Chiang; Mark Yandell; Jennifer R Wortman; Peter G Amanatides; Suzanne L Hladun; Eric C Pratts; Jeffery E Johnson; Kristina L Dodson; Kerry J Woodford; Cheryl A Evans; Barry Gropman; Douglas B Rusch; Eli Venter; Mei Wang; Thomas J Smith; Jarrett T Houck; Donald E Tompkins; Charles Haynes; Debbie Jacob; Soo H Chin; David R Allen; Carl E Dahlke; Robert Sanders; Kelvin Li; Xiangjun Liu; Alexander A Levitsky; William H Majoros; Quan Chen; Ashley C Xia; John R Lopez; Michael T Donnelly; Matthew H Newman; Anna Glodek; Cheryl L Kraft; Marc Nodell; Feroze Ali; Hui-Jin An; Danita Baldwin-Pitts; Karen Y Beeson; Shuang Cai; Mark Carnes; Amy Carver; Parris M Caulk; Angela Center; Yen-Hui Chen; Ming-Lai Cheng; My D Coyne; Michelle Crowder; Steven Danaher; Lionel B Davenport; Raymond Desilets; Susanne M Dietz; Lisa Doup; Patrick Dullaghan; Steven Ferriera; Carl R Fosler; Harold C Gire; Andres Gluecksmann; Jeannine D Gocayne; Jonathan Gray; Brit Hart; Jason Haynes; Jeffery Hoover; Tim Howland; Chinyere Ibegwam; Mena Jalali; David Johns; Leslie Kline; Daniel S Ma; Steven MacCawley; Anand Magoon; Felecia Mann; David May; Tina C McIntosh; Somil Mehta; Linda Moy; Mee C Moy; Brian J Murphy; Sean D Murphy; Keith A Nelson; Zubeda Nuri; Kimberly A Parker; Alexandre C Prudhomme; Vinita N Puri; Hina Qureshi; John C Raley; Matthew S Reardon; Megan A Regier; Yu-Hui C Rogers; Deanna L Romblad; Jakob Schutz; John L Scott; Richard Scott; Cynthia D Sitter; Michella Smallwood; Arlan C Sprague; Erin Stewart; Renee V Strong; Ellen Suh; Karena Sylvester; Reginald Thomas; Ni Ni Tint; Christopher Tsonis; Gary Wang; George Wang; Monica S Williams; Sherita M Williams; Sandra M Windsor; Keriellen Wolfe; Mitchell M Wu; Jayshree Zaveri; Kabir Chaturvedi; Andrei E Gabrielian; Zhaoxi Ke; Jingtao Sun; Gangadharan Subramanian; J Craig Venter; Cynthia M Pfannkoch; Mary Barnstead; Lisa D Stephenson
Journal:  Science       Date:  2002-05-31       Impact factor: 47.728

4.  Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes.

Authors:  Samuel Aparicio; Jarrod Chapman; Elia Stupka; Nik Putnam; Jer-Ming Chia; Paramvir Dehal; Alan Christoffels; Sam Rash; Shawn Hoon; Arian Smit; Maarten D Sollewijn Gelpke; Jared Roach; Tania Oh; Isaac Y Ho; Marie Wong; Chris Detter; Frans Verhoef; Paul Predki; Alice Tay; Susan Lucas; Paul Richardson; Sarah F Smith; Melody S Clark; Yvonne J K Edwards; Norman Doggett; Andrey Zharkikh; Sean V Tavtigian; Dmitry Pruss; Mary Barnstead; Cheryl Evans; Holly Baden; Justin Powell; Gustavo Glusman; Lee Rowen; Leroy Hood; Y H Tan; Greg Elgar; Trevor Hawkins; Byrappa Venkatesh; Daniel Rokhsar; Sydney Brenner
Journal:  Science       Date:  2002-07-25       Impact factor: 47.728

5.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

6.  The FlyBase database of the Drosophila genome projects and community literature.

Authors: 
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

7.  Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Authors: 
Journal:  Nature       Date:  2000-12-14       Impact factor: 49.962

8.  The genome sequence of Schizosaccharomyces pombe.

Authors:  V Wood; R Gwilliam; M-A Rajandream; M Lyne; R Lyne; A Stewart; J Sgouros; N Peat; J Hayles; S Baker; D Basham; S Bowman; K Brooks; D Brown; S Brown; T Chillingworth; C Churcher; M Collins; R Connor; A Cronin; P Davis; T Feltwell; A Fraser; S Gentles; A Goble; N Hamlin; D Harris; J Hidalgo; G Hodgson; S Holroyd; T Hornsby; S Howarth; E J Huckle; S Hunt; K Jagels; K James; L Jones; M Jones; S Leather; S McDonald; J McLean; P Mooney; S Moule; K Mungall; L Murphy; D Niblett; C Odell; K Oliver; S O'Neil; D Pearson; M A Quail; E Rabbinowitsch; K Rutherford; S Rutter; D Saunders; K Seeger; S Sharp; J Skelton; M Simmonds; R Squares; S Squares; K Stevens; K Taylor; R G Taylor; A Tivey; S Walsh; T Warren; S Whitehead; J Woodward; G Volckaert; R Aert; J Robben; B Grymonprez; I Weltjens; E Vanstreels; M Rieger; M Schäfer; S Müller-Auer; C Gabel; M Fuchs; A Düsterhöft; C Fritzc; E Holzer; D Moestl; H Hilbert; K Borzym; I Langer; A Beck; H Lehrach; R Reinhardt; T M Pohl; P Eger; W Zimmermann; H Wedler; R Wambutt; B Purnelle; A Goffeau; E Cadieu; S Dréano; S Gloux; V Lelaure; S Mottier; F Galibert; S J Aves; Z Xiang; C Hunt; K Moore; S M Hurst; M Lucas; M Rochet; C Gaillardin; V A Tallada; A Garzon; G Thode; R R Daga; L Cruzado; J Jimenez; M Sánchez; F del Rey; J Benito; A Domínguez; J L Revuelta; S Moreno; J Armstrong; S L Forsburg; L Cerutti; T Lowe; W R McCombie; I Paulsen; J Potashkin; G V Shpakovski; D Ussery; B G Barrell; P Nurse; L Cerrutti
Journal:  Nature       Date:  2002-02-21       Impact factor: 49.962

Review 9.  Genome sequence of the nematode C. elegans: a platform for investigating biology.

Authors: 
Journal:  Science       Date:  1998-12-11       Impact factor: 47.728

10.  The sequence of the human genome.

Authors:  J C Venter; M D Adams; E W Myers; P W Li; R J Mural; G G Sutton; H O Smith; M Yandell; C A Evans; R A Holt; J D Gocayne; P Amanatides; R M Ballew; D H Huson; J R Wortman; Q Zhang; C D Kodira; X H Zheng; L Chen; M Skupski; G Subramanian; P D Thomas; J Zhang; G L Gabor Miklos; C Nelson; S Broder; A G Clark; J Nadeau; V A McKusick; N Zinder; A J Levine; R J Roberts; M Simon; C Slayman; M Hunkapiller; R Bolanos; A Delcher; I Dew; D Fasulo; M Flanigan; L Florea; A Halpern; S Hannenhalli; S Kravitz; S Levy; C Mobarry; K Reinert; K Remington; J Abu-Threideh; E Beasley; K Biddick; V Bonazzi; R Brandon; M Cargill; I Chandramouliswaran; R Charlab; K Chaturvedi; Z Deng; V Di Francesco; P Dunn; K Eilbeck; C Evangelista; A E Gabrielian; W Gan; W Ge; F Gong; Z Gu; P Guan; T J Heiman; M E Higgins; R R Ji; Z Ke; K A Ketchum; Z Lai; Y Lei; Z Li; J Li; Y Liang; X Lin; F Lu; G V Merkulov; N Milshina; H M Moore; A K Naik; V A Narayan; B Neelam; D Nusskern; D B Rusch; S Salzberg; W Shao; B Shue; J Sun; Z Wang; A Wang; X Wang; J Wang; M Wei; R Wides; C Xiao; C Yan; A Yao; J Ye; M Zhan; W Zhang; H Zhang; Q Zhao; L Zheng; F Zhong; W Zhong; S Zhu; S Zhao; D Gilbert; S Baumhueter; G Spier; C Carter; A Cravchik; T Woodage; F Ali; H An; A Awe; D Baldwin; H Baden; M Barnstead; I Barrow; K Beeson; D Busam; A Carver; A Center; M L Cheng; L Curry; S Danaher; L Davenport; R Desilets; S Dietz; K Dodson; L Doup; S Ferriera; N Garg; A Gluecksmann; B Hart; J Haynes; C Haynes; C Heiner; S Hladun; D Hostin; J Houck; T Howland; C Ibegwam; J Johnson; F Kalush; L Kline; S Koduru; A Love; F Mann; D May; S McCawley; T McIntosh; I McMullen; M Moy; L Moy; B Murphy; K Nelson; C Pfannkoch; E Pratts; V Puri; H Qureshi; M Reardon; R Rodriguez; Y H Rogers; D Romblad; B Ruhfel; R Scott; C Sitter; M Smallwood; E Stewart; R Strong; E Suh; R Thomas; N N Tint; S Tse; C Vech; G Wang; J Wetter; S Williams; M Williams; S Windsor; E Winn-Deen; K Wolfe; J Zaveri; K Zaveri; J F Abril; R Guigó; M J Campbell; K V Sjolander; B Karlak; A Kejariwal; H Mi; B Lazareva; T Hatton; A Narechania; K Diemer; A Muruganujan; N Guo; S Sato; V Bafna; S Istrail; R Lippert; R Schwartz; B Walenz; S Yooseph; D Allen; A Basu; J Baxendale; L Blick; M Caminha; J Carnes-Stine; P Caulk; Y H Chiang; M Coyne; C Dahlke; A Deslattes Mays; M Dombroski; M Donnelly; D Ely; S Esparham; C Fosler; H Gire; S Glanowski; K Glasser; A Glodek; M Gorokhov; K Graham; B Gropman; M Harris; J Heil; S Henderson; J Hoover; D Jennings; C Jordan; J Jordan; J Kasha; L Kagan; C Kraft; A Levitsky; M Lewis; X Liu; J Lopez; D Ma; W Majoros; J McDaniel; S Murphy; M Newman; T Nguyen; N Nguyen; M Nodell; S Pan; J Peck; M Peterson; W Rowe; R Sanders; J Scott; M Simpson; T Smith; A Sprague; T Stockwell; R Turner; E Venter; M Wang; M Wen; D Wu; M Wu; A Xia; A Zandieh; X Zhu
Journal:  Science       Date:  2001-02-16       Impact factor: 47.728

View more
  19 in total

1.  The Gene Ontology (GO) database and informatics resource.

Authors:  M A Harris; J Clark; A Ireland; J Lomax; M Ashburner; R Foulger; K Eilbeck; S Lewis; B Marshall; C Mungall; J Richter; G M Rubin; J A Blake; C Bult; M Dolan; H Drabkin; J T Eppig; D P Hill; L Ni; M Ringwald; R Balakrishnan; J M Cherry; K R Christie; M C Costanzo; S S Dwight; S Engel; D G Fisk; J E Hirschman; E L Hong; R S Nash; A Sethuraman; C L Theesfeld; D Botstein; K Dolinski; B Feierbach; T Berardini; S Mundodi; S Y Rhee; R Apweiler; D Barrell; E Camon; E Dimmer; V Lee; R Chisholm; P Gaudet; W Kibbe; R Kishore; E M Schwarz; P Sternberg; M Gwinn; L Hannick; J Wortman; M Berriman; V Wood; N de la Cruz; P Tonellato; P Jaiswal; T Seigfried; R White
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  PANTHER: a library of protein families and subfamilies indexed by function.

Authors:  Paul D Thomas; Michael J Campbell; Anish Kejariwal; Huaiyu Mi; Brian Karlak; Robin Daverman; Karen Diemer; Anushya Muruganujan; Apurva Narechania
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

3.  The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes.

Authors:  Andreas Ruepp; Alfred Zollner; Dieter Maier; Kaj Albermann; Jean Hani; Martin Mokrejs; Igor Tetko; Ulrich Güldener; Gertrud Mannhaupt; Martin Münsterkötter; H Werner Mewes
Journal:  Nucleic Acids Res       Date:  2004-10-14       Impact factor: 16.971

4.  It's all GO for plant scientists.

Authors:  Jennifer I Clark; Cath Brooksbank; Jane Lomax
Journal:  Plant Physiol       Date:  2005-07       Impact factor: 8.340

Review 5.  FINDSITE: a combined evolution/structure-based approach to protein function prediction.

Authors:  Jeffrey Skolnick; Michal Brylinski
Journal:  Brief Bioinform       Date:  2009-03-26       Impact factor: 11.622

6.  High-efficiency transformation of Plasmodium falciparum by the lepidopteran transposable element piggyBac.

Authors:  Bharath Balu; Douglas A Shoue; Malcolm J Fraser; John H Adams
Journal:  Proc Natl Acad Sci U S A       Date:  2005-10-31       Impact factor: 11.205

Review 7.  PANTHER: Making genome-scale phylogenetics accessible to all.

Authors:  Paul D Thomas; Dustin Ebert; Anushya Muruganujan; Tremayne Mushayahama; Laurent-Philippe Albou; Huaiyu Mi
Journal:  Protein Sci       Date:  2021-11-25       Impact factor: 6.725

8.  Large-scale gene function analysis with the PANTHER classification system.

Authors:  Huaiyu Mi; Anushya Muruganujan; John T Casagrande; Paul D Thomas
Journal:  Nat Protoc       Date:  2013-07-18       Impact factor: 13.491

9.  PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification.

Authors:  Paul D Thomas; Anish Kejariwal; Michael J Campbell; Huaiyu Mi; Karen Diemer; Nan Guo; Istvan Ladunga; Betty Ulitsky-Lazareva; Anushya Muruganujan; Steven Rabkin; Jody A Vandergriff; Olivier Doremieux
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

10.  A threading-based method for the prediction of DNA-binding proteins with application to the human genome.

Authors:  Mu Gao; Jeffrey Skolnick
Journal:  PLoS Comput Biol       Date:  2009-11-13       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.