Literature DB >> 18040051

Distinguishing protein-coding and noncoding genes in the human genome.

Michele Clamp1, Ben Fry, Mike Kamal, Xiaohui Xie, James Cuff, Michael F Lin, Manolis Kellis, Kerstin Lindblad-Toh, Eric S Lander.   

Abstract

Although the Human Genome Project was completed 4 years ago, the catalog of human protein-coding genes remains a matter of controversy. Current catalogs list a total of approximately 24,500 putative protein-coding genes. It is broadly suspected that a large fraction of these entries are functionally meaningless ORFs present by chance in RNA transcripts, because they show no evidence of evolutionary conservation with mouse or dog. However, there is currently no scientific justification for excluding ORFs simply because they fail to show evolutionary conservation: the alternative hypothesis is that most of these ORFs are actually valid human genes that reflect gene innovation in the primate lineage or gene loss in the other lineages. Here, we reject this hypothesis by carefully analyzing the nonconserved ORFs-specifically, their properties in other primates. We show that the vast majority of these ORFs are random occurrences. The analysis yields, as a by-product, a major revision of the current human catalogs, cutting the number of protein-coding genes to approximately 20,500. Specifically, it suggests that nonconserved ORFs should be added to the human gene catalog only if there is clear evidence of an encoded protein. It also provides a principled methodology for evaluating future proposed additions to the human gene catalog. Finally, the results indicate that there has been relatively little true innovation in mammalian protein-coding genes.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18040051      PMCID: PMC2148306          DOI: 10.1073/pnas.0709013104

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  15 in total

Review 1.  Recent duplication, domain accretion and the dynamic mutation of the human genome.

Authors:  E E Eichler
Journal:  Trends Genet       Date:  2001-11       Impact factor: 11.639

2.  The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes.

Authors:  Helen Skaletsky; Tomoko Kuroda-Kawaguchi; Patrick J Minx; Holland S Cordum; LaDeana Hillier; Laura G Brown; Sjoerd Repping; Tatyana Pyntikova; Johar Ali; Tamberlyn Bieri; Asif Chinwalla; Andrew Delehaunty; Kim Delehaunty; Hui Du; Ginger Fewell; Lucinda Fulton; Robert Fulton; Tina Graves; Shun-Fang Hou; Philip Latrielle; Shawn Leonard; Elaine Mardis; Rachel Maupin; John McPherson; Tracie Miner; William Nash; Christine Nguyen; Philip Ozersky; Kymberlie Pepin; Susan Rock; Tracy Rohlfing; Kelsi Scott; Brian Schultz; Cindy Strong; Aye Tin-Wollam; Shiaw-Pyng Yang; Robert H Waterston; Richard K Wilson; Steve Rozen; David C Page
Journal:  Nature       Date:  2003-06-19       Impact factor: 49.962

3.  Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution.

Authors:  Jill Cheng; Philipp Kapranov; Jorg Drenkow; Sujit Dike; Shane Brubaker; Sandeep Patel; Jeffrey Long; David Stern; Hari Tammana; Gregg Helt; Victor Sementchenko; Antonio Piccolboni; Stefan Bekiranov; Dione K Bailey; Madhavan Ganesh; Srinka Ghosh; Ian Bell; Daniela S Gerhard; Thomas R Gingeras
Journal:  Science       Date:  2005-03-24       Impact factor: 47.728

4.  Evolutionary fate of retroposed gene copies in the human genome.

Authors:  Nicolas Vinckenbosch; Isabelle Dupanloup; Henrik Kaessmann
Journal:  Proc Natl Acad Sci U S A       Date:  2006-02-21       Impact factor: 11.205

5.  Genome sequence, comparative analysis and haplotype structure of the domestic dog.

Authors:  Kerstin Lindblad-Toh; Claire M Wade; Tarjei S Mikkelsen; Elinor K Karlsson; David B Jaffe; Michael Kamal; Michele Clamp; Jean L Chang; Edward J Kulbokas; Michael C Zody; Evan Mauceli; Xiaohui Xie; Matthew Breen; Robert K Wayne; Elaine A Ostrander; Chris P Ponting; Francis Galibert; Douglas R Smith; Pieter J DeJong; Ewen Kirkness; Pablo Alvarez; Tara Biagi; William Brockman; Jonathan Butler; Chee-Wye Chin; April Cook; James Cuff; Mark J Daly; David DeCaprio; Sante Gnerre; Manfred Grabherr; Manolis Kellis; Michael Kleber; Carolyne Bardeleben; Leo Goodstadt; Andreas Heger; Christophe Hitte; Lisa Kim; Klaus-Peter Koepfli; Heidi G Parker; John P Pollinger; Stephen M J Searle; Nathan B Sutter; Rachael Thomas; Caleb Webber; Jennifer Baldwin; Adal Abebe; Amr Abouelleil; Lynne Aftuck; Mostafa Ait-Zahra; Tyler Aldredge; Nicole Allen; Peter An; Scott Anderson; Claudel Antoine; Harindra Arachchi; Ali Aslam; Laura Ayotte; Pasang Bachantsang; Andrew Barry; Tashi Bayul; Mostafa Benamara; Aaron Berlin; Daniel Bessette; Berta Blitshteyn; Toby Bloom; Jason Blye; Leonid Boguslavskiy; Claude Bonnet; Boris Boukhgalter; Adam Brown; Patrick Cahill; Nadia Calixte; Jody Camarata; Yama Cheshatsang; Jeffrey Chu; Mieke Citroen; Alville Collymore; Patrick Cooke; Tenzin Dawoe; Riza Daza; Karin Decktor; Stuart DeGray; Norbu Dhargay; Kimberly Dooley; Kathleen Dooley; Passang Dorje; Kunsang Dorjee; Lester Dorris; Noah Duffey; Alan Dupes; Osebhajajeme Egbiremolen; Richard Elong; Jill Falk; Abderrahim Farina; Susan Faro; Diallo Ferguson; Patricia Ferreira; Sheila Fisher; Mike FitzGerald; Karen Foley; Chelsea Foley; Alicia Franke; Dennis Friedrich; Diane Gage; Manuel Garber; Gary Gearin; Georgia Giannoukos; Tina Goode; Audra Goyette; Joseph Graham; Edward Grandbois; Kunsang Gyaltsen; Nabil Hafez; Daniel Hagopian; Birhane Hagos; Jennifer Hall; Claire Healy; Ryan Hegarty; Tracey Honan; Andrea Horn; Nathan Houde; Leanne Hughes; Leigh Hunnicutt; M Husby; Benjamin Jester; Charlien Jones; Asha Kamat; Ben Kanga; Cristyn Kells; Dmitry Khazanovich; Alix Chinh Kieu; Peter Kisner; Mayank Kumar; Krista Lance; Thomas Landers; Marcia Lara; William Lee; Jean-Pierre Leger; Niall Lennon; Lisa Leuper; Sarah LeVine; Jinlei Liu; Xiaohong Liu; Yeshi Lokyitsang; Tashi Lokyitsang; Annie Lui; Jan Macdonald; John Major; Richard Marabella; Kebede Maru; Charles Matthews; Susan McDonough; Teena Mehta; James Meldrim; Alexandre Melnikov; Louis Meneus; Atanas Mihalev; Tanya Mihova; Karen Miller; Rachel Mittelman; Valentine Mlenga; Leonidas Mulrain; Glen Munson; Adam Navidi; Jerome Naylor; Tuyen Nguyen; Nga Nguyen; Cindy Nguyen; Thu Nguyen; Robert Nicol; Nyima Norbu; Choe Norbu; Nathaniel Novod; Tenchoe Nyima; Peter Olandt; Barry O'Neill; Keith O'Neill; Sahal Osman; Lucien Oyono; Christopher Patti; Danielle Perrin; Pema Phunkhang; Fritz Pierre; Margaret Priest; Anthony Rachupka; Sujaa Raghuraman; Rayale Rameau; Verneda Ray; Christina Raymond; Filip Rege; Cecil Rise; Julie Rogers; Peter Rogov; Julie Sahalie; Sampath Settipalli; Theodore Sharpe; Terrance Shea; Mechele Sheehan; Ngawang Sherpa; Jianying Shi; Diana Shih; Jessie Sloan; Cherylyn Smith; Todd Sparrow; John Stalker; Nicole Stange-Thomann; Sharon Stavropoulos; Catherine Stone; Sabrina Stone; Sean Sykes; Pierre Tchuinga; Pema Tenzing; Senait Tesfaye; Dawa Thoulutsang; Yama Thoulutsang; Kerri Topham; Ira Topping; Tsamla Tsamla; Helen Vassiliev; Vijay Venkataraman; Andy Vo; Tsering Wangchuk; Tsering Wangdi; Michael Weiand; Jane Wilkinson; Adam Wilson; Shailendra Yadav; Shuli Yang; Xiaoping Yang; Geneva Young; Qing Yu; Joanne Zainoun; Lisa Zembek; Andrew Zimmer; Eric S Lander
Journal:  Nature       Date:  2005-12-08       Impact factor: 49.962

6.  Sequence and organization of the human mitochondrial genome.

Authors:  S Anderson; A T Bankier; B G Barrell; M H de Bruijn; A R Coulson; J Drouin; I C Eperon; D P Nierlich; B A Roe; F Sanger; P H Schreier; A J Smith; R Staden; I G Young
Journal:  Nature       Date:  1981-04-09       Impact factor: 49.962

7.  Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes.

Authors:  Michael F Lin; Joseph W Carlson; Madeline A Crosby; Beverley B Matthews; Charles Yu; Soo Park; Kenneth H Wan; Andrew J Schroeder; L Sian Gramates; Susan E St Pierre; Margaret Roark; Kenneth L Wiley; Rob J Kulathinal; Peili Zhang; Kyl V Myrick; Jerry V Antone; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

8.  The transcriptional landscape of the mammalian genome.

Authors:  P Carninci; T Kasukawa; S Katayama; J Gough; M C Frith; N Maeda; R Oyama; T Ravasi; B Lenhard; C Wells; R Kodzius; K Shimokawa; V B Bajic; S E Brenner; S Batalov; A R R Forrest; M Zavolan; M J Davis; L G Wilming; V Aidinis; J E Allen; A Ambesi-Impiombato; R Apweiler; R N Aturaliya; T L Bailey; M Bansal; L Baxter; K W Beisel; T Bersano; H Bono; A M Chalk; K P Chiu; V Choudhary; A Christoffels; D R Clutterbuck; M L Crowe; E Dalla; B P Dalrymple; B de Bono; G Della Gatta; D di Bernardo; T Down; P Engstrom; M Fagiolini; G Faulkner; C F Fletcher; T Fukushima; M Furuno; S Futaki; M Gariboldi; P Georgii-Hemming; T R Gingeras; T Gojobori; R E Green; S Gustincich; M Harbers; Y Hayashi; T K Hensch; N Hirokawa; D Hill; L Huminiecki; M Iacono; K Ikeo; A Iwama; T Ishikawa; M Jakt; A Kanapin; M Katoh; Y Kawasawa; J Kelso; H Kitamura; H Kitano; G Kollias; S P T Krishnan; A Kruger; S K Kummerfeld; I V Kurochkin; L F Lareau; D Lazarevic; L Lipovich; J Liu; S Liuni; S McWilliam; M Madan Babu; M Madera; L Marchionni; H Matsuda; S Matsuzawa; H Miki; F Mignone; S Miyake; K Morris; S Mottagui-Tabar; N Mulder; N Nakano; H Nakauchi; P Ng; R Nilsson; S Nishiguchi; S Nishikawa; F Nori; O Ohara; Y Okazaki; V Orlando; K C Pang; W J Pavan; G Pavesi; G Pesole; N Petrovsky; S Piazza; J Reed; J F Reid; B Z Ring; M Ringwald; B Rost; Y Ruan; S L Salzberg; A Sandelin; C Schneider; C Schönbach; K Sekiguchi; C A M Semple; S Seno; L Sessa; Y Sheng; Y Shibata; H Shimada; K Shimada; D Silva; B Sinclair; S Sperling; E Stupka; K Sugiura; R Sultana; Y Takenaka; K Taki; K Tammoja; S L Tan; S Tang; M S Taylor; J Tegner; S A Teichmann; H R Ueda; E van Nimwegen; R Verardo; C L Wei; K Yagi; H Yamanishi; E Zabarovsky; S Zhu; A Zimmer; W Hide; C Bult; S M Grimmond; R D Teasdale; E T Liu; V Brusic; J Quackenbush; C Wahlestedt; J S Mattick; D A Hume; C Kai; D Sasaki; Y Tomaru; S Fukuda; M Kanamori-Katayama; M Suzuki; J Aoki; T Arakawa; J Iida; K Imamura; M Itoh; T Kato; H Kawaji; N Kawagashira; T Kawashima; M Kojima; S Kondo; H Konno; K Nakano; N Ninomiya; T Nishio; M Okada; C Plessy; K Shibata; T Shiraki; S Suzuki; M Tagami; K Waki; A Watahiki; Y Okamura-Oho; H Suzuki; J Kawai; Y Hayashizaki
Journal:  Science       Date:  2005-09-02       Impact factor: 47.728

9.  Pfam: clans, web tools and services.

Authors:  Robert D Finn; Jaina Mistry; Benjamin Schuster-Böckler; Sam Griffiths-Jones; Volker Hollich; Timo Lassmann; Simon Moxon; Mhairi Marshall; Ajay Khanna; Richard Durbin; Sean R Eddy; Erik L L Sonnhammer; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  The Vertebrate Genome Annotation (Vega) database.

Authors:  J L Ashurst; C-K Chen; J G R Gilbert; K Jekosch; S Keenan; P Meidl; S M Searle; J Stalker; R Storey; S Trevanion; L Wilming; T Hubbard
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  210 in total

Review 1.  Early progress in epigenetic regulation of endothelin pathway genes.

Authors:  A K Welch; M E Jacobs; C S Wingo; B D Cain
Journal:  Br J Pharmacol       Date:  2013-01       Impact factor: 8.739

2.  Mass spectrometry in high-throughput proteomics: ready for the big time.

Authors:  Tommy Nilsson; Matthias Mann; Ruedi Aebersold; John R Yates; Amos Bairoch; John J M Bergeron
Journal:  Nat Methods       Date:  2010-09       Impact factor: 28.547

Review 3.  MicroRNAs in skin and wound healing.

Authors:  Jaideep Banerjee; Yuk Cheung Chan; Chandan K Sen
Journal:  Physiol Genomics       Date:  2010-10-19       Impact factor: 3.107

Review 4.  Annotating non-coding regions of the genome.

Authors:  Roger P Alexander; Gang Fang; Joel Rozowsky; Michael Snyder; Mark B Gerstein
Journal:  Nat Rev Genet       Date:  2010-07-13       Impact factor: 53.242

5.  Nonspecific binding limits the number of proteins in a cell and shapes their interaction networks.

Authors:  Margaret E Johnson; Gerhard Hummer
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-27       Impact factor: 11.205

6.  Gene inactivation and its implications for annotation in the era of personal genomics.

Authors:  Suganthi Balasubramanian; Lukas Habegger; Adam Frankish; Daniel G MacArthur; Rachel Harte; Chris Tyler-Smith; Jennifer Harrow; Mark Gerstein
Journal:  Genes Dev       Date:  2011-01-01       Impact factor: 11.361

7.  Targeted capture and next-generation sequencing identifies C9orf75, encoding taperin, as the mutated gene in nonsyndromic deafness DFNB79.

Authors:  Atteeq Ur Rehman; Robert J Morell; Inna A Belyantseva; Shahid Y Khan; Erich T Boger; Mohsin Shahzad; Zubair M Ahmed; Saima Riazuddin; Shaheen N Khan; Sheikh Riazuddin; Thomas B Friedman
Journal:  Am J Hum Genet       Date:  2010-02-18       Impact factor: 11.025

Review 8.  Systems approaches to molecular cancer diagnostics.

Authors:  Shuyi Ma; Cory C Funk; Nathan D Price
Journal:  Discov Med       Date:  2010-12       Impact factor: 2.970

9.  Proteomic landscape of bronchoalveolar lavage fluid in human immunodeficiency virus infection.

Authors:  Elizabeth V Nguyen; Sina A Gharib; Kristina Crothers; Yu-Hua Chow; David R Park; David R Goodlett; Lynn M Schnapp
Journal:  Am J Physiol Lung Cell Mol Physiol       Date:  2013-11-08       Impact factor: 5.464

10.  The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.

Authors:  Raffaele Mazza; Francesco Strozzi; Andrea Caprera; Paolo Ajmone-Marsan; John L Williams
Journal:  BMC Genomics       Date:  2009-12-14       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.