Literature DB >> 20653929

The evolution of gene regulation, the RNA universe, and the vexed questions of artefact and noise.

Miranda Robertson.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20653929      PMCID: PMC2905359          DOI: 10.1186/1741-7007-8-97

Source DB:  PubMed          Journal:  BMC Biol        ISSN: 1741-7007            Impact factor:   7.431


× No keyword cloud information.

Editorial

In the ten years since the first sequencing of the human genome, much has been made of the need to look to gene regulation, and not gene number or DNA sequence, for the evolution of organismal diversity and complexity - an issue that rose to prominence, with the realization first, that the number of human genes is about the same as the number required to specify a nematode worm; and second, that the DNA of H. sapiens is roughly 96% identical to that of the chimpanzee. But the realization that the secret of evolution lies in changes in gene regulation considerably predates the revelations of genomics. Allan Wilson and colleagues, in a paper published in 1974 [1], drew attention to the simple and striking fact that morphologically homogeneous frog species also have relatively homogeneous karyotypes, whereas mammalian species, which are markedly diverse morphologically, also show major differences in chromosome number and organization; changes in proteins, by contrast, are much the same for both groups. They concluded that genome organization, and by implication gene regulation, is more important for metazoan evolution than protein sequence (and cite earlier publications of EB Ford and Susumu Ohno for the same insight). The following year, Mary-Claire King and Wilson published a more detailed examination of the chromosomal distinctions between human and chimpanzee [2], arguing compellingly, without benefit of high-throughput anything, that changes in the organization of the genome, and not changes in protein-coding sequence, must account for the crucial differences between the two primates. In those pre-genomic days, the protein data were in large part immunological and electrophoretic; the analysis of genome reorganization depended on chromosome banding patterns (Giemsa banding, not FISH); and almost nothing was known of the mechanism of gene regulation in eukaryotes. The ground between then and now is covered in a recent review by Sean Carroll [3], who acknowledges Emile Zuckerkandl and Eric Davidson as early proponents of the importance of gene regulation in morphological evolution and charts the remarkable history of the development of ideas consequent on the discovery of the homeobox genes, with a strong emphasis on the evolution of cis-regulatory elements - that is to say, DNA binding sites for gene regulatory proteins - as the basis for morphological change. The argument is that DNA regulatory elements and the proteins that bind to them, often combinatorially, constitute regulatory networks that can evolve rapidly through changes to the regulatory elements, which are often modular, different modules binding different proteins characteristic of distinct differentiated states of a cell. The gene regulatory proteins can also change, of course, but are generally more highly conserved than their binding sites. Tuch et al. [4] have published a short and pellucid overview of the essential points and principles of this schema, in the context of recent evidence on how such regulatory circuits can become rewired in yeast. In our video Q&A published today [5], John Mattick gives a personal account of his arguments for the view that the regulatory potential of proteins and their binding sites is not sufficient to account for the evolution of complex higher organisms, and explains his case for invoking a largely uncharted universe of regulatory RNAs.

Beyond regulatory proteins

He puts his points much more eloquently and persuasively than I could, and I will not rehearse them here: so for an elaboration of the argument, and for how the structural properties of RNA lend themselves to exploitation in the regulation of gene expression, or how its functional versatility may contribute to the evolution of cognition, I refer the reader to the interview (which is available as text as well as video). But a significant part of the basis for his ideas lies in reports over the past several years that most of the genome is transcribed (see especially [6]). Since less than 2% of the human genome, in particular, encodes proteins, this would appear to mean an RNA world on a scale well beyond that of the known world of proteins, and the possibility of a hitherto undreamed of regulatory resource. The alternative view is that most of the non-coding RNA can be accounted for as technical artefact or transcriptional noise (see [7]).

Vexed questions

Technical artefact is an issue because much of the evidence for wholesale transcription of the genome derives from tiling array technology, in which labelled cDNAs representing the transcriptome are hybridized to arrays representing the entire genome, and which is susceptible to false positives due to hybridization with imperfectly matched probes. With the more recent development of techniques for high-throughput sequencing of cDNAs (RNA-seq), it has become possible tackle the transcriptome by direct sequencing, eliminating the problem of cross-hybridization and leading to much lower estimates of the proportion of the genome that is transcribed. The functional significance of the transcripts has been called into question on several grounds: for example, many are rare, or rapidly degraded; and they are generally ill conserved. But these arguments can be reasonably easily turned on their head, and precisely the same properties construed as consistent with, if not indicative of, a regulatory role. The detailed arguments and counter-arguments can be found in reviews by Mattick and colleagues, and by Timothy Hughes and Harm van Bakel from the opposing viewpoint, published last year in Briefings in Functional Genomics and Proteomics [7,8]. More recently, Hughes and colleagues have published a paper [9] directly addressing the question of artefact by comparing the results of tiling array experiments and RNA-seq on a range of human and mouse tissues and cell lines, and pursuing the issue of function through an analysis of those transcripts that emerge as valid in the RNA-seq results. They conclude, first, that the great majority of the non-coding transcripts identified in tiling arrays are cross-hybridization artefacts, leaving 12% that are also identified by RNA-seq; and second, that of those, the great majority can be accounted for as unannotated exons of known genes, or introns of known genes, or transcriptional noise due to overrunning polymerases, leaving 2% as non-coding RNA of unknown function. This second point, on the nature and the functional significance or otherwise of the transcripts, is a matter of interpretation, and can no doubt be debated. Nor is the first point exempt from challenge: RNA-seq analysis, like any other genomic analysis, may give different results depending upon how it is done, and rare transcripts, for example, may be missed: a news report [10] on the van Bakel et al. paper [9] quotes Philipp Kapranov, whose RNA-seq analysis apparently delivers much higher estimates of non-coding transcription. But there have been other indications of false positives from tiling arrays, and it is difficult to escape the conclusion that the non-coding RNA universe may turn out to be substantially smaller than earlier analyses suggest.

The evolution of complexity

Where does this leave the issue of how to account for the complexity of higher organisms? (Let us put aside the question of how exactly complexity is defined, on the grounds that we can probably all agree that on any relevant criteria a human being is more complex than a nematode worm.) One implication of the van Bakel et al. paper is that there are more exons in the genome than we know about, which would imply more complexity than has yet been tallied in the protein universe. Nor has it been demonstrated by any rigorous computation that combinatorial control of gene expression by protein complexes is insufficient to support the regulatory complexity required to make a human (to which alternative splicing of coding RNAs is likely to make a significant contribution - see for example[11]). However it is clear that even if alternatively spliced and combinatorially interacting proteins were in principle adequate to the task, in practice that is not the sole regulatory resource, and there do indeed exist regulatory RNAs, some quite well understood, others much less well (see [5]). Regulatory RNAs of course also exist in bacteria, where they have been known for 30 years and have a considerable diversity of functions that are much better understood than the more recently discovered eukaryotic ones, and indeed richly illustrate the regulatory modes to which RNA lends itself [12] - a fact that Mattick does not mention in his Q&A for BMC Biology but has acknowledged clearly in other publications (see for example [8]). However there is already known to be quantitatively more regulatory RNA in mammals, even without the unexplained non-coding transcripts that have emerged from transcriptomics.

In biology, the answer is (almost) always yes

The magnitude of the contribution of technical artefact, unannotated coding sequence and transcriptional noise to the reported non-coding transcriptome may not yet be settled, but it would be astonishing if they didn't all contribute. As for whether the evolution of complexity depends on regulatory proteins or regulatory RNAs, the answer is certain to be yes to both. There is much still to be learned about gene regulatory circuits operated by proteins, which will no doubt turn out to include RNA components; and even more to be learned about regulatory RNA. It is the allure and promise of this unexplored territory that Mattick clearly finds irresistible.
  11 in total

Review 1.  Evolution at two levels in humans and chimpanzees.

Authors:  M C King; A C Wilson
Journal:  Science       Date:  1975-04-11       Impact factor: 47.728

2.  Evolution of eukaryotic transcription circuits.

Authors:  Brian B Tuch; Hao Li; Alexander D Johnson
Journal:  Science       Date:  2008-03-28       Impact factor: 47.728

3.  Pervasive transcription of the eukaryotic genome: functional indices and conceptual implications.

Authors:  Marcel E Dinger; Paulo P Amaral; Timothy R Mercer; John S Mattick
Journal:  Brief Funct Genomic Proteomic       Date:  2009-09-21

Review 4.  Establishing legitimacy and function in the new transcriptome.

Authors:  Harm van Bakel; Timothy R Hughes
Journal:  Brief Funct Genomic Proteomic       Date:  2009-11

5.  The importance of gene rearrangement in evolution: evidence from studies on rates of chromosomal, protein, and anatomical evolution.

Authors:  A C Wilson; V M Sarich; L R Maxson
Journal:  Proc Natl Acad Sci U S A       Date:  1974-08       Impact factor: 11.205

Review 6.  Expansion of the eukaryotic proteome by alternative splicing.

Authors:  Timothy W Nilsen; Brenton R Graveley
Journal:  Nature       Date:  2010-01-28       Impact factor: 49.962

Review 7.  Regulatory RNAs in bacteria.

Authors:  Lauren S Waters; Gisela Storz
Journal:  Cell       Date:  2009-02-20       Impact factor: 41.582

8.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

Authors:  Ewan Birney; John A Stamatoyannopoulos; Anindya Dutta; Roderic Guigó; Thomas R Gingeras; Elliott H Margulies; Zhiping Weng; Michael Snyder; Emmanouil T Dermitzakis; Robert E Thurman; Michael S Kuehn; Christopher M Taylor; Shane Neph; Christoph M Koch; Saurabh Asthana; Ankit Malhotra; Ivan Adzhubei; Jason A Greenbaum; Robert M Andrews; Paul Flicek; Patrick J Boyle; Hua Cao; Nigel P Carter; Gayle K Clelland; Sean Davis; Nathan Day; Pawandeep Dhami; Shane C Dillon; Michael O Dorschner; Heike Fiegler; Paul G Giresi; Jeff Goldy; Michael Hawrylycz; Andrew Haydock; Richard Humbert; Keith D James; Brett E Johnson; Ericka M Johnson; Tristan T Frum; Elizabeth R Rosenzweig; Neerja Karnani; Kirsten Lee; Gregory C Lefebvre; Patrick A Navas; Fidencio Neri; Stephen C J Parker; Peter J Sabo; Richard Sandstrom; Anthony Shafer; David Vetrie; Molly Weaver; Sarah Wilcox; Man Yu; Francis S Collins; Job Dekker; Jason D Lieb; Thomas D Tullius; Gregory E Crawford; Shamil Sunyaev; William S Noble; Ian Dunham; France Denoeud; Alexandre Reymond; Philipp Kapranov; Joel Rozowsky; Deyou Zheng; Robert Castelo; Adam Frankish; Jennifer Harrow; Srinka Ghosh; Albin Sandelin; Ivo L Hofacker; Robert Baertsch; Damian Keefe; Sujit Dike; Jill Cheng; Heather A Hirsch; Edward A Sekinger; Julien Lagarde; Josep F Abril; Atif Shahab; Christoph Flamm; Claudia Fried; Jörg Hackermüller; Jana Hertel; Manja Lindemeyer; Kristin Missal; Andrea Tanzer; Stefan Washietl; Jan Korbel; Olof Emanuelsson; Jakob S Pedersen; Nancy Holroyd; Ruth Taylor; David Swarbreck; Nicholas Matthews; Mark C Dickson; Daryl J Thomas; Matthew T Weirauch; James Gilbert; Jorg Drenkow; Ian Bell; XiaoDong Zhao; K G Srinivasan; Wing-Kin Sung; Hong Sain Ooi; Kuo Ping Chiu; Sylvain Foissac; Tyler Alioto; Michael Brent; Lior Pachter; Michael L Tress; Alfonso Valencia; Siew Woh Choo; Chiou Yu Choo; Catherine Ucla; Caroline Manzano; Carine Wyss; Evelyn Cheung; Taane G Clark; James B Brown; Madhavan Ganesh; Sandeep Patel; Hari Tammana; Jacqueline Chrast; Charlotte N Henrichsen; Chikatoshi Kai; Jun Kawai; Ugrappa Nagalakshmi; Jiaqian Wu; Zheng Lian; Jin Lian; Peter Newburger; Xueqing Zhang; Peter Bickel; John S Mattick; Piero Carninci; Yoshihide Hayashizaki; Sherman Weissman; Tim Hubbard; Richard M Myers; Jane Rogers; Peter F Stadler; Todd M Lowe; Chia-Lin Wei; Yijun Ruan; Kevin Struhl; Mark Gerstein; Stylianos E Antonarakis; Yutao Fu; Eric D Green; Ulaş Karaöz; Adam Siepel; James Taylor; Laura A Liefer; Kris A Wetterstrand; Peter J Good; Elise A Feingold; Mark S Guyer; Gregory M Cooper; George Asimenos; Colin N Dewey; Minmei Hou; Sergey Nikolaev; Juan I Montoya-Burgos; Ari Löytynoja; Simon Whelan; Fabio Pardi; Tim Massingham; Haiyan Huang; Nancy R Zhang; Ian Holmes; James C Mullikin; Abel Ureta-Vidal; Benedict Paten; Michael Seringhaus; Deanna Church; Kate Rosenbloom; W James Kent; Eric A Stone; Serafim Batzoglou; Nick Goldman; Ross C Hardison; David Haussler; Webb Miller; Arend Sidow; Nathan D Trinklein; Zhengdong D Zhang; Leah Barrera; Rhona Stuart; David C King; Adam Ameur; Stefan Enroth; Mark C Bieda; Jonghwan Kim; Akshay A Bhinge; Nan Jiang; Jun Liu; Fei Yao; Vinsensius B Vega; Charlie W H Lee; Patrick Ng; Atif Shahab; Annie Yang; Zarmik Moqtaderi; Zhou Zhu; Xiaoqin Xu; Sharon Squazzo; Matthew J Oberley; David Inman; Michael A Singer; Todd A Richmond; Kyle J Munn; Alvaro Rada-Iglesias; Ola Wallerman; Jan Komorowski; Joanna C Fowler; Phillippe Couttet; Alexander W Bruce; Oliver M Dovey; Peter D Ellis; Cordelia F Langford; David A Nix; Ghia Euskirchen; Stephen Hartman; Alexander E Urban; Peter Kraus; Sara Van Calcar; Nate Heintzman; Tae Hoon Kim; Kun Wang; Chunxu Qu; Gary Hon; Rosa Luna; Christopher K Glass; M Geoff Rosenfeld; Shelley Force Aldred; Sara J Cooper; Anason Halees; Jane M Lin; Hennady P Shulha; Xiaoling Zhang; Mousheng Xu; Jaafar N S Haidar; Yong Yu; Yijun Ruan; Vishwanath R Iyer; Roland D Green; Claes Wadelius; Peggy J Farnham; Bing Ren; Rachel A Harte; Angie S Hinrichs; Heather Trumbower; Hiram Clawson; Jennifer Hillman-Jackson; Ann S Zweig; Kayla Smith; Archana Thakkapallayil; Galt Barber; Robert M Kuhn; Donna Karolchik; Lluis Armengol; Christine P Bird; Paul I W de Bakker; Andrew D Kern; Nuria Lopez-Bigas; Joel D Martin; Barbara E Stranger; Abigail Woodroffe; Eugene Davydov; Antigone Dimas; Eduardo Eyras; Ingileif B Hallgrímsdóttir; Julian Huppert; Michael C Zody; Gonçalo R Abecasis; Xavier Estivill; Gerard G Bouffard; Xiaobin Guan; Nancy F Hansen; Jacquelyn R Idol; Valerie V B Maduro; Baishali Maskeri; Jennifer C McDowell; Morgan Park; Pamela J Thomas; Alice C Young; Robert W Blakesley; Donna M Muzny; Erica Sodergren; David A Wheeler; Kim C Worley; Huaiyang Jiang; George M Weinstock; Richard A Gibbs; Tina Graves; Robert Fulton; Elaine R Mardis; Richard K Wilson; Michele Clamp; James Cuff; Sante Gnerre; David B Jaffe; Jean L Chang; Kerstin Lindblad-Toh; Eric S Lander; Maxim Koriabine; Mikhail Nefedov; Kazutoyo Osoegawa; Yuko Yoshinaga; Baoli Zhu; Pieter J de Jong
Journal:  Nature       Date:  2007-06-14       Impact factor: 49.962

9.  Most "dark matter" transcripts are associated with known genes.

Authors:  Harm van Bakel; Corey Nislow; Benjamin J Blencowe; Timothy R Hughes
Journal:  PLoS Biol       Date:  2010-05-18       Impact factor: 8.029

10.  Video Q&A: Non-coding RNAs and eukaryotic evolution - a personal view.

Authors:  John Mattick
Journal:  BMC Biol       Date:  2010-07-16       Impact factor: 7.431

View more
  8 in total

Review 1.  Pervasive transcription: illuminating the dark matter of bacterial transcriptomes.

Authors:  Joseph T Wade; David C Grainger
Journal:  Nat Rev Microbiol       Date:  2014-07-28       Impact factor: 60.633

2.  One hundred million adenosine-to-inosine RNA editing sites: hearing through the noise.

Authors:  Randi J Ulbricht; Ronald B Emeson
Journal:  Bioessays       Date:  2014-05-30       Impact factor: 4.345

3.  The reality of pervasive transcription.

Authors:  Michael B Clark; Paulo P Amaral; Felix J Schlesinger; Marcel E Dinger; Ryan J Taft; John L Rinn; Chris P Ponting; Peter F Stadler; Kevin V Morris; Antonin Morillon; Joel S Rozowsky; Mark B Gerstein; Claes Wahlestedt; Yoshihide Hayashizaki; Piero Carninci; Thomas R Gingeras; John S Mattick
Journal:  PLoS Biol       Date:  2011-07-12       Impact factor: 8.029

4.  The noncoding universe.

Authors:  Kester Jarvis; Miranda Robertson
Journal:  BMC Biol       Date:  2011-07-28       Impact factor: 7.431

5.  Extensive evolutionary changes in regulatory element activity during human origins are associated with altered gene expression and positive selection.

Authors:  Yoichiro Shibata; Nathan C Sheffield; Olivier Fedrigo; Courtney C Babbitt; Matthew Wortham; Alok K Tewari; Darin London; Lingyun Song; Bum-Kyu Lee; Vishwanath R Iyer; Stephen C J Parker; Elliott H Margulies; Gregory A Wray; Terrence S Furey; Gregory E Crawford
Journal:  PLoS Genet       Date:  2012-06-28       Impact factor: 5.917

6.  Comparative transcriptomics across the prokaryotic tree of life.

Authors:  Ofir Cohen; Shany Doron; Omri Wurtzel; Daniel Dar; Sarit Edelheit; Iris Karunker; Eran Mick; Rotem Sorek
Journal:  Nucleic Acids Res       Date:  2016-05-06       Impact factor: 16.971

7.  Intronic RNAs constitute the major fraction of the non-coding RNA in mammalian cells.

Authors:  Georges St Laurent; Dmitry Shtokalo; Michael R Tackett; Zhaoqing Yang; Tatyana Eremina; Claes Wahlestedt; Silvio Urcuqui-Inchima; Bernd Seilheimer; Timothy A McCaffrey; Philipp Kapranov
Journal:  BMC Genomics       Date:  2012-09-24       Impact factor: 3.969

8.  Exome RNA sequencing reveals rare and novel alternative transcripts.

Authors:  Jonatan Halvardson; Ammar Zaghlool; Lars Feuk
Journal:  Nucleic Acids Res       Date:  2012-08-31       Impact factor: 16.971

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.