Literature DB >> 21551137

A cautionary note for retrocopy identification: DNA-based duplication of intron-containing genes significantly contributes to the origination of single exon genes.

Yong E Zhang1, Maria D Vibranovski, Benjamin H Krinsky, Manyuan Long.   

Abstract

MOTIVATION: Retrocopies are important genes in the genomes of almost all higher eukaryotes. However, the annotation of such genes is a non-trivial task. Intronless genes have often been considered to be retroposed copies of intron-containing paralogs. Such categorization relies on the implicit premise that alignable regions of the duplicates should be long enough to cover exon-exon junctions of the intron-containing genes, and thus intron loss events can be inferred. Here, we examined the alternative possibility that intronless genes could be generated by partial DNA-based duplication of intron-containing genes in the fruitfly genome.
RESULTS: By building pairwise protein-, transcript- and genome-level DNA alignments between intronless genes and their corresponding intron-containing paralogs, we found that alignments do not cover exon-exon junctions in 40% of cases and thus no intron loss could be inferred. For these cases, the candidate parental proteins tend to be partially duplicated, and intergenic sequences or neighboring genes are included in the intronless paralog. Moreover, we observed that it is significantly less likely for these paralogs to show inter-chromosomal duplication and testis-dominant transcription, compared to the remaining 60% of cases with evidence of clear intron loss (retrogenes). These lines of analysis reveal that DNA-based duplication contributes significantly to the 40% of cases of single exon gene duplication. Finally, we performed an analogous survey in the human genome and the result is similar, wherein 34% of the cases do not cover exon-exon junctions. Thus, genome annotation for retrogene identification should discard candidates without clear evidence of intron loss. CONTACT: mlong@uchicago.edu; zhangy@uchicago.edu

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21551137      PMCID: PMC3117337          DOI: 10.1093/bioinformatics/btr280

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Many G-protein-coupled receptors are encoded by retrogenes.

Authors:  J Brosius
Journal:  Trends Genet       Date:  1999-08       Impact factor: 11.639

2.  The Bioperl toolkit: Perl modules for the life sciences.

Authors:  Jason E Stajich; David Block; Kris Boulez; Steven E Brenner; Stephen A Chervitz; Chris Dagdigian; Georg Fuellen; James G R Gilbert; Ian Korf; Hilmar Lapp; Heikki Lehväslaiho; Chad Matsalla; Chris J Mungall; Brian I Osborne; Matthew R Pocock; Peter Schattner; Martin Senger; Lincoln D Stein; Elia Stupka; Mark D Wilkinson; Ewan Birney
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

Review 3.  Origins, evolution, and phenotypic impact of new genes.

Authors:  Henrik Kaessmann
Journal:  Genome Res       Date:  2010-07-22       Impact factor: 9.043

4.  Not born equal: increased rate asymmetry in relocated and retrotransposed rodent gene duplicates.

Authors:  Brian P Cusack; Kenneth H Wolfe
Journal:  Mol Biol Evol       Date:  2006-12-18       Impact factor: 16.240

5.  Age-dependent chromosomal distribution of male-biased genes in Drosophila.

Authors:  Yong E Zhang; Maria D Vibranovski; Benjamin H Krinsky; Manyuan Long
Journal:  Genome Res       Date:  2010-08-26       Impact factor: 9.043

6.  Chromosomal redistribution of male-biased genes in mammalian evolution with two bursts of gene gain on the X chromosome.

Authors:  Yong E Zhang; Maria D Vibranovski; Patrick Landback; Gabriel A B Marais; Manyuan Long
Journal:  PLoS Biol       Date:  2010-10-05       Impact factor: 8.029

7.  PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.

Authors:  Mikita Suyama; David Torrents; Peer Bork
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

8.  The UCSC genome browser database: update 2007.

Authors:  R M Kuhn; D Karolchik; A S Zweig; H Trumbower; D J Thomas; A Thakkapallayil; C W Sugnet; M Stanke; K E Smith; A Siepel; K R Rosenbloom; B Rhead; B J Raney; A Pohl; J S Pedersen; F Hsu; A S Hinrichs; R A Harte; M Diekhans; H Clawson; G Bejerano; G P Barber; R Baertsch; D Haussler; W J Kent
Journal:  Nucleic Acids Res       Date:  2006-11-16       Impact factor: 16.971

9.  NATsDB: Natural Antisense Transcripts DataBase.

Authors:  Yong Zhang; Jiongtang Li; Lei Kong; Ge Gao; Qing-Rong Liu; Liping Wei
Journal:  Nucleic Acids Res       Date:  2006-11-01       Impact factor: 16.971

10.  Positive selection for the male functionality of a co-retroposed gene in the hominoids.

Authors:  Yong Zhang; Shujuan Lu; Shuqi Zhao; Xiaofeng Zheng; Manyuan Long; Liping Wei
Journal:  BMC Evol Biol       Date:  2009-10-15       Impact factor: 3.260

View more
  13 in total

1.  The medicinal leech genome encodes 21 innexin genes: different combinations are expressed by identified central neurons.

Authors:  Brandon Kandarian; Jasmine Sethi; Allan Wu; Michael Baker; Neema Yazdani; Eunice Kym; Alejandro Sanchez; Lee Edsall; Terry Gaasterland; Eduardo Macagno
Journal:  Dev Genes Evol       Date:  2012-02-23       Impact factor: 0.900

2.  Evolution of gene structural complexity: an alternative-splicing-based model accounts for intron-containing retrogenes.

Authors:  Chengjun Zhang; Andrea R Gschwend; Yidan Ouyang; Manyuan Long
Journal:  Plant Physiol       Date:  2014-02-11       Impact factor: 8.340

3.  Origin of a novel protein-coding gene family with similar signal sequence in Schistosoma japonicum.

Authors:  Evaristus Chibunna Mbanefo; Yu Chuanxin; Mihoko Kikuchi; Mohammed Nasir Shuaibu; Daniel Boamah; Masashi Kirinoki; Naoko Hayashi; Yuichi Chigusa; Yoshio Osada; Shinjiro Hamano; Kenji Hirayama
Journal:  BMC Genomics       Date:  2012-06-20       Impact factor: 3.969

4.  SinEx DB: a database for single exon coding sequences in mammalian genomes.

Authors:  Roddy Jorquera; Rodrigo Ortiz; F Ossandon; Juan Pablo Cárdenas; Rene Sepúlveda; Carolina González; David S Holmes
Journal:  Database (Oxford)       Date:  2016-06-07       Impact factor: 3.451

5.  The Genomic Impact of Gene Retrocopies: What Have We Learned from Comparative Genomics, Population Genomics, and Transcriptomic Analyses?

Authors:  Claudio Casola; Esther Betrán
Journal:  Genome Biol Evol       Date:  2017-06-01       Impact factor: 3.416

6.  DNA transposons mediate duplications via transposition-independent and -dependent mechanisms in metazoans.

Authors:  Shengjun Tan; Huijing Ma; Jinbo Wang; Man Wang; Mengxia Wang; Haodong Yin; Yaqiong Zhang; Xinying Zhang; Jieyu Shen; Danyang Wang; Graham L Banes; Zhihua Zhang; Jianmin Wu; Xun Huang; Hua Chen; Siqin Ge; Chun-Long Chen; Yong E Zhang
Journal:  Nat Commun       Date:  2021-07-13       Impact factor: 14.919

7.  "Orphan" retrogenes in the human genome.

Authors:  Joanna Ciomborowska; Wojciech Rosikiewicz; Damian Szklarczyk; Wojciech Makałowski; Izabela Makałowska
Journal:  Mol Biol Evol       Date:  2012-10-12       Impact factor: 16.240

8.  Different patterns of gene structure divergence following gene duplication in Arabidopsis.

Authors:  Yupeng Wang; Xu Tan; Andrew H Paterson
Journal:  BMC Genomics       Date:  2013-09-24       Impact factor: 3.969

9.  Evolutionary fate and implications of retrocopies in the African coelacanth genome.

Authors:  Kang Du; Shunping He
Journal:  BMC Genomics       Date:  2015-11-10       Impact factor: 3.969

10.  Gene Duplication Analysis Reveals No Ancient Whole Genome Duplication but Extensive Small-Scale Duplications during Genome Evolution and Adaptation of Schistosoma mansoni.

Authors:  Shuai Wang; Xing-Quan Zhu; Xuepeng Cai
Journal:  Front Cell Infect Microbiol       Date:  2017-09-21       Impact factor: 5.293

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.