Literature DB >> 26886859

Pseudo-De Novo Assembly and Analysis of Unmapped Genome Sequence Reads in Wild Zebrafish Reveal Novel Gene Content.

Joshua J Faber-Hammond1, Kim H Brown1.   

Abstract

Zebrafish represents the third vertebrate with an officially completed genome, yet it remains incomplete with additions and corrections continuing with the current release, GRCz10, having 13% of zebrafish cDNA sequences unmapped. This disparity may result from population differences, given that the genome reference was generated from clonal individuals with limited genetic diversity. This is supported by the recent analysis of a single wild zebrafish, which identified over 5.2 million SNPs and 1.6 million in/dels in the previous genome build, zv9. Re-examination of this sequence data set indicated that 13.8% of quality sequence reads failed to align to GRCz10. Using a novel bioinformatics de novo assembly pipeline on these unmappable reads, we identified 1,514,491 novel contigs covering ∼224 Mb of genomic sequence. Among these, 1083 contigs were found to contain a potential gene coding sequence. RNA-seq data comparison confirmed that 362 contigs contained a transcribed DNA sequence, suggesting that a large amount of functional genomic sequence remains unannotated in the zebrafish reference genome. By utilizing the bioinformatics pipeline developed in this study, the zebrafish genome will be bolstered as a model for human disease research. Adaptation of the pipeline described here also offers a cost-efficient and effective method to identify and map novel genetic content across any genome and will ultimately aid in the completion of additional genomes for a broad range of species.

Entities:  

Mesh:

Year:  2016        PMID: 26886859      PMCID: PMC4799696          DOI: 10.1089/zeb.2015.1154

Source DB:  PubMed          Journal:  Zebrafish        ISSN: 1545-8547            Impact factor:   1.985


  32 in total

1.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

2.  Sherlock: detecting gene-disease associations by matching patterns of expression QTL and GWAS.

Authors:  Xin He; Chris K Fuller; Yi Song; Qingying Meng; Bin Zhang; Xia Yang; Hao Li
Journal:  Am J Hum Genet       Date:  2013-05-02       Impact factor: 11.025

3.  QTL analysis of behavioral and morphological differentiation between wild and laboratory zebrafish (Danio rerio).

Authors:  Dominic Wright; Reiichiro Nakamichi; Jens Krause; Roger K Butlin
Journal:  Behav Genet       Date:  2006-01-12       Impact factor: 2.805

4.  A sequence-based variation map of zebrafish.

Authors:  Ashok Patowary; Ramya Purkanti; Meghna Singh; Rajendra Chauhan; Angom Ramcharan Singh; Mohit Swarnkar; Naresh Singh; Vikas Pandey; Carlos Torroja; Matthew D Clark; Jean-Pierre Kocher; Karl J Clark; Derek L Stemple; Eric W Klee; Stephen C Ekker; Vinod Scaria; Sridhar Sivasubbu
Journal:  Zebrafish       Date:  2013-03       Impact factor: 1.985

5.  Simplifier: a web tool to eliminate redundant NGS contigs.

Authors:  Rommel Thiago Jucá Ramos; Adriana Ribeiro Carneiro; Vasco Azevedo; Maria Paula Schneider; Debmalya Barh; Artur Silva
Journal:  Bioinformation       Date:  2012-10-13

6.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences.

Authors:  Jeremy Goecks; Anton Nekrutenko; James Taylor
Journal:  Genome Biol       Date:  2010-08-25       Impact factor: 13.583

7.  An SNP-Based Linkage Map for Zebrafish Reveals Sex Determination Loci.

Authors:  Kevin M Bradley; Joan P Breyer; David B Melville; Karl W Broman; Ela W Knapik; Jeffrey R Smith
Journal:  G3 (Bethesda)       Date:  2011-06-01       Impact factor: 3.154

8.  Comparison of RefSeq protein-coding regions in human and vertebrate genomes.

Authors:  Jessica H Fong; Terence D Murphy; Kim D Pruitt
Journal:  BMC Genomics       Date:  2013-09-25       Impact factor: 3.969

9.  The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates.

Authors:  Camille Berthelot; Frédéric Brunet; Domitille Chalopin; Amélie Juanchich; Maria Bernard; Benjamin Noël; Pascal Bento; Corinne Da Silva; Karine Labadie; Adriana Alberti; Jean-Marc Aury; Alexandra Louis; Patrice Dehais; Philippe Bardou; Jérôme Montfort; Christophe Klopp; Cédric Cabau; Christine Gaspin; Gary H Thorgaard; Mekki Boussaha; Edwige Quillet; René Guyomard; Delphine Galiana; Julien Bobe; Jean-Nicolas Volff; Carine Genêt; Patrick Wincker; Olivier Jaillon; Hugues Roest Crollius; Yann Guiguen
Journal:  Nat Commun       Date:  2014-04-22       Impact factor: 14.919

10.  Comparative analysis of CpG islands in four fish genomes.

Authors:  Leng Han; Zhongming Zhao
Journal:  Comp Funct Genomics       Date:  2008
View more
  7 in total

1.  An Interrogation of Shared and Unique Copy Number Variants Across Genetically Distinct Zebrafish Strains.

Authors:  Lindsay A Holden; Charles Wilson; Zachary Heineman; Kimberly P Dobrinski; Kim H Brown
Journal:  Zebrafish       Date:  2018-11-10       Impact factor: 1.985

2.  CAFU: a Galaxy framework for exploring unmapped RNA-Seq data.

Authors:  Siyuan Chen; Chengzhi Ren; Jingjing Zhai; Jiantao Yu; Xuyang Zhao; Zelong Li; Ting Zhang; Wenlong Ma; Zhaoxue Han; Chuang Ma
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

3.  Anchored pseudo-de novo assembly of human genomes identifies extensive sequence variation from unmapped sequence reads.

Authors:  Joshua J Faber-Hammond; Kim H Brown
Journal:  Hum Genet       Date:  2016-04-09       Impact factor: 4.132

4.  Isolation and characterization of hypoxia inducible gene connective tissue growth factor (CTGF) in Labeo rohita.

Authors:  Iliyas Rashid; Vishwamitra Singh Baisvar; Mahender Singh; Prachi Srivastava; Ravindra Kumar; Basdeo Kushwaha; Ajey Kumar Pathak
Journal:  Mol Biol Rep       Date:  2019-01-28       Impact factor: 2.316

5.  Changes in the gut microbiota during Asian particolored bat (Vespertilio sinensis) development.

Authors:  Zhongwei Yin; Keping Sun; Aoqiang Li; Deyi Sun; Zhongle Li; Guohong Xiao; Jiang Feng
Journal:  PeerJ       Date:  2020-05-12       Impact factor: 2.984

6.  Assembly and Analysis of Unmapped Genome Sequence Reads Reveal Novel Sequence and Variation in Dogs.

Authors:  Lindsay A Holden; Meharji Arumilli; Marjo K Hytönen; Sruthi Hundi; Jarkko Salojärvi; Kim H Brown; Hannes Lohi
Journal:  Sci Rep       Date:  2018-07-18       Impact factor: 4.379

7.  The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats.

Authors:  Robin H van der Weide; Marieke Simonis; Roel Hermsen; Pim Toonen; Edwin Cuppen; Joep de Ligt
Journal:  PLoS One       Date:  2016-08-08       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.