Literature DB >> 22678431

A post-assembly genome-improvement toolkit (PAGIT) to obtain annotated genomes from contigs.

Martin T Swain1, Isheng J Tsai, Samual A Assefa, Chris Newbold, Matthew Berriman, Thomas D Otto.   

Abstract

Genome projects now produce draft assemblies within weeks owing to advanced high-throughput sequencing technologies. For milestone projects such as Escherichia coli or Homo sapiens, teams of scientists were employed to manually curate and finish these genomes to a high standard. Nowadays, this is not feasible for most projects, and the quality of genomes is generally of a much lower standard. This protocol describes software (PAGIT) that is used to improve the quality of draft genomes. It offers flexible functionality to close gaps in scaffolds, correct base errors in the consensus sequence and exploit reference genomes (if available) in order to improve scaffolding and generating annotations. The protocol is most accessible for bacterial and small eukaryotic genomes (up to 300 Mb), such as pathogenic bacteria, malaria and parasitic worms. Applying PAGIT to an E. coli assembly takes ∼24 h: it doubles the average contig size and annotates over 4,300 gene models.

Entities:  

Mesh:

Year:  2012        PMID: 22678431      PMCID: PMC3648784          DOI: 10.1038/nprot.2012.068

Source DB:  PubMed          Journal:  Nat Protoc        ISSN: 1750-2799            Impact factor:   13.491


  50 in total

1.  A System for Automated Bacterial (genome) Integrated Annotation--SABIA.

Authors:  Luiz G P Almeida; Roger Paixão; Rangel C Souza; Gisele C da Costa; Frank J A Barrientos; M Trindade dos Santos; Darcy F de Almeida; Ana Tereza R Vasconcelos
Journal:  Bioinformatics       Date:  2004-04-15       Impact factor: 6.937

2.  Enhancements and modifications of primer design program Primer3.

Authors:  Triinu Koressaar; Maido Remm
Journal:  Bioinformatics       Date:  2007-03-22       Impact factor: 6.937

Review 3.  Steady progress and recent breakthroughs in the accuracy of automated genome annotation.

Authors:  Michael R Brent
Journal:  Nat Rev Genet       Date:  2008-01       Impact factor: 53.242

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

Review 5.  The impact of next-generation sequencing on genomics.

Authors:  Jun Zhang; Rod Chiodini; Ahmed Badr; Genfa Zhang
Journal:  J Genet Genomics       Date:  2011-03-15       Impact factor: 4.275

6.  The sequence and de novo assembly of the giant panda genome.

Authors:  Ruiqiang Li; Wei Fan; Geng Tian; Hongmei Zhu; Lin He; Jing Cai; Quanfei Huang; Qingle Cai; Bo Li; Yinqi Bai; Zhihe Zhang; Yaping Zhang; Wen Wang; Jun Li; Fuwen Wei; Heng Li; Min Jian; Jianwen Li; Zhaolei Zhang; Rasmus Nielsen; Dawei Li; Wanjun Gu; Zhentao Yang; Zhaoling Xuan; Oliver A Ryder; Frederick Chi-Ching Leung; Yan Zhou; Jianjun Cao; Xiao Sun; Yonggui Fu; Xiaodong Fang; Xiaosen Guo; Bo Wang; Rong Hou; Fujun Shen; Bo Mu; Peixiang Ni; Runmao Lin; Wubin Qian; Guodong Wang; Chang Yu; Wenhui Nie; Jinhuan Wang; Zhigang Wu; Huiqing Liang; Jiumeng Min; Qi Wu; Shifeng Cheng; Jue Ruan; Mingwei Wang; Zhongbin Shi; Ming Wen; Binghang Liu; Xiaoli Ren; Huisong Zheng; Dong Dong; Kathleen Cook; Gao Shan; Hao Zhang; Carolin Kosiol; Xueying Xie; Zuhong Lu; Hancheng Zheng; Yingrui Li; Cynthia C Steiner; Tommy Tsan-Yuk Lam; Siyuan Lin; Qinghui Zhang; Guoqing Li; Jing Tian; Timing Gong; Hongde Liu; Dejin Zhang; Lin Fang; Chen Ye; Juanbin Zhang; Wenbo Hu; Anlong Xu; Yuanyuan Ren; Guojie Zhang; Michael W Bruford; Qibin Li; Lijia Ma; Yiran Guo; Na An; Yujie Hu; Yang Zheng; Yongyong Shi; Zhiqiang Li; Qing Liu; Yanling Chen; Jing Zhao; Ning Qu; Shancen Zhao; Feng Tian; Xiaoling Wang; Haiyin Wang; Lizhi Xu; Xiao Liu; Tomas Vinar; Yajun Wang; Tak-Wah Lam; Siu-Ming Yiu; Shiping Liu; Hemin Zhang; Desheng Li; Yan Huang; Xia Wang; Guohua Yang; Zhi Jiang; Junyi Wang; Nan Qin; Li Li; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Gane Ka-Shu Wong; Maynard Olson; Xiuqing Zhang; Songgang Li; Huanming Yang; Jian Wang; Jun Wang
Journal:  Nature       Date:  2009-12-13       Impact factor: 49.962

7.  Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology.

Authors:  Thomas D Otto; Mandy Sanders; Matthew Berriman; Chris Newbold
Journal:  Bioinformatics       Date:  2010-06-18       Impact factor: 6.937

8.  Genomic insights into the origin of parasitism in the emerging plant pathogen Bursaphelenchus xylophilus.

Authors:  Taisei Kikuchi; James A Cotton; Jonathan J Dalzell; Koichi Hasegawa; Natsumi Kanzaki; Paul McVeigh; Takuma Takanashi; Isheng J Tsai; Samuel A Assefa; Peter J A Cock; Thomas Dan Otto; Martin Hunt; Adam J Reid; Alejandro Sanchez-Flores; Kazuko Tsuchihara; Toshiro Yokoi; Mattias C Larsson; Johji Miwa; Aaron G Maule; Norio Sahashi; John T Jones; Matthew Berriman
Journal:  PLoS Pathog       Date:  2011-09-01       Impact factor: 6.823

9.  A systematically improved high quality genome and transcriptome of the human blood fluke Schistosoma mansoni.

Authors:  Anna V Protasio; Isheng J Tsai; Anne Babbage; Sarah Nichol; Martin Hunt; Martin A Aslett; Nishadi De Silva; Giles S Velarde; Tim J C Anderson; Richard C Clark; Claire Davidson; Gary P Dillon; Nancy E Holroyd; Philip T LoVerde; Christine Lloyd; Jacquelline McQuillan; Guilherme Oliveira; Thomas D Otto; Sophia J Parker-Manuel; Michael A Quail; R Alan Wilson; Adhemar Zerlotini; David W Dunne; Matthew Berriman
Journal:  PLoS Negl Trop Dis       Date:  2012-01-10

10.  r2cat: synteny plots and comparative assembly.

Authors:  Peter Husemann; Jens Stoye
Journal:  Bioinformatics       Date:  2009-12-16       Impact factor: 6.937

View more
  120 in total

1.  Consed: a graphical editor for next-generation sequencing.

Authors:  David Gordon; Phil Green
Journal:  Bioinformatics       Date:  2013-08-31       Impact factor: 6.937

2.  Investigation of intra-herd spread of Mycobacterium caprae in cattle by generation and use of a whole-genome sequence.

Authors:  S Broeckl; S Krebs; A Varadharajan; R K Straubinger; H Blum; M Buettner
Journal:  Vet Res Commun       Date:  2017-02-13       Impact factor: 2.459

3.  Novel genetic code and record-setting AT-richness in the highly reduced plastid genome of the holoparasitic plant Balanophora.

Authors:  Huei-Jiun Su; Todd J Barkman; Weilong Hao; Samuel S Jones; Julia Naumann; Elizabeth Skippington; Eric K Wafula; Jer-Ming Hu; Jeffrey D Palmer; Claude W dePamphilis
Journal:  Proc Natl Acad Sci U S A       Date:  2018-12-31       Impact factor: 11.205

4.  Isolation and characterization of a crude oil degrading bacteria from formation water: comparative genomic analysis of environmental Ochrobactrum intermedium isolate versus clinical strains.

Authors:  Lu-jun Chai; Xia-wei Jiang; Fan Zhang; Bei-wen Zheng; Fu-chang Shu; Zheng-liang Wang; Qing-feng Cui; Han-ping Dong; Zhong-zhi Zhang; Du-jie Hou; Yue-hui She
Journal:  J Zhejiang Univ Sci B       Date:  2015-10       Impact factor: 3.066

5.  A rural worker infected with a bovine-prevalent genotype of Campylobacter fetus subsp. fetus supports zoonotic transmission and inconsistency of MLST and whole-genome typing.

Authors:  G Iraola; L Betancor; L Calleros; P Gadea; G Algorta; S Galeano; P Muxi; G Greif; R Pérez
Journal:  Eur J Clin Microbiol Infect Dis       Date:  2015-04-29       Impact factor: 3.267

6.  Comparative genomics reveals diversified CRISPR-Cas systems of globally distributed Microcystis aeruginosa, a freshwater bloom-forming cyanobacterium.

Authors:  Chen Yang; Feibi Lin; Qi Li; Tao Li; Jindong Zhao
Journal:  Front Microbiol       Date:  2015-05-12       Impact factor: 5.640

7.  Gene Loss and Lineage-Specific Restriction-Modification Systems Associated with Niche Differentiation in the Campylobacter jejuni Sequence Type 403 Clonal Complex.

Authors:  Laura Morley; Alan McNally; Konrad Paszkiewicz; Jukka Corander; Guillaume Méric; Samuel K Sheppard; Jochen Blom; Georgina Manning
Journal:  Appl Environ Microbiol       Date:  2015-03-20       Impact factor: 4.792

8.  Complete genome sequence of Mycobacterium vaccae type strain ATCC 25954.

Authors:  Yung S Ho; Sabir A Adroub; Maram Abadi; Bader Al Alwan; Reham Alkhateeb; Ge Gao; Alaa Ragab; Shahjahan Ali; Dick van Soolingen; Wilbert Bitter; Arnab Pain; Abdallah M Abdallah
Journal:  J Bacteriol       Date:  2012-11       Impact factor: 3.490

9.  Complete genome sequence of Mycobacterium fortuitum subsp. fortuitum type strain DSM46621.

Authors:  Yung S Ho; Sabir A Adroub; Fajr Aleisa; Hanan Mahmood; Ghofran Othoum; Fahad Rashid; Manal Zaher; Shahjahan Ali; Wilbert Bitter; Arnab Pain; Abdallah M Abdallah
Journal:  J Bacteriol       Date:  2012-11       Impact factor: 3.490

10.  blaNDM-5 carried by an IncX3 plasmid in Escherichia coli sequence type 167.

Authors:  Ping Yang; Yi Xie; Ping Feng; Zhiyong Zong
Journal:  Antimicrob Agents Chemother       Date:  2014-09-22       Impact factor: 5.191

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.