Literature DB >> 27587810

High-Quality Genome Assembly and Annotation for Plasmodium coatneyi, Generated Using Single-Molecule Real-Time PacBio Technology.

Jung-Ting Chien1, Suman B Pakala2, Juliana A Geraldo3, Stacey A Lapp1, Jay C Humphrey2, John W Barnwell4, Jessica C Kissinger2, Mary R Galinski5.   

Abstract

Plasmodium coatneyi is a protozoan parasite species that causes simian malaria and is an excellent model for studying disease caused by the human malaria parasite, P. falciparum Here we report the complete (nontelomeric) genome sequence of P. coatneyi Hackeri generated by the application of only Pacific Biosciences RS II (PacBio RS II) single-molecule real-time (SMRT) high-resolution sequence technology and assembly using the Hierarchical Genome Assembly Process (HGAP). This is the first Plasmodium genome sequence reported to use only PacBio technology. This approach has proven to be superior to short-read only approaches for this species.
Copyright © 2016 Chien et al.

Entities:  

Year:  2016        PMID: 27587810      PMCID: PMC5009967          DOI: 10.1128/genomeA.00883-16

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

A Plasmodium genome sequence was published initially in 2002, for P. falciparum (1). Genome sequences for several other primate Plasmodium species have followed (1–6), but none has yet been generated using only PacBio technology (6). Plasmodium coatneyi, which infects Macaca mulatta (rhesus macaques) and serves as a model of P. falciparum (7, 8), was chosen as a test platform for PacBio sequencing. A preliminary draft of the P. coatneyi genome based on short-read (<500 bp) sequence technology is available in the NCBI database (PRJNA233970). Although the overall “big picture” can be gained from this genome assembly, there are over 500 sequence gaps distributed throughout the parasite’s estimated 14 nuclear chromosomes. Like P. falciparum, the P. coatneyi genome has numerous repetitive sequences and complex multi-gene families which present major difficulties that have prohibited nontelomeric genome assembly with closure using only short-read technologies. Gaps prevent reliable gene content analysis, genetics and reference-based gene expression analyses, all of which are critical for understanding Plasmodium and disease progression. We have implemented PacBio (RSSMRT) sequence technology to tackle these issues. Genomic DNA was extracted from ex vivo matured schizont-stage parasites with a Qiagen DNA blood midi kit. The gDNA was further purified with a PowerClean DNA cleanup kit (Mo Bio Laboratories). Five micrograms of gDNA were subsequently used for library preparation. SMRTbell DNA libraries (Pacific Biosciences) were constructed according to the PacBio standard protocol with the BluePippin size-selection system (Sage Science). Sequence was generated on a PacBio RSII instrument using P6-C4 chemistry. Following cleaning, the mean assembled subread length is 5,824 bp; the N50 is 7,257; the total number of bases is 1,792,197,364 and the total number of reads is 257,557. HGAP3 (9) de novo assembly was performed using the Amazon EC2 cloud SMRT portal. The error correction module was defined as minimum subread length of 100 bp, a minimum read quality of 0.80, and a minimum read length of 6,000 bp. Following host (M. mulatta) contig removal, 15 nuclear contigs, one mitochondrial contig, and one apicoplast contig remained (51.42× average coverage). Contig identity and synteny were evaluated via BLASTn and progressive MAUVE algorithms (10) using the P. knowlesi genome from GeneDB (3) as the reference. Two suspected interchromosomal rearrangements occurring within gene family sequences located on Chr4/Chr13 and Chr12/Chr14 could not be validated by PCR, suggesting these sequences may in fact be correct as presented here. De novo gene prediction was performed using SNAP (11) and Augustus (12) for gene calls in the MAKER2 (13) genome annotation tool. The P. vivax and P. knowlesi predicted proteomes were included as evidence. In total, 5,516 protein-encoding genes were predicted, including up to 112 SICAvar genes. The complete annotated mitochondrial and apicoplast genomes are also included in this report. The annotation was validated with P. coatneyi RNA-Seq data, Uniprot, KEGG and OrthoMCL Orthology, and InterProScan5 (14–18). 5,060 genes have strong evidence of synteny.

Accession number(s).

The fourteen chromosome sequences were deposited at NCBI (BioProject PRJNA315987) under accession numbers CP016239 to CP016252 and provided to PlasmoDB (19).
  19 in total

1.  Using native and syntenically mapped cDNA alignments to improve de novo gene finding.

Authors:  Mario Stanke; Mark Diekhans; Robert Baertsch; David Haussler
Journal:  Bioinformatics       Date:  2008-01-24       Impact factor: 6.937

2.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

3.  A primate model for human cerebral malaria: Plasmodium coatneyi-infected rhesus monkeys.

Authors:  M Aikawa; A Brown; C D Smith; T Tegoshi; R J Howard; T H Hasler; Y Ito; G Perry; W E Collins; K Webster
Journal:  Am J Trop Med Hyg       Date:  1992-04       Impact factor: 2.345

4.  Plasmodium coatneyi in rhesus macaques replicates the multisystemic dysfunction of severe malaria in humans.

Authors:  Alberto Moreno; Monica Cabrera-Mora; Anapatricia Garcia; Jack Orkin; Elizabeth Strobert; John W Barnwell; Mary R Galinski
Journal:  Infect Immun       Date:  2013-03-18       Impact factor: 3.441

5.  Genome sequence of the human malaria parasite Plasmodium falciparum.

Authors:  Malcolm J Gardner; Neil Hall; Eula Fung; Owen White; Matthew Berriman; Richard W Hyman; Jane M Carlton; Arnab Pain; Karen E Nelson; Sharen Bowman; Ian T Paulsen; Keith James; Jonathan A Eisen; Kim Rutherford; Steven L Salzberg; Alister Craig; Sue Kyes; Man-Suen Chan; Vishvanath Nene; Shamira J Shallom; Bernard Suh; Jeremy Peterson; Sam Angiuoli; Mihaela Pertea; Jonathan Allen; Jeremy Selengut; Daniel Haft; Michael W Mather; Akhil B Vaidya; David M A Martin; Alan H Fairlamb; Martin J Fraunholz; David S Roos; Stuart A Ralph; Geoffrey I McFadden; Leda M Cummings; G Mani Subramanian; Chris Mungall; J Craig Venter; Daniel J Carucci; Stephen L Hoffman; Chris Newbold; Ronald W Davis; Claire M Fraser; Bart Barrell
Journal:  Nature       Date:  2002-10-03       Impact factor: 49.962

6.  OrthoMCL: identification of ortholog groups for eukaryotic genomes.

Authors:  Li Li; Christian J Stoeckert; David S Roos
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

7.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups.

Authors:  Feng Chen; Aaron J Mackey; Christian J Stoeckert; David S Roos
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

8.  InterProScan 5: genome-scale protein function classification.

Authors:  Philip Jones; David Binns; Hsin-Yu Chang; Matthew Fraser; Weizhong Li; Craig McAnulla; Hamish McWilliam; John Maslen; Alex Mitchell; Gift Nuka; Sebastien Pesseat; Antony F Quinn; Amaia Sangrador-Vegas; Maxim Scheremetjew; Siew-Yit Yong; Rodrigo Lopez; Sarah Hunter
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

9.  KEGG OC: a large-scale automatic construction of taxonomy-based ortholog clusters.

Authors:  Akihiro Nakaya; Toshiaki Katayama; Masumi Itoh; Kazushi Hiranuka; Shuichi Kawashima; Yuki Moriya; Shujiro Okuda; Michihiro Tanaka; Toshiaki Tokimatsu; Yoshihiro Yamanishi; Akiyasu C Yoshizawa; Minoru Kanehisa; Susumu Goto
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

10.  Gene finding in novel genomes.

Authors:  Ian Korf
Journal:  BMC Bioinformatics       Date:  2004-05-14       Impact factor: 3.169

View more
  23 in total

Review 1.  Ape Origins of Human Malaria.

Authors:  Paul M Sharp; Lindsey J Plenderleith; Beatrice H Hahn
Journal:  Annu Rev Microbiol       Date:  2020-09-08       Impact factor: 15.500

2.  Distinct amino acid and lipid perturbations characterize acute versus chronic malaria.

Authors:  Regina Joice Cordy; Rapatbhorn Patrapuvich; Loukia N Lili; Monica Cabrera-Mora; Jung-Ting Chien; Gregory K Tharp; Manoj Khadka; Esmeralda Vs Meyer; Stacey A Lapp; Chester J Joyner; AnaPatricia Garcia; Sophia Banton; ViLinh Tran; Viravarn Luvira; Siriwan Rungin; Teerawat Saeseu; Nattawan Rachaphaew; Suman B Pakala; Jeremy D DeBarry; Jessica C Kissinger; Eric A Ortlund; Steven E Bosinger; John W Barnwell; Dean P Jones; Karan Uppal; Shuzhao Li; Jetsumon Sattabongkot; Alberto Moreno; Mary R Galinski
Journal:  JCI Insight       Date:  2019-05-02

Review 3.  Systems biology of malaria explored with nonhuman primates.

Authors:  Mary R Galinski
Journal:  Malar J       Date:  2022-06-07       Impact factor: 3.469

4.  Plasmodium knowlesi Cytoadhesion Involves SICA Variant Proteins.

Authors:  Mariko S Peterson; Chester J Joyner; Stacey A Lapp; Jessica A Brady; Jennifer S Wood; Monica Cabrera-Mora; Celia L Saney; Luis L Fonseca; Wayne T Cheng; Jianlin Jiang; Stephanie R Soderberg; Mustafa V Nural; Allison Hankus; Deepa Machiah; Ebru Karpuzoglu; Jeremy D DeBarry; Rabindra Tirouvanziam; Jessica C Kissinger; Alberto Moreno; Sanjeev Gumber; Eberhard O Voit; Juan B Gutierrez; Regina Joice Cordy; Mary R Galinski
Journal:  Front Cell Infect Microbiol       Date:  2022-06-23       Impact factor: 6.073

Review 5.  Malaria in pregnancy: the relevance of animal models for vaccine development.

Authors:  Justin Doritchamou; Andrew Teo; Michal Fried; Patrick E Duffy
Journal:  Lab Anim (NY)       Date:  2017-10-06       Impact factor: 12.625

Review 6.  Plasmodium knowlesi: a superb in vivo nonhuman primate model of antigenic variation in malaria.

Authors:  M R Galinski; S A Lapp; M S Peterson; F Ay; C J Joyner; K G LE Roch; L L Fonseca; E O Voit
Journal:  Parasitology       Date:  2017-07-17       Impact factor: 3.234

7.  An improved Plasmodium cynomolgi genome assembly reveals an unexpected methyltransferase gene expansion.

Authors:  Erica M Pasini; Ulrike Böhme; Gavin G Rutledge; Annemarie Voorberg-Van der Wel; Mandy Sanders; Matt Berriman; Clemens Hm Kocken; Thomas Dan Otto
Journal:  Wellcome Open Res       Date:  2017-06-16

8.  New var reconstruction algorithm exposes high var sequence diversity in a single geographic location in Mali.

Authors:  Antoine Dara; Elliott F Drábek; Mark A Travassos; Kara A Moser; Arthur L Delcher; Qi Su; Timothy Hostelley; Drissa Coulibaly; Modibo Daou; Ahmadou Dembele; Issa Diarra; Abdoulaye K Kone; Bourema Kouriba; Matthew B Laurens; Amadou Niangaly; Karim Traore; Youssouf Tolo; Claire M Fraser; Mahamadou A Thera; Abdoulaye A Djimde; Ogobara K Doumbo; Christopher V Plowe; Joana C Silva
Journal:  Genome Med       Date:  2017-03-28       Impact factor: 11.117

Review 9.  Malaria in the 'Omics Era'.

Authors:  Mirko Pegoraro; Gareth D Weedall
Journal:  Genes (Basel)       Date:  2021-05-30       Impact factor: 4.096

10.  PacBio assembly of a Plasmodium knowlesi genome sequence with Hi-C correction and manual annotation of the SICAvar gene family.

Authors:  S A Lapp; J A Geraldo; J-T Chien; F Ay; S B Pakala; G Batugedara; J Humphrey; J D DeBARRY; K G Le Roch; M R Galinski; J C Kissinger
Journal:  Parasitology       Date:  2017-07-19       Impact factor: 3.234

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.