Literature DB >> 26021920

Draft genome sequences of 24 microbial strains assembled from direct sequencing from 4 stool samples.

Patricio Jeraldo, Álvaro Hernández1, Bryan A White, Daniel O'Brien2, David Ahlquist3, Lisa Boardman3, Nicholas Chia4.   

Abstract

The ability to assemble genomes from metagenomic sequencing avoids the need for culture and any associated culture biases. We assembled 24 essentially complete draft genomes from metagenomic pair-end and size-selected mate pair sequencing from 4 stool samples, 2 from subjects diagnosed with colorectal cancer and 2 from healthy controls.
Copyright © 2015 Jeraldo et al.

Entities:  

Year:  2015        PMID: 26021920      PMCID: PMC4447905          DOI: 10.1128/genomeA.00526-15

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Colorectal cancer is common, with 136,830 new cancers annually, and lethal, with 51,310 cancer deaths each year (1), and has plausible connections to several microbial agents. There are multiple ongoing efforts to identify microbes that cause cancer and find ways to quantify that risk by using a combination of modern computational and sequencing techniques in the hopes of eventually attenuating that risk by manipulating the gut microbiome using antibiotics, probiotics, or prebiotics. This would radically change the way colon cancer is treated. Subjects were consented under IRB number 10-006009, which was reviewed and approved by the Mayo Clinic Institutional Review Board. Written consent was provided by all subjects. Genomic DNA was extracted from 4 stool samples collected from subjects prior to colonoscopy. Two were from subjects who were diagnosed with colon cancer, and two were from subjects with a negative colonoscopy. Paired-end and Nextera mate pair DNA libraries from each sample were prepared with target fragment lengths of 500 bp for the paired-end library, and 4 to 6 kb, 8 to 10 kb and 15 to 20 kb for the mate pair libraries. A total of 9,195,789,522 reads were produced on Illumina HiSeq 2500 runs for 300 cycles. From this, for the reference-based binning and assembly, we iteratively mapped the reads onto the MetaHIT metagenomic species references (2) using BWA 0.7.10 (3), assembled each bin using SPAdes 3.5.0 (4), and removed extraneous contigs using BioBloom Tools 2.0.6 (5). For the de novo binning and assembly, we assembled the whole metagenome using Ray 2.3.1 (6), for a total of 2.1 Gbp assembled into 1,748,414 contigs, with an N50 of 40,874 bp. Contigs were binned using differential coverage, taxonomy, G+C content, and tetranucleotide frequency (7), unassembled reads were mapped onto these bins using BWA 0.7.10, and each bin was reassembled using SPAdes 3.5.0. From these 353 genomic bins, we filtered for essentially complete, high-fidelity genomes satisfying all the following criteria. Genomes must have at least 100 of a set of essential genes (7), have single copies of a set of 24 single-copy genes (8) allowing for two extra copies, having at least cone copy of the 16S rRNA gene longer than 1,300 bp, and having at most 12 copies of complete or partial 16S rRNA genes. Genome completeness was estimated using CheckM 0.9.7 (9) This resulted in 24 essentially complete genomes, listed in Table 1.
TABLE 1 

List of accession numbers for genomes submitted under this announcement

NomenclatureBioSampleAccession no.Total no. of basesN50Estimated completeness (%)
Ruminococcus sp. A254.MGS-108SAMN03449993LAQZ000000001,942,85761,18897.32
Clostridium sp. A254.MGS-251SAMN03449994LARA000000001,983,93278,17998.66
Burkholderia sp. K4410.MGS-135SAMN03449996LARC000000002,654,41549,325100
Clostridium sp. K4410.MGS-306SAMN03449997LARD000000002,054,369124,39289.74
Odoribacter sp. N15.MGS-14SAMN03449998LARE000000003,441,5644238698.83
Ruminococcus sp. N15.MGS-57SAMN03450004LARF000000002,359,76245,12297.95
Alistipes sp. N15.MGS-157SAMN03449999LARG000000002,171,493171,79899.52
Carnobacterium sp. N15.MGS-207SAMN03450000LARH000000002,235,05671,77799.98
Carnobacterium sp. N15.MGS-251SAMN03450001LARI000000001,977,841102,35898.66
Lactobacillus sp. N15.MGS-260SAMN03450002LARJ000000001,820,85858,04394.68
Heliobacterium sp. N15.MGS-306SAMN03450003LARK000000002,061,532121,89489.74
Odoribacter sp. N54.MGS-14SAMN03450006LARL000000003,456,95138,25298.57
Bacteroides sp. N54.MGS-20SAMN03450007LARM000000002,979,72574,27599.96
Acinetobacter sp. N54.MGS-139SAMN03450005LARN000000002,171,67641,165100
Lactobacillus sp. N54.MGS-719SAMN03450008LARO000000001,821,79036,46698.72
Akkermansia sp. UNK.MGS-1SAMN03456685LBCB000000002,770,805190,92492.12
Clostridium sp. UNK.MGS-6SAMN03456689LBCC000000002,411,49719,22687.94
Odoribacter sp. UNK.MGS-12SAMN03456680LBCD000000005,872,33891,53295.72
Bacteroides sp. UNK.MGS-14SAMN03456682LBCF000000005,132,637250,31498.75
Roseburia sp. UNK.MGS-15SAMN03456683LBCG000000004,890,836278,60493.1
Rhodospirillum sp. UNK.MGS-17SAMN03456684LBCH000000001,897,358219,44194.68
Eubacterium sp. UNK.MGS-25SAMN03456686LBCI000000004,343,513459,23797.96
Eubacterium sp. UNK.MGS-26SAMN03456687LBCJ000000004,451,1403,090,75897.96
Ruminococcus sp. UNK.MGS-30SAMN03456688LBCK000000002,959,500615,13887.25
List of accession numbers for genomes submitted under this announcement

Nucleotide sequence accession numbers.

These complete genome sequences have been deposited at DDBJ/EMBL/GenBank under the accession numbers provided in Table 1.
  9 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Colorectal cancer statistics, 2014.

Authors:  Rebecca Siegel; Carol Desantis; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2014-03-17       Impact factor: 508.702

3.  Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.

Authors:  H Bjørn Nielsen; Mathieu Almeida; Agnieszka Sierakowska Juncker; Simon Rasmussen; Junhua Li; Shinichi Sunagawa; Damian R Plichta; Laurent Gautier; Anders G Pedersen; Emmanuelle Le Chatelier; Eric Pelletier; Ida Bonde; Trine Nielsen; Chaysavanh Manichanh; Manimozhiyan Arumugam; Jean-Michel Batto; Marcelo B Quintanilha Dos Santos; Nikolaj Blom; Natalia Borruel; Kristoffer S Burgdorf; Fouad Boumezbeur; Francesc Casellas; Joël Doré; Piotr Dworzynski; Francisco Guarner; Torben Hansen; Falk Hildebrand; Rolf S Kaas; Sean Kennedy; Karsten Kristiansen; Jens Roat Kultima; Pierre Léonard; Florence Levenez; Ole Lund; Bouziane Moumen; Denis Le Paslier; Nicolas Pons; Oluf Pedersen; Edi Prifti; Junjie Qin; Jeroen Raes; Søren Sørensen; Julien Tap; Sebastian Tims; David W Ussery; Takuji Yamada; Pierre Renault; Thomas Sicheritz-Ponten; Peer Bork; Jun Wang; Søren Brunak; S Dusko Ehrlich
Journal:  Nat Biotechnol       Date:  2014-07-06       Impact factor: 54.908

4.  Ray Meta: scalable de novo metagenome assembly and profiling.

Authors:  Sébastien Boisvert; Frédéric Raymond; Elénie Godzaridis; François Laviolette; Jacques Corbeil
Journal:  Genome Biol       Date:  2012-12-22       Impact factor: 13.583

5.  Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes.

Authors:  Mads Albertsen; Philip Hugenholtz; Adam Skarshewski; Kåre L Nielsen; Gene W Tyson; Per H Nielsen
Journal:  Nat Biotechnol       Date:  2013-05-26       Impact factor: 54.908

6.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

Authors:  Donovan H Parks; Michael Imelfort; Connor T Skennerton; Philip Hugenholtz; Gene W Tyson
Journal:  Genome Res       Date:  2015-05-14       Impact factor: 9.043

7.  BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters.

Authors:  Justin Chu; Sara Sadeghi; Anthony Raymond; Shaun D Jackman; Ka Ming Nip; Richard Mar; Hamid Mohamadi; Yaron S Butterfield; A Gordon Robertson; Inanç Birol
Journal:  Bioinformatics       Date:  2014-08-20       Impact factor: 6.937

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.

Authors:  Jenna Morgan Lang; Aaron E Darling; Jonathan A Eisen
Journal:  PLoS One       Date:  2013-04-25       Impact factor: 3.240

  9 in total
  3 in total

1.  Metabolic network-guided binning of metagenomic sequence fragments.

Authors:  Matthew B Biggs; Jason A Papin
Journal:  Bioinformatics       Date:  2015-11-14       Impact factor: 6.937

2.  Capturing One of the Human Gut Microbiome's Most Wanted: Reconstructing the Genome of a Novel Butyrate-Producing, Clostridial Scavenger from Metagenomic Sequence Data.

Authors:  Patricio Jeraldo; Alvaro Hernandez; Henrik B Nielsen; Xianfeng Chen; Bryan A White; Nigel Goldenfeld; Heidi Nelson; David Alhquist; Lisa Boardman; Nicholas Chia
Journal:  Front Microbiol       Date:  2016-05-26       Impact factor: 5.640

Review 3.  Metabolic modeling with Big Data and the gut microbiome.

Authors:  Jaeyun Sung; Vanessa Hale; Annette C Merkel; Pan-Jun Kim; Nicholas Chia
Journal:  Appl Transl Genom       Date:  2016-02-05
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.