Literature DB >> 29439051

Complete Genome Sequence of Industrial Dairy Strain Streptococcus thermophilus DGCC 7710.

E Anne Hatmaker1, Lauren A Riley1,2, Kaela B O'Dell1, Beth Papanek1, Brenton R Graveley3, Sandra C Garrett3, Yunzhou Wei4, Michael P Terns4,5,6, Adam M Guss7,2.   

Abstract

We report here the complete genome sequence of Streptococcus thermophilus DGCC 7710. S. thermophilus is widely used in industrial dairy production.
Copyright © 2018 Hatmaker et al.

Entities:  

Year:  2018        PMID: 29439051      PMCID: PMC5805889          DOI: 10.1128/genomeA.01587-17

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Streptococcus thermophilus is an economically important Gram-positive bacterium often used in starter cultures for common dairy products (1). The species has undergone extensive reductive evolution, including the loss of virulence genes and genes related to carbohydrate metabolism. Losing these genes allowed for adaptation to the milk and yogurt environment, for example, through the inclusion of exopolysaccharide (EPS) and lactose fermentation operons (2). Genomic comparisons are necessary to understand such adaptive evolution and to explore other features of interest, including defense mechanisms. The S. thermophilus strain DGCC 7710 is of particular interest due to its role in the discovery of clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems as a bacterial immune system (3, 4), yet to date, no complete genome sequence is available. S. thermophilus DGCC 7710 was grown in M17 medium with 10% lactose. High-molecular-weight genomic DNA was isolated using the Genomic-tip 100G kit (Qiagen, Valencia, CA, USA) and sequenced by the Department of Energy’s Joint Genome Institute using single-molecule real-time (SMRT) sequencing (5) on a PacBio RS II system (Pacific Biosciences, Menlo Park, CA). An additional short-read library was prepared using the TruSeq DNA PCR-free kit (Illumina, San Diego, CA, USA). The average size of the library DNA was determined using a TapeStation 2200 instrument (Agilent Technologies, Santa Clara, CA, USA). DNA concentration was measured with a Qubit 2.0 fluorometer (Invitrogen, Carlsbad, CA, USA) and adjusted according to Illumina specifications. Libraries were sequenced on an Illumina MiSeq platform with a 160-bp single-end protocol. In total, 749,014 RS II and 2,859,266 MiSeq reads were generated for S. thermophilus DGCC 7710. De novo assembly was completed using SPAdes (6), with the Illumina MiSeq single-end reads as the short-read input and the PacBio filtered subreads input as an additional library. Scaffolds under 500 bp with low coverage were removed. The coverage of the remaining six scaffolds was over 120×. We used the complete genome of S. thermophilus MN-BM-A02 (GenBank accession number CP010999) as the template for genome organization. Gaps between the five larger scaffolds were filled using the final scaffold containing the ribosomal operon, which is repeated several times throughout most microbial genomes. The complete S. thermophilus DGCC 7710 genome is 1,851,207 bp in length and consists of a single circular chromosome with 39% GC content. Annotation using the NCBI Prokaryotic Genome Annotation Pipeline (7) revealed 1,887 protein-coding genes and 75 RNA genes. The genome contains 56 tRNAs, 4 noncoding RNAs (ncRNAs), and 15 rRNA genes. Of the 1,887 protein-coding genes, 230 are annotated as pseudogenes, which is not unusual in S. thermophilus strains, such as ND07 (GenBank accession number CP016394), which has 236 pseudogenes, and ACA-DC 2 (GenBank accession number LT604076), which has 217 pseudogenes. Compared to other sequenced S. thermophilus strains, DGCC 7710 is of similar size and contains a comparable number of genes. Our complete assembly will allow researchers to explore the genomic diversity of S. thermophilus in more depth, impacting basic and applied research on a species relevant to the dairy industry.

Accession number(s).

The complete genome sequence of Streptococcus thermophilus DGCC 7710 has been deposited in GenBank under accession number CP025216.
  7 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  CRISPR provides acquired resistance against viruses in prokaryotes.

Authors:  Rodolphe Barrangou; Christophe Fremaux; Hélène Deveau; Melissa Richards; Patrick Boyaval; Sylvain Moineau; Dennis A Romero; Philippe Horvath
Journal:  Science       Date:  2007-03-23       Impact factor: 47.728

3.  Diversity, activity, and evolution of CRISPR loci in Streptococcus thermophilus.

Authors:  Philippe Horvath; Dennis A Romero; Anne-Claire Coûté-Monvoisin; Melissa Richards; Hélène Deveau; Sylvain Moineau; Patrick Boyaval; Christophe Fremaux; Rodolphe Barrangou
Journal:  J Bacteriol       Date:  2007-12-07       Impact factor: 3.490

4.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

5.  The complete genome sequence of the yogurt isolate Streptococcus thermophilus ACA-DC 2.

Authors:  Voula Alexandraki; Maria Kazou; Jochen Blom; Bruno Pot; Effie Tsakalidou; Konstantinos Papadimitriou
Journal:  Stand Genomic Sci       Date:  2017-01-31

6.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

7.  Complete sequence and comparative genome analysis of the dairy bacterium Streptococcus thermophilus.

Authors:  Alexander Bolotin; Benoît Quinquis; Pierre Renault; Alexei Sorokin; S Dusko Ehrlich; Saulius Kulakauskas; Alla Lapidus; Eugene Goltsman; Michael Mazur; Gordon D Pusch; Michael Fonstein; Ross Overbeek; Nikos Kyprides; Bénédicte Purnelle; Deborah Prozzi; Katrina Ngui; David Masuy; Frédéric Hancy; Sophie Burteau; Marc Boutry; Jean Delcour; André Goffeau; Pascal Hols
Journal:  Nat Biotechnol       Date:  2004-11-14       Impact factor: 54.908

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.