Literature DB >> 33503026

One is not enough: On the effects of reference genome for the mapping and subsequent analyses of short-reads.

Carlos Valiente-Mullor1, Beatriz Beamud1, Iván Ansari1, Carlos Francés-Cuesta1, Neris García-González1, Lorena Mejía1,2, Paula Ruiz-Hueso1, Fernando González-Candelas1,3.   

Abstract

Mapping of high-throughput sequencing (HTS) reads to a single arbitrary reference genome is a frequently used approach in microbial genomics. However, the choice of a reference may represent a source of errors that may affect subsequent analyses such as the detection of single nucleotide polymorphisms (SNPs) and phylogenetic inference. In this work, we evaluated the effect of reference choice on short-read sequence data from five clinically and epidemiologically relevant bacteria (Klebsiella pneumoniae, Legionella pneumophila, Neisseria gonorrhoeae, Pseudomonas aeruginosa and Serratia marcescens). Publicly available whole-genome assemblies encompassing the genomic diversity of these species were selected as reference sequences, and read alignment statistics, SNP calling, recombination rates, dN/dS ratios, and phylogenetic trees were evaluated depending on the mapping reference. The choice of different reference genomes proved to have an impact on almost all the parameters considered in the five species. In addition, these biases had potential epidemiological implications such as including/excluding isolates of particular clades and the estimation of genetic distances. These findings suggest that the single reference approach might introduce systematic errors during mapping that affect subsequent analyses, particularly for data sets with isolates from genetically diverse backgrounds. In any case, exploring the effects of different references on the final conclusions is highly recommended.

Entities:  

Mesh:

Year:  2021        PMID: 33503026      PMCID: PMC7870062          DOI: 10.1371/journal.pcbi.1008678

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  89 in total

1.  PAML 4: phylogenetic analysis by maximum likelihood.

Authors:  Ziheng Yang
Journal:  Mol Biol Evol       Date:  2007-05-04       Impact factor: 16.240

Review 2.  Comparative genomics: the bacterial pan-genome.

Authors:  Hervé Tettelin; David Riley; Ciro Cattuto; Duccio Medini
Journal:  Curr Opin Microbiol       Date:  2008-10       Impact factor: 7.934

3.  progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors:  Aaron E Darling; Bob Mau; Nicole T Perna
Journal:  PLoS One       Date:  2010-06-25       Impact factor: 3.240

4.  Proteinortho: detection of (co-)orthologs in large-scale analysis.

Authors:  Marcus Lechner; Sven Findeiss; Lydia Steiner; Manja Marz; Peter F Stadler; Sonja J Prohaska
Journal:  BMC Bioinformatics       Date:  2011-04-28       Impact factor: 3.169

5.  Whole-genome analysis of diverse Chlamydia trachomatis strains identifies phylogenetic relationships masked by current clinical typing.

Authors:  Simon R Harris; Ian N Clarke; Helena M B Seth-Smith; Anthony W Solomon; Lesley T Cutcliffe; Peter Marsh; Rachel J Skilton; Martin J Holland; David Mabey; Rosanna W Peeling; David A Lewis; Brian G Spratt; Magnus Unemo; Kenneth Persson; Carina Bjartling; Robert Brunham; Henry J C de Vries; Servaas A Morré; Arjen Speksnijder; Cécile M Bébéar; Maïté Clerc; Bertille de Barbeyrac; Julian Parkhill; Nicholas R Thomson
Journal:  Nat Genet       Date:  2012-03-11       Impact factor: 38.330

6.  Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses.

Authors:  Arthur W Pightling; Nicholas Petronella; Franco Pagotto
Journal:  PLoS One       Date:  2014-08-21       Impact factor: 3.240

Review 7.  Dissemination of Antimicrobial Resistance in Microbial Ecosystems through Horizontal Gene Transfer.

Authors:  Christian J H von Wintersdorff; John Penders; Julius M van Niekerk; Nathan D Mills; Snehali Majumder; Lieke B van Alphen; Paul H M Savelkoul; Petra F G Wolffs
Journal:  Front Microbiol       Date:  2016-02-19       Impact factor: 5.640

8.  Interactive Tree Of Life (iTOL) v4: recent updates and new developments.

Authors:  Ivica Letunic; Peer Bork
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

9.  Previously undetected super-spreading of Mycobacterium tuberculosis revealed by deep sequencing.

Authors:  Robyn S Lee; Jean-François Proulx; Fiona McIntosh; Marcel A Behr; William P Hanage
Journal:  Elife       Date:  2020-02-04       Impact factor: 8.140

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  7 in total

1.  Evaluation of within-host evolution of methicillin-resistant Staphylococcus aureus (MRSA) by comparing cgMLST and SNP analysis approaches.

Authors:  Amaya Campillay Lagos; Martin Sundqvist; Fredrik Dyrkell; Marc Stegger; Bo Söderquist; Paula Mölling
Journal:  Sci Rep       Date:  2022-06-22       Impact factor: 4.996

Review 2.  Techniques in bacterial strain typing: past, present, and future.

Authors:  Shelby R Simar; Blake M Hanson; Cesar A Arias
Journal:  Curr Opin Infect Dis       Date:  2021-08-01       Impact factor: 4.968

3.  Evolutionary Processes in the Emergence and Recent Spread of the Syphilis Agent, Treponema pallidum.

Authors:  Marta Pla-Díaz; Leonor Sánchez-Busó; Lorenzo Giacani; David Šmajs; Philipp P Bosshard; Homayoun C Bagheri; Verena J Schuenemann; Kay Nieselt; Natasha Arora; Fernando González-Candelas
Journal:  Mol Biol Evol       Date:  2022-01-07       Impact factor: 16.240

4.  Bacterial low-abundant taxa are key determinants of a healthy airway metagenome in the early years of human life.

Authors:  Marie-Madlen Pust; Burkhard Tümmler
Journal:  Comput Struct Biotechnol J       Date:  2021-12-15       Impact factor: 7.271

5.  Characterization of Expression and Epigenetic Features of Core Genes in Common Wheat.

Authors:  Dongyang Zheng; Wenli Zhang
Journal:  Genes (Basel)       Date:  2022-06-21       Impact factor: 4.141

6.  Chromosome-level assemblies from diverse clades reveal limited structural and gene content variation in the genome of Candida glabrata.

Authors:  Marina Marcet-Houben; María Alvarado; Ewa Ksiezopolska; Ester Saus; Piet W J de Groot; Toni Gabaldón
Journal:  BMC Biol       Date:  2022-10-08       Impact factor: 7.364

Review 7.  Bioinformatics tools used for whole-genome sequencing analysis of Neisseria gonorrhoeae: a literature review.

Authors:  Reema Singh; Anthony Kusalik; Jo-Anne R Dillon
Journal:  Brief Funct Genomics       Date:  2022-04-11       Impact factor: 4.840

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.