Literature DB >> 33397413

Reference flow: reducing reference bias using multiple population genomes.

Nae-Chyun Chen1, Brad Solomon1, Taher Mun1, Sheila Iyer1, Ben Langmead2.   

Abstract

Most sequencing data analyses start by aligning sequencing reads to a linear reference genome, but failure to account for genetic variation leads to reference bias and confounding of results downstream. Other approaches replace the linear reference with structures like graphs that can include genetic variation, incurring major computational overhead. We propose the reference flow alignment method that uses multiple population reference genomes to improve alignment accuracy and reduce reference bias. Compared to the graph aligner vg, reference flow achieves a similar level of accuracy and bias avoidance but with 14% of the memory footprint and 5.5 times the speed.

Entities:  

Mesh:

Year:  2021        PMID: 33397413      PMCID: PMC7780692          DOI: 10.1186/s13059-020-02229-3

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


  40 in total

1.  Fast and accurate genomic analyses using genome graphs.

Authors:  Goran Rakocevic; Vladimir Semenyuk; Wan-Ping Lee; James Spencer; John Browning; Ivan J Johnson; Vladan Arsenijevic; Jelena Nadj; Kaushik Ghose; Maria C Suciu; Sun-Gou Ji; Gülfem Demir; Lizao Li; Berke Ç Toptaş; Alexey Dolgoborodov; Björn Pollex; Iosif Spulber; Irina Glotova; Péter Kómár; Andrew L Stachyra; Yilong Li; Milos Popovic; Morten Källberg; Amit Jain; Deniz Kural
Journal:  Nat Genet       Date:  2019-01-14       Impact factor: 38.330

2.  Convolutional Embedded Networks for Population Scale Clustering and Bio-Ancestry Inferencing.

Authors:  Md Rezaul Karim; Michael Cochez; Achille Zappa; Ratnesh Sahay; Dietrich Rebholz-Schuhmann; Oya Beyan; Stefan Decker
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2022-02-03       Impact factor: 3.710

3.  Choosing non-redundant representative subsets of protein sequence data sets using submodular optimization.

Authors:  Maxwell W Libbrecht; Jeffrey A Bilmes; William Stafford Noble
Journal:  Proteins       Date:  2018-02-01

4.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

5.  The African Genome Variation Project shapes medical genetics in Africa.

Authors:  Deepti Gurdasani; Tommy Carstensen; Fasil Tekola-Ayele; Luca Pagani; Ioanna Tachmazidou; Konstantinos Hatzikotoulas; Savita Karthikeyan; Louise Iles; Martin O Pollard; Ananyo Choudhury; Graham R S Ritchie; Yali Xue; Jennifer Asimit; Rebecca N Nsubuga; Elizabeth H Young; Cristina Pomilla; Katja Kivinen; Kirk Rockett; Anatoli Kamali; Ayo P Doumatey; Gershim Asiki; Janet Seeley; Fatoumatta Sisay-Joof; Muminatou Jallow; Stephen Tollman; Ephrem Mekonnen; Rosemary Ekong; Tamiru Oljira; Neil Bradman; Kalifa Bojang; Michele Ramsay; Adebowale Adeyemo; Endashaw Bekele; Ayesha Motala; Shane A Norris; Fraser Pirie; Pontiano Kaleebu; Dominic Kwiatkowski; Chris Tyler-Smith; Charles Rotimi; Eleftheria Zeggini; Manjinder S Sandhu
Journal:  Nature       Date:  2014-12-03       Impact factor: 49.962

6.  FORGe: prioritizing variants for graph genomes.

Authors:  Jacob Pritt; Nae-Chyun Chen; Ben Langmead
Journal:  Genome Biol       Date:  2018-12-17       Impact factor: 13.583

7.  Elimination of Reference Mapping Bias Reveals Robust Immune Related Allele-Specific Expression in Crossbred Sheep.

Authors:  Mazdak Salavati; Stephen J Bush; Sergio Palma-Vera; Mary E B McCulloch; David A Hume; Emily L Clark
Journal:  Front Genet       Date:  2019-09-19       Impact factor: 4.599

8.  CHOP: haplotype-aware path indexing in population graphs.

Authors:  Tom Mokveld; Jasper Linthorst; Zaid Al-Ars; Henne Holstege; Marcel Reinders
Journal:  Genome Biol       Date:  2020-03-11       Impact factor: 13.583

9.  WASP: allele-specific software for robust molecular quantitative trait locus discovery.

Authors:  Bryce van de Geijn; Graham McVicker; Yoav Gilad; Jonathan K Pritchard
Journal:  Nat Methods       Date:  2015-09-14       Impact factor: 28.547

10.  Assembly of a pan-genome from deep sequencing of 910 humans of African descent.

Authors:  Rachel M Sherman; Juliet Forman; Valentin Antonescu; Daniela Puiu; Michelle Daya; Nicholas Rafaels; Meher Preethi Boorgula; Sameer Chavan; Candelaria Vergara; Victor E Ortega; Albert M Levin; Celeste Eng; Maria Yazdanbakhsh; James G Wilson; Javier Marrugo; Leslie A Lange; L Keoki Williams; Harold Watson; Lorraine B Ware; Christopher O Olopade; Olufunmilayo Olopade; Ricardo R Oliveira; Carole Ober; Dan L Nicolae; Deborah A Meyers; Alvaro Mayorga; Jennifer Knight-Madden; Tina Hartert; Nadia N Hansel; Marilyn G Foreman; Jean G Ford; Mezbah U Faruque; Georgia M Dunston; Luis Caraballo; Esteban G Burchard; Eugene R Bleecker; Maria I Araujo; Edwin F Herrera-Paz; Monica Campbell; Cassandra Foster; Margaret A Taub; Terri H Beaty; Ingo Ruczinski; Rasika A Mathias; Kathleen C Barnes; Steven L Salzberg
Journal:  Nat Genet       Date:  2018-11-19       Impact factor: 38.330

View more
  8 in total

1.  Towards mouse genetic-specific RNA-sequencing read mapping.

Authors:  Nastassia Gobet; Maxime Jan; Paul Franken; Ioannis Xenarios
Journal:  PLoS Comput Biol       Date:  2022-09-26       Impact factor: 4.779

Review 2.  Population-scale genotyping of structural variation in the era of long-read sequencing.

Authors:  Cheng Quan; Hao Lu; Yiming Lu; Gangqiao Zhou
Journal:  Comput Struct Biotechnol J       Date:  2022-05-27       Impact factor: 6.155

3.  One is not enough: On the effects of reference genome for the mapping and subsequent analyses of short-reads.

Authors:  Carlos Valiente-Mullor; Beatriz Beamud; Iván Ansari; Carlos Francés-Cuesta; Neris García-González; Lorena Mejía; Paula Ruiz-Hueso; Fernando González-Candelas
Journal:  PLoS Comput Biol       Date:  2021-01-27       Impact factor: 4.475

Review 4.  A guide for the diagnosis of rare and undiagnosed disease: beyond the exome.

Authors:  Shruti Marwaha; Joshua W Knowles; Euan A Ashley
Journal:  Genome Med       Date:  2022-02-28       Impact factor: 15.266

5.  Pan-human consensus genome significantly improves the accuracy of RNA-seq analyses.

Authors:  Benjamin Kaminow; Sara Ballouz; Jesse Gillis; Alexander Dobin
Journal:  Genome Res       Date:  2022-03-07       Impact factor: 9.438

6.  Constructing small genome graphs via string compression.

Authors:  Yutong Qiu; Carl Kingsford
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.937

7.  Introgression among North American wild grapes (Vitis) fuels biotic and abiotic adaptation.

Authors:  Abraham Morales-Cruz; Jonas A Aguirre-Liguori; Yongfeng Zhou; Andrea Minio; Summaira Riaz; Andrew M Walker; Dario Cantu; Brandon S Gaut
Journal:  Genome Biol       Date:  2021-09-03       Impact factor: 13.583

8.  LevioSAM: Fast lift-over of variant-aware reference alignments.

Authors:  Taher Mun; Nae-Chyun Chen; Ben Langmead
Journal:  Bioinformatics       Date:  2021-05-25       Impact factor: 6.931

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.