Literature DB >> 34011274

Investigating the impact of reference assembly choice on genomic analyses in a cattle breed.

Audald Lloret-Villas1, Meenu Bhati2, Naveen Kumar Kadri2, Ruedi Fries3, Hubert Pausch2.   

Abstract

BACKGROUND: Reference-guided read alignment and variant genotyping are prone to reference allele bias, particularly for samples that are greatly divergent from the reference genome. A Hereford-based assembly is the widely accepted bovine reference genome. Haplotype-resolved genomes that exceed the current bovine reference genome in quality and continuity have been assembled for different breeds of cattle. Using whole genome sequencing data of 161 Brown Swiss cattle, we compared the accuracy of read mapping and sequence variant genotyping as well as downstream genomic analyses between the bovine reference genome (ARS-UCD1.2) and a highly continuous Angus-based assembly (UOA_Angus_1).
RESULTS: Read mapping accuracy did not differ notably between the ARS-UCD1.2 and UOA_Angus_1 assemblies. We discovered 22,744,517 and 22,559,675 high-quality variants from ARS-UCD1.2 and UOA_Angus_1, respectively. The concordance between sequence- and array-called genotypes was high and the number of variants deviating from Hardy-Weinberg proportions was low at segregating sites for both assemblies. More artefactual INDELs were genotyped from UOA_Angus_1 than ARS-UCD1.2 alignments. Using the composite likelihood ratio test, we detected 40 and 33 signatures of selection from ARS-UCD1.2 and UOA_Angus_1, respectively, but the overlap between both assemblies was low. Using the 161 sequenced Brown Swiss cattle as a reference panel, we imputed sequence variant genotypes into a mapping cohort of 30,499 cattle that had microarray-derived genotypes using a two-step imputation approach. The accuracy of imputation (Beagle R2) was very high (0.87) for both assemblies. Genome-wide association studies between imputed sequence variant genotypes and six dairy traits as well as stature produced almost identical results from both assemblies.
CONCLUSIONS: The ARS-UCD1.2 and UOA_Angus_1 assemblies are suitable for reference-guided genome analyses in Brown Swiss cattle. Although differences in read mapping and genotyping accuracy between both assemblies are negligible, the choice of the reference genome has a large impact on detecting signatures of selection that already reached fixation using the composite likelihood ratio test. We developed a workflow that can be adapted and reused to compare the impact of reference genomes on genome analyses in various breeds, populations and species.

Entities:  

Keywords:  Alignment quality; Bovine; Functional annotation; Genome-wide association study; Reference genome comparison; Sequence variants; Signatures of selection

Year:  2021        PMID: 34011274     DOI: 10.1186/s12864-021-07554-w

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  36 in total

1.  CrossMap: a versatile tool for coordinate conversion between genome assemblies.

Authors:  Hao Zhao; Zhifu Sun; Jing Wang; Haojie Huang; Jean-Pierre Kocher; Liguo Wang
Journal:  Bioinformatics       Date:  2013-12-18       Impact factor: 6.937

Review 2.  The Third Revolution in Sequencing Technology.

Authors:  Erwin L van Dijk; Yan Jaszczyszyn; Delphine Naquin; Claude Thermes
Journal:  Trends Genet       Date:  2018-06-22       Impact factor: 11.639

3.  A first look at the Oxford Nanopore MinION sequencer.

Authors:  Alexander S Mikheyev; Mandy M Y Tin
Journal:  Mol Ecol Resour       Date:  2014-09-24       Impact factor: 7.090

4.  Efficacy of injectable ivermectin against natural infections of Stephanurus dentatus in swine.

Authors:  H N Becker
Journal:  Am J Vet Res       Date:  1986-07       Impact factor: 1.156

5.  Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle.

Authors:  Hans D Daetwyler; Aurélien Capitan; Hubert Pausch; Paul Stothard; Rianne van Binsbergen; Rasmus F Brøndum; Xiaoping Liao; Anis Djari; Sabrina C Rodriguez; Cécile Grohs; Diane Esquerré; Olivier Bouchez; Marie-Noëlle Rossignol; Christophe Klopp; Dominique Rocha; Sébastien Fritz; André Eggen; Phil J Bowman; David Coote; Amanda J Chamberlain; Charlotte Anderson; Curt P VanTassell; Ina Hulsegge; Mike E Goddard; Bernt Guldbrandtsen; Mogens S Lund; Roel F Veerkamp; Didier A Boichard; Ruedi Fries; Ben J Hayes
Journal:  Nat Genet       Date:  2014-07-13       Impact factor: 38.330

6.  FORGe: prioritizing variants for graph genomes.

Authors:  Jacob Pritt; Nae-Chyun Chen; Ben Langmead
Journal:  Genome Biol       Date:  2018-12-17       Impact factor: 13.583

7.  Accurate sequence variant genotyping in cattle using variation-aware genome graphs.

Authors:  Danang Crysnanto; Christine Wurmser; Hubert Pausch
Journal:  Genet Sel Evol       Date:  2019-05-15       Impact factor: 4.297

8.  Unlocking the bovine genome.

Authors:  Ross L Tellam; Danielle G Lemay; Curtis P Van Tassell; Harris A Lewin; Kim C Worley; Christine G Elsik
Journal:  BMC Genomics       Date:  2009-04-24       Impact factor: 3.969

9.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.

Authors:  Jacob F Degner; John C Marioni; Athma A Pai; Joseph K Pickrell; Everlyne Nkadori; Yoav Gilad; Jonathan K Pritchard
Journal:  Bioinformatics       Date:  2009-10-06       Impact factor: 6.937

10.  Assembly of a pan-genome from deep sequencing of 910 humans of African descent.

Authors:  Rachel M Sherman; Juliet Forman; Valentin Antonescu; Daniela Puiu; Michelle Daya; Nicholas Rafaels; Meher Preethi Boorgula; Sameer Chavan; Candelaria Vergara; Victor E Ortega; Albert M Levin; Celeste Eng; Maria Yazdanbakhsh; James G Wilson; Javier Marrugo; Leslie A Lange; L Keoki Williams; Harold Watson; Lorraine B Ware; Christopher O Olopade; Olufunmilayo Olopade; Ricardo R Oliveira; Carole Ober; Dan L Nicolae; Deborah A Meyers; Alvaro Mayorga; Jennifer Knight-Madden; Tina Hartert; Nadia N Hansel; Marilyn G Foreman; Jean G Ford; Mezbah U Faruque; Georgia M Dunston; Luis Caraballo; Esteban G Burchard; Eugene R Bleecker; Maria I Araujo; Edwin F Herrera-Paz; Monica Campbell; Cassandra Foster; Margaret A Taub; Terri H Beaty; Ingo Ruczinski; Rasika A Mathias; Kathleen C Barnes; Steven L Salzberg
Journal:  Nat Genet       Date:  2018-11-19       Impact factor: 38.330

View more
  1 in total

1.  Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies.

Authors:  Alexander S Leonard; Danang Crysnanto; Zih-Hua Fang; Michael P Heaton; Brian L Vander Ley; Carolina Herrera; Heinrich Bollwein; Derek M Bickhart; Kristen L Kuhn; Timothy P L Smith; Benjamin D Rosen; Hubert Pausch
Journal:  Nat Commun       Date:  2022-05-31       Impact factor: 17.694

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.