Literature DB >> 31598382

Dog10K: the International Consortium of Canine Genome Sequencing.

Guo-Dong Wang1,2, Greger Larson3, Jeffrey M Kidd4, Bridgett M vonHoldt5, Elaine A Ostrander6, Ya-Ping Zhang1,2.   

Abstract

Entities:  

Year:  2019        PMID: 31598382      PMCID: PMC6776106          DOI: 10.1093/nsr/nwz068

Source DB:  PubMed          Journal:  Natl Sci Rev        ISSN: 2053-714X            Impact factor:   17.275


× No keyword cloud information.
Dogs (Canis lupus familiaris) were the first species to enter into a domestic relationship with people [1] and are a source of fascination all over the world, not only due to their history of domestication and dispersal along with human beings [2], but also because of their diverse phenotypes and behaviors, driven by both artificial and natural selection [3]. Dogs and humans have often been subjected to similar selection pressures [4], and these shared evolutionary trajectories have led to the emergence of the same common disorders including heart disease, neurologic disorders, diabetes and cancer [5]. The worldwide canine genetics genomics communities recently formed the International Consortium of Canine Genome Sequencing, also called the Dog10K Consortium (http://www.dog10kgenomes.org), to address major research questions regarding the genetic underpinnings of domestication, breed formation, aging, behavior and morphologic variation, and to advance our understanding of human and canine health [6]. This collaboration was originally conceived by the Chinese Academy of Sciences in 2016 with an initial plan to organize sample selection and collection, sequencing technology, bioinformatics, analysis and logistics, toward the goal of generating and sharing whole-genome sequence (WGS) data from 10 000 dogs and wild canids to improve the utility of the canine model system. An initial meeting was held in the summer of 2016 in the Academic Summit of Dog Genomes, featuring 25 participants representing 9 countries and 18 institutions (Fig. 1a), and the second meeting was held on 26 May 2019 at the 10th International Conference on Canine and Feline Genetics and Genomics (Fig. 1b) with 15 participants representing 13 institutions.
Figure 1.

(a) Initial meeting of the Dog10K Consortium in Beijing, 2016 (Courtesy of Yunyun Zheng). (b) Second meeting of the Dog10k Consortium in Bern, 2019 (Courtesy of Guo-Dong Wang).

(a) Initial meeting of the Dog10K Consortium in Beijing, 2016 (Courtesy of Yunyun Zheng). (b) Second meeting of the Dog10k Consortium in Bern, 2019 (Courtesy of Guo-Dong Wang). The primary goals of this collaborative endeavor are to generate WGSs of 10 000 canine/canids within 5 years, refine the existing reference genome from a Boxer, create new reference genomes from additional canids and apply the data to a myriad of scientific questions. The resulting catalog will contain comprehensive high-density genomic data, including single nucleotide variants (SNVs), structural variants (SVs), which include copy number variations (CNVs), and mobile element insertions (MEIs). Using these data, the possibility now exists to expand our knowledge of wild and domestic canids, and enhance studies of biology, disease susceptibility and evolution. We aim to sequence each genome to at least 20× coverage, which provides optimal coverage with minimal error rates. De novo genome assembly will be undertaken for both breed dogs and wild canids using a variety of technologies, including Pacific Biosciences long-reads (PacBio) (100×), Bacterial Artificial Chromosome (BAC)-end sequencing, optical mapping (Bionano Saphyr), phased haplotypes (60x by the 10x genomics platform), chromosome conformation (Hi-C), etc. We will sequence mRNAs, long non-coding RNAs and circle RNAs in pituitary, liver, hippocampus, ovarian and prefrontal cortex samples, and a full-length transcriptome through PacBio using a mix of the above tissues for the annotation of the reference genome. To address research questions, the Consortium defines the roles of participants and its interaction with the larger scientific community.

Sample acquisition for phylogeny, domestication, and phenotypic and clinical traits

Integral to the success of Dog10K is the principle that sample collection will be extensive, covering all established breeds throughout the world, as well as village dogs, wild canids, and niche or non-registered dog populations that fall outside the formal definition of a recognized breed. The latter can be used as outgroups to improve detection of ancestral versus derived alleles. The wild canids and village dogs will help the demographic history and occurrence of admixture at the early stages of dog domestication to be deciphered. Niche population dogs will reveal the natural selection events that occurred when dogs traveled alongside humans during early human civilizations. Studies of established breeds will facilitate genetic studies of breed morphology, behavior and disease susceptibility. The Consortium is also sampling pedigrees spanning three generations for both dogs and wolves to generate deep, comprehensive genome sequence information, which will further enable an improved inference of evolutionary rates in canids.

Bioinformatic analysis and data distribution

The Consortium will build a standard pipeline for SNV calling based on the Genome Analysis Toolkit (GATK) best practices [7] and will produce analysis-ready BAM files for each sample. Variant discovery will proceed from the BAM files and produce variant calls, which involves the generation of G.VCF files by the HaplotypeCaller for each sample and the production of variant calls by GenotypeGVCFs, combining all individuals. The output will be in VCF format for the further filtering of variants, refinement of genotypes and annotation. We will also use a combination of methods to call the SVs, CNVs and MEIs. Basic population genetic analysis will be performed to infer nuclear diversity, haplotypes recombination maps and other parameters. Data sharing is a crucial component of the Consortium. The FASTQ, BAM and G.VCF files will be stored and shared through the Genome Sequence Archive (http://gsa.big.ac.cn/). FASTQ files will be stored and shared through the Sequence Read Archive (https://www.ncbi.nlm.nih.gov/sra/). At the same time, we will use iDog (http://bigd.big.ac.cn/idog/) to share omics data [8], [i.e. the data set of variations (SNVs, CNVs, SVs and MEIs), haplotype, gene expression, parameters of population genetics, etc.]. All data sets will be continuously updated throughout the duration of the project. Meanwhile, we will update the progress of the dog10K Consortium on the website (http://www.dog10kgenomes.org) and incorporate a variety of metadata, including information regarding the number of samples collected, gender, geography, progress to date and other sample-affiliated information. All cooperating organizations and related partners’ information will also be available on the website. For data generated by other means and incorporated into Dog10K, links will be established to the original websites where primary data and additional analyses are located.

Population genomic- and WGS-based case-controls analysis

Dogs adapted to novel environments as they dispersed across continents with people and often hybridized with local wild canids as they did so. We will assess the strength and timing of selection at the first stage of dog domestication, and will identify artificial selection that affected early breeding processes. Dog10K data will allow for signatures of selection to be associated with specific traits and local adaptation processes in wild canid populations to be identified, both of which will serve as a basis for investigating the potential role of polygenic selection in canines, none of which have been studied to date. We will also explore the complex patterns of admixture among wild and domestic canid species, and search for genomic regions depleted of admixed ancestry and adapted introgression. Full genomes from diverse populations will provide new data to test ideas on the distribution of selection, which are integral to evolutionary models of demography and adaptation. Strong artificial selection has resulted in ∼450 globally recognized breeds with distinct traits related to morphology, including body size, leg length, skull shape, etc. [9]. Dog10K will expand that applicability by developing an exhaustive catalog of genetic variation to enable the generation of an accurate imputation panel for dog genome-wide association studies, in much the same way that the 1000 Genomes Project has improved the study of low-frequency variants in human association studies [10,11]. Reference sequence data on all breeds will provide the scientific community with an ability to perform matched WGS-based case-controls analysis, either directly or through imputation, as has recently been successfully demonstrated for breed standard traits. The transition of the process to studies of disease will be permitted by this large data set. This aim is important, since the link between canine and human health has generated the most interest within the biomedical community. Advancing our understanding of common human diseases through studies of comparable diseases based on genomic approaches is key to the overall success of Dog10K.

International collaboration

Dog10K aims to integrate quickly with ongoing sequencing and analysis efforts. For instance, the data set under construction includes the sequencing and subsequent analysis of 722 canids [9] and rare breeds from isolated locations throughout the world [12]. Other ongoing efforts in the general dog community fall into three categories. First, several research groups are studying domestication, independent of Dog10K [13,14]. The integration of WGS data from such studies will rapidly expand the utility of the Dog10K data set, permitting greater understanding of the selection events that have led to specific behaviors in domestic dogs as well as domestication itself [15]. Second, there are scientific programs focused on studies of ancient dog DNA (summarized in [16]). Such studies will provide insights into the links between wild canids and domesticated dogs, and will therefore be integrated into Dog10K. It is hoped that, simultaneously, the Dog10K reference data set will prove useful for studies of ancient canines. Finally, there are several ongoing efforts to build de novo reference genomes based on at least 12 distinct dog breeds or wild canids [17]. The aim of Dog10K is to produce an integrated reference genome with the caliber of the human genome. By both aiding the community with its advanced bioinformatics as well as directly integrating data produced by various research groups, this is sure to happen.

CONCLUSION

The Dog10K Consortium will generate a comprehensive data set of unparalleled complexity and power. This effort will result in a resource that will prove invaluable to a wide range of fields, including comparative genomics, veterinary science, evolutionary biology, medical research and conservation genetics. Dog10K will thus elevate the dog to its rightful place as a premier genetic system for revealing fundamental insights into evolution, health and biology; therefore, Dog10K will be a valuable partner for the growing number of researchers who are investigating dogs as a viable model for studies of disease susceptibility, progression, treatment response and gene therapy outcomes [18,19].
  17 in total

1.  Genomic Analyses Reveal the Influence of Geographic Origin, Migration, and Hybridization on Modern Dog Breed Development.

Authors:  Heidi G Parker; Dayna L Dreger; Maud Rimbault; Brian W Davis; Alexandra B Mullen; Gretchen Carpintero-Ramirez; Elaine A Ostrander
Journal:  Cell Rep       Date:  2017-04-25       Impact factor: 9.423

Review 2.  Demographic history, selection and functional diversity of the canine genome.

Authors:  Elaine A Ostrander; Robert K Wayne; Adam H Freedman; Brian W Davis
Journal:  Nat Rev Genet       Date:  2017-09-25       Impact factor: 53.242

3.  The evolutionary history of dogs in the Americas.

Authors:  Máire Ní Leathlobhair; Angela R Perri; Evan K Irving-Pease; Kelsey E Witt; Anna Linderholm; James Haile; Ophelie Lebrasseur; Carly Ameen; Jeffrey Blick; Adam R Boyko; Selina Brace; Yahaira Nunes Cortes; Susan J Crockford; Alison Devault; Evangelos A Dimopoulos; Morley Eldridge; Jacob Enk; Shyam Gopalakrishnan; Kevin Gori; Vaughan Grimes; Eric Guiry; Anders J Hansen; Ardern Hulme-Beaman; John Johnson; Andrew Kitchen; Aleksei K Kasparov; Young-Mi Kwon; Pavel A Nikolskiy; Carlos Peraza Lope; Aurélie Manin; Terrance Martin; Michael Meyer; Kelsey Noack Myers; Mark Omura; Jean-Marie Rouillard; Elena Y Pavlova; Paul Sciulli; Mikkel-Holger S Sinding; Andrea Strakova; Varvara V Ivanova; Christopher Widga; Eske Willerslev; Vladimir V Pitulko; Ian Barnes; M Thomas P Gilbert; Keith M Dobney; Ripan S Malhi; Elizabeth P Murchison; Greger Larson; Laurent A F Frantz
Journal:  Science       Date:  2018-07-06       Impact factor: 47.728

4.  Dogs cloned from adult somatic cells.

Authors:  Byeong Chun Lee; Min Kyu Kim; Goo Jang; Hyun Ju Oh; Fibrianto Yuda; Hye Jin Kim; M Shamim Hossein; M Hossein Shamim; Jung Ju Kim; Sung Keun Kang; Gerald Schatten; Woo Suk Hwang
Journal:  Nature       Date:  2005-08-04       Impact factor: 49.962

5.  Red fox genome assembly identifies genomic regions associated with tame and aggressive behaviours.

Authors:  Anna V Kukekova; Jennifer L Johnson; Xueyan Xiang; Shaohong Feng; Shiping Liu; Halie M Rando; Anastasiya V Kharlamova; Yury Herbeck; Natalya A Serdyukova; Zijun Xiong; Violetta Beklemischeva; Klaus-Peter Koepfli; Rimma G Gulevich; Anastasiya V Vladimirova; Jessica P Hekman; Polina L Perelman; Aleksander S Graphodatsky; Stephen J O'Brien; Xu Wang; Andrew G Clark; Gregory M Acland; Lyudmila N Trut; Guojie Zhang
Journal:  Nat Ecol Evol       Date:  2018-08-06       Impact factor: 15.460

6.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

7.  An integrated map of structural variation in 2,504 human genomes.

Authors:  Peter H Sudmant; Tobias Rausch; Eugene J Gardner; Robert E Handsaker; Alexej Abyzov; John Huddleston; Yan Zhang; Kai Ye; Goo Jun; Markus Hsi-Yang Fritz; Miriam K Konkel; Ankit Malhotra; Adrian M Stütz; Xinghua Shi; Francesco Paolo Casale; Jieming Chen; Fereydoun Hormozdiari; Gargi Dayama; Ken Chen; Maika Malig; Mark J P Chaisson; Klaudia Walter; Sascha Meiers; Seva Kashin; Erik Garrison; Adam Auton; Hugo Y K Lam; Xinmeng Jasmine Mu; Can Alkan; Danny Antaki; Taejeong Bae; Eliza Cerveira; Peter Chines; Zechen Chong; Laura Clarke; Elif Dal; Li Ding; Sarah Emery; Xian Fan; Madhusudan Gujral; Fatma Kahveci; Jeffrey M Kidd; Yu Kong; Eric-Wubbo Lameijer; Shane McCarthy; Paul Flicek; Richard A Gibbs; Gabor Marth; Christopher E Mason; Androniki Menelaou; Donna M Muzny; Bradley J Nelson; Amina Noor; Nicholas F Parrish; Matthew Pendleton; Andrew Quitadamo; Benjamin Raeder; Eric E Schadt; Mallory Romanovitch; Andreas Schlattl; Robert Sebra; Andrey A Shabalin; Andreas Untergasser; Jerilyn A Walker; Min Wang; Fuli Yu; Chengsheng Zhang; Jing Zhang; Xiangqun Zheng-Bradley; Wanding Zhou; Thomas Zichner; Jonathan Sebat; Mark A Batzer; Steven A McCarroll; Ryan E Mills; Mark B Gerstein; Ali Bashir; Oliver Stegle; Scott E Devine; Charles Lee; Evan E Eichler; Jan O Korbel
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs.

Authors:  Adam H Freedman; Rena M Schweizer; Diego Ortega-Del Vecchyo; Eunjung Han; Brian W Davis; Ilan Gronau; Pedro M Silva; Marco Galaverni; Zhenxin Fan; Peter Marx; Belen Lorente-Galdos; Oscar Ramirez; Farhad Hormozdiari; Can Alkan; Carles Vilà; Kevin Squire; Eli Geffen; Josip Kusak; Adam R Boyko; Heidi G Parker; Clarence Lee; Vasisht Tadigotla; Adam Siepel; Carlos D Bustamante; Timothy T Harkins; Stanley F Nelson; Tomas Marques-Bonet; Elaine A Ostrander; Robert K Wayne; John Novembre
Journal:  PLoS Genet       Date:  2016-03-04       Impact factor: 5.917

9.  Whole-genome sequence analysis shows that two endemic species of North American wolf are admixtures of the coyote and gray wolf.

Authors:  Bridgett M vonHoldt; James A Cahill; Zhenxin Fan; Ilan Gronau; Jacqueline Robinson; John P Pollinger; Beth Shapiro; Jeff Wall; Robert K Wayne
Journal:  Sci Adv       Date:  2016-07-27       Impact factor: 14.136

10.  iDog: an integrated resource for domestic dogs and wild canids.

Authors:  Bixia Tang; Qing Zhou; Lili Dong; Wulue Li; Xiangquan Zhang; Li Lan; Shuang Zhai; Jingfa Xiao; Zhang Zhang; Yiming Bao; Ya-Ping Zhang; Guo-Dong Wang; Wenming Zhao
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  5 in total

Review 1.  Dog-human translational genomics: state of the art and genomic resources.

Authors:  Stefano Pallotti; Ignazio S Piras; Andrea Marchegiani; Matteo Cerquetella; Valerio Napolioni
Journal:  J Appl Genet       Date:  2022-09-08       Impact factor: 2.653

2.  Fine human genetic map based on UK10K data set.

Authors:  Ziqian Hao; Pengyuan Du; Yi-Hsuan Pan; Haipeng Li
Journal:  Hum Genet       Date:  2022-01-20       Impact factor: 4.132

3.  Genomic regions under selection in the feralization of the dingoes.

Authors:  Shao-Jie Zhang; Guo-Dong Wang; Pengcheng Ma; Liang-Liang Zhang; Ting-Ting Yin; Yan-Hu Liu; Newton O Otecko; Meng Wang; Ya-Ping Ma; Lu Wang; Bingyu Mao; Peter Savolainen; Ya-Ping Zhang
Journal:  Nat Commun       Date:  2020-02-03       Impact factor: 14.919

Review 4.  COMMD1 Exemplifies the Power of Inbred Dogs to Dissect Genetic Causes of Rare Copper-Related Disorders.

Authors:  Ronald Jan Corbee; Louis C Penning
Journal:  Animals (Basel)       Date:  2021-02-25       Impact factor: 2.752

5.  Whole-Genome Sequencing Reveals Lactase Persistence Adaptation in European Dogs.

Authors:  Yan-Hu Liu; Lu Wang; Zhiguo Zhang; Newton O Otecko; Saber Khederzadeh; Yongqin Dai; Bin Liang; Guo-Dong Wang; Ya-Ping Zhang
Journal:  Mol Biol Evol       Date:  2021-10-27       Impact factor: 16.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.