Matt A Field1,2, Benjamin D Rosen3, Olga Dudchenko4,5,6, Eva K F Chan7,8, Andre E Minoche, Richard J Edwards9, Kirston Barton7,10, Ruth J Lyons7, Daniel Enosi Tuipulotu9, Vanessa M Hayes7,10,11, Arina D Omer4,5, Zane Colaric4,5, Jens Keilwagen12, Ksenia Skvortsova7, Ozren Bogdanovic7,9, Martin A Smith7,10, Erez Lieberman Aiden4,5,6,13,14, Timothy P L Smith15, Robert A Zammit16, J William O Ballard9. 1. Centre for Tropical Bioinformatics and Molecular Biology, Australian Institute of Tropical Health and Medicine, James Cook University, Smithfield Road, Cairns, QLD 4878, Australia. 2. John Curtin School of Medical Research, Australian National University, Garran Rd, Canberra, ACT 2600, Australia. 3. Animal Genomics and Improvement Laboratory, Agricultural Research Service USDA, Baltimore Ave, Beltsville, MD 20705, USA. 4. The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Baylor Plaza, Houston, TX 77030, USA. 5. Department of Computer Science, Rice University, Main St, Houston, TX 77005, USA. 6. Center for Theoretical and Biological Physics, Rice University, Main St, Houston, TX 77005, USA. 7. Garvan Institute of Medical Research, Victoria Street, Darlinghurst, NSW 2010, Australia. 8. St Vincent's Clinical School, University of New South Wales Sydney, Victoria Street, Darlinghurst NSW 2010, Australia. 9. School of Biotechnology and Biomolecular Sciences, UNSW Sydney, High St, Kensington, NSW 2052, Australia. 10. Faculty of Medicine, UNSW Sydney, High St, Kensington, NSW 2052, Australia. 11. Central Clinical School, University of Sydney, Parramatta Road, Camperdown, NSW 2050, Australia. 12. Julius Kühn-Institut, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany. 13. Broad Institute of MIT and Harvard, Main St, Cambridge, MA 02142, USA. 14. Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, ShanghaiTech University, Huaxia Middle Rd, Pudong 201210, China. 15. US Meat Animal Research Center, Agricultural Research Service USDA, Rd 313, Clay Center, NE 68933, USA. 16. Vineyard Veterinary Hospital, Windsor Rd, Vineyard, NSW 2765, Australia.
Abstract
BACKGROUND: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. FINDINGS: Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. CONCLUSIONS: GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology.
BACKGROUND: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. FINDINGS: Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. CONCLUSIONS: GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology.
Authors: Heidi G Parker; Dayna L Dreger; Maud Rimbault; Brian W Davis; Alexandra B Mullen; Gretchen Carpintero-Ramirez; Elaine A Ostrander Journal: Cell Rep Date: 2017-04-25 Impact factor: 9.423
Authors: Ernest T Lam; Alex Hastie; Chin Lin; Dean Ehrlich; Somes K Das; Michael D Austin; Paru Deshpande; Han Cao; Niranjan Nagarajan; Ming Xiao; Pui-Yan Kwok Journal: Nat Biotechnol Date: 2012-08 Impact factor: 54.908
Authors: Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy Journal: Genome Res Date: 2017-03-15 Impact factor: 9.043
Authors: Alisa Mo; Eran A Mukamel; Fred P Davis; Chongyuan Luo; Gilbert L Henry; Serge Picard; Mark A Urich; Joseph R Nery; Terrence J Sejnowski; Ryan Lister; Sean R Eddy; Joseph R Ecker; Jeremy Nathans Journal: Neuron Date: 2015-06-17 Impact factor: 17.173
Authors: Alex R Hastie; Lingli Dong; Alexis Smith; Jeff Finklestein; Ernest T Lam; Naxin Huo; Han Cao; Pui-Yan Kwok; Karin R Deal; Jan Dvorak; Ming-Cheng Luo; Yong Gu; Ming Xiao Journal: PLoS One Date: 2013-02-06 Impact factor: 3.240
Authors: Kevin Batcher; Scarlett Varney; Daniel York; Matthew Blacksmith; Jeffrey M Kidd; Robert Rebhun; Peter Dickinson; Danika Bannasch Journal: Genome Res Date: 2022-08-12 Impact factor: 9.438
Authors: Matt A Field; Sonu Yadav; Olga Dudchenko; Meera Esvaran; Benjamin D Rosen; Ksenia Skvortsova; Richard J Edwards; Jens Keilwagen; Blake J Cochran; Bikash Manandhar; Sonia Bustamante; Jacob Agerbo Rasmussen; Richard G Melvin; Barry Chernoff; Arina Omer; Zane Colaric; Eva K F Chan; Andre E Minoche; Timothy P L Smith; M Thomas P Gilbert; Ozren Bogdanovic; Robert A Zammit; Torsten Thomas; Erez L Aiden; J William O Ballard Journal: Sci Adv Date: 2022-04-22 Impact factor: 14.957
Authors: Robert A Player; Ellen R Forsyth; Kathleen J Verratti; David W Mohr; Alan F Scott; Christopher E Bradburne Journal: Life Sci Alliance Date: 2021-01-29
Authors: Jennifer R S Meadows; Kerstin Lindblad-Toh; Chao Wang; Ola Wallerman; Maja-Louise Arendt; Elisabeth Sundström; Åsa Karlsson; Jessika Nordin; Suvi Mäkeläinen; Gerli Rosengren Pielberg; Jeanette Hanson; Åsa Ohlsson; Sara Saellström; Henrik Rönnberg; Ingrid Ljungvall; Jens Häggström; Tomas F Bergström; Åke Hedhammar Journal: Commun Biol Date: 2021-02-10
Authors: Richard J Edwards; Matt A Field; James M Ferguson; Olga Dudchenko; Jens Keilwagen; Benjamin D Rosen; Gary S Johnson; Edward S Rice; La Deanna Hillier; Jillian M Hammond; Samuel G Towarnicki; Arina Omer; Ruqayya Khan; Ksenia Skvortsova; Ozren Bogdanovic; Robert A Zammit; Erez Lieberman Aiden; Wesley C Warren; J William O Ballard Journal: BMC Genomics Date: 2021-03-16 Impact factor: 3.969
Authors: Julia V Halo; Amanda L Pendleton; Feichen Shen; Aurélien J Doucet; Thomas Derrien; Christophe Hitte; Laura E Kirby; Bridget Myers; Elzbieta Sliwerska; Sarah Emery; John V Moran; Adam R Boyko; Jeffrey M Kidd Journal: Proc Natl Acad Sci U S A Date: 2021-03-16 Impact factor: 11.205