| Literature DB >> 29230066 |
Daehwan Lee1, Dajeong Lim2, Daehong Kwon1, Juyeon Kim1, Jongin Lee1, Mikang Sim1, Bong-Hwan Choi2, Seog-Gyu Choi3, Jaebum Kim4.
Abstract
Rapid and cost effective production of large-scale genome data through next-generation sequencing has enabled population-level studies of various organisms to identify their genotypic differences and phenotypic consequences. This is also used to study indigenous animals with historical and economical values, although they are less studied than model organisms. The objective of this study was to perform functional and evolutionary analysis of Korean bob-tailed native dog Donggyeong with distinct tail and agility phenotype using whole-genome sequencing data by using population and comparative genomics approaches. Based on the uniqueness of non-synonymous single nucleotide polymorphisms obtained from next-generation sequencing data, Donggyeong dog-specific genes/proteins and their functions were identified by comparison with 12 other dog breeds and six other related species. These proteins were further divided into subpopulation-specific ones with different tail length and protein interaction-level signatures were investigated. Finally, the trajectory of shaping protein interactions of subpopulation-specific proteins during evolution was uncovered. This study expands our knowledge of Korean native dogs. Our results also provide a good example of using whole-genome sequencing data for population-level analysis in closely related species.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29230066 PMCID: PMC5725459 DOI: 10.1038/s41598-017-17817-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Classification of Donggyeong dog SNPs.
| Category | Long tail | Short tail | Non-tail | |
|---|---|---|---|---|
| Exon | Synonymous | 34,765 (0.40%) | 34,628 (0.40%) | 35,713 (0.40%) |
| Non-synonymous | 29,781 (0.34%) | 29,061 (0.34%) | 30,625 (0.34%) | |
| Non-Exon | Splice site | 6,938 (0.08%) | 6,909 (0.08%) | 7,174 (0.08%) |
| Intron | 2,377,868 (27.44%) | 2,367,464 (27.65%) | 2,449,979 (27.52%) | |
| UTR | 57,514 (0.66%) | 57,558 (0.67%) | 59,429 (0.67%) | |
| Intergenic | 6,166,978 (71.15%) | 6,073,164 (70.93%) | 6,326,879 (71.07%) |
The fraction to the total number of SNPs in Supplementary Table 1 is shown in parentheses.
Figure 1Examples of Donggyeong dog-specific non-synonymous SNPs and consequential amino acid variants. Top panel shows gene structure with the direction of transcription (blue arrow). Bottom panel indicates positions of non-synonymous SNPs and comparison of amino acids among different dog breeds and related species. Two different amino acids corresponding to two nucleotide variants in Donggyeong dog are shown together with a slash delimiter. The dash symbol represents a gap in multiple sequene alignment. DG: Donggyeong dog, DQ: Diquing village dog, KM: Kunming dog, YJ: Yingjiang village dog, GS: German shepherd, LJ: Lijiang village dog, TM: Tibetan mastiff. Other dog breeds not shown here contained the same variant as dog breeds with blue color shown in this figure.
Figure 2Protein-protein interactions of Donggyeong dog tail length-specific extended protein sets. (A) Entire protein-protein interaction network of Donggyeong dog tail length-specific protein sets. (B) Tail-specific (blue) and non-tail-specific (red) subnetworks with interactions absent in other related species (cat, horse, pig, and cow). Node colors correspond to the colors used in Supplementary Fig. 2 (blue: the tail-specific proteins not shared with the non-tail-specific and the common proteins, red: the non-tail-specific proteins not shared with the tail-specific and the common proteins, green: shared proteins by the tail-specific and the non-tail-specific proteins excluding the common proteins, and gray: the common proteins). The definition of the protein sets is found in Materials and Methods.
Figure 3Evolutionary changes in protein interactions among Donggyeong dog tail-specific proteins against a total of 2,218 protein pairs (A) and non-tail-specific proteins against a total of 2,539 protein pairs (B). Numbers on branches represent the number of newly appeared (in the case of +) or disappeared (in the case of −) protein interactions. Percentages in parentheses indicate fraction against the number of protein pairs without interaction (in the case of +) or the number of protein pairs with interaction (in the case of −). Two numbers next to names of ancestors or above names of descendant species mean the number of protein pairs with or without interaction, respectively. Divergence time among species was obtained from the TimeTree website (http://www.timetree.org/).