Literature DB >> 31841508

Selection signatures in goats reveal copy number variants underlying breed-defining coat color phenotypes.

Jan Henkel1,2, Rashid Saif1,3, Vidhya Jagannathan1,2, Corinne Schmocker1, Flurina Zeindler4, Erika Bangerter5, Ursula Herren5, Dimitris Posantzis6, Zafer Bulut7, Philippe Ammann8, Cord Drögemüller1,2, Christine Flury4, Tosso Leeb1,2.   

Abstract

Domestication and human selection have formed diverse goat breeds with characteristic phenotypes. This process correlated with the fixation of causative genetic variants controlling breed-specific traits within regions of reduced genetic diversity, so called selection signatures or selective sweeps. Using whole genome sequencing of DNA pools (pool-seq) from 20 genetically diverse modern goat breeds and bezoars, we identified 2,239 putative selection signatures. In two Pakistani goat breeds, Pak Angora and Barbari, we found selection signatures in a region harboring KIT, a gene involved in melanoblast development, migration, and survival. The search for candidate causative variants responsible for these selective sweeps revealed two different copy number variants (CNVs) downstream of KIT that were exclusively present in white Pak Angora and white-spotted Barbari goats. Several Swiss goat breeds selected for specific coat colors showed selection signatures at the ASIP locus encoding the agouti signaling protein. Analysis of these selective sweeps revealed four different CNVs associated with the white or tan (AWt), Swiss markings (Asm), badgerface (Ab), and the newly proposed peacock (Apc) allele. RNA-seq analyses on skin samples from goats with the different CNV alleles suggest that the identified structural variants lead to an altered expression of ASIP between eumelanistic and pheomelanistic body areas. Our study yields novel insights into the genetic control of pigmentation by identifying six functionally relevant CNVs. It illustrates how structural changes of the genome have contributed to phenotypic evolution in domestic goats.

Entities:  

Year:  2019        PMID: 31841508      PMCID: PMC6936872          DOI: 10.1371/journal.pgen.1008536

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

Goat domestication started around 10,000 years ago in the fertile crescent and is believed to be one of the earliest domestication events of livestock animals [1, 2]. Bezoars, the wild ancestors of domestic goats are an extant species with a distribution in Western Asia from Turkey to Pakistan. Since domestication, goats followed the human migration [3] and played an economically important role for their owners by providing various products like milk, meat or fibers. These economical values were further increased by production-orientated breeding, which led to more than 600 diverse goat breeds at present time [4-6]. Artificial selection of domesticated goats not only resulted in specialized elite breeds for milk, meat or fibers, but also in breeds with unique coat color phenotypes [4, 7]. Due to their striking appearance, these goat breeds are of special value to their owners, selected for uniform coat color, and kept in closed populations. Coat color phenotypes are one of the most intensively studied traits in goats [8-12]. They include solid colored animals of different color, animals with symmetrical color patterns, and animals with white markings, white spotting phenotypes or completely white animals. White markings, white spotting and completely white phenotypes typically result from a lack of melanocytes in the skin and hair follicles. This group of phenotypes is also termed leucism or piebaldism and characterized by defects in melanoblast development or migration [13-17]. Very light coat colors resembling white are also seen in animals that have a normal set of melanocytes synthesizing a very pale pheomelanin [18]. Melanocytes produce two types of pigments, the brown to black eumelanin and the red to yellow pheomelanin. The so-called pigment type switching, an intensively studied signaling process, governs whether a given melanocyte produces eumelanin or pheomelanin [19]. Eumelanin is produced, if MC1R is activated by its ligand α-melanocyte stimulating hormone (α-MSH), while pheomelanin is produced if α-MSH is absent and/or outcompeted by binding of the competitive antagonist ASIP to MC1R [20-25]. Different alternative promoters of the ASIP gene enable spatially and temporally regulated ASIP expression, which results in characteristic patterns of eumelanin and pheomelanin synthesis [25-28]. Domestication and artificial selection correlated with the fixation of causative genetic variants controlling breed-specific traits within regions of reduced genetic diversity, so called selection signatures or selective sweeps [29-31]. A method detecting regions of low heterozygosity from sequence data of pooled individuals (pool-seq) was developed and used to identify loci under selection in chicken, pigs, rabbits and the Atlantic herring [32-35]. Recently, pooled heterozygosity scores were also applied to monitor loci under selection in goats [36]. In the present study, we aimed to gain a better understanding of the genetic variants determining breed-specific coat color phenotypes in goats. We therefore performed a comprehensive screen for selection signatures in bezoars and 20 breeds of domesticated goats.

Results

Selection signature analysis

For the present study, we selected 8–12 animals each from 20 phenotypically diverse domesticated goat breeds and their wild ancestor, the bezoar. We isolated genomic DNA from these animals and prepared equimolar DNA pools for sequencing (Table 1).
Table 1

Breeds collected for pool sequencing (pool-seq).

BreedAbbreviationBreed originAnimals per pool
Pak AngoraANGPakistan10
Appenzell goatAPZSwitzerland12
BarbariBARPakistan12
BeetalBEEPakistan12
Bezoar (Capra aegagrus)BEZwild ancestor8
Grisons Striped goatBSTSwitzerland12
Boer goatBURAfrica12
Capra GrigiaCAGSwitzerland12
Dera Din PanahDDPPakistan12
Chamois Colored goatGFGSwitzerland12
KamoriKAMPakistan12
NachiNACPakistan12
Nera VerzascaNERSwitzerland12
PahariPAHPakistan12
Peacock goatPFASwitzerland12
Saanen goatSANSwitzerland12
St. Gallen Booted goatSTGSwitzerland10
TeddyTEDPakistan12
Toggenburg goatTOGSwitzerland12
Valais Blackneck goatVAGSwitzerland12
African Dwarf goatZWZAfrica12
We obtained 2x150 bp paired-end sequence data corresponding to 30x genome coverage per pool, called high confidence SNVs and calculated pooled heterozygosity scores (−ZHp) in 150 kb sliding windows with 75 kb step size (S1 Fig; S1 and S2 Tables). The significance threshold was conservatively set at −ZHp ≥ 4, which identified 5,220 windows with extremely reduced heterozygosity (0.8% of all windows). Overlapping windows were further merged into 2,239 selection signatures (1.1% of total genomic length). This corresponded to 112 selection signatures per breed pool on average (median = 81; S3 Table). To evaluate the validity of the pool-seq approach, we compared the results from pool-seq data to individual whole genome sequence data of 120 goats from five different Swiss breeds (S1 Table). We called SNVs from the individual sequence data and calculated Hp and −ZHp scores respectively (S2 Table). The pool-seq dataset and the dataset with individual sequences yielded similar results (S2 Fig). As a validation of the significance threshold, we inspected our data for selection signatures near known causative variants for breed-defining coat color traits. The characteristic brown coat color of the Toggenburg goat is caused by a missense SNV in the TYRP1 gene, p.Gly496Asp [10]. Toggenburg goats showed the expected selection signature harboring the TYRP1 gene with a −ZHp value of 4.88 (Fig 1; S3 Table).
Fig 1

Manhattan plots showing −ZHp values from 20 diverse goat breeds.

The red horizontal line indicates the chosen significance threshold of −ZHp = 4. Each dot represents a 150 kb window. Each plot contains 29 autosomes and two unplaced scaffolds representing the X chromosome. Selection signatures co-localizing with known coat color genes are marked with arrows.

Manhattan plots showing −ZHp values from 20 diverse goat breeds.

The red horizontal line indicates the chosen significance threshold of −ZHp = 4. Each dot represents a 150 kb window. Each plot contains 29 autosomes and two unplaced scaffolds representing the X chromosome. Selection signatures co-localizing with known coat color genes are marked with arrows. In addition to the search for reduced heterozygosity, we calculated FST values for each breed pool in a pairwise comparison with bezoars. The FST analysis identified 847 selection signatures or 0.4% of the total genomic length (S1 and S3 Figs, S4 Table).

CNVs at the KIT locus in two Pakistani goat breeds

The completely white Pak Angora and the white spotted Barbari breeds showed strong selection signatures harboring the KIT gene on chromosome 6 with −ZHp values of 7.20 and 4.56 (Fig 1; S3 Table). We searched for candidate causative variants within the signatures, but did not detect any coding variants in the KIT gene. However, visual inspection of the short read alignments revealed two different copy number variants (CNVs) downstream of the KIT gene in the Pak Angora and Barbari breeds (Fig 2).
Fig 2

CNVs at the KIT locus.

The coverage plot of bezoars (BEZ) does not show any copy number variation and represents the wildtype allele. In the Pak Angora breed (ANG), the coverage plot shows a triplication of ~100 kb downstream of the KIT gene. In the Barbari breed (BAR), the same region is duplicated. The Barbari allele shows a complex rearrangement involving the insertion of a ~23 kb genome segment originating at 89.2 Mb into the duplicated sequence at ~70.9 Mb with the simultaneous deletion of ~16 kb of KIT sequence. Please note that the coverage at ~89.2 Mb corresponds to three times the average. One genome equivalent corresponds to the wildtype sequence at ~89.2 Mb. Read-pair information indicated that the other two genome equivalents are inserted into the duplicated sequence at ~70.9 Mb (S4 Fig). The dashed red line indicates the average coverage across the whole genome of each pool-seq dataset.

CNVs at the KIT locus.

The coverage plot of bezoars (BEZ) does not show any copy number variation and represents the wildtype allele. In the Pak Angora breed (ANG), the coverage plot shows a triplication of ~100 kb downstream of the KIT gene. In the Barbari breed (BAR), the same region is duplicated. The Barbari allele shows a complex rearrangement involving the insertion of a ~23 kb genome segment originating at 89.2 Mb into the duplicated sequence at ~70.9 Mb with the simultaneous deletion of ~16 kb of KIT sequence. Please note that the coverage at ~89.2 Mb corresponds to three times the average. One genome equivalent corresponds to the wildtype sequence at ~89.2 Mb. Read-pair information indicated that the other two genome equivalents are inserted into the duplicated sequence at ~70.9 Mb (S4 Fig). The dashed red line indicates the average coverage across the whole genome of each pool-seq dataset. Both CNVs started ~63 kb downstream of KIT and covered ~100 kb of the genome reference sequence without known coding DNA. The short read-aligments of read-pairs spanning the amplification breakpoints confirmed that the individual copies of the CNV were arranged in tandem in a head to tail orientation (S4 Fig). The Pak Angora allele consisted of a triplication of the 100 kb region. The Barbari allele represented a duplication of the same ~100 kb region with an additional 16,280 bp deletion in its central part. The read-pair information at the breakpoints indicated that the deleted part was replaced by a 22,702 bp genomic fragment from the 5’-flanking sequence of the RASSF6 gene, which is located 19 Mb further downstream on the same chromosome (S4 Fig; S5 Table). The shared breakpoints of the two different CNV alleles suggested a common origin of these alleles.

CNVs at the ASIP locus in Swiss goat breeds

Five Swiss goat breeds with different coat color patterns had a selection signature with −ZHp ≥ 4 in the region of the ASIP locus on chromosome 13 (Fig 1; S3 Table). We did not find any ASIP coding variation in the breeds with ASIP selection signatures. Based on the segregation of coat color patterns in a large breeding experiment, the existence of up to 11 different caprine ASIP alleles has been postulated [8]. The most dominant of this allelic series, termed “white or tan” (A), is responsible for white coat color in goats [8]. Furthermore, it has been shown that the white coat color in Saanen goats is caused by a triplication of the ASIP gene [9]. Inspection of the short-read alignments of our own sequence data from Saanen goats at the ASIP locus confirmed the previously reported triplication and revealed the exact boundaries of the triplication. It spans 154,677 bp of the reference genome sequence and comprises the entire coding sequence of the ASIP, AHCY and ITCH genes. The individual copies are arranged in tandem in a head to tail orientation. Appenzell goats, another white goat breed, had the same CNV allele as the Saanen goats (Fig 3, S4 Fig, S5 Table).
Fig 3

CNVs at the ASIP locus.

A Coverage plots of the ASIP locus in different goat breeds reveal four different CNVs. The bezoar (BEZ) coverage plot shows uniform coverage and is characteristic for the wildtype allele (A). Underneath, four different mutant ASIP alleles associated with different CNVs are illustrated. The line on top of each plot schematically indicates the most likely configuration of these mutant alleles derived from the available short-read sequence information (S4 Fig). The dashed red line indicates the average coverage across the whole genome of each breed. B Schematic drawings and C representative photographs illustrating the coat color phenotypes of the studied breeds. The photo of the bezoar was obtained during summer, when the dark stripes at the collar and the belly are much less pronounced than in the winter coat. Note that some of the patterns show an exactly inverse distribution of eumelanin and pheomelanin. For example, goats with the A allele have white (pheomelanistic) facial stripes and legs, while goats with the A or A alleles have black (eumelanistic) facial stripes and legs.

CNVs at the ASIP locus.

A Coverage plots of the ASIP locus in different goat breeds reveal four different CNVs. The bezoar (BEZ) coverage plot shows uniform coverage and is characteristic for the wildtype allele (A). Underneath, four different mutant ASIP alleles associated with different CNVs are illustrated. The line on top of each plot schematically indicates the most likely configuration of these mutant alleles derived from the available short-read sequence information (S4 Fig). The dashed red line indicates the average coverage across the whole genome of each breed. B Schematic drawings and C representative photographs illustrating the coat color phenotypes of the studied breeds. The photo of the bezoar was obtained during summer, when the dark stripes at the collar and the belly are much less pronounced than in the winter coat. Note that some of the patterns show an exactly inverse distribution of eumelanin and pheomelanin. For example, goats with the A allele have white (pheomelanistic) facial stripes and legs, while goats with the A or A alleles have black (eumelanistic) facial stripes and legs. We next investigated the coverage plots of Grisons Striped goats and Toggenburg goats. These two breeds show a characteristic color pattern, which has been postulated to be caused by an ASIP allele termed “Swiss markings” (A) [8]. They are fixed for an ASIP allele with 8 tandem copies of a 13,433 bp sequence from the 5’-flanking region of ASIP (Fig 3, S4 Fig, S5 Table). The Chamois Colored goat and the St. Gallen Booted goat are characterized by a color pattern and an ASIP allele termed “badgerface” (A) [8]. Our pool-seq data revealed a five-fold amplification of 45,680 bp located ~61 kb downstream of the A amplification (Fig 3, S4 Fig, S5 Table). The Peacock goat is a rare Swiss goat breed with a unique and striking coat color pattern that has not been investigated previously. Pool-seq data from Peacock goats indicated a selection signature at the ASIP locus. The ASIP allele in Peacock goats, which we propose to term “peacock” (A), has a quadruplication of the same ~45 kb region having five copies in the A allele. It is additionally flanked by triplicated segments of 27,996 bp and 41,807 bp on the left and right side of the quadruplicated sequence (Fig 3, S4 Fig, S5 Table). The central part of the A allele has exactly the same breakpoints as the A allele suggesting a common origin of A and A. The goat genome reference sequence is derived from a San Clemente goat, which has a similar coat color pattern as bezoars. The genome reference therefore supposedly represents the wildtype allele at the ASIP locus, termed “bezoar” (A) [8]. Bezoars and all other Swiss goat breeds did not show any CNVs at the ASIP locus. In the remaining goat breeds the A, A and A alleles were segregating at low frequencies. These breeds are either not specifically selected for coat color (e.g. African dwarf goat, Beetal) or the effect of ASIP is phenotypically not visible due to epistatic effects of other genes that lead to white spotting phenotypes (e.g. Boer goat, Pak Angora, Barbari).

Quantitative analysis of ASIP mRNA expression

The different CNVs in putative regulatory regions of the ASIP gene prompted us to hypothesize that quantitative differences in ASIP expression may cause the different color patterns. We therefore obtained whole skin samples from five goats carrying different CNV alleles. We isolated RNA from matched pairs of eumelanistic and pheomelanistic skin and performed an RNA-seq experiment to determine the expression level of ASIP mRNA expression. In Grisons Striped goats (A), Chamois Colored goats (A) and Peacock goats (A), the eumelanistic skin showed very low ASIP mRNA expression. The pheomelanistic skin regions in these three goats had at least 10-fold higher ASIP expression than the corresponding eumelanistic samples. The uniformly white (pheomelanistic) Saanen goat (A) had the highest ASIP mRNA expression. There was no obvious correlation between the quantitative ASIP mRNA expression and the intensity of the pheomelanistic pigmentation. The intensely red colored skin from the Chamois Colored goat had an intermediate ASIP mRNA expression compared to the pale white skin from e.g. the Saanen and Peacock goat. Visual inspection of the RNA-seq short-read alignments indicated the utilization of nine different 5’-untranslated exons in nine different transcript isoforms originating from different ASIP alleles (Fig 4; S1 File).
Fig 4

ASIP mRNA expression and identified transcripts in skin.

A Representative photographs of the five sampled goat breeds. The biopsy sites are numbered and indicated by red circles. B Trimmed mean of M (TMM) values of ASIP mRNA expression were determined from RNA-seq data for each sample. The colors of the bars correspond to the pigmentation of the skin samples. Please note that the Valais Blackneck goat (VAG) has a black base color that is independent of the ASIP gene. This goat has a white spotting phenotype and lacks melanocytes in its caudal half. The low ASIP expression in the unpigmented white skin sample of this goat underscores the difference to the pheomelanistic pale white pigmentation in other goats. C ASIP transcript isoforms in pheomelanistic skin samples from goats with different ASIP alleles. Transcript isoforms X1 and X2 correspond to the RefSeq accessions XM_018057735.1 and XM_018057736.1. CNV breakpoints of the A, A, and A alleles are indicated.

ASIP mRNA expression and identified transcripts in skin.

A Representative photographs of the five sampled goat breeds. The biopsy sites are numbered and indicated by red circles. B Trimmed mean of M (TMM) values of ASIP mRNA expression were determined from RNA-seq data for each sample. The colors of the bars correspond to the pigmentation of the skin samples. Please note that the Valais Blackneck goat (VAG) has a black base color that is independent of the ASIP gene. This goat has a white spotting phenotype and lacks melanocytes in its caudal half. The low ASIP expression in the unpigmented white skin sample of this goat underscores the difference to the pheomelanistic pale white pigmentation in other goats. C ASIP transcript isoforms in pheomelanistic skin samples from goats with different ASIP alleles. Transcript isoforms X1 and X2 correspond to the RefSeq accessions XM_018057735.1 and XM_018057736.1. CNV breakpoints of the A, A, and A alleles are indicated.

Discussion

In the present study, we discovered 2,239 loci under selection in 20 diverse goat breeds with various phenotypes and different geographical origins. Our methodology comprised the identification of regions with low heterozygosity from pool-seq data in combination with pairwise FST to bezoars, the wild ancestor of domesticated goats. The pool-seq approach was validated by repeating the analyses with virtually identical results from individually sequenced goats in five breeds. We have to caution that reduced heterozygosity and high FST values may not only result from selection, but may also be due to random demographic changes. The comprehensive catalogue of identified selection signatures can now be used as a starting point to identify causal genetic variants that control a wide variety of breed-defining traits. We particularly focused on selection signatures harboring known coat color genes and identified two CNVs in the 3’-flanking region of KIT in two Pakistani goat breeds, the completely white Pak Angora breed and the white spotted Barbari breed. An association between white coat color and the KIT locus has been reported before in Iranian Markhoz goats, which also represents an Angora type goat [37]. The KIT gene is flanked by several hundred kilobases of non-coding genomic DNA on either side, which are required for the precise regulation of its temporal and spatial expression. The KIT protein is a receptor tyrosine kinase mediating a survival signal for several different cell types including melanoblasts and melanocytes, but also e.g. hematopoetic stem cells, mast cells, interstitial cells of Cajal and spermatogonia [38,39]. KIT is a proto-oncogene and its overexpression may have detrimental consequences such as tumor development [40,41]. Insufficient expression of functional KIT protein in melanoblasts or melanocytes will lead to apoptosis of these cells and results in white spotting phenotypes [15,42-45]. Structural variants at the KIT locus cause several other breed defining coat color phenotypes in domestic animals, such as the dominant white and belt phenotypes in pigs [33,46,47], color-sided and lineback in cattle [48,49], and tobiano spotting in horses [50]. Lineback in cattle and tobiano in horses also involve structural variants in the 3’-flanking region of KIT [49,50]. All these phenotypes are characterized by striking alterations in pigmentation without any deleterious consequences on the other KIT dependent cell types, which would be expected to result in potentially serious health problems. The comparative data from other species strongly suggest that the newly detected caprine KIT CNVs in goats cause the complete lack of skin and hair pigmentation in Pak Angora and the white spotted phenotype in Barbari goats due to altered expression of KIT during fetal development of melanoblasts. The ASIP gene codes for the agouti signaling protein, the competitive inhibitor of melanocortin 1 receptor expressed on melanocytes [20]. Variation in the quantitative amount of ASIP mRNA expression from different promoters is the central mechanism regulating the so-called pigment-type switching [19,21,28]. The regulatory elements of the ASIP gene are most likely contained in its large 5’-flanking region. Differing from the KIT locus, ASIP does not contain a very large 3’-flanking region. Spatially and temporally regulated synthesis of eumelanin and pheomelanin enables mammals to express a wide variety of coat color patterns that are essential for e.g. camouflage or mate recognition in many wild species. ASIP variants cause a wide variety of breed defining coat color phenotypes in domestic animals. ASIP loss of function variants, typically in the coding sequence, are responsible for recessive black in e.g. dogs [51], horses [52], and rabbits [53]. Gain of function variants in ASIP, such as an ectopic overexpression lead to dominant red phenotypes [54]. There are only very few examples of non-coding regulatory variants at the ASIP locus that have been fully characterized at the molecular level. One important example is the mouse black-and tan allele (a), which is caused by a ~6 kb retroviral-like insertion in the region of the hair cycle-specific promoter [25]. In black and tan at mice, hairs are no longer banded and show a uniformly yellow or uniformly black pigmentation. Amplification of the entire ASIP gene has previously been shown to cause the white coat color in many sheep breeds [18] and the white or tan allele (A) in Saanen goats [9]. Our study confirmed the previous results and defined the exact breakpoints of the caprine A triplication that actually comprises not only the ASIP gene, but also the flanking AHCY and ITCH genes. Unexpectedly, we did not observe selection signatures at the window harboring the ASIP gene in the Saanen and Appenzell breeds. Both of these breeds are strictly selected for uniform white (pheomelanistic) coat color and have a very high frequency of the A allele. We think that the lack of a significant −ZHp score at ASIP in these two breeds is caused by at least three factors. For the calculation of the −ZHp score, we considered only SNVs with a maximum coverage of 50x in order to suppress artifacts caused by non-specific mapping of highly repetitive sequences. As the ASIP locus is triplicated in Saanen and Appenzell goats their pools had >50x average coverage at ASIP and almost no SNVs were called in the region. Furthermore, when we inspected the whole genome sequencing data from 24 individual Appenzell goats, we found that only 22 of them were A/A as expected. The remaining two animals had the genotype A/A. As A is the most dominant allele in the series, it apparently has still not reached absolute fixation and other ASIP alleles are segregating at least in the Appenzell goat breed. Finally, the three copies of the triplication had several sequence differences, which were called as variable positons with a 2:1 ratio of the alleles. This provides a biological explanation why the selection signature might be weaker than expected. We observed significant selection signatures at the ASIP gene in five Swiss goat breeds. In these breeds, we identified three additional non-coding CNVs in the 5’-region of ASIP that are likely to cause the Swiss markings (A), badgerface (A) and peacock (A) alleles in goats. We have to caution that the peacock phenotype has not been reported before and may be influenced by additional genes other than ASIP. The corresponding coat color phenotypes are very interesting as they have characteristic patterns of eumelanistic and pheomelanistic pigmentation. Domestic goats with these alleles therefore represent a valuable resource for dissecting the precise function of individual regulatory elements in future studies. The A and A alleles result in almost exactly inverted distributions of eumelanin and pheomelanin. Our RNA-seq data confirm that the different pigmentation patterns are caused by different levels of ASIP mRNA transcription at different body locations. These data also revealed that the different caprine ASIP gene alleles give rise to a higher number of non-coding 5’-exons compared to other mammals [25,26]. Additional data, such as Cage-seq and full-length Iso-seq data will be required for a comprehensive annotation of all possible transcription start sites and splice isoforms in goats. Such data are expected to become available soon with the advances of the FAANG project [55]. In conclusion, we identified 2,239 selection signatures in 20 diverse goat breeds with various coat color phenotypes. These selection signatures revealed six different functionally relevant CNVs underlying breed-defining coat color phenotypes in goats. The results should help to advance our mechanistic understanding of temporal and spatial regulation of transcription.

Materials and methods

Ethics statement

All animal experiments were performed according the local regulations. All animals in this study were examined with the consent of their owners. Sample collection was approved by the “Cantonal Committee For Animal Experiments” (Canton of Bern; permit 75/16).

Animals

For this study, 244 female animals of 20 phenotypical diverse goat breeds and their wild ancestor, the bezoar, were sampled (S1 Table). Ten of the analyzed goat breeds originate from Switzerland, eight from Pakistan and two from Africa. Swiss and African breeds were sampled in Switzerland. Pakistani breeds were sampled in Pakistan. bezoar samples were from zoo animals. For the Swiss goat breeds, we selected representative animals of the breeds and excluded any first-degree relatives. For the other goat breeds, we did not have full pedigree information and used convenience samples. Genomic DNA was isolated from EDTA blood samples.

Whole genome sequencing of pools (pool-seq)

Breed pools were prepared by pooling equimolar amounts of 12 animals per breed (ANG: 10, BEZ: 8 and STG: 10). Illumina TruSeq PCR-free genomic DNA libraries with an insert size of 350 bp were prepared. Each breed pool was sequenced on one lane of an Illumina HiSeq 3000 instrument and on average 300 million 2x150 bp paired-end reads per breed pool were collected (S1 Table).

Mapping and variant calling

Adapter sequences, reads with too many Ns and low quality bases were trimmed or ultimately discarded, if the remaining read length was < 50 bp with fastq-mcf version 1.1.2 (settings: -l 50 -S -q 20). The cleaned reads were mapped to the goat reference genome ARS1 [56] with Burrows-Wheeler Aligner (BWA-MEM) algorithm version 0.7.13 [57] using the “-M” flag to mark shorter alignments as secondary. The resulting mapping files in SAM format were converted into BAM format and coordinate sorted using SAMtools version 1.3 [58]. A local indel realignment was performed using the Genome Analysis Toolkit version 3.7 [59] with default settings. Duplicated reads were marked, using Picard Tools version 2.2.1 (http://broadinstitute.github.io/picard) with default settings for patterned flow cell models. Single nucleotide variants were called using (i) Genome Analysis Toolkit UnifiedGenotyper version 3.7 [59] with the settings: -glm SNP, -stand_call_conf 20, -out_mode EMIT_VARIANTS_ONLY and –ploidy 16/20/24 and (ii) SAMtools mpileup [60] with the settings -q 15, -Q 20, -C 50 and -B. The variants resulting from UnifiedGenotyper were filtered for high quality variants with GATK’s VariantFiltration tool using the generic hard-filtering recommendations available from https://gatkforums.broadinstitute.org/gatk/discussion/6925/understanding-and-adapting-the-generic-hard-filtering-recommendations, while the mpileup files were streamed to the PoPoolation2 version 1.201 pipeline [61]. We used the scripts mpileup2sync.jar with settings --fastq-type sanger and --min-qual 20 and snp-frequency-diff.pl with the settings --min-coverage 15, --max-coverage 50 and --min-count 3. Both pipelines yielded similar numbers of SNV (S2 Table).

Sweep analysis of pool-seq data

A screen for selective sweeps was performed using the SNV file produced for each breed pool individually by the mpileup PoPoolation2 pipeline. At each identified SNV position in the files, we took the numbers of major (nMAJ) and minor (nMIN) allele counts observed in each breed and calculated pooled heterozygosity (Hp) [32] with an in-house written script. The script applies Hp = 2ΣnMAJΣnMIN/(ΣnMAJ+ΣnMIN)2 in a sliding 50% overlapping window approach. We evaluated the results with different window sizes (25 to 300 kb) and decided on 150 kb as the most appropriate size [33, 34]. The obtained Hp values for all 34,382 overlapping 150 kb windows across the whole genome were Z transformed, performing ZHp = (Hp-μHp/σHp). Windows with a −ZHp ≥ 4 were retained as selective windows and adjacent or overlapping selective windows were merged into selection signatures, individually per breed. We annotated the identified selection signatures along with NCBI’s Capra hircus Annotation Release 102 (S3 Table). In addition, to the Hp calculation, we calculated weighted population FST values for each SNV in a 150 kb sliding, 50% overlapping window approach. We applied the FST-sliding.pl script of the Popoolation2 pipeline. The script FST-sliding.pl was run with the settings --min-count 2 --min-coverage 4 --max-coverage 50 --window-size 150000 --step-size 75000 --suppress-noninformative and --pool-size 10:12:12:12:8:12:12:12:12:12:12:12:12:12:12:12:10:12:12:12:12. It used the previously obtained sync file of all pools combined as input and calculated weighted population pairwise Fst values using the standard equation as shown in Hartl and Clark [62]. This resulted in 210 pairwise comparisons, from which we selected the comparisons between the 20 domesticated goat breeds with the bezoar. The obtained FST values were Z transformed, performing ZFST = (FST-μFST/σFST).

Whole genome re-sequencing of individual goats and variant calling

In addition to the pool-seq experiment, we selected 120 goats from five Swiss breeds for individual whole genome re-sequencing. The 24 animals per breed included the 12 goats represented in the breed pools (S1 Table). Illumina TruSeq PCR-free DNA libraries with an insert size of 350 bp were prepared and sequenced on an Illumina NovaSeq 6000 instrument, yielding on average 240 million 2x150 bp paired-end reads per goat (S1 Table). Clean reads were produced by running fastp, version 0.12.5 [63], an ultra-fast all-in-one FASTQ preprocessor capable of trimming polyG tails, a known issue of NovaSeq reads. The cleaned reads were mapped to the ARS1 goat reference genome [56] with Burrows-Wheeler Aligner (BWA-MEM) algorithm using the “-M” flag to mark shorter alignments as secondary. The resulting SAM files were converted into BAM files and coordinate sorted using SAMtools. Duplicated reads were marked, using Picard Tools (http://broadinstitute.github.io/picard) with default settings for patterned flow cell models. The marked BAM files were streamed to GATK’s BaseRecalibrator tool, supported with known SNV provided by the VarGoats consortium (http://www.goatgenome.org/vargoats.html). Subsequently, GATK’s HaplotypeCaller with the settings --emitRefConfidence GVCF and -stand_call_conf 30 was used to call genome-wide variants [59]. The variant files were merged and GATK’s GenotypeGVCFs was used to call variants in the 120 goats combined. As a next step, the called variants were filtered for high quality variants with GATK’s VariantFiltration tool (version 3.8) using the generic hard-filtering recommendations available from https://gatkforums.broadinstitute.org/gatk/discussion/6925/understanding-and-adapting-the-generic-hard-filtering-recommendations. SnpEff [64] and NCBI’s Capra hircus Annotation Release 102 was used to annotate the variants.

Sweep analysis of individual goats

To calculate Hp scores of the individual goats, we selected biallelic, passed SNPs per breed using GATK’s SelectVariants tool (version 3.8), applying --restrictAllelesTo BIALLELIC --selectTypeToInclude SNP --sample_expressions '(APZ/BST/PFA/STG/VAG)' --maxNOCALLnumber 0 --excludeFiltered --excludeNonVariants. This yielded on average 14.9 million SNVs per combined Swiss goat breed, comprising each 24 individually sequenced animals (S2 Table). As a next step, the VCF files containing only biallelic, passed SNPs were transformed into table format using GATK’s VariantsToTable tool. This table contained only information regarding SNP position, reference allele and genotype of the 24 animals. With an in-house written Python script, we converted the table produced with GATK’s VariantsToTable to major and minor alleles and counted the number of observations. This output was then used for Hp calculation as described in sweep analysis of pool-seq data.

Manhattan plots

The −ZHp values were plotted using the function manhattan of the qqman package [65] with R [66]. Each data point represents a 150 kb window. A red horizontal line was drawn representing the chosen significance threshold of −ZHp ≥ 4 (corresponding to 0.8% of all windows).

CNV analyses

Coverage plots for regions of interest were created by calculating the coverage of each base in a defined region of interest using Samtools depth -b. Additionally coverage stats across the whole genomes, including the average coverage were calculated using goleft covstats (https://github.com/brentp/goleft). Taken both results together, we plotted the coverage using R plot type h version 3.4.1 and indicated the average coverage line. Potential CNVs were also visually evaluated by inspection of the short-read alignemnts (bam-files) in the Integrative Genome Viewer (IGV) [67].

Skin biopsies and total RNA extraction

Skin biopsies were taken from five slaughtered animals of different goat breeds (SAN, BST, GFG, PFA and VAG). Two 6 mm punch biopsies were taken from differentially pigmented body areas of each animal (S6 Table). The biopsies were immediately put in RNAlater (Qiagen) for at least 24 h and then frozen at –20°C. Prior to RNA extraction, the skin biopsies were homogenized mechanically with the TissueLyser II device from Qiagen. Total RNA was extracted from the homogenized tissue using the RNeasy Fibrous Tissue Mini Kit (Qiagen) according to the manufacturer’s instructions. RNA quality was assessed with a FragmentAnalyzer (Advanced Analytical) and the concentration was measured using a Qubit Fluorometer (ThermoFisher Scientific).

Whole transcriptome sequencing (RNA-seq)

From each sample, 1 μg of high quality total RNA (RIN >9) was used for library preparation with the Illumina TruSeq Stranded mRNA kit. The 10 libraries were pooled and sequenced on an S1 flow cell with 2x50 bp paired-end sequencing using an Illumina NovaSeq 6000 instrument. On average, 31.5 million paired-end reads per sample were collected (S6 Table). All reads that passed quality control were mapped to the ARS1 goat reference genome assembly using STAR aligner (version 2.6.0c) [68]. The read abundance was calculated using HTseq (version 0.9.1) [69] and a gff3 file obtained from NCBI’s Capra hircus Annotation Release 102. We used the EdgeR package [70] to read the HTseq count data and calculated the log fold changes using the exactTest function where the biological co-efficient of variation (BCV) was set to 0.1. Trimmed mean of M (TMM) values of ASIP mRNA expression were determined for each sample [71].

Distributions of Hp, −ZHp, FST and −ZFST.

(PDF) Click here for additional data file.

−ZHp scores Manhattan plots of pooled sequencing and single sequencing.

(PDF) Click here for additional data file.

Manhattan plots of −ZHp scores and ZFST scores.

(PDF) Click here for additional data file.

Details of the CNV alleles.

(PDF) Click here for additional data file.

FASTA sequences of nine different caprine ASIP transcripts.

(TXT) Click here for additional data file.

Read statistics and accessions of pool-seq and individual WGS data.

(XLSX) Click here for additional data file.

Pool-seq and individual WGS SNV statistics.

(XLSX) Click here for additional data file.

Hp selection signatures per breed pool.

(XLSX) Click here for additional data file.

FST selection signatures per breed pool.

(XLSX) Click here for additional data file.

Details of CNV alleles.

(XLSX) Click here for additional data file.

Descriptive statistics and accessions of RNA-seq datasets.

(XLSX) Click here for additional data file. 26 Sep 2019 Dear Dr Leeb, Thank you very much for submitting your Research Article entitled 'Selection signatures in goats reveal copy number variants underlying breed-defining coat color phenotypes' to PLOS Genetics. The manuscript was fully evaluated at the editorial level and by two peer reviewers. The reviewers appreciated the attention to an important problem, but raised some important concerns that should be feasible to address with additional analyses.  Based on the reviews, we will not be able to accept this version of the manuscript, but are interested in evaluating a revised version that addresses the reviewers' concerns. Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions. Yours sincerely, Gregory S. Barsh Editor-in-Chief PLOS Genetics Gregory Copenhaver Editor-in-Chief PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The authors of this study have performed pooled whole genome sequencing of 20 goat breeds and identified thousands of putative signals of selection based on reduced heterozygosity. Here they report that some of the detected putative selective sweeps involve copy number variants at KIT and ASIP associated with coat color variants. The frequent occurrence of such CNVs is interesting. Further characterization of these CNVs would make the paper even more interesting for the readers of this journal. Specific comments 1. It would be interesting to know to which extent the copy number varies between individuals within breeds. Since the authors report pooled sequencing data it is possible that the copy numbers here are just the average and that variation in copy number may contribute to heterogeneity in phenotypes. The authors have apparently some individual sequence data that can be directly used for this and it would be easy to design TaqMan assays to explore this. 2. It would also be interesting to confirm that these are tandem duplications located at the KIT and ASIP loci and not translocated copies in other places of the genome. This can be confirmed by characterizing the duplication borders either using available short reads or by PCR. This would also allow the identification of the exact duplication breakpoints which can be used to characterize how the initial structural change occurred. Furthermore, this would allow the authors to confirm that those cases where the authors assume that different alleles share a common origin and one allele has evolved further by accumulating additional changes on the same haplotype as suggested for the two KIT alleles (Fig. 2) and for A(pc). 3. Another analysis worth doing is to explore the sequence haplotypes present in the duplicated copies, the two or more copies may be identical in sequence represent different alleles depending on how the duplication event occurred and how much sequence homogenization has taken place between the two copies. For instance, the authors report that there was not any increased homozygosity in white Saanen goats. The explanation could be that the 150 kb triplication shows sequence heterogeneity between copies which are interpreted as heterozygous position in the SNP analysis. The authors write on page 10 that A(pc) is probably derived from the A(b) allele, this can be tested by comparing the sequence identities across the CNV region and by determining the exact duplication breakpoints. 4. As far as I understand the coat color of the Peacock goat has not been reported before. It appears very likely that the CNV reported here is contributing to the phenotype but I assume it is impossible to tell if the phenotype described as A(pc) is solely determined by an ASIP allele. It may be a combined effect of this variant in combination with one or more other loci. I recommend that the authors state that this is a preliminary description of the Apc phenotype. 5. Adalsteinsson et al described 8 ASIP alleles in goats, 5 alleles are characterized here, one of these is new, so the question is whether there are 4 more alleles not yet characterized or were these represented among the 20 different breeds but were not associated with a sweep signal and/or CNV? 6. I disagree with the statement at the bottom of page 14 that white spotting in Pak Angora and Barbari goats are likely due to insufficient expression of KIT. The lesson from the KIT literature in mouse and other species is that KIT loss-of-functions in the heterozygous state gives a mild white spotting and homozygotes are completely white and lethal or sublethal due to pleiotropic effects on hematopoiesis. In contrast, structural changes often cause dominant white spotting (e.g. patch and rump-white in mouse) most likely due to ectopic expression of KIT interfering with KIT normal function disrupting melanocyte migration/maturation. I think it is much more likely that the CNVs reported here alters the KIT expression pattern than leading to just insufficient expression. Minor comments 1. Page 4, middle. To the best of my knowledge the ligand for MC1R is named melanocyte stimulating hormone (MSH). The 1 in melanocortin refers to that it is 1 out of 5 melanocortin receptors. 2. Page 9. There is redundancy between the figure legend and the text. Reviewer #2: The authors used pooled sequencing and windowed selection scans to identify putative selective sweeps between 20 goat populations. This is a very good, comprehensive analysis using an efficient method for selective sweep detection. However, there are some aspects of the approach and presentation that should be addressed. The authors refer to a drop in pooled heterozygosity as a selective signature. This in itself is not, nor is strong FST. When combining the two, preferably coupled to a test for neutrality, one may consider a selective sweep. The authors should apply more formal tests for selection. 2,239 selection signatures, with over 100 per pool is quite high. How do the authors account for demographic factors driving divergence as opposed to positive selection, particularly since you have access to individual data from five breeds? Did they attempt other formal methods such as XP-EHH, XP-CLR, or iHS? It would be informative to include zFST with the zHp data so readers can assess if the authors’ claims hold true. I see them referenced in Figure S1 and S3, but these were not included in the submission received by this reviewer – only supplemental tables. How much total genomic length do the 2,239 signatures represent? Copy number changes distort selective signals since they are collapsed in the assembly, and typically contain interlocus variation. Thus, the presence of a CNV does not mean positive selection, particularly when variants are excluded based on coverage, which would distort Hp. The authors should compare these signals to true selective sweep signals driving non-copy related evolution. Do the copy number estimates hold true for individual sequences? Detecting sweeps is typically done using varying window sizes to test the robustness of the detected signal. Did the authors examine this, and choose 150kb based on the result? How did they determine this window size was appropriate for their data? Identifying CNV is a separate method from identifying selective sweeps. Was there an effort to identify CNV outside sweep regions? This should be included since the authors are trying tie CNV signals to those of selective sweeps. In sum, the discovery of these specific CNV, and their effect on ASIP are convincing and will constitute a significant contribution to the field. However, the conflation of selection and copy number is confusing, and should be clarified further. Without a comprehensive description of CNV and comparison with Hp, FST, and a more formal test for selection, it is difficult to conclude this is selection, rather than simply demography. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: None Reviewer #2: No: I was not provided supplemental figures in the PDF ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No 18 Oct 2019 Submitted filename: Reviewer_response_R1.pdf Click here for additional data file. 23 Nov 2019 Dear Dr Leeb, We are pleased to inform you that your manuscript entitled "Selection signatures in goats reveal copy number variants underlying breed-defining coat color phenotypes" has been editorially accepted for publication in PLOS Genetics. Congratulations! Please note that Reviewer #1 has some final small comments (see below) that you may want to consider as you prepare the final version of your manuscript for the production team (the editorial team will not need to reevaluate). Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional accept, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about one way to make your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Gregory P. Copenhaver Editor-in-Chief PLOS Genetics www.plosgenetics.org Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The authors have responded adequately to my previous comments and I have only some minor comments that they may consider. 1. Abstract. The authors may change the text to “we identified 2,239 putative selection signatures” to acknowledge the comment from reviewer 2 that a fraction of these are likely to be false positives caused by for instance genetic drift during breed formation. (The same caution may be added to the first sentence of the Discussion). They can also change to “we found strong selection signatures in a region harboring KIT” indicating that this signal stands out among the 2,239 putative signals. 2. Page 4. The detection of loci under selection in Atlantic herring was not based on loss of heterozygosity but based on allele frequency differences between populations. However, some of the loci under selection showed clear sweep signals with loss of heterozygosity. 3. Page 6, bottom line. Correct typo for TYRP1 Reviewer #2: Though there are other analyses I contend SHOULD be done, I acknowledge that the findings are significant and warrants publication. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-19-01351R1 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. 10 Dec 2019 PGENETICS-D-19-01351R1 Selection signatures in goats reveal copy number variants underlying breed-defining coat color phenotypes Dear Dr Leeb, We are pleased to inform you that your manuscript entitled "Selection signatures in goats reveal copy number variants underlying breed-defining coat color phenotypes" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Nicholas White PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics
  67 in total

1.  Molecular characterization of the mouse agouti locus.

Authors:  S J Bultman; E J Michaud; R P Woychik
Journal:  Cell       Date:  1992-12-24       Impact factor: 41.582

2.  Association of an Agouti allele with fawn or sable coat color in domestic dogs.

Authors:  Tom G Berryere; Julie A Kerns; Gregory S Barsh; Sheila M Schmutz
Journal:  Mamm Genome       Date:  2005-04       Impact factor: 2.957

3.  Functional Annotation of Animal Genomes (FAANG): Current Achievements and Roadmap.

Authors:  Elisabetta Giuffra; Christopher K Tuggle
Journal:  Annu Rev Anim Biosci       Date:  2018-11-14       Impact factor: 8.923

4.  The initial domestication of goats (Capra hircus) in the Zagros mountains 10,000 years ago.

Authors:  M A Zeder; B Hesse
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

5.  Whole-genome resequencing reveals loci under selection during chicken domestication.

Authors:  Carl-Johan Rubin; Michael C Zody; Jonas Eriksson; Jennifer R S Meadows; Ellen Sherwood; Matthew T Webster; Lin Jiang; Max Ingman; Ted Sharpe; Sojeong Ka; Finn Hallböök; Francois Besnier; Orjan Carlborg; Bertrand Bed'hom; Michèle Tixier-Boichard; Per Jensen; Paul Siegel; Kerstin Lindblad-Toh; Leif Andersson
Journal:  Nature       Date:  2010-03-10       Impact factor: 49.962

6.  Agouti protein inhibits the production of eumelanin and phaeomelanin in the presence and absence of alpha-melanocyte stimulating hormone.

Authors:  A Graham; K Wakamatsu; G Hunt; S Ito; A J Thody
Journal:  Pigment Cell Res       Date:  1997-10

7.  An iterative genetic and dynamical modelling approach identifies novel features of the gene regulatory network underlying melanocyte development.

Authors:  Emma R Greenhill; Andrea Rocco; Laura Vibert; Masataka Nikaido; Robert N Kelsh
Journal:  PLoS Genet       Date:  2011-09-01       Impact factor: 5.917

8.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

9.  fastp: an ultra-fast all-in-one FASTQ preprocessor.

Authors:  Shifu Chen; Yanqing Zhou; Yaru Chen; Jia Gu
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

10.  Genome-wide SNP profiling of worldwide goat populations reveals strong partitioning of diversity and highlights post-domestication migration routes.

Authors:  Licia Colli; Marco Milanesi; Andrea Talenti; Francesca Bertolini; Minhui Chen; Alessandra Crisà; Kevin Gerard Daly; Marcello Del Corvo; Bernt Guldbrandtsen; Johannes A Lenstra; Benjamin D Rosen; Elia Vajana; Gennaro Catillo; Stéphane Joost; Ezequiel Luis Nicolazzi; Estelle Rochat; Max F Rothschild; Bertrand Servin; Tad S Sonstegard; Roberto Steri; Curtis P Van Tassell; Paolo Ajmone-Marsan; Paola Crepaldi; Alessandra Stella
Journal:  Genet Sel Evol       Date:  2018-11-19       Impact factor: 4.297

View more
  15 in total

1.  Selection signatures and formation of the Samosir goat breed through the cultures of the Batak Toba Tribe in Samosir Island, Indonesia.

Authors:  Suhendra Pakpahan; Rini Widayanti; Wayan T Artama
Journal:  Vet World       Date:  2022-04-24

2.  Detection of whole genome selection signatures of Pakistani Teddy goat.

Authors:  Rashid Saif; Jan Henkel; Tania Mahmood; Aniqa Ejaz; Fraz Ahmad; Saeeda Zia
Journal:  Mol Biol Rep       Date:  2021-10-05       Impact factor: 2.316

3.  Atypical Genotypes for Canine Agouti Signaling Protein Suggest Novel Chromosomal Rearrangement.

Authors:  Dayna L Dreger; Heidi Anderson; Jonas Donner; Jessica A Clark; Arlene Dykstra; Angela M Hughes; Kari J Ekenstedt
Journal:  Genes (Basel)       Date:  2020-07-03       Impact factor: 4.096

4.  The LCORL Locus is under Selection in Large-Sized Pakistani Goat Breeds.

Authors:  Rashid Saif; Jan Henkel; Vidhya Jagannathan; Cord Drögemüller; Christine Flury; Tosso Leeb
Journal:  Genes (Basel)       Date:  2020-02-05       Impact factor: 4.096

5.  Variants at the ASIP locus contribute to coat color darkening in Nellore cattle.

Authors:  Beatriz B Trigo; Adam T H Utsunomiya; Alvaro A A D Fortunato; Marco Milanesi; Rafaela B P Torrecilha; Harrison Lamb; Loan Nguyen; Elizabeth M Ross; Ben Hayes; Rômulo C M Padula; Thayla S Sussai; Ludmilla B Zavarez; Rafael S Cipriano; Maria M T Caminhas; Flavia L Lopes; Cassiano Pelle; Tosso Leeb; Danika Bannasch; Derek Bickhart; Timothy P L Smith; Tad S Sonstegard; José F Garcia; Yuri T Utsunomiya
Journal:  Genet Sel Evol       Date:  2021-04-28       Impact factor: 4.297

6.  Runs of homozygosity in Swiss goats reveal genetic changes associated with domestication and modern selection.

Authors:  Heidi Signer-Hasler; Jan Henkel; Tosso Leeb; Christine Flury; Erika Bangerter; Zafer Bulut; Cord Drögemüller
Journal:  Genet Sel Evol       Date:  2022-01-24       Impact factor: 4.297

Review 7.  DNA copy number variation: Main characteristics, evolutionary significance, and pathological aspects.

Authors:  Ondrej Pös; Jan Radvanszky; Gergely Buglyó; Zuzana Pös; Diana Rusnakova; Bálint Nagy; Tomas Szemes
Journal:  Biomed J       Date:  2021-02-13       Impact factor: 4.910

8.  A genome-wide analysis of copy number variation in Murciano-Granadina goats.

Authors:  Dailu Guan; Amparo Martínez; Anna Castelló; Vincenzo Landi; María Gracia Luigi-Sierra; Javier Fernández-Álvarez; Betlem Cabrera; Juan Vicente Delgado; Xavier Such; Jordi Jordana; Marcel Amills
Journal:  Genet Sel Evol       Date:  2020-08-08       Impact factor: 4.297

9.  Genomic Analysis Revealed a Convergent Evolution of LINE-1 in Coat Color: A Case Study in Water Buffaloes (Bubalus bubalis).

Authors:  Dong Liang; Pengju Zhao; Jingfang Si; Lingzhao Fang; Erola Pairo-Castineira; Xiaoxiang Hu; Qing Xu; Yali Hou; Yu Gong; Zhengwen Liang; Bing Tian; Huaming Mao; Marnoch Yindee; Md Omar Faruque; Siton Kongvongxay; Souksamlane Khamphoumee; George E Liu; Dong-Dong Wu; James Stuart F Barker; Jianlin Han; Yi Zhang
Journal:  Mol Biol Evol       Date:  2021-03-09       Impact factor: 16.240

10.  Dog colour patterns explained by modular promoters of ancient canid origin.

Authors:  Danika L Bannasch; Christopher B Kaelin; Gregory S Barsh; Tosso Leeb; Anna Letko; Robert Loechel; Petra Hug; Vidhya Jagannathan; Jan Henkel; Petra Roosje; Marjo K Hytönen; Hannes Lohi; Meharji Arumilli; Katie M Minor; James R Mickelson; Cord Drögemüller
Journal:  Nat Ecol Evol       Date:  2021-08-12       Impact factor: 15.460

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.