| Literature DB >> 31105741 |
Manuel More1, Gustavo Gutiérrez1, Max Rothschild2, Francesca Bertolini3, F Abel Ponce de León4.
Abstract
Alpacas are one of four South American Camelid species living in the highlands of the Andes. Production of alpaca fiber contributes to the economy of the region and the livelihood of many rural families. Fiber quantity and quality are important and in need of a modern breeding program based on genomic selection to accelerate genetic gain. To achieve this is necessary to discover enough molecular markers, single nucleotide polymorphisms (SNPs) in particular, to provide genome coverage and facilitate genome wide association studies to fiber production characteristics. The aim of this study was to discover alpaca SNPs by genotyping forty alpaca DNA samples using the BovineHD Genotyping Beadchip. Data analysis was performed with GenomeStudio (Illumina) software. Because different filters and thresholds are reported in the literature we investigated the effects of no-call threshold (≥0.05, ≥0.15, and ≥0.25) and call frequency (≥0.9 and =1.0) in identifying positive SNPs. Average GC Scores, calculated as the average of the 10% and 50% GenCall scores for each SNP (≥0.70) and the GenTrain score ≥ 0.25 parameters were applied to all comparisons. SNPs with minor allele frequency (MAF) ≥ 0.05 or ≥ 0.01 were retained. Since detection of SNPs is based on the stable binding of oligonucleotide probes to the target DNA immediately adjacent to the variant nucleotide, all positive SNP flanking sequences showing perfect alignments between the bovine and alpaca genomes for the first 21 or 26 nucleotides flanking the variant nucleotide at either side were selected. Only SNPs localized in one scaffold were assumed unique. Unique SNPs identified in both reference genomes were kept and mapped on the Vicugna_pacos 2.0.2 genome. The effects of the no-call threshold ≥ 0.25, call frequency = 1 and average GC ≥ 0.7 were meaningful and identified 6756 SNPs of which 400 were unique and polymorphic (MAF ≥ 0.01). Assignment to alpaca chromosomes was possible for 292 SNPs. Likewise, 209 SNPs were localized in 202 alpaca gene loci and 29 of these share the same loci with the dromedary. Interestingly, 69 of 400 alpaca SNPs have 100% similarity with dromedary.Entities:
Keywords: SNP; alpaca; bovine; genotyping; polymorphic
Year: 2019 PMID: 31105741 PMCID: PMC6492526 DOI: 10.3389/fgene.2019.00361
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Single nucleotide polymorphisms (SNPs) from the BovineHD Genotyping Beadchip that cross-amplify in alpacas. Genotypes are called for each sample (dot) by their signal intensity (Norm R, Y-axis) and Allele Frequency (Norm Theta, X-axis) relative to canonical cluster positions (dark shading) for a given SNP marker (red = AA, purple = AB, blue = BB). Black points represent no call samples. Polymorphic SNP genotype cluster patterns (A–C) selected with Method VI are presented. Difficult to interpret genotype cluster patterns (D,E) that were not retained are also shown and a monomorphic genotype cluster pattern (F) present among the 6756 positive SNPs.
Parameter values used for each method of analysis.
| Parameter | Method I | Method II | Method III | Method IV | Method V | Method VI | Method VII |
|---|---|---|---|---|---|---|---|
| No-call threshold | ≥0.05 | ≥0.05 | ≥0.15 | ≥0.15 | ≥0.25 | ≥0.25 | ≥0.25 |
| Call frequency | ≥0.9 | 1 | ≥0.9 | 1 | ≥0.9 | 1 | 1 |
| Average GC | ≥0.7 | ≥0.7 | ≥0.7 | ≥0.7 | ≥0.7 | ≥0.7 | * |
| GenTrain score | ≥0.25 | ≥0.25 | ≥0.25 | ≥0.25 | ≥0.25 | ≥0.25 | ≥0.25 |
Number of positive SNPs by method.
| Parameter of analysis | Method I | Method II | Method III | Method IV | Method V | Method VI | Method VII |
|---|---|---|---|---|---|---|---|
| No-call threshold ≥0.05 | No-call threshold ≥0.15 | No-call threshold ≥0.25 | |||||
| Call freq ≥ 0.9 | Call freq = 1 | Call freq ≥ 0.9 | Call freq = 1 | Call freq ≥ 0.9 | Call freq = 1 | Call freq = 1 | |
| Call frequency | 530106 | 111471 | 368001 | 39279 | 262506 | 23429 | 23429 |
| Average GC (≥0.7) | 22437 | 11364 | 24979 | 8232 | 25609 | 6756 | ∗ |
| GenTrainScore (≥0.25) | 22437 | 11364 | 24979 | 8232 | 25609 | 6756 | 23429 |
| MAF (≥0.01) | 22435 | 11364 | 24962 | 8232 | 25563 | 6756 | 23427 |
| MAF (≥0.05) | 1970 | 898 | 1724 | 430 | 1467 | 274 | 2044 |
Number of positive bovine SNPs aligned to the alpaca reference genomes.
| Reference genome | Method I | Method II | Method III | Method IV | Method V | Method VI | Method VII | Method VI |
|---|---|---|---|---|---|---|---|---|
| MAF ≥ 0.05 | MAF ≥ 0.01 | |||||||
| Aligned to more than one scaffold | 10 | 5 | 7 | 3 | 7 | 1 | 9 | 33 |
| Unique SNPs | 94 | 45 | 72 | 23 | 67 | 18 | 154 | 467 |
| Aligned to more than one scaffold | 10 | 6 | 6 | 3 | 6 | 1 | 11 | 30 |
| Unique SNPs | 95 | 41 | 72 | 21 | 68 | 17 | 146 | 466 |
| 79 | 36 | 62 | 18 | 59 | 16 | 129 | 400 | |
| 57 | 29 | 49 | 15 | 45 | 13 | 98 | 292 |
FIGURE 2Distribution of positive and unique SNPs in alpaca chromosomes by predicted localization.