Literature DB >> 29187988

Landscape genomic approach to detect selection signatures in locally adapted Brazilian swine genetic groups.

Robson Jose Cesconeto1, Stéphane Joost2, Concepta Margaret McManus3, Samuel Rezende Paiva4, Jaime Araujo Cobuci1, Jose Braccini1.   

Abstract

Samples of 191 animals from 18 different Brazilian locally adapted swine genetic groups were genotyped using Illumina Porcine SNP60 BeadChip in order to identify selection signatures related to the monthly variation of Brazilian environmental variables. Using BayeScan software, 71 SNP markers were identified as FST outliers and 60 genotypes (58 markers) were found by Samβada software in 371 logistic models correlated with 112 environmental variables. Five markers were identified in both methods, with a Kappa value of 0.073 (95% CI: 0.011-0.134). The frequency of these markers indicated a clear north-south country division that reflects Brazilian environmental differences in temperature, solar radiation, and precipitation. Global spatial territory correlation for environmental variables corroborates this finding (average Moran's I = 0.89, range from 0.55 to 0.97). The distribution of alleles over the territory was not strongly correlated with the breed/genetic groups. These results are congruent with previous mtDNA studies and should be used to direct germplasm collection for the National gene bank.

Entities:  

Keywords:  Sus scrofa; animal genetic resources; conservation genetics; molecular markers; population structure

Year:  2017        PMID: 29187988      PMCID: PMC5696410          DOI: 10.1002/ece3.3323

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


INTRODUCTION

After the introduction to Brazil of Portuguese, Spanish, and Asian swine breeds in early 1500s, these animals spread throughout the Brazilian territory. Through equilibrium between evolutionary (crossbreeding, artificial selection, and mainly genetic drift) and local environment forces have originated several locally adapted swine breeds that one which have had sufficient time in the county for to be genetically adapted to the environment (Galal & Boyazoglu, 2001). Locally adapted swine breeds (e.g., Piau, Canastra) were used as an important source of meat and fat by farmers and the general population (Mariante, Castro, Albuquerque, Paiva, & Germano, 2003) until the 1970s, when changes in the market led to the introduction of North American (e.g., Duroc) and European (e.g., Landrace, Large White) breeds, specialized in meat production. At present, locally adapted breeds are present only on small farms, with low input levels. These local breeds, results of a sum of economic, social, historical, and cultural factors, are a reservoir of genetic variability (Giovambattista et al. 2001) and, principally, source of traits selected and fixed mainly by the influence of the environment (Hall & Ruane, 1993). Mirkena et al. (2010) discuss genetic influence on disease tolerance/resistance in small ruminants and, in addition to other factors, cite advantages in increased fitness of locally adapted breeds, while Osman et al. ( 2017) studied adaptability and suitability advantages of Egyptian local breeds and Traspov et al. (2016) highlighted the adaptation to local climate, feed, pathogens, and human preferences of Russian Belorussian, Kazakhstan, and Ukraine pig breeds. This adaptation to the environment can be evaluated by genomic analyses of areas of the genome that have been, or still are, under selection (Luikart, England, Tallmon, Jordan, & Taberlet, 2003; Storz, 2005; Vitalis, Gautier, Dawson, & Beaumont, 2014). These can be estimated using single nucleotide polymorphisms (SNPs) spread throughout the genome by theoretical populational FST outliers approach that are assumed to be signatures of natural selection (Lewontin & Krakauer 1973; Luikart et al., 2003; Joost et al., 2007; Lotterhos & Whitlock 2014). Signatures of selection were found by Ottoni et al. (2013) in pigs from archaeological sites, helping to understand some events of pig domestication in Western Eurasia, introgression of Asian genes in European pigs by human selection (Bosse, Megens, Frantz et al., 2014), and enables the identification of introgression among different breeds (Bosse, Megens, Madsen et al., 2014). These selection signatures can help us understand the complex relation between adapted swine genetic groups and the environment, as well as the process of adaptation of swine over the Brazilian territory and to overcome the challenges in swine management in a country with continental dimensions and different climatic conditions. In a constantly changing world, the identification of those signatures may be the key to promote more sustainable animal production, improving gains in productivity and welfare, as well as decreasing sanitary expenses with medication and management (Mirkena et al., 2010; Shabtay, 2015). They can also be used for branding of particular regional products (Herrero‐Medrano et al., 2013). In addition, these results might be an auxiliary tool to help the enrichment of National gene banks (Paiva, McManus, & Blackburn, 2016) and conservation programs as suggested by Nuijten et al. (2016) or Bosse et al. (2015), who show that management strategies to preserve the variation in managed populations can benefit by whole‐genome, high‐density, marker‐assisted methods. The hypothesis of this study is that monthly variation from Brazilian environment by the years, as was seen with Vietinamese (Pham et al., 2014), American village pigs (Burgos‐Paz et al., 2012), and Chinese sheep (Yuan et al., 2017), influenced successful adaptation of swine in the Brazilian territory and left detectable signatures of natural selection. Understanding the influence of the environment on the process of allele selection can be useful to improve gains on small farms, preserve genetic variation from herds, and adaptation to world climatic changes. To test this hypothesis, a medium SNP chip array of locally adapted swine breeds population, with animals sampled from over the main Brazilian regions, was used to identify selection signatures through FST Outliers approach.

MATERIALS AND METHODS

Sampling

The Brazilian territory is divided into five regions (each further divided into states) based on natural, cultural, social, and economic features. Despite the high mobility of swine, free movement of animals between states and regions is restricted by legal and sanity factors (Classical Swine Fever, African Swine Fever, Foot‐and‐Mouth disease, Aujeszky's disease). So, to capture high spatial representation of the environment and genetic territorial dispersion of the swine breeds over the Brazilian territory, the sample selection (Table 1 and S1) was structured with at least one sample from each Political Region (Figure S1). A total of 191 samples of nonrelated animals from 18 different swine genetic groups (13 locally adapted Brazilian swine genetic groups, four commercial or global breeds, and one group formed by crossbred animals) were randomly selected. All samples used in this experiment are deposited in Embrapa's Gene Bank (http://aleloanimal.cenargen.embrapa.br) located at Embrapa Genetic Resources and Biotechnology Center, Brasilia, DF. The samples from locally adapted Brazilian swine genetic groups were classified in accordance with a phenotypic description suggested by Viana (1956), Germano, Albuquerque, and Castro ( 2002), and Mariante and Cavalcante (2006).
Table 1

Basic descriptive statistics of sampling (location and environment) of Brazilian locally adapted swine breeds

RegionStateSamplesBreedSamplesElevationTemperatureSolar RadiationAridityPETannualPrecipitation
%%Average SD Average SD Average SD Average SD Average SD Average SD
NPA2916.02SMA2916.023.170.89273.1432.8914.770.6315619.8318.681573.210.42204.78150.62
NEBA31.66SPI31.662600244.0346.6514.491.86444301690062.5834.05
PE6033.15SBA105.52387.9134.81240.7945.2514.671.354168.31284.791638123.6455.9448.63
SCR52.76550278.89236.8847.1214.61.555007.62562.541654.670.6367.7767.29
SCT84.42475.2584.33242.4246.3614.671.323983611.411692.6385.1756.0251.44
SCTA84.42616.25187.32234.6146.6614.631.454705.381975.411650.75107.8263.9163.75
SDL21.16149.5252.9932.4714.671.3512692.5313.251446.531.82161.3697.68
SME52.76261144.31243.3639.714.661.365699.82061.741543.8200.4970.8750.84
SMO84.42481.8877.98241.446.1514.671.314094.5562.621685.7584.9657.3652.32
SNI73.87511.5791.09239.6845.7914.671.324034.29613.291671.5772.3856.0453.19
SPI63.31275225.33242.7639.7914.671.346644.173691.361528.17133.181.9971.27
SUR10.558258.1429.9314.671.39130251395151.58104.73
PB21.1SCRI21.15570217.5839.314.71.25888201467010860.93
COGO2714.92SCB31.6611160208.547.4614.312.2410976015360140.67110.26
SCR31.6611160208.547.4614.312.2410976015360140.67110.26
SDL31.669590216.3148.8514.322.229133015810120.4296.74
SLW10.55586182.9257.1513.363.57130951399152.8325.35
SMO10.551116208.547.914.312.31109761536140.67113.56
SMT52.761103.890.73209.7747.414.312.2210227.6839.621540.628.64131.08102.11
SNI31.66895.33258.45221.3649.514.431.988112.333487.86160990.32107.1197.53
SPI63.311038.1750.71212.7247.7414.312.2110008.51177.421554.8326.83129.63103.03
SRP21.111160208.547.5714.312.2510976015360140.67111.06
MS189.94SMT189.9498.835.75258.4548.7414.12.557219.5693.151696.8311.77102.161.3
MT31.66SMT31.66122.672.89261.8651.0514.282.286932.6711.551785.670.58103.1768.62
SDMG31.66SPI10.55658205.6453.7213.962.8580791526102.4281.82
SPN21.16950202.9453.3313.962.798197015200103.7580.55
SRS147.73SCR31.66140.6730.02176.1756.4212.873.9110935.6785.4512715.2115.8319.53
SME10.55155175.5856.8812.874.02109531269115.9220.01
SMO42.21139.531176.2456.3612.873.8910927.7550.51271.254.5115.8319.48
SNI52.7683.89.09184.1152.5612.933.8311237.6216.231244.818.57116.5517.37
SPI10.5589185.2552.1212.953.96113511236116.9217.31
SC2212.15SCB10.55401181.5847.8113.273.66126481216128.1734.67
SDC42.215860182.9256.5513.363.4513095013990152.8324.53
SDL73.87433.86103.93184.5252.0913.33.4912064.29704.111324.7150.75133.9633.34
SLW21.15860182.9256.7513.363.4913095013990152.8324.79
SMO84.42436182.4185.0651.9313.323.4812811.5316.361305.5100139.4632.02
Brazil181100181100398.28352.05231.2555.5614.252.39233.54201.141550.34156.12116.4197.42

N, number of samples; SD, standard deviation; PETannual, annual potential evapotranspiration.

Region: N, north; NE, northeast; CO, midwest, SD, southeast; S, South; State: PA, Pará; BA, Bahia; PE, Pernambuco; PB, Paraíba; GO, Goiás; MS, Mato Grosso do Sul; MT, Mato Grosso; MG, Minas Gerais; RS, Rio Grande do Sul; SC, Santa Catarina; Breed: SBA, Baé; SCB, Casco de Burro; SCR, Caruncho; SMEc, Crioulo; SCT, Canastra; SCTA, Canastrão; SDC, Duroc; SLD, Landrace; SLW, Large Withe; SMA, Marajó; SME, Mestiço; SMO, Moura; SMT, Monteiro; SNI, Nilo; SPI, Piau; SPN, Pietran; SRP, Rabo de Peixe.

Basic descriptive statistics of sampling (location and environment) of Brazilian locally adapted swine breeds N, number of samples; SD, standard deviation; PETannual, annual potential evapotranspiration. Region: N, north; NE, northeast; CO, midwest, SD, southeast; S, South; State: PA, Pará; BA, Bahia; PE, Pernambuco; PB, Paraíba; GO, Goiás; MS, Mato Grosso do Sul; MT, Mato Grosso; MG, Minas Gerais; RS, Rio Grande do Sul; SC, Santa Catarina; Breed: SBA, Baé; SCB, Casco de Burro; SCR, Caruncho; SMEc, Crioulo; SCT, Canastra; SCTA, Canastrão; SDC, Duroc; SLD, Landrace; SLW, Large Withe; SMA, Marajó; SME, Mestiço; SMO, Moura; SMT, Monteiro; SNI, Nilo; SPI, Piau; SPN, Pietran; SRP, Rabo de Peixe.

Genotyping and quality control

The DNA samples were genotyped with the Illumina Porcine SNP60 BeadChip v2. To eliminate SNPs with low‐quality identification, monomorphic markers, SNPs recently fixed in the populations and lower informative samples that could generate as false‐positive selection signatures as bias, quality control of raw data (191 samples and 61,565 SNP markers) was performed with SNP & Variation Suite v8.x (Golden Helix, Inc., Bozeman, MT, USA 2015). We chose parameter thresholds as reported in literature (Bosse, Megens, Madsen et al., 2014; Burgos‐Paz et al., 2012; Traspov et al., 2016) that eliminated low‐quality SNPs/Samples but preserve a maximum number of samples: minimal individual genotype call rate of 90% that excluded 11 samples; 95% call rate and 0.05% minor allele frequency (MAF) for the markers when 21,605 SNPs were excluded. Additional linkage disequilibrium (LD) pruning was performed using a window size = 50, window increment = 5, and r2 threshold = 0.05, which eliminated a further 11,646 SNPs. The final data had 28,860 SNP markers with an SNP density of 1/87,026 kb.

Environmental variables

The environmental variables from the Brazilian territory were obtained at the World ClimProject (http://www.worldclim.org/), GTOPO30 (https://lta.cr.usgs.gov/GTOPO30), and Harvest Choice (http://harvestchoice.org/) and 30‐s geographical information system (GIS) layer using Qgis v2.6 (QGIS Development Team 2009). Monthly maximum, average and minimum temperature, annual average and median for maximum, average and minimum temperature, seasonal averages and medians for maximum, average and minimum temperature, monthly solar radiation, annual average and median solar radiation, seasonal averages and medians solar radiation, monthly precipitation, annual average and median precipitation seasonal averages and medians precipitation, 19 bioclimatic variables (BIO1‐19), elevation, PETannual (annual potential evapotranspiration), and Aridity (ratio of precipitation to PET) were obtained for each sample from this layer (Table S2). As monthly and season variation in climatic condition has interfere in reproductive performance (De Rensis, Ziecik, & Kirkwood, 2017; Petrocelli, Batista, & Gosálvez, 2015; Prunier, Quesnel, de Bragança, & Kermabon, 1996), pulmonary disease (Eze et al., 2015; Gao, Xiao, Qin, Cao, & Wang, 2016) piglet early survey (Berger et al., 2007; Iida & Koketsu, 2014) in all stages of life (Ross et al., 2015; Wildt, Riegle, & Dukelow, 1975), including intrauterine development (Johnson et al., 2013, 2015a), we used the environmental variables in an exploratory approach, to identify the influence of each explanatory variable on the allele frequencies.

Relationship between samples

The individual and populational levels of expected heterozygosity (He) and observed heterozygosity (Ho) were computed by the SNP & Variation Suite v8.x (Golden Helix, Inc., Bozeman, MT, USA 2015) and used to calculate the inbreeding coefficient (F IS), to identify sources of genetic variance by means of analysis of molecular variance (AMOVA) between and within the geopolitical groups (samples within region), as well as between native and commercial breeds (Tables S3 and S4), performed in Arlequin V 3.5.2.2 (Excoffier, Laval, & Stefan, 2005). Population structure analysis was performed in STRUCTURE (Pritchard, Stephens, & Donnelly, 2000), and a number of groups identified by second‐order rate of change of the likelihood (∆K) (Evanno, Regnaut, & Goudet, 2005). The discontinuity of genetic composition was evaluated using a Mantel test between the Euclidian geographical distance and the genetic distance F ST/(1−F ST) was calculated in PASSaGE (Rosenberg & Anderson, 2011).

Signatures of selection and outlier detection

Loci with high or low allelic differentiation in relation to the expected neutrality, from the 28,860 SNPs in final data, were used as an indication of selection (Hoffmann & Willi, 2008) and were tested by two different methodologies of outlier identification. BayeScan software V 2.1 (Foll & Gaggiotti, 2008) used a Bayesian approach via Markov Chain Monte Carlo (MCMC), assuming a prior Dirichlet distribution of alleles within populations and a hierarchical Bayesian model. The program calculates posterior odds, from the posterior probability of the models, with and without selection on a locus, using the proportion of loci with a strong increase in F ST relative to other loci among the MCMC outputs of its simulations (Beaumont & Balding, 2004). The software was set up with 5,000 burn‐in interactions, followed by 10,000 interactions with thinning interval of 10. Convergence was verified using CODA package for R (Plummer, Best, Cowles, & Vines, 2006) with critical values of −1.96 > z > +1.96. A second analysis was performed using the software Samβada (Joost et al., 2007; Stucki et al., 2014) that used logistic regression models to determine the probability of allele presence/absence in a specific environment. The models were considered significant when the G Score and Wald Score were significant at α = 0.01 threshold with a Bonferroni correction. The G Score can be defined as the ratio between maximum log likelihood of model with the presence of the independent variable and the maximum log likelihood of model without independent variable, or as the independent variable affects in the log likelihood model. The Wald Score tests if goodness of fit is affected when the independent variable is removed from the model. Using the FREQ procedure (Proc FREQ) of SAS v9.3 (SAS Institute Inc. 2011), the agreement between the two methods was evaluated through the Kappa index. The Kappa index is a measure of interrater agreement, between two or more methods: When the observed agreement exceeds chance agreement, kappa is positive, with its magnitude reflecting the strength of agreement. Gene annotations within candidate regions were obtained using the data provided by Ensembl (Cunningham et al., 2015) and NCBI (http://www.ncbi.nlm.nih.gov). To explore the linkage disequilibrium (LD) of selection signatures detected with other FST outliers and with nearby genes, we calculated the LD from these markers using Plink software (Purcell, 2014). To measure the degree of spatial association for marker signaled as FST outliers by both methods, the Global spatial autocorrelation (Moran's I) was calculated. Moran's I describes the autocorrelation between the values of a variable in a certain location with the values of this same variable in a neighboring location (Druck, Carvalho, Câmara, & Monteiro, 2004), with null hypothesis being that there is no spatial clustering.

RESULTS

Molecular variance analysis among states grouped into regions (Table S3) showed 93.35% of the genetic variance was contained within states and only 0.87% among regions. The genetic variance among a group of animals from commercial breeds and a group of locally adapted genetic groups (Table S4) showed individual variance (81.85%) was larger than variance between groups (3.66%) or from individuals within groups (8.27%). The F IS (Table S5) varied from −0051 to 0.642 (within breeds) showing that inbreeding levels of the naturalized breeds vary considerably within subpopulations, and was consistent with populational structure found in STRUCTURE analyses. Among locally adapted breeds, the Monteiro breed showed the highest average values of F IS (μ = 0.289), which indicates low genetic differentiation between individuals within the herds of this breed. Within regions, the populations from the State of Mato Grosso do Sul showed higher inbreeding levels (μ = 0.296), while the animals from the State of Paraiba revealed a tendency toward excess of heterozygotes (μ = −0.001). The theoretical number of actual populations using the genetic frequencies of the loci to infer the influence of the genetic groups on the composition and number of populations (Figure S2) was determined. The individuals were adequately allocated inside their original population by means of a K equal to the number of sampled breeds (K = 18). But, using the second‐order rate of change of the likelihood (ΔK), the number of groups was reduced to seven (k = 7). BayeScan software identified 71 SNP markers as FST outliers, while Samβada software identified 60 genotypes (from 58 SNP markers) in 371 univariate logistic models, using 112 environmental variables. No multivariate model was significant at α=0.05. The markers MARC0021990; ASGA0033717; MARC0007678 were responsible for 42% of all models generated by Samβada (Figure S3). Five markers, associated with different environmental conditions (Table 2), were found using both methods (ALGA0032795; ALGA0054315; ASGA0026250; ASGA0029202; BGIS0004952) with Kappa 0.073 (95% CI: 0.011–0.134) located in regions of the genome with the presence of several assumed FST outliers (Figure 1). The linkage disequilibrium (Figure 2) of markers identified as signatures of selection suggested they could be associated with nearby genes (Table 3) responsible for intracellular transport, immune response, cell respiration, and related to the circulatory system, probably as a physiological response to cellular stress (Table 3). The marker ASGA0029202 was associated with precipitation and thermal amplitude and is near (±0.2 Mb) the CDH2 gene that has an effect on the formation of blood vessels, while the marker MARC0021990 (responsible for 20% of the models) was close to (±0.27 Mb) the gene CYP7B1 which is involved, among other functions, with cofactor HEME, suggesting indirect evidence of importance of circulatory system on genetic adaptation to fluctuation on temperature, Bio18 (precipitation of warmest quarter), and solar radiation in the Brazilian territory (Table 3).
Table 2

Samβada output to environmental association to markers detected as signatures of selection in both methods

MarkerEnv_1LoglikelihoodGscoreWaldScoreBeta_0Beta_1
ALGA0032795TMinoutMedinan−89.9350.1038.443.85−0.03
ALGA0032795TMINMai−89.9350.1038.443.85−0.03
ALGA0054315TMAXAbr−88.3450.3738.8311.20−0.04
ASGA0026250TMinoutMedinan−88.4651.6039.193.92−0.03
ASGA0026250TMINMai−88.4651.6039.193.92−0.03
ASGA0026250TMinoutMed−88.6251.2838.714.17−0.03
ASGA0026250TMINAbr−89.1550.2238.575.26−0.03
ASGA0029202Bio18−99.3249.8439.36−2.410.01
BGIS0004952Bio18−94.8360.5845.77−2.870.01
BGIS0004952RadSolPrimMed−95.6458.9641.26−36.922.30
BGIS0004952RadSolPrimMediana−95.7558.7542.09−30.261.87
BGIS0004952RadSolNov−95.8758.5141.34−30.741.90
BGIS0004952RadSolJAn−96.2357.8041.04−25.051.54
BGIS0004952RadSolDez−96.5757.1140.59−20.931.29

Env_1, environment; TMinoutMedinan, median minimal temperature in autumn; TMINMai, minimal temperature in May; TMAXAbr, maximum temperature in April; TMinoutMed, average minimal temperature in autumn; TMINAbr, minimum temperature in April; Bio18, precipitation of warmest quarter; RadSolPrimMed, average of solar radiation to spring; RadSolPrimMediana, median of solar radiation to spring; RadSolNov, solar radiation in November; RadSolJAn, solar radiation January; RadSolDez, solar radiation December.

Figure 1

Genome position of Outliers detected by SAmβada (1), BayeScan (2), and through both methods (3)

Figure 2

Linkage disequilibrium decay and linkage disequilibrium up to 1,000 Mega bases (Mb) around the markers identified as selection signatures by Samβada/BayeScan

Table 3

Position in genome, distance from nearest genes, and biological function of these genes of markers identified as selection signatures in Brazilian locally adapted swine breeds

MarkerCrhMost severe consequenceGeneMarkers is between (in reverse strand)
Gene (Distance from SNP)FunctionGene (Distance from SNP)Function
ALGA00327955IntergenicENSSSCG00000024523 (±0.003 Mb)Transmembrane transporter activityENSSSCG00000000783 (±0.3 Mb)Glucose transmembrane transporter activity
ALGA00543159Upstream geneENSSSCG00000015405Cell surface receptor signaling pathway/immune response/
IntergenicENSSSCG00000015405 (±0.8 Mb)Cell surface receptor signaling pathwaySource: InterProENSSSCG00000015406 (±0.08 Mb)Immune response
ASGA00262505IntergenicENSSSCG00000024523 (±0.05 Mb)Glucose transmembrane transporter activityENSSSCG00000000783 (±0.1 Mb)Glucose transmembrane transporter activity/sugar:proton symporter activity.
ASGA00292026IntergenicENSSSCG00000003720 (±0.6 Mb)Transporter activityENSSSCG00000003722 (±0.2 Mb)Calcium ion binding (blood vasal morphogenesis….)
BGIS00049528IntergenicCOMMD8
MARC00219904IntergenicENSSSCG00000019505 (±0.1 Mb)RNA genesENSSSCG00000019140 (±0.0075 Mb)RNA genes
ENSSSCG00000022092/ENSSSCG00000006222 (±0.2 Mb)Iron (heme axial ligand)/RNA polymerase II transcription factor activity, sequence‐specific DNA binding
ASGA00337177IntronENSSSCG00000001771Rho guanyl‐nucleotide exchange factor activity
MARC00076783IntergenicENSSSCG00000008605 (±0.4 Mb)Uncharacterized proteinENSSSCG00000008606 (±0.0001 Mb)metal ion binding

Source:www.ensembl.org and http://www.ncbi.nlm.nih.gov/

Mb, mega bases.

Samβada output to environmental association to markers detected as signatures of selection in both methods Env_1, environment; TMinoutMedinan, median minimal temperature in autumn; TMINMai, minimal temperature in May; TMAXAbr, maximum temperature in April; TMinoutMed, average minimal temperature in autumn; TMINAbr, minimum temperature in April; Bio18, precipitation of warmest quarter; RadSolPrimMed, average of solar radiation to spring; RadSolPrimMediana, median of solar radiation to spring; RadSolNov, solar radiation in November; RadSolJAn, solar radiation January; RadSolDez, solar radiation December. Genome position of Outliers detected by SAmβada (1), BayeScan (2), and through both methods (3) Linkage disequilibrium decay and linkage disequilibrium up to 1,000 Mega bases (Mb) around the markers identified as selection signatures by Samβada/BayeScan Position in genome, distance from nearest genes, and biological function of these genes of markers identified as selection signatures in Brazilian locally adapted swine breeds Source:www.ensembl.org and http://www.ncbi.nlm.nih.gov/ Mb, mega bases. Global spatial correlation for environmental variables was high, with 5 (five) neighbor windows (average Moran's I = 0.89, from 0.55 to 0.97), and reaching close to zero with 15 (fifteen) neighbor windows. The highest value for Moran′s I was associated with solar radiation in the summer months. The selection signal markers have had a high global spatial correlation between 5 and 10 neighbors and present a rapid decrease to zero with 35 neighbors (Figure 3). With five neighbors, the maximum local I was 0.7072 from marker CASI0001257 and the smallest was −0.04346 from marker ASGA0002592 (Figure 3).
Figure 3

Moran′s I correlogram from genotypes of the markers identified as selection signatures in Brazilian locally adapted swine breeds by BayeScan and Samβada. Maximum, minimum, and average from all markers

Moran′s I correlogram from genotypes of the markers identified as selection signatures in Brazilian locally adapted swine breeds by BayeScan and Samβada. Maximum, minimum, and average from all markers For these five markers, considered selection signatures in BayeScan and Samβada, we found a nucleus of homozygotes in the neighborhood, but only with up to 30 neighbors. A regionalization of these markers was observed around a nucleus of climatic variation (Figure 4), with a loss of influence when geographical distance between samples was increased, or when distancing from the climatic influence center was decreased.
Figure 4

Maps of sampling points and genotype distribution maps from markers identified as selection signatures by Samβada/BayeScan. (a) Marker Asga0029202 in Bio18 layer (precipitation of Warmest Quarter). (b) Marker Alga0054315 in Minimum Temperature in May layer and (c) Marker Alga0054315 in Maximum Temperature in May Layer

Maps of sampling points and genotype distribution maps from markers identified as selection signatures by Samβada/BayeScan. (a) Marker Asga0029202 in Bio18 layer (precipitation of Warmest Quarter). (b) Marker Alga0054315 in Minimum Temperature in May layer and (c) Marker Alga0054315 in Maximum Temperature in May Layer The Mantel test between individual pairwise genetic distances Fst/(1‐Fst) and individual geographical distance had correlation coefficient of r = 0.02 (t = 0.58, p(r) > 1e−05, with 99,999 replications).

DISCUSSION

The evolution and adaptation of pigs are subject to environmental influences, as has been observed in humans (Storz 2010), humans and cattle (Beja‐Pereira et al., 2003), fish (Nielsen et al., 2009), and other species (Manthey & Moyle, 2015). The genetic variability and population structure found were similar to other populations (Boitard et al., 2010; Burgos‐Paz et al., 2012) and other approaches (Sollero et al., 2009). A history of geographical isolation from the other breeds was in agreement with highest FIS values presented by the Monteiro breed: a breed raised only in the states of Mato Grosso do Sul and Mato Grosso. The small number of groups suggested by STRUCTURE, similar to that found by Sollero et al. (2009) working with Brazilian pigs and microsatellites, reveals that the breeds share alleles, possibly caused by interracial mating, including commercial breeds as found by Traspov et al. (2016), a common behavior carried out by small Brazilian farmers. Brazilian pigs have a high genetic variability, similar to that among locally adapted breeds found for Vietnamese (Pham et al., 2014), Indian (De et al., 2013), and Colombian pigs (Burgos‐Paz et al., 2012). This genetic diversity could be connected with the environment through years of selection, leaving marks on the swine genome. There are many different methodologies for the detection of genetic markers or genomic regions under influence of natural selection, and one of these approaches is the identification of populational theoretical FST outliers. The use of georeferenced environmental data associated with FST outliers helps in the understanding of the evolutionary process and the influence of the environment on this process. For this work, we used two methods to detect Outliers in F ST. According to Pérez‐Figueroa, García‐Pereira, Saura, Rolán‐Alvarez, and Caballero (2010), BayeScan's algorithm under neutral hypothesis admits less than 1% of false discoveries, when we assume the Direchlet distribution and that the population has a neutral structure. Those presuppositions on the distribution and structure may become biased due to the existence of more than one sample within the population, or when individuals share a common ancestor in the recent past (Lotterhos & Whitlock 2014). Feng, Jiang, and Fan (2015) argue that some BayeScan configurations can affect the proportion and the direction of the markers in selection. This kind of bias does not occur with Samβada, because it translates samples in alleles frequencies associated with ambient data and uses these outliers to calculate logistic regression, which explains allele presence in a specific environment (Stucki et al., 2014). As the Samβada algorithm is based on individual and local levels, taking into consideration the p‐value after Bonferroni correction to determinate the significance of the models, the probability of mistakenly considering significant an association between marker and environmental variables decreases (Stucki, 2014; Stucki et al., 2014). The rates of spatial autocorrelation (Figure 3) showed that the 5 to 10 closest neighbors tend to have high spatial autocorrelation among each other. This behavior was possibly motivated by the habit of farmers interchanging sires and dams, trying to maintain inbreeding at low levels (Favero & de Figueiredo, 2009; Gama et al., 2013). The probability of genetic similarity at a distance higher than 10 neighbors decreases, and this might be related to the limited dispersion due to sanitary legislation within the country for swine species, as well the market organization. The pattern of spatial distribution of the genotypes, identified as selection markers (Figure 4), associated with environmental conditions such as temperature, solar radiation, and BIO18‐ precipitation of the warmest quarter (Table 2), during some periods in the year, shows adaptive selection linked to seasonality. The genotypic frequency of these signatures of selection divides the territory into two regions (Table 4), one in the north where we have predominantly the occurrence of one of the genotypes and the other to the south where the alternative genotype occurs. According to Nimer (1979), these two regions are identified by different climates: the north shows “equatorial,” “tropical,” and “northeast occidental tropical” climates; the south shows “temperate” and “central Brazil tropical” climates.
Table 4

Frequencies of genotypes from markers detected as selection signature according to Brazilian political regions

MarkerALGA0032795ALGA0054315ASGA0026250ASGA0029202BGIS0004952
Marker GenotypeGG (%)GA (%)AA (%)TT (%)CT (%)CC (%)TT (%)CT (%)CC (%)AA (%)AC (%)CC (%)TT (%)CT (%)CC (%)
Region
North2143521726351322205043521
Northeast203942264930424019274342203376
Midwest2832202419342031293920341173
Southeast820305018520520
South4214445154413422814630130
Genotype frequency Total100100100100100100100100100100100100100100100
Frequencies of genotypes from markers detected as selection signature according to Brazilian political regions Although the markers MARC0021990; ASGA0033717; MARC0007678 were responsible for a high number of significant models identified in Samβada, we did not find any significant multivariate model (Figure S3). When one marker is linked to some environmental variables, this infers that many evolutionary steps within the environment, throughout the year, influence the presence of markers. Despite only univariate models being found, there were associations between these alleles and the variation of temperature throughout the year, but not among the seasons as discussed by Martyn Plummer et al. (2006). The environmental temperature is closely linked to welfare (Lee & Phillips, 1948) and animal productivity (Collier & Gebremedhin, 2015), affecting pigs in all stages of life (Ross et al., 2015; Wildt et al., 1975), including intrauterine development, with consequences in the postnatal development of animals (Johnson et al., 2013, 2015a). The significant models found by Samβada for mean diurnal range (BIO2) associated with the marker ALGA0012967 in an intronic region of the LGR4 gene, which directly influences the testicular development and spermatogenesis, were in accordance with Petrocelli et al. (2015) who reported seasonal variation of seminal quality parameters affecting the reproductive performance of females. Once the survival and adaptation of the species in the environment are limited by reproductive success from individuals, and knowing when environmental conditions such as temperature and humidity are outside thermal comfort limits, we can see physiological alterations leading to reproductive failure in females (Nteeba et al. 2015) and males (Flowers, 2015; Wettemann & Bazer, 1985). Ai, Huang, and Ren (2013), working with Chinese pigs in Tibet, and Burgos‐Paz et al. (2012) with American pigs, found selection signatures correlated with the extremes of environmental conditions (high‐altitude adaptation), linked to altitude and circulatory system, respectively. Different from these authors, we found selection signatures for variation in temperature, radiation solar, and BIO18 (Table 2; Figure 4). The identification of selection signatures helps us to understand the relationship between climate and adaptive genetic variation, informing the conservation of both putatively neutral and adaptive components of genetic diversity (Bradbury, Smithson, & Krauss, 2013) across a dynamic and heterogeneous unpredictable landscape. Selection signatures from autochthone breeds may be a tool to improve livestock production through changes in the frequencies of these alleles in commercial herds, improving the adaptation in different environments. This is important in a world marked by environmental change that acts by altering the composition of the community and shifting range boundaries, phenology, genetic diversity, and genetic structure of organisms (Manel et al., 2012), probably imposing strong selection pressures on traits important for fitness (Gienapp, Teplitsky, Alho, Mills, & Merilä, 2008).

CONCLUSION

Allele frequency of markers from Brazilian locally adapted swine breeds was seen to be under the influence of environmental conditions showing evidence of footprints of divergent selection in at least 8 (eight) SNP markers, associated with temperature, solar radiation, and BIO18 linked with intracellular activity and circulatory system and were considered important for species adaptation. The distribution of SNP alleles over the Brazilian territory demonstrates a clear north–south orientation, dividing the country into two distinct regions according to climatic conditions, drier and sunnier in the North and wetter and colder in the South. This information on selection signature distribution across Brazilian territory could be included in programs of assisted selectin using genetic markers, helping farmer through easier management of animals selected for adaptive characteristics. In the same way, the markers could be used to direct animals for more suitable regions according to their genotype in both traditional husbandry situations as well as genetic resource conservation programs.

CONFLICT OF INTEREST

None declared.

DATA ACCESSIBILITY

Genetic data deposited at Animal GRIN/Alelo Animal. It is available in http://nrrc.ars.usda.gov/A-GRIN/genomic_account_page?record_source=US.

AUTHORS CONTRIBUTION

All authors enrolled have had equal contribution to the work presented in this paper, since the acquisition of data, until final approval of the version to be published. Click here for additional data file. Click here for additional data file.
  48 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Indigenous domestic breeds as reservoirs of genetic diversity: the Argentinean Creole cattle.

Authors:  G Giovambattista; M V Ripoli; P Peral-Garcia; J L Bouzat
Journal:  Anim Genet       Date:  2001-10       Impact factor: 3.169

Review 3.  The power and promise of population genomics: from genotyping to genome typing.

Authors:  Gordon Luikart; Phillip R England; David Tallmon; Steve Jordan; Pierre Taberlet
Journal:  Nat Rev Genet       Date:  2003-12       Impact factor: 53.242

4.  Identifying adaptive genetic divergence among populations from genome scans.

Authors:  Mark A Beaumont; David J Balding
Journal:  Mol Ecol       Date:  2004-04       Impact factor: 6.185

Review 5.  Using genome scans of DNA polymorphism to infer adaptive population divergence.

Authors:  Jay F Storz
Journal:  Mol Ecol       Date:  2005-03       Impact factor: 6.185

6.  Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study.

Authors:  G Evanno; S Regnaut; J Goudet
Journal:  Mol Ecol       Date:  2005-07       Impact factor: 6.185

Review 7.  Climate change and evolution: disentangling environmental and genetic responses.

Authors:  P Gienapp; C Teplitsky; J S Alho; J A Mills; J Merilä
Journal:  Mol Ecol       Date:  2008-01       Impact factor: 6.185

8.  A spatial analysis method (SAM) to detect candidate loci for selection: towards a landscape genomics approach to adaptation.

Authors:  S Joost; A Bonin; M W Bruford; L Després; C Conord; G Erhardt; P Taberlet
Journal:  Mol Ecol       Date:  2007-09       Impact factor: 6.185

9.  Physiological temperature response and embryonic mortality in stressed swine.

Authors:  D E Wildt; G D Riegle; W R Dukelow
Journal:  Am J Physiol       Date:  1975-12

Review 10.  Gene-culture coevolution between cattle milk protein genes and human lactase genes.

Authors:  Albano Beja-Pereira; Gordon Luikart; Phillip R England; Daniel G Bradley; Oliver C Jann; Giorgio Bertorelle; Andrew T Chamberlain; Telmo P Nunes; Stoitcho Metodiev; Nuno Ferrand; Georg Erhardt
Journal:  Nat Genet       Date:  2003-11-23       Impact factor: 38.330

View more
  3 in total

1.  Multiple Selection Signatures in Farmed Atlantic Salmon Adapted to Different Environments Across Hemispheres.

Authors:  María Eugenia López; Tyler Linderoth; Ashie Norris; Jean Paul Lhorente; Roberto Neira; José Manuel Yáñez
Journal:  Front Genet       Date:  2019-10-01       Impact factor: 4.599

2.  Rapid identification and interpretation of gene-environment associations using the new R.SamBada landscape genomics pipeline.

Authors:  Solange Duruz; Natalia Sevane; Oliver Selmoni; Elia Vajana; Kevin Leempoel; Sylvie Stucki; Pablo Orozco-terWengel; Estelle Rochat; Susana Dunner; Michael W Bruford; Stéphane Joost
Journal:  Mol Ecol Resour       Date:  2019-06-24       Impact factor: 7.090

3.  Genomic Association between SNP Markers and Diseases in the "Curraleiro Pé-Duro" Cattle.

Authors:  Thais Miranda Silva Freitas; Juliana Moraes Dias; Luanna Kim Pires Guimarães; Sáudio Vieira Peixoto; Rayanne Henrique Santana da Silva; Kareem Rady Badr; Maria Ivete Moura; Adriana Santana do Carmo; Vincenzo Landi; Maria Clorinda Soares Fioravanti
Journal:  Genes (Basel)       Date:  2021-05-25       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.