Literature DB >> 23638110

A further look at porcine chromosome 7 reveals VRTN variants associated with vertebral number in Chinese and Western pigs.

Yin Fan1, Yuyun Xing, Zhiyan Zhang, Huashui Ai, Zixuan Ouyang, Jing Ouyang, Ming Yang, Pinghua Li, Yijie Chen, Jun Gao, Lin Li, Lusheng Huang, Jun Ren.   

Abstract

The number of vertebrae is an economically important trait that affects carcass length and meat production in pigs. A major quantitative trait locus (QTL) for thoracic vertebral number has been repeatedly identified on pig chromosome (SSC) 7. To dissect the genetic basis of the major locus, we herein genotyped a large sample of animals from 3 experimental populations of Chinese and Western origins using 60K DNA chips. Genome-wide association studies consistently identified the locus across the 3 populations and mapped the locus to a 947-Kb region on SSC7. An identical-by-descent sharing assay refined the locus to a 100-Kb segment that harbors only two genes including VRTN and SYNDIG1L. Of them, VRNT has been proposed as a strong candidate of the major locus in Western modern breeds. Further, we resequenced the VRTN gene using DNA samples of 35 parental animals with known QTL genotypes by progeny testing. Concordance tests revealed 4 candidate causal variants as their genotypes showed the perfect segregation with QTL genotypes of the tested animals. An integrative analysis of evolutional constraints and functional elements supported two VRTN variants in a complete linkage disequilibrium phase as the most likely causal mutations. The promising variants significantly affect the number of thoracic vertebrae (one vertebra) in large scale outbred animals, and are segregating at rather high frequencies in Western pigs and at relatively low frequencies in a number of Chinese breeds. Altogether, we show that VRTN variants are significantly associated with the number of thoracic vertebrae in both Chinese and Western pigs. The finding advances our understanding of the genetic architecture of the vertebral number in pigs. Furthermore, our finding is of economical importance as it provides a robust breeding tool for the improvement of vertebral number and meat production in both Chinese indigenous pigs and Western present-day commercial pigs.

Entities:  

Mesh:

Year:  2013        PMID: 23638110      PMCID: PMC3634791          DOI: 10.1371/journal.pone.0062534

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The number of vertebrate is a phenotypic trait with high heritability estimates of 0.60 to 0.62 in domestic pigs [1], [2]. The porcine vertebrae consist of morphologically differentiated formulae: cervical, thoracic, lumbar, sacral and caudal vertebrae. The numbers of cervical and sacral vertebrae are always fixed at 7 and 4 except for rare cases in pigs [3], [4] as in other mammals, showing an evidence of evolutionary constraint [5]. In contrast, the number of thoracic and lumbar vertebrae varies considerably in pigs. Wild boars, the ancestors of domestic pigs, have uniformly 19 thoracic and lumbar vertebrae. Most Chinese indigenous breeds show 19 or 20 thoracolumbar vertebrae [6]. Comparatively, Western modern breeds, such as Duroc, Landrace and Large White, have more (n = 21–23) thoracic-lumbar vertebral formulae [1]. The increased vertebral number is likely caused by the long-term intense selection on enlarged size in these breeds. The number of vertebrae is an economically important trait in pigs as it is associated with body size and carcass length. It has been estimated that one extra vertebra can expand the carcass length of about 80 mm [3]. Therefore, deciphering the genetic basis of swine vertebral number variation will not only improve our understanding of vertebral developmental biology in mammals but also benefit the selective breeding for larger body size in the pig industry. Several studies have reported quantitative trait loci (QTL) for vertebral number in pigs by genome scans on the basis of microsatellite markers. Two genome-wide significant QTL were first detected on pig chromosomes (SSC) 1 and 2 in a Meishan × Göttingen cross [7]. Subsequently, the QTL on SSC1 has been repeatedly evidenced in different studies, explaining the variation in one vertebra [8]–[10]. Mikawa et al [11] further show that NR6A1c.134 G>A is a promising candidate causal variant underlying the major QTL. Besides, another genome-wide significant QTL for the number of thoracic vertebrae has been consistently identified on SSC7 by different investigators [9], [10], [12], [13]. More recently, the VRTN gene was proposed to be a strong candidate of the SSC7 QTL in Western commercial breeds [14]. However, the causality of VRTN needs more supporting evidence. It is worth noting that the QTL has also been detected in Chinese breeds [10]. Therefore, additional efforts are required to address if VRTN is also the responsible gene for the QTL detected in Chinese breeds. Moreover, the causative variant of VRTN has not yet been defined and thus needs further investigation. In our previous report [10], we detected the major QTL for the number of thoracic vertebrae on SSC7 using a genome scan with 194 microsatellite markers in a White Duroc × Erhualian F2 resource population. We found that the QTL affects the thoracic vertebral number in both Western and Chinese pigs. The aim of this study was to characterize the responsible gene and the most likely causal variant for the SSC7 QTL by a series of genetic analyses including genome-wide association studies (GWAS), identical-by-descent (IBD) sharing mapping, genetic concordance test, evolutional constraint assay and association analysis in a large scale outbred samples.

Materials and Methods

Ethics Statement

All the procedures involving animals are in compliance with the care and use guidelines of experimental animals established by the Ministry of Agriculture of China. The ethics committee of Jiangxi Agricultural University specifically approved this study.

Animals and Phenotype Recording

In this study, experimental animals were from four pig populations including the White Duroc × Erhualian F2 intercross, an Erhualian × Tongcheng F2 resource population, a Chinese Sutai purebred population and a Western three-way hybrid (Duroc × Landrance × Large White, DLL) commercial population. The White Duroc × Erhualian cross was developed and managed as described by Guo et al. (2009) [15]. In brief, two White Duroc sires and 17 Erhualian dams were mated to produce F1 animals. A total of 1912 F2 animals were generated in six batches by intercrossing 9 F1 boars with 59 F1 sows avoiding full-sib mating. Of the 1912 animals, 918 individuals were slaughtered for phenotype recording at the age of 240±3 days. In the Erhualian × Tongcheng F2 pedigree, one Chinese Erhualian boar was mated to one Chinese Tongcheng sow, and 2 F1 boars and 7 F1 sows were then intercrossed to produce 61 F2 individuals that were slaughtered at 60 days of age. Sutai is a Chinese synthetic breed that was originally produced by crossing Chinese Taihu and Western Duroc (50% Taihu and 50% Duroc) [16]. In the current study, 4 Sutai boars were mated with 55 Sutai sows to produce 461 offspring, of which 435 were slaughtered at 240±3 days of age. Animals of the three experimental populations were all raised in the research farm at Jiangxi Agricultural University in Nanchang, China. In addition, 1403 pig samples were collected from 9 three-way hybrid (DLL) populations in a commercial slaughterhouse in Nanchang. After slaughter, all animals were recorded for the number of thoracic and lumbar vertebrae.

SNP Chip and Genotyping

Genomic DNA was extracted from ear tissue of each animal using a standard phenol/chloroform method. DNA quality was determined by a Nanodrop-100 spectrophotometer (Thermo Fisher, USA). All eligible DNA samples were diluted to a final concentration of 50 ng/µl. The samples were genotyped for 62163 SNPs on the Porcine SNP 60K Beadchips (Illumina, USA) according to the supplier’s protocol. The same quality control criteria were applied for the SNP data of each population by the check.marker function of GenABEL [17]. Animals with SNP call rates ≥95% and familial Mendelian error rates ≤0.1, and SNPs with call rates ≥95%, minor allele frequencies (MAF) ≥0.1 and significance levels of deviation from Hardy-Weinberg equilibrium ≤10−6 were included for further statistical analysis.

GWAS Mapping

A mixed model-based single-locus regression analysis was performed for GWAS mapping using GenABEL, an R library for whole genome association analysis [17]. The analysis adjusted population stratification by modeling similarities between individuals on the basis of genome-wide SNP data. For the meta-analysis of GWAS, the statistical χ2 values at each locus in each experimental population were summed to calculate new χ2 values with a freedom degree of 3. Bonferroni corrected P-values were adopted for the genome-wide significance threshold that was set as 0.05/N, where N is the number of informative SNPs in the data set.

Haplotype Sharing Analysis

The QTL genotypes of F1 sires in two F2 populations and progenitor boars in Sutai pigs were first determined by the marker-assisted segregation analysis as described in Nezer et al [18]. Briefly, the likelihood ratio of sire’s QTL genotypes for homozygotes (QQ or qq) or heterozygotes (Qq) was calculated using offspring’s phenotypic data by grouping the “Left” chromosome or “Right” chromosome separately. A Z-score log10 statistics was then obtained by comparing the likelihood of heterozygotes to the likelihood of homozygotes. Sires were considered to be Qq when Z>2, QQ or qq when Z<−2, and unknown QTL genotypes when Z in the interval of −2 to 2. Further, the QTL genotypes of F0 sires in the F2 populations were judged by the Mendelian inheritance and multiple comparison tests against the deduced Q-chromosomes of F1 sires using a classical t-test. Haplotypes of sires with successfully deduced QTL genotypes were reconstructed with SimWalk2 v2.91 [19] under default setting based on the pedigree information. All Q-bearing chromosomes were examined for their shared haplotypes to narrow down the QTL position. To improve haplotype-sharing resolution, high density markers in addition to the 60K SNP data were characterized in the targeted region by comparative sequencing of 35 parental samples with known QTL genotypes. Genomic DNA was amplified with primers listed in at optimal annealing temperatures. PCR products were directly sequenced with PCR primers on a 3130XL Genetic Analyzer (Applied Biosystem, USA). Variants were then recorded by the alignment of sequence reads.

Resequencing of the VRTN Gene

To resequence the porcine VRTN gene, a set of primers () were used to amplify genomic DNA of the 35 parental pigs. Amplification was performed in a routine way with 1.5 mM of MgCl2 and optimal annealing temperatures. The resulting PCR products covered a ∼30-kb region harboring the VRTN gene. All amplicons were bi-directionally sequenced with original PCR primers on the 3130XL Genetic Analyzer (Applied Biosystem, USA). Variants were excluded as the causal mutation if their genotypes were not concordant with the QTL genotypes in the sire samples.

Analyses of Evolutional Constraints and Functional Significance of VRTN Candidate Causal Variants

Genomic sequences of human, mouse, cattle, dog and horse VRTN genes were retrieved from the NCBI nucleotide database (http://www.ncbi.nlm.nih.gov/gene/?term=VRTN). The MultiPipMaker software [20] was implemented to analyze evolutionary constraints of VRTN candidate causal variants. Further, Genomic Evolutionary Rate Profiling (GREP) alignment [21] based on 30 eutherian mammals were determined for the most likely causal variants via the Ensembl Genome Browser (www.ensembl.org/). Functional elements were predicted for the regions harboring candidate causal variants by an integrative genomic analysis using the UCSC Genome Browser at http://genome.ucsc.edu/.

Association and Genetic Variability of the Most Likely Causal Variants in Outbred Populations

Two most likely causal variants, g.19034A>C and g.20311_20312ins291 (GenBank accession no. AB554652.1), were genotyped on 435 Sutai pigs and 192 Western purebred pigs. The genotypes of g.19034A>C were detected by a TaqMan SNP assay (Primers F5/R5 and the corresponding probe, ) on a 7900HT Fast Real-Time PCR System (Applied Biosystem, USA), For g.20311_20312ins291, the genotypes were judged using a PCR-based test (Primers F6/R6, ). PCR products were separated by 2% agrose gel electrophoresis and the genotypes were visually recorded according to the length of amplicon. The ins allele was represented by amplicons of 411 bp and the wild-type allele by amplicons of 120 bp (). Moreover, 1403 Western three-way hybrid pigs and 1371 purebred pigs from 19 diverse breeds and wild boars were genotyped for g.20311_20312ins291. Association of the variants with the number of thoracic vertebrae was performed on the Sutai and Western hybrid animals by a classical t-test. Frequencies of the ins (Q) allele increasing vertebral number were calculated for each purebred population.

Results

GWAS Mapped the Major QTL to a 947-Kb Region on SSC7

In the present study, we conducted GWAS for vertebral number in 3 distinct populations. A total of 39448, 43760 and 20509 informative SNPs were filtered for GWAS on the White Duroc × Erhualian F2 intercross, Sutai pigs and the Erhualian × Tongcheng F2 intercross, respectively. All experimental animals were successfully genotyped for the 60K chips. Consistently, we detected the most significant SNPs on SSC7 across the three populations ( ). To determine whether the locus affects numeric variation in thoracic vertebrae or lumbar vertebrae, we performed GWAS for the two phenotypic traits separately in these populations. We found that the locus specifically affects the number of thoracic vertebrae in contrast to its negligible effect on lumbar vertebral formula (), which is consistent with our previous QTL mapping result [10]. The top SNPs had a strong additive effect of approximate 0.5 thoracic vertebrae per allele (). In the White Duroc × Erhualian intercross ( ), the strongest SNP was H3GA0022664 at 103.91 Mb on SSC7 (Sscrofa10.2), which is located in intron 4 of the PROX2 gene. In the Sutai population ( ), GWAS identified the most significant SNP at 102.46 Mb on SSC7. The SNP MARC0113727 resides in the NUMB gene. The most prominent SNP in the Erhualian × Tongcheng F2 population was CASI0006750 at 115.51 Mb that is located in an interval region flanked by the FLRT2 and GALC genes ( ). The consistent GWAS results strongly indicate that a common variant underlies the QTL detected in these populations. We also performed a meta-analysis of GWAS for vertebral number across the three populations. The most significant finding was H3GA0022664 at 103.91 Mb, the top SNP evidenced in the White Duroc × Erhualian intercross. The SNP showed much higher significance level (LOD = 41) in the meta-analysis with combined data ( ). Two second significant SNPs were ASGA0035500 at 103.57 Mb and MARC0030523 at 101.86 Mb with LOD values of 38 and 37, respectively. Further, we defined the 95% confidence intervals (CI) of the major locus by LOD dropoff 2 from the strongest SNP. According to this criterion, the critical regions were 101.86–105.75, 102.46–104.31 and 103.37–117.04 Mb on SSC7 in the White Duroc × Erhualian intercross, Sutai pigs and Erhualian × Tongcheng intercross populations, respectively. The ∼947-Kb (103.37–104.31 Mb) region of overlap between the 95% CIs is of great interest as it defines the most likely QTL position harboring the responsible gene. For subsequent analysis, we focused on the ∼947-Kb segment.
Figure 1

GWAS map the major QTL for the number of thoracic vertebrae to SSC7.

GWAS were performed on the White Duroc × Erhualian F2 intercross (A), Sutai pigs (B), Erhualian × Tongcheng F2 intercross (C) and meta-analysis (D). Negative log10 P-values of all SNPs are plotted against position on each pig chromosome in the y-axis. Chromosomes are shown in different colors for clarity in the x-axis. Log (1/P) values of more than 5 are genome-wide significant.

GWAS map the major QTL for the number of thoracic vertebrae to SSC7.

GWAS were performed on the White Duroc × Erhualian F2 intercross (A), Sutai pigs (B), Erhualian × Tongcheng F2 intercross (C) and meta-analysis (D). Negative log10 P-values of all SNPs are plotted against position on each pig chromosome in the y-axis. Chromosomes are shown in different colors for clarity in the x-axis. Log (1/P) values of more than 5 are genome-wide significant.

IBD Analysis Pinpointed VRTN as the Responsible Gene

To perform IBD sharing analysis for fine mapping of the SSC7 QTL, we first determined the QTL genotypes of 12 F1 sires in the two F2 populations and 4 Sutai progenitor boars by marker-assisted segregation ( ). Further, we judged the QTL status of 19 founder sires and dams in the F2 populations by multiple comparisons of targeted chromosomes with the reference Q (number-increase) or q (wild-type) chromosomes of F1 sires. In total, we identified 22 Q alleles and 44 q alleles ( , ). Haplotype reconstruction allowed us to identify the chromosomal region shared by all Q chromosomes. In the 947-Kb critical region, all 22 Q-bearing chromosomes across the three populations shared only one region of 128-Kb defined by two SNPs: INRA0027623 and ASGA0035500 ( ). To exclude the possibility of false positive IBD, we developed high density markers around the putative IBD region by comparatively sequencing 13 amplicons of the 35 individuals with known QTL genotypes. A final set of 35 informative markers was identified around the region. Haplotype reconstruction analysis clearly showed that all Q-bearing chromosomes share a unique haplotype of 100-Kb corresponding to 15 variants. The IBD region harbors only two annotated genes: VRTN and SYNDIG1L ( ). Of them, SYNDIG1L seem to be implicated in Huntington diseases in rodent models [22], while VRTN has been proposed as a strong candidate of the QTL for vertebral number [14]. Hence, our finding supports the assumption that VRTN is the responsible gene for the SSC7 QTL.
Table 1

Determination of QTL genotypes of founder animals by the t-test or marker assisted segregation analysis in three experimental populations.

Average no. of thoracic vertebraeb (no. of individuals)
Populationa GenerationAnimalLeft chromosomeRight chromosomet-test Z-testc QTL genotype
WEF1 315.33±0.50 (79)14.50±0.60 (78)13.46Qq
1715.37±0.62 (57)14.61±0.49 (49)8.07Qq
2315.30±0.48 (10)15.18±0.60 (11)−4.04QQ
2915.34±0.60 (47)15.27±0.49 (52)−2.08QQ
3515.33±0.66 (39)14.49±0.51 (39)6.34Qq
4115.28±0.58 (87)14.64±0.53 (83)10.42Qq
4715.35±0.63 (26)15.40±0.50 (20)−2.08QQ
4915.22±0.64 (45)14.51±0.56 (37)4.73Qq
7515.39±0.62 (56)14.67±0.52 (45)6.95Qq
F0 03814.50±0.40 (30)14.40±0.27 (10)<0.0001qq
05214.63±0.30 (523)14.38±0.27 (8)<0.0001qq
05414.63±0.30 (523)ns0.20q_
05814.63±0.30 (523)14.63±0.27 (60)<0.0001qq
06815.29±0.33 (1055)14.54±0.31 (72)<0.0001Qq
07315.29±0.33 (1055)15.29±0.33 (1055)QQ
07415.29±0.33 (1055)14.54±0.31 (72)<0.0001Qq
07515.29±0.33 (1055)15.29±0.33 (1055)QQ
09014.63±0.30 (523)14.63±0.30 (523)qq
09414.63±0.30 (523)nsq_
12414.63±0.30 (523)14.63±0.27 (60)<0.0001qq
12614.63±0.30 (523)14.63±0.27 (60)<0.0001qq
14614.63±0.27 (60)114.38±0.27 (8)<0.0001qq
17414.63±0.30 (523)14.89±0.36 (9)<0.0001qq
19614.63±0.30 (523)ns<0.53q_
20215.29±0.33 (1055)14.63±0.30 (523)Qq
29214.63±0.30 (523)14.5±0.4 (30)<0.0001qq
119014.63±0.30 (523)nsq_
SUF0 562113.94±0.29 (52)13.98±0.27 (49)−3.87qq
567513.92±0.19 (51)13.90±0.20 (39)−4.96qq
631314.15±0.22 (47)14.24±0.24 (38)−3.05qq
653714.31±0.27 (42)14.62±0.28 (50)2.05Qq
ETF0 C1414.14±0.13 (14)14.70±0.22 (40)0.001Qq
F1 C202714.28±0.21 (29)14.67±0.23 (27)0.003Qq
C204214.22±0.19 (9)14.77±0.19 (13)0.009Qq
C204614.14±0.13 (14)14.20±0.18 (10)0.73Qq

WE, White Duroc × Erhualian F2 intercross; SU, Sutai pigs; ET, Erhualian × Tongcheng F2 intercross.

Left or right chromosomes are defined arbitrarily for each sire. For left and right chromosomes of F0 animals in the two F2 intercross populations, q chromosomes are judged by the Mendelian inheritance and their significant difference from the average value of deduced Q chromosomes in F1 sires, or vice versa. Information is not available (ns) for some right chromosomes as no or few F2 animals inherit these chromosomes. Some F0 animals have identical haplotypes corresponding to Q or q chromosomes of F1 sires. Average values are shown for these chromosomes.

The Z-test on 9 F1 sires in the White Duroc × Erhualian F2 intercross has been reported in our previous study [10].

Figure 2

All Q-bearing chromosomes for increased vertebral number share a 100-Kb region harboring the VRTN gene.

The IBD critical regions defined by the 60 K SNP data (panel A) and the characterized SNPs (panel B) are indicated by shaded boxes. Two annotated genes, VRTN and SYNDIG1L, are located in the IBD region. SNP alleles are shown by 1 and 2 for the major and minor alleles, respectively. The characterized SNPs shown in panel B are numbered according to their locations in the current pig genome assembly (Sscrofa10.2). Identities of animals carrying the Q-chromosome are given in the left axis. Animals C14, C2027 and C2042 are from the Erhualian × Tongcheng F2 intercross, and the others are from the White Duroc × Erhualian F2 intercross.

All Q-bearing chromosomes for increased vertebral number share a 100-Kb region harboring the VRTN gene.

The IBD critical regions defined by the 60 K SNP data (panel A) and the characterized SNPs (panel B) are indicated by shaded boxes. Two annotated genes, VRTN and SYNDIG1L, are located in the IBD region. SNP alleles are shown by 1 and 2 for the major and minor alleles, respectively. The characterized SNPs shown in panel B are numbered according to their locations in the current pig genome assembly (Sscrofa10.2). Identities of animals carrying the Q-chromosome are given in the left axis. Animals C14, C2027 and C2042 are from the Erhualian × Tongcheng F2 intercross, and the others are from the White Duroc × Erhualian F2 intercross. WE, White Duroc × Erhualian F2 intercross; SU, Sutai pigs; ET, Erhualian × Tongcheng F2 intercross. Left or right chromosomes are defined arbitrarily for each sire. For left and right chromosomes of F0 animals in the two F2 intercross populations, q chromosomes are judged by the Mendelian inheritance and their significant difference from the average value of deduced Q chromosomes in F1 sires, or vice versa. Information is not available (ns) for some right chromosomes as no or few F2 animals inherit these chromosomes. Some F0 animals have identical haplotypes corresponding to Q or q chromosomes of F1 sires. Average values are shown for these chromosomes. The Z-test on 9 F1 sires in the White Duroc × Erhualian F2 intercross has been reported in our previous study [10].

An Integrative Analysis Revealed the Most Likely Causal Variants of VRTN

To identify the causal variant of VRTN, we resequenced a 30-Kb segment covering the VRTN gene using the above-mentioned 35 DNA samples. Variants were excluded as causal mutations if their genotypes were discordant with QTL genotypes of the tested animals. Mikawa et al. (2011) [11] have previously reported 9 VRTN polymorphisms significantly associated with the number of thoracic vertebrae in European breeds. Of the 9 polymorphic sites, 5 are apparently not causal variants because of the discordance between QTL and mutation genotypes (). The other 4 variants including g.8063G>A, g.13066C>T, g.19034A>C and g.20311_20312ins291 appear to be strong candidate causal variant as their genotypes showed the complete concordance with QTL genotypes across all tested samples (). We did not find any other variants cosegregating with the QTL genotypes in the VRTN region. To test evolutionary constraints on the 4 candidate causal variants, we performed the MultiPipMaker alignment of 6 orthologous VRTN genes in mammals. Of them, g.8063G>A and g.13066C>T are present in multiple species ( ) and are less likely causal mutations. In contrast, g.19034A>C and g.20311_20312ins291 occur exclusively in domestic pigs, standing out as the promising causal variants. Through the Ensembl Genome Browser, we confirmed the unique occurrence of the two variants in domestic pigs out of 30 eutherian mammals (). Further, we inferred the functional features of the genomic region encompassing the two variants using the integrative genomic data set on the UCSC Genome Browser. We found that the variants reside in an active promoter. The promoter has typical open chromatin signatures of H3K4me proximal to peaks of H3K27AC that are often found near active regulatory elements. The variants correspond to two transcription factor binding sites with high GERP conservation scores. Moreover, g.19034A>C is located in a DNaseI hypersensitive site that are often associated with transcriptional activity ( ). Altogether, these findings form the hypothesis that g.19034A>C and g.20311_20312ins291 are functional variants altering the expression of the porcine VRTN gene, presumably leading to the QTL effect on vertebral number.
Figure 3

Visualization of evolutionary constraints and functional significance of candidate causal variants.

Four variants showing 100% concordance with the QTL genotypes of parental pigs are shown in the genomic structure of the VRTN gene. Exons are indicated by boxes and non-coding regions by thin lines (medium panel). The sequences for the alignment were taken from the following accessions: NC_010449.4 (Sus scrofa), AC_000167.1 (Bos taurus), NC_006590.3 (Canis lupus familiaris), NC_000014.8 (Homo sapiens) and NC_000078.6 (Mus musculus). The positions of variants are indicated by shaded boxes and the missing sequences are marked by the dash (upper panel). An integrative genomic analysis was performed on a 4-kb region harboring the two most likely causal variants via the UCSC Genome Browser (lower panel). GERP, GERP scores for mammalian alignments; Digital DNaseI, digital DNaseI hypersensitivity clusters; H3K27AC, histone 3 lysine 27 acetylation; H3K4me3, histone 3 lysine 4 trimethylation; H1-hESC, chromatin segmentation by Hidden Markov Model predicted from the ENCODE consortium; TF, transcription binding sites.

Visualization of evolutionary constraints and functional significance of candidate causal variants.

Four variants showing 100% concordance with the QTL genotypes of parental pigs are shown in the genomic structure of the VRTN gene. Exons are indicated by boxes and non-coding regions by thin lines (medium panel). The sequences for the alignment were taken from the following accessions: NC_010449.4 (Sus scrofa), AC_000167.1 (Bos taurus), NC_006590.3 (Canis lupus familiaris), NC_000014.8 (Homo sapiens) and NC_000078.6 (Mus musculus). The positions of variants are indicated by shaded boxes and the missing sequences are marked by the dash (upper panel). An integrative genomic analysis was performed on a 4-kb region harboring the two most likely causal variants via the UCSC Genome Browser (lower panel). GERP, GERP scores for mammalian alignments; Digital DNaseI, digital DNaseI hypersensitivity clusters; H3K27AC, histone 3 lysine 27 acetylation; H3K4me3, histone 3 lysine 4 trimethylation; H1-hESC, chromatin segmentation by Hidden Markov Model predicted from the ENCODE consortium; TF, transcription binding sites.

Association Analysis in a Large Scale Outbred Samples Support Two VRTN Variants as the Causal Mutation(s)

We genotyped g.19034A>C and g.20311_20312ins291 on 435 Sutai pigs and 192 unrelated animals from Duroc, Landrace and Large White. The two variants showed the complete linkage disequilibrium with identical genotypes on all tested animals (data not shown). For subsequent analyses, we focused on g.20311_20312ins291 due to its cost-effective genotyping. To confirm the effect of VRTN variants on vertebral number, we further genotyped g.20311_20312ins291 on Sutai pigs (n = 435) and a large scale samples of Western commercial hybrid pigs (n = 1403). Association analysis revealed that this polymorphic site had strong effect on the number of thoracic vertebrae but not the number of lumbar vertebrae ( ). Ins/ins (QQ) individuals have approximate one additional thoracic segment compared with wide-type homozygotes in both Chinese Sutai and Western hybrid pigs ( ). This is in agreement with the observed QTL effect, reinforcing the assumption that the VRTN variant(s) cause the QTL effect on vertebral number on SSC7.
Table 2

Association of VRTN most likely causal variants with vertebral number in Sutai and Western commercial hybrid pig populationsa.

PopulationTraitGenotype (no. of individuals)Log(1/P) F-value
ins/ins ins/−−/−
Sutaithoracic vertebral number15.07±0.13a (30)14.35±0.24b (192)13.88±0.19c (213)>16115.80
lumbar vertebral number5.27±0.20a (30)5.45±0.25b (192)5.49±0.25b (213)0.072.59
DLLb thoracic vertebral number15.63±0.41a (527)15.11±0.44b (662)14.71±0.52c (214)>16183.90
lumbar vertebral number5.36±0.23a (527)5.47±0.25b (662)5.47±0.25b (214)1.143.23

Phenotypic values are shown in mean ± standard deviation. Values with different superscripts in the same line are significantly different.

Western three-way hybrid pigs from Duroc × (Landrace × Large White).

Phenotypic values are shown in mean ± standard deviation. Values with different superscripts in the same line are significantly different. Western three-way hybrid pigs from Duroc × (Landrace × Large White).

Both Western and Chinese Pigs are Segregating for the VRTN Causal Mutation(s)

To investigate the distribution of the VRTN causal mutation in a broad panel of breeds, we genotyped g.20311_20312ins291 on 1371 unrelated pigs representing 20 diverse breeds and wild boars ( ). As expected, wild boars are all homozygous for the wild-type allele as they uniformly have 19 thoracolumbar vertebrae. Most of Chinese indigenous breeds are also fixed at the wild-type allele, which is consistent with the fidelity of the thoracic vertebral number in these breeds. However, the mutation are segregating in several Chinese breeds including Bamaxiang, Erhualian, Hang, Laiwu Black, Tibetan and Tongcheng pigs. Especially, the derived allele for increased vertebral number is present in Hang (29%) and Tongcheng (28%) pigs at rather high frequencies ( ). This finding, together with the absence of the mutant allele in Chinese wild boars, would suggest that the mutation occurred in China after domestication. The mutation in Western present-day breeds is very common. Interestingly, the highest frequency (71%) of the mutation is found in Landrace that is known for long body length.
Table 3

Frequencies of the derived (Q) allele for increased vertebral number in Chinese and Western pig breeds.

BreedOriginNo.Allele frequency
Chinese breeds
Bama XiangGuangxi310.15
ErhualianJiangsu2050.06
HangJiangxi310.29
Hetao Big-EarInner Mongolia300.00
JiangquhaiJiangsu320.00
JinhuaZhejiang320.00
Laiwu BlackShandong320.14
LantangGuangdong300.00
MeishanJiangsu300.00
MinzhuHeilongjiang300.00
NingxiangHunan320.00
RongchangChongqing320.00
TibetanTibet300.08
TongchengHubei320.28
Wuzhishan pigHainan300.00
Yushan BlackJiangxi310.00
Chinese wild boarJiangxi310.00
Western breeds
DurocUSA1030.54
LandraceDenmark1750.71
Large WhiteUK, Canada, France3920.66

Discussion

The QTL for Vertebral Number on SSC7 in Western and Chinese Pigs is Caused by a Common Variant

Western commercial pigs and a number of Chinese indigenous pigs show considerable variation in the thoracic-lumbar vertebral number. To address if common or distinct variants cause numeric variation in vertebral number, we herein performed GWAS mapping of the loci for the phenotypic trait using both Western × Chinese (White Duroc × Erhualian F2 cross and Sutai pigs) and Chinese × Chinese (Erhualian × Tongcheng F2 intercross) hybrid populations. As a result, we consistently detected genome-wide significant loci for the number of thoracic vertebrae on SSC7 across the three experimental populations, where the most significant SNPs were located in a proximal region ( ). This allows us to hypothesize that a common variant causes the major QTL effect in both Western and Chinese pigs. With this assumption, we defined the most likely region of 947-Kb harboring the responsible gene by determining the overlapping segment between the 95% CIs in the 3 tested populations. In the critical region, all Q-chromosomes of parental pigs for increased vertebral number share a 100-Kb haplotype containing 15 polymorphic sites. These Q chromosomes were originated from three Erhualian founder sows and two White Duroc founder boars in the White Duroc × Erhualian intercross as well as one Tongcheng founder sow in the Erhualian × Tongcheng cross. The shared Q-segment across diverse chromosomes of different origins strengths our assumption that the number-increasing allele at the SSC7 QTL in Chinese and Western pigs is derived from a common ancestor. After identifying the most likely causal mutation(s) for the QTL, we performed a comprehensive survey of the genetic variability at the locus in diverse breeds. We found that the mutation is segregating in a number of Chinese breeds ( ), thus supporting the hypothesis that the QTL is likely of Chinese origin. According to historical records [23], [24], Chinese indigenous pigs were intercrossed into European pigs approximately 200 years ago, contributing to the formation of Western present-day commercial breeds. This has been confirmed by the recently completed analyses of pig genome that showed a strong signal (37% fraction) of admixture from Chinese breeds into European breeds [25]. It thus raises the possibility that the number-increasing allele was introduced from Chinese pigs into Western pigs at that time. Recombination events in the past two centuries have gradually eroded the introduced Chinese haplotype, leading to the current ‘compound’ (mosaic) haplotype from European and Chinese descent in Western pigs, like White Duroc boars in this study ( ). During the past decades, Western modern breeds have experienced intensive selection on carcass length and lean production. It, we believe, dramatically increase the frequency of the causal mutation in these breeds. It should be noted that we also detected a genome-wide significant locus for vertebral number on SSC1 ( ). The locus perfectly corresponds to the NR6A1 region that has been associated with vertebral number [11]. In contrast to the SSC7 locus, the number-increase allele at the NR6A1 locus is of European origin. Moreover, the NR6A1 region shows strong signatures of selection and the beneficial allele has been fixed in European breeds [26].

Our Findings Support that VRTN is the Responsible Gene for the QTL on SSC7

In a previous report, Mikawa et al. (2011) [14] identified a strong candidate gene of the QTL on SSC7 using QTL and linkage disequilibrium mapping studies on European commercial pigs. The previously uncharacterized gene, namely VRTN, showed significant association with the number of thoracic vertebrae in Western pigs. We herein used genome-wide 60 K SNPs to conduct GWAS mapping of the QTL on large scale samples from three different populations. By the GWAS and IBD sharing analyses, we define the QTL within a critical region of 100 Kb that harbors only two annotated genes, i.e. VRTN and SYNDIG1L. SYNDIG1 is a candidate for Huntington disease in rodent models [22]. It is thus reasonable to deduce that VRTN is the responsible gene for the variation in vertebral number at the SSC7 QTL. One paralogue of the VRTN gene is located at 103.2 Mb, adjacent to the 100-Kb critical region. The paralogous gene is not likely the underlying gene as it falls outside the IBD region. The orthologs of VRTN appear to be a conserved protein in a wide range of organisms from fishes to mammals on the UCSC Genome Browser. VRTN is therefore likely to be an essential regulator for somitogenesis in the development of embryo in a wide range of organisms. Nevertheless, the detailed underlying mechanism of VRTN in the development of swine vertebral formula remains largely unknown at present and needs further investigation.

VRTN g.19034A>C and g.20311_20312ins291 are the Most Likely Causal Variants underlying the QTL on SSC7

By resequencing a 41-Kb region around the VRTN gene, Mikawa et al. (2011) identified 9 variants that show concordance with the QTL genotypes of family patriarchs and are significantly associated with vertebral number in Western pigs. These variants are in a complete linkage disequilibrium phase. Therefore, the causative polymorphic site could not be defined in the previous study [14]. To resolve the problem, we herein resequenced the VRTN gene and applied the concordance test for all polymorphisms including the 9 variants on 35 individuals with deduced QTL genotypes by progeny testing. Notably, four VRTN polymorphisms are in complete concordance with the QTL genotypes of the tested animals. All 12 individuals heterozygous for the QTL are also heterozygous for the polymorphisms, and all 23 pigs homozygous for the QTL are also homozygous for the polymorphisms. The finding apparently indicates that these polymorphisms are strong candidate causal variants underlying the QTL. The number of thoracolumbar vertebrae is always fixed at 19 in mammals [27]. In domestic animals, the changes in thoracolumbar vertebral number are common in pigs and sheep but are very rare in other species [28]. The developmental constraint supports the hypothesis that the underlying mutation for vertebral number in this study is lineage-specific in pigs or sheep among mammals. Of the 4 candidate mutations, g.19034A>C and g.20311_20312ins291 show evolutionary constraints as they exist exclusively in domestic pigs in 30 eutherian mammals. Moreover, the two variants reside in functional elements of an active promoter and likely affect the expression of VRTN, as corroborated by diverse transcriptomic and epigenomic data. Therefore, the two variants are more promising causal mutations compared with the other two non-conservative ones. Given the complete linkage disequilibrium between the two polymorphisms, we can not genetically judge if they act collaboratively or one of them is the underlying mutation contributing to the phenotypic variation. Further functional assays are required to address the question. We genotyped g.20311_20312ins291 on all animals across the two intercross pedigrees and a large sample of pigs from two outbred populations, i.e. Chinese Sutai pigs and Western three-way hybrid pigs. The variant absorbed totally the QTL effect in the intercross populations as all significant SNPs vanished when it was included as a fixed effect in the GWAS statistic model (data not shown). Moreover, the variant affects one additional thoracic vertebra in both two outbred populations, which is in good agreement with the observed QTL effect. The finding again favors the assumption of the variant as the causal mutation. Our robust result based on large scale samples is important for the pig industry as it provides an accurate and efficient diagnostic tool for the genetic improvement of vertebral number in the swine breeding programs.

Conclusion

By GWAS and IBD mapping studies on three distinct populations, we gave additional supporting evidence for the assumption that VRTN is the responsible gene underlying the QTL for the number of thoracic vertebrae on SSC7. We further showed that a common variant determine the QTL effect in both Western and Chinese pigs and the derived allele for increased vertebral number is likely of Chinese origin. Using an integrative analysis of concordance test, evolutionary constraint and deleteriousness prediction, we identified the most likely causal variants in the VRTN gene. We confirmed the major effect of the variants on thoracic vertebrae formula in two large scale outbred populations, and revealed the genetic variability at the mutation site in a wide panel of pig breeds. Our findings advance the understanding of the developmental biology of vertebrae formula and establish a reliable breeding tool for thoracic vertebral number in both Chinese and Western pigs. Further studies are warranted to elucidate the underlying molecular mechanism of the VRTN causal mutation(s). Electrophoresis patterns of the g.20311_20312 ins291 marker. A direct PCR was performed to diagnose the indel marker as described in Materials and Methods. Lane 1, 3 and 4, −/−; lane 2, ins/−; lane 5, ins/ins. M is 50 bp DNA ladder. (TIF) Click here for additional data file. GWAS for the number of lumbar vertebrae. GWAS were performed on the White Duroc × Erhualian F2 intercross (A), Sutai pigs (B) and Erhualian × Tongcheng F2 intercross (C). Negative log10 P-values of all SNPs are plotted against position on each pig chromosome in the y-axis. Chromosomes are shown in different colors for clarity in the x-axis. Log (1/P) values of more than 5 are genome-wide significant. (TIF) Click here for additional data file. Conservation of the chromosomal region encompassing the most likely causal variant (g.19034A>C) in 30 eutherian mammals. For clarity, the sole occurrence of g.20311_20312 ins291 in these mammals is not shown in this figure. (TIF) Click here for additional data file. PCR primers for the characterization of polymorphisms in the IBD region and the resequencing of the porcine gene. (DOC) Click here for additional data file. The effect of the most significant SNPs on the number of thoracic vertebrae in three experimental populations. (DOC) Click here for additional data file. Concordance of candidate causal variants and QTL genotypes of parental pigs in three experimental populations. (DOC) Click here for additional data file.
  22 in total

1.  MultiPipMaker: a comparative alignment server for multiple DNA sequences.

Authors:  Laura Elnitski; Richard Burhans; Cathy Riemer; Ross Hardison; Webb Miller
Journal:  Curr Protoc Bioinformatics       Date:  2010-06

2.  GenABEL: an R library for genome-wide association analysis.

Authors:  Yurii S Aulchenko; Stephan Ripke; Aaron Isaacs; Cornelia M van Duijn
Journal:  Bioinformatics       Date:  2007-03-23       Impact factor: 6.937

3.  A linkage map of the porcine genome from a large-scale White Duroc x Erhualian resource population and evaluation of factors affecting recombination rates.

Authors:  Y Guo; H Mao; J Ren; X Yan; Y Duan; G Yang; D Ren; Z Zhang; B Yang; J Ouyang; B Brenig; C Haley; L Huang
Journal:  Anim Genet       Date:  2009-02       Impact factor: 3.169

4.  Beneficial genotype of swine FUT1 gene governing resistance to E. coli F18 is associated with important economic traits.

Authors:  Wen-Bin Bao; Lan Ye; Zhang-Yuan Pan; Jin Zhu; Guo-Qiang Zhu; Xue-Gen Huang; Sheng-Long Wu
Journal:  J Genet       Date:  2011-08       Impact factor: 1.166

5.  Capucin: a novel striatal marker down-regulated in rodent models of Huntington disease.

Authors:  M de Chaldée; C Brochier; A Van de Vel; N Caudy; R Luthi-Carter; M C Gaillard; J M Elalouf
Journal:  Genomics       Date:  2005-12-15       Impact factor: 5.736

6.  Mapping and fine mapping of quantitative trait loci for the number of vertebrae in a White Duroc × Chinese Erhualian intercross resource population.

Authors:  D R Ren; J Ren; G F Ruan; Y M Guo; L H Wu; G C Yang; L H Zhou; L Li; Z Y Zhang; L S Huang
Journal:  Anim Genet       Date:  2012-02-09       Impact factor: 3.169

7.  Quantitative trait loci analysis on Sus scrofa chromosome 7 for meat production, meat quality, and carcass traits within a Duroc purebred population.

Authors:  Y Uemoto; Y Nagamine; E Kobayashi; S Sato; T Tayama; Y Suda; T Shibata; K Suzuki
Journal:  J Anim Sci       Date:  2008-06-20       Impact factor: 3.159

8.  Identification of a second gene associated with variation in vertebral number in domestic pigs.

Authors:  Satoshi Mikawa; Shuji Sato; Masahiro Nii; Takeya Morozumi; Gou Yoshioka; Noriaki Imaeda; Tsuneko Yamaguchi; Takeshi Hayashi; Takashi Awata
Journal:  BMC Genet       Date:  2011-01-14       Impact factor: 2.797

9.  Strong signatures of selection in the domestic pig genome.

Authors:  Carl-Johan Rubin; Hendrik-Jan Megens; Alvaro Martinez Barrio; Khurram Maqbool; Shumaila Sayyab; Doreen Schwochow; Chao Wang; Örjan Carlborg; Patric Jern; Claus B Jørgensen; Alan L Archibald; Merete Fredholm; Martien A M Groenen; Leif Andersson
Journal:  Proc Natl Acad Sci U S A       Date:  2012-11-14       Impact factor: 11.205

10.  Analyses of pig genomes provide insight into porcine demography and evolution.

Authors:  Martien A M Groenen; Alan L Archibald; Hirohide Uenishi; Christopher K Tuggle; Yasuhiro Takeuchi; Max F Rothschild; Claire Rogel-Gaillard; Chankyu Park; Denis Milan; Hendrik-Jan Megens; Shengting Li; Denis M Larkin; Heebal Kim; Laurent A F Frantz; Mario Caccamo; Hyeonju Ahn; Bronwen L Aken; Anna Anselmo; Christian Anthon; Loretta Auvil; Bouabid Badaoui; Craig W Beattie; Christian Bendixen; Daniel Berman; Frank Blecha; Jonas Blomberg; Lars Bolund; Mirte Bosse; Sara Botti; Zhan Bujie; Megan Bystrom; Boris Capitanu; Denise Carvalho-Silva; Patrick Chardon; Celine Chen; Ryan Cheng; Sang-Haeng Choi; William Chow; Richard C Clark; Christopher Clee; Richard P M A Crooijmans; Harry D Dawson; Patrice Dehais; Fioravante De Sapio; Bert Dibbits; Nizar Drou; Zhi-Qiang Du; Kellye Eversole; João Fadista; Susan Fairley; Thomas Faraut; Geoffrey J Faulkner; Katie E Fowler; Merete Fredholm; Eric Fritz; James G R Gilbert; Elisabetta Giuffra; Jan Gorodkin; Darren K Griffin; Jennifer L Harrow; Alexander Hayward; Kerstin Howe; Zhi-Liang Hu; Sean J Humphray; Toby Hunt; Henrik Hornshøj; Jin-Tae Jeon; Patric Jern; Matthew Jones; Jerzy Jurka; Hiroyuki Kanamori; Ronan Kapetanovic; Jaebum Kim; Jae-Hwan Kim; Kyu-Won Kim; Tae-Hun Kim; Greger Larson; Kyooyeol Lee; Kyung-Tai Lee; Richard Leggett; Harris A Lewin; Yingrui Li; Wansheng Liu; Jane E Loveland; Yao Lu; Joan K Lunney; Jian Ma; Ole Madsen; Katherine Mann; Lucy Matthews; Stuart McLaren; Takeya Morozumi; Michael P Murtaugh; Jitendra Narayan; Dinh Truong Nguyen; Peixiang Ni; Song-Jung Oh; Suneel Onteru; Frank Panitz; Eung-Woo Park; Hong-Seog Park; Geraldine Pascal; Yogesh Paudel; Miguel Perez-Enciso; Ricardo Ramirez-Gonzalez; James M Reecy; Sandra Rodriguez-Zas; Gary A Rohrer; Lauretta Rund; Yongming Sang; Kyle Schachtschneider; Joshua G Schraiber; John Schwartz; Linda Scobie; Carol Scott; Stephen Searle; Bertrand Servin; Bruce R Southey; Goran Sperber; Peter Stadler; Jonathan V Sweedler; Hakim Tafer; Bo Thomsen; Rashmi Wali; Jian Wang; Jun Wang; Simon White; Xun Xu; Martine Yerle; Guojie Zhang; Jianguo Zhang; Jie Zhang; Shuhong Zhao; Jane Rogers; Carol Churcher; Lawrence B Schook
Journal:  Nature       Date:  2012-11-15       Impact factor: 49.962

View more
  25 in total

1.  Genome-wide study refines the quantitative trait locus for number of ribs in a Large White × Minzhu intercross pig population and reveals a new candidate gene.

Authors:  Long-Chao Zhang; Jing-Wei Yue; Lei Pu; Li-Gang Wang; Xin Liu; Jing Liang; Hua Yan; Ke-Bin Zhao; Na Li; Hui-Bi Shi; Yue-Bo Zhang; Li-Xian Wang
Journal:  Mol Genet Genomics       Date:  2016-06-15       Impact factor: 3.291

2.  Genome-wide association study and genomic predictions for exterior traits in Yorkshire pigs1.

Authors:  Jungjae Lee; SeokHyun Lee; Jong-Eun Park; Sung-Ho Moon; Sung-Woon Choi; Gwang-Woong Go; Dajeong Lim; Jun-Mo Kim
Journal:  J Anim Sci       Date:  2019-07-02       Impact factor: 3.159

3.  An imputed whole-genome sequence-based GWAS approach pinpoints causal mutations for complex traits in a specific swine population.

Authors:  Guorong Yan; Xianxian Liu; Shijun Xiao; Wenshui Xin; Wenwu Xu; Yiping Li; Tao Huang; Jiangtao Qin; Lei Xie; Junwu Ma; Zhiyan Zhang; Lusheng Huang
Journal:  Sci China Life Sci       Date:  2021-08-11       Impact factor: 6.038

Review 4.  Mining the pig genome to investigate the domestication process.

Authors:  S E Ramos-Onsins; W Burgos-Paz; A Manunza; M Amills
Journal:  Heredity (Edinb)       Date:  2014-07-30       Impact factor: 3.821

5.  Genome wide association study for the identification of genes associated with tail fat deposition in Chinese sheep breeds.

Authors:  Caiye Zhu; Na Li; Heping Cheng; Youji Ma
Journal:  Biol Open       Date:  2021-05-04       Impact factor: 2.422

6.  A genome-wide association study reveals dominance effects on number of teats in pigs.

Authors:  Marcos S Lopes; John W M Bastiaansen; Barbara Harlizius; Egbert F Knol; Henk Bovenhuis
Journal:  PLoS One       Date:  2014-08-26       Impact factor: 3.240

7.  Possible introgression of the VRTN mutation increasing vertebral number, carcass length and teat number from Chinese pigs into European pigs.

Authors:  Jie Yang; Lusheng Huang; Ming Yang; Yin Fan; Lin Li; Shaoming Fang; Wenjiang Deng; Leilei Cui; Zhen Zhang; Huashui Ai; Zhenfang Wu; Jun Gao; Jun Ren
Journal:  Sci Rep       Date:  2016-01-19       Impact factor: 4.379

8.  A study of vertebra number in pigs confirms the association of vertnin and reveals additional QTL.

Authors:  Gary A Rohrer; Dan J Nonneman; Ralph T Wiedmann; James F Schneider
Journal:  BMC Genet       Date:  2015-10-30       Impact factor: 2.797

9.  Rapid screening for phenotype-genotype associations by linear transformations of genomic evaluations.

Authors:  Jose L Gualdrón Duarte; Rodolfo J C Cantet; Ronald O Bates; Catherine W Ernst; Nancy E Raney; Juan P Steibel
Journal:  BMC Bioinformatics       Date:  2014-07-19       Impact factor: 3.169

10.  SNP- and haplotype-based genome-wide association studies for growth, carcass, and meat quality traits in a Duroc multigenerational population.

Authors:  Shuji Sato; Yoshinobu Uemoto; Takashi Kikuchi; Sachiko Egawa; Kimiko Kohira; Tomomi Saito; Hironori Sakuma; Satoshi Miyashita; Shinji Arata; Takatoshi Kojima; Keiichi Suzuki
Journal:  BMC Genet       Date:  2016-04-19       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.