Literature DB >> 23151582

Analyses of pig genomes provide insight into porcine demography and evolution.

Martien A M Groenen1, Alan L Archibald, Hirohide Uenishi, Christopher K Tuggle, Yasuhiro Takeuchi, Max F Rothschild, Claire Rogel-Gaillard, Chankyu Park, Denis Milan, Hendrik-Jan Megens, Shengting Li, Denis M Larkin, Heebal Kim, Laurent A F Frantz, Mario Caccamo, Hyeonju Ahn, Bronwen L Aken, Anna Anselmo, Christian Anthon, Loretta Auvil, Bouabid Badaoui, Craig W Beattie, Christian Bendixen, Daniel Berman, Frank Blecha, Jonas Blomberg, Lars Bolund, Mirte Bosse, Sara Botti, Zhan Bujie, Megan Bystrom, Boris Capitanu, Denise Carvalho-Silva, Patrick Chardon, Celine Chen, Ryan Cheng, Sang-Haeng Choi, William Chow, Richard C Clark, Christopher Clee, Richard P M A Crooijmans, Harry D Dawson, Patrice Dehais, Fioravante De Sapio, Bert Dibbits, Nizar Drou, Zhi-Qiang Du, Kellye Eversole, João Fadista, Susan Fairley, Thomas Faraut, Geoffrey J Faulkner, Katie E Fowler, Merete Fredholm, Eric Fritz, James G R Gilbert, Elisabetta Giuffra, Jan Gorodkin, Darren K Griffin, Jennifer L Harrow, Alexander Hayward, Kerstin Howe, Zhi-Liang Hu, Sean J Humphray, Toby Hunt, Henrik Hornshøj, Jin-Tae Jeon, Patric Jern, Matthew Jones, Jerzy Jurka, Hiroyuki Kanamori, Ronan Kapetanovic, Jaebum Kim, Jae-Hwan Kim, Kyu-Won Kim, Tae-Hun Kim, Greger Larson, Kyooyeol Lee, Kyung-Tai Lee, Richard Leggett, Harris A Lewin, Yingrui Li, Wansheng Liu, Jane E Loveland, Yao Lu, Joan K Lunney, Jian Ma, Ole Madsen, Katherine Mann, Lucy Matthews, Stuart McLaren, Takeya Morozumi, Michael P Murtaugh, Jitendra Narayan, Dinh Truong Nguyen, Peixiang Ni, Song-Jung Oh, Suneel Onteru, Frank Panitz, Eung-Woo Park, Hong-Seog Park, Geraldine Pascal, Yogesh Paudel, Miguel Perez-Enciso, Ricardo Ramirez-Gonzalez, James M Reecy, Sandra Rodriguez-Zas, Gary A Rohrer, Lauretta Rund, Yongming Sang, Kyle Schachtschneider, Joshua G Schraiber, John Schwartz, Linda Scobie, Carol Scott, Stephen Searle, Bertrand Servin, Bruce R Southey, Goran Sperber, Peter Stadler, Jonathan V Sweedler, Hakim Tafer, Bo Thomsen, Rashmi Wali, Jian Wang, Jun Wang, Simon White, Xun Xu, Martine Yerle, Guojie Zhang, Jianguo Zhang, Jie Zhang, Shuhong Zhao, Jane Rogers, Carol Churcher, Lawrence B Schook.   

Abstract

For 10,000 years pigs and humans have shared a close and complex relationship. From domestication to modern breeding practices, humans have shaped the genomes of domestic pigs. Here we present the assembly and analysis of the genome sequence of a female domestic Duroc pig (Sus scrofa) and a comparison with the genomes of wild and domestic pigs from Europe and Asia. Wild pigs emerged in South East Asia and subsequently spread across Eurasia. Our results reveal a deep phylogenetic split between European and Asian wild boars ∼1 million years ago, and a selective sweep analysis indicates selection on genes involved in RNA processing and regulation. Genes associated with immune response and olfaction exhibit fast evolution. Pigs have the largest repertoire of functional olfactory receptor genes, reflecting the importance of smell in this scavenging animal. The pig genome sequence provides an important resource for further improvements of this important livestock species, and our identification of many putative disease-causing variants extends the potential of the pig as a biomedical model.

Entities:  

Mesh:

Year:  2012        PMID: 23151582      PMCID: PMC3566564          DOI: 10.1038/nature11622

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


Main

The domestic pig (Sus scrofa) is a eutherian mammal and a member of the Cetartiodactyla order, a clade distinct from rodent and primates, that last shared a common ancestor with humans between 79 and 97 million years (Myr) ago[1,2] (http://www.timetree.net). Molecular genetic evidence indicates that Sus scrofa emerged in South East Asia during the climatic fluctuations of the early Pliocene 5.3–3.5 Myr ago. Then, beginning ∼10,000 years ago, pigs were domesticated in multiple locations across Eurasia[3] (Frantz, L. A. F. et al., manuscript submitted). Here we provide a high-quality draft pig genome sequence developed under the auspices of the Swine Genome Sequencing Consortium[4,5], established using bacterial artificial chromosome (BAC)[6] and whole-genome shotgun (WGS) sequences (see Methods and Supplementary Information). The assembly (Sscrofa10.2) comprises 2.60 gigabases (Gb) assigned to chromosomes with a further 212 megabases (Mb) in unplaced scaffolds (Table 1 and Supplementary Tables 1–3).
Table 1

Assembly and annotation statistics

AssemblyPlacedUnplacedAnnotation*
Total length2,596,639,456211,869,92221,640 protein-coding genes
Ungapped length2,323,671,356195,490,322380 pseudogenes
Scaffolds5,3434,5622,965 ncRNAs†
Contigs73,524168,358197,675 gene exons
Scaffold N50637,33298,02226,487 gene transcripts
Contig N5080,7202,423

*Numbers refer to the annotation performed by Ensembl (release 67). Results of an independent annotation by the NCBI can be obtained from http://www.ncbi.nlm.nih.gov/mapview/stats/BuildStats.cgi?taxid=9823&build=4&ver=1.

†An improved ncRNA annotation with 3,601 ncRNAs and structured elements is available as a separate track in Ensembl version 70 and for download from http://rth.dk/resources/rnannotator/susscr102.

N50, 50% of the genome is in fragments of this length or longer.

Assembly and annotation statistics *Numbers refer to the annotation performed by Ensembl (release 67). Results of an independent annotation by the NCBI can be obtained from http://www.ncbi.nlm.nih.gov/mapview/stats/BuildStats.cgi?taxid=9823&build=4&ver=1. †An improved ncRNA annotation with 3,601 ncRNAs and structured elements is available as a separate track in Ensembl version 70 and for download from http://rth.dk/resources/rnannotator/susscr102. N50, 50% of the genome is in fragments of this length or longer.

Genome annotation

A de novo repeat discovery and annotation strategy (Supplementary Fig. 8) revealed a total of 95 novel repeat families, including: 5 long interspersed elements (LINEs), 6 short interspersed elements (SINEs), 8 satellites and 76 long terminal repeats (LTRs). The relative content of repetitive elements (∼40%, Supplementary Figs 9 and 10) is lower than reported for other mammalian genomes. The main repetitive element groups are the LINE1 and glutamic acid transfer RNA (tRNAGlu)-derived SINEs or PRE (porcine repetitive element). The expansion of PRE is specific to the porcine lineage. Phylogenetic analysis of LINE1 and PRE (Supplementary Figs 13 and 14) indicates that only a single lineage of each is currently active and that the main expansion of both LINE1 and PRE occurred in the first half of the Tertiary period. Smaller expansions, particularly in LINE1, have occurred since, but recent activity is very low (Supplementary Information). Annotation of genes, transcripts and predictions of orthologues and paralogues was performed using the Ensembl analysis pipeline[7] (Table 1 and Supplementary Figs 3–7). Further annotation for non-protein-coding RNAs (ncRNAs) was undertaken with another analysis pipeline (Supplementary Information and Supplementary Table 4).

Evolution of the porcine genome

Evolution of genes and gene families

To examine the mutation rate and type of protein-coding genes that show accelerated evolution in pigs, we identified ∼9,000 as 1:1 orthologues within a group of six mammals (human, mouse, dog, horse, cow and pig). This orthologous gene set was used to identify proteins that show accelerated evolution in each of these six mammalian lineages (Supplementary Information). The observed number of synonymous substitutions per synonymous site (dS) for the pig lineage (0.160) is similar to that of the other mammals (0.138–0.201) except for the mouse (0.458), indicating similar evolutionary rates in pigs and other mammals. The observed dN/dS ratio (ratio of the rate of non-synonymous substitutions to the rate of synonymous substitutions) of 0.144 is between those of humans (0.163) and mice (0.116), indicating an intermediate level of purifying selection pressure in the pig. Genes showing increased dN/dS ratios in each lineage were analysed using DAVID[8] to examine whether these rapidly evolving genes were enriched for specific biological processes. Most lineages show different fast-evolving pathways, but some pathways are shared (Fig. 1).
Figure 1

Phylogeny of the six mammals used in the dN/dS analysis.

KEGG pathways with genes that show accelerated evolution for each of the six mammals used in the dN/dS analysis. The bar charts show the individual dN/dS and dS values for each of the six mammals. The dN/dS and dS values refer to the time period of each of the six individual lineages. The number of proteins that show significantly accelerated dN/dS ratios in each lineage varies from 84 in the mouse to 311 in the pig lineage. Pathways significantly (P < 0.05) enriched within this group of genes are also shown with the number of genes shown in brackets. HPI, Helicobacter pylori infection.

PowerPoint slide

Phylogeny of the six mammals used in the dN/dS analysis.

KEGG pathways with genes that show accelerated evolution for each of the six mammals used in the dN/dS analysis. The bar charts show the individual dN/dS and dS values for each of the six mammals. The dN/dS and dS values refer to the time period of each of the six individual lineages. The number of proteins that show significantly accelerated dN/dS ratios in each lineage varies from 84 in the mouse to 311 in the pig lineage. Pathways significantly (P < 0.05) enriched within this group of genes are also shown with the number of genes shown in brackets. HPI, Helicobacter pylori infection. PowerPoint slide Immune genes are known to be actively evolving in mammals[9,10]. Because many immune genes were not included in the analysis of 1:1 orthologues, we examined a randomly selected subset of 158 immunity-related pig proteins for evidence of accelerated evolution (Supplementary information). Twenty-seven of these genes (17%) demonstrated accelerated evolution (Supplementary Table 8). A parallel analysis of 143 human and 145 bovine orthologues revealed very similar rates of evolution (18% in human and 12% in cattle, respectively). Using a branch-site analysis, we detected accelerated evolution of amino acids in PRSS12, CD1D and TRAF3 specific to pig (positive selection on pig branch), as well as amino acids in TREM1, IL1B and SCARA5 specific to pig and cow (positive selection on the cetartiodactyl branch). Further analysis of porcine immune genes (Supplementary Table 5) revealed evidence for specific gene duplications and gene-family expansions (Supplementary Tables 6 and 7). The analysis of this second cetartiodactyl genome indicates that some expansions are cetartiodactyl-specific (cathelicidin) whereas others are ruminant/bovine-specific (β-defensins, C-type lyzozymes) or potentially porcine-specific (type I interferon, δ subfamily). Pigs have at least 39 type I interferon (IFN) genes, which is twice the number identified in humans and significantly more than in mice. We also detected 16 pseudogenes in this family. Cattle have 51 type I IFNs (13 pseudogenes), indicating that both bovine and porcine type I IFN families have undergone expansion. This is particularly important for interferon subtypes δ (IFND), ω (IFNW) and τ (IFNT); pigs and cattle are evolving species-specific subtypes of IFND and IFNT, respectively. Both species are expanding the IFNW family and share many more IFNW isoforms than other species. Thus, expansion of interferon genes is not ruminant-specific as proposed earlier[10], although duplication within some specific sub-families seems to be either bovine- or porcine-specific. Within the immunity-related genes annotated, we found evidence for duplication of six immune-related genes: IL1B, CD36, CD68, CD163, CRP and IFIT1, and one non-immune gene, RDH16. The CD36 gene is also duplicated in the bovine genome, whereas the IL1B gene duplication, where evidence for a partial duplication was reported previously[11], is unique in mammals. Other key immune genes in the major histocompatibility complex, immunoglobulin, T-cell-receptor and natural killer cell receptor loci have been characterized in detail[12,13,14,15,16,17,18,19] (Supplementary Information). Another significant porcine genome expansion is the olfactory receptor gene family. We identified 1,301 porcine olfactory receptor genes and 343 partial olfactory receptor genes[20]. The fraction of pseudogenes within these olfactory receptor sequences (14%) is the lowest observed in any species so far. This large number of functional olfactory receptor genes most probably reflects the strong reliance of pigs on their sense of smell while scavenging for food.

Conservation of synteny and evolutionary breakpoints

Alignment of the porcine genome against seven other mammalian genomes (Supplementary Information) identified homologous synteny blocks (HSBs). Using porcine HSBs and stringent filtering criteria, 192 pig-specific evolutionary breakpoint regions (EBRs) were located. The number of porcine EBRs (146, Supplementary Table 11 and Supplementary Fig. 16) is comparable to the number of bovine-lineage-specific EBRs (100) reported earlier using a slightly lower resolution (500 kilobases (kb)), indicating that both lineages evolved with an average rate of ∼2.1 large-scale rearrangements per million years after the divergence from a common cetartiodactyl ancestor ∼60 Myr ago[2]. This rate compares to ∼1.9 rearrangements per million years within the primate lineage (Supplementary Table 11). A total of 20 and 18 cetartiodactyl EBRs (shared by pigs and cattle) were detected using the pig and human genomes as a reference, respectively. Pig-specific EBRs were enriched for LTR endogenous retrovirus 1 (LTR-ERV1) transposons and satellite repeats (Supplementary Table 12), indicating that these two families of repetitive sequences have contributed to chromosomal evolution in the pig lineage. Different families of transposable elements seem to have been active in the cetartiodactyl ancestor. The cetartiodactyl EBRs are enriched for LINE1 elements and tRNAGlu-derived SINEs. tRNAGlu-derived SINEs, previously found over-represented in cetartiodactyl EBRs defined in the bovine genome[10], originated in the common ancestor of cetartiodactyls[21]. Our observation that these elements are also enriched in porcine EBRs strongly supports the hypothesis that active transposable elements promote lineage-specific genomic rearrangements. A stringent set of porcine to human one-to-one orthologues using the MetaCore database revealed that porcine EBRs and adjacent intervals are enriched for genes involved in sensory perception of taste (P < 8.9 × 10−6; FDR <0.05), indicating that taste phenotypes may have been affected by events associated with genomic rearrangements. Pigs have a limited ability to taste NaCl[22]. SCNN1B, a gene encoding a sodium channel involved in the perception of salty tastes, is located in a porcine-specific EBR. Another gene, ITPR3, encoding a receptor for inositol triphosphate and a calcium channel involved in the perception of umami and sweet tastes, has been affected by the insertion of several porcine-specific SINE mobile elements into its 3′ untranslated region (3′ UTR), consistent with our observation of a higher density of transposable elements in EBRs. In addition to 8 bitter taste receptor genes annotated by Ensembl and which were used in the gene enrichment analysis, we identified 9 intact genes, to give a total number of 17 TAS2R receptors in the pig (Supplementary Table 13). This compares to 18 intact bitter taste receptors in cattle, 19 in horse, 15 in dog and 25 in humans[23,24]. Of the 14 bitter taste receptor genes that were mapped to a specific pig chromosome (SSC), 10 were found near 2 EBRs on SSC5 and SSC18 (Supplementary Tables 13 and 15). We also found that at least four taste receptors (TAS1R2, TAS2R1, TAS2R40 and TAS2R39) have been under relaxed selection (Supplementary Information). Pigs are not sensitive to bitter tastes and tolerate higher concentrations of bitter compounds than humans[22,25]. Thus, pigs can eat food that is unpalatable to humans. A review of the porcine taste transduction network (Supplementary Fig. 17) revealed additional genes affected by rearrangements that affect ‘apical and taste receptor cell’ processes. Together with the observed over-representation of genes related to ‘adrenergic receptor activity’ and ‘angiotensin and other binding’ categories in the pig EBRs (Supplementary Fig. 18), our data indicate that chromosomal rearrangements significantly contributed to adaptation in the suid lineage.

Population divergence and domestication

Divergence between Asian and European wild boar

We investigated the evolution within Sus scrofa in Eurasia by sequencing ten individual unrelated wild boars from different geographical areas. In total, 17,210,760 single nucleotide polymorphisms (SNPs) were identified among these ten wild boars. The number of SNPs segregating in the four Asian wild boars (11,472,192) was much higher than that observed in the six European wild boars (6,407,224) with only 2,212,288 shared SNPs. This higher nucleotide diversity was visible in the distribution of heterozygous sites of the Asian compared to the European wild boar genomes (Fig. 2). Phylogenomic analyses of complete genome sequences from these wild boars and six domestic pigs revealed distinct Asian and European lineages (Supplementary Fig. 23) that split during the mid-Pleistocene 1.6–0.8 Myr ago (Calabrian stage, Frantz, L. A. F. et al., manuscript submitted). Colder climates during the Calabrian glacial intervals probably triggered isolation of populations across Eurasia. Admixture analyses (Supplementary Information) within Eurasian Sus scrofa disclosed gene flow between the northern Chinese and European populations consistent with pig migration across Eurasia, between Europe and northern China, throughout the Pleistocene. Our demographic analysis on the whole-genome sequences of European and Asian wild boars (Fig. 3) revealed an increase in the European population after pigs arrived from China. During the Last Glacial Maximum (LGM; ∼20,000 years ago)[26], however, Asian and European populations both suffered population bottlenecks. The drop in population size was more pronounced in Europe than Asia (Fig. 3), suggesting a greater impact of the LGM in northern European regions and probably resulting in the observed lower genetic diversity in modern European wild boar.
Figure 2

Distribution of heterozygosity for individual pig genomes.

Shown is the distribution of the heterozygosity as the log2(SNPs) per 10k bin. a, Wild Sus scrofa: blue, south China; green, north China; orange, Italian; red, Dutch. b, Breeds: blue, Chinese breeds (Jiangquhai, Meishan, Xiang); red–yellow, European breeds (Hampshire, large white, landrace). Note that the Hampshire breed is a North American breed of European origin.

PowerPoint slide

Figure 3

Demographic history of wild boars.

Demographic history was inferred using a hidden Markov model (HMM) approach as implemented in pairwise sequentially Markovian coalescence (PSMC)[45]. In the absence of known mutation rates for pig, we used the default mutation rate for human (μ) of 2.5 × 10−8. For the generation time (g) we used an estimate of 5 years. The Last Glacial Maximum (LGM) is highlighted in grey. WBnl, wild boar Netherlands; WBit, wild boar Italy; WBNch, wild boar north China; WBSch, wild boar south China.

PowerPoint slide

Distribution of heterozygosity for individual pig genomes.

Shown is the distribution of the heterozygosity as the log2(SNPs) per 10k bin. a, Wild Sus scrofa: blue, south China; green, north China; orange, Italian; red, Dutch. b, Breeds: blue, Chinese breeds (Jiangquhai, Meishan, Xiang); red–yellow, European breeds (Hampshire, large white, landrace). Note that the Hampshire breed is a North American breed of European origin. PowerPoint slide

Demographic history of wild boars.

Demographic history was inferred using a hidden Markov model (HMM) approach as implemented in pairwise sequentially Markovian coalescence (PSMC)[45]. In the absence of known mutation rates for pig, we used the default mutation rate for human (μ) of 2.5 × 10−8. For the generation time (g) we used an estimate of 5 years. The Last Glacial Maximum (LGM) is highlighted in grey. WBnl, wild boar Netherlands; WBit, wild boar Italy; WBNch, wild boar north China; WBSch, wild boar south China. PowerPoint slide The deep phylogenetic split between European and Asian wild boars is further supported by our observation of 1,272,737 fixed differences between the six European and four Asian wild boars, 1,706 of which are non-synonymous mutations in 1,191 different genes. Genes involved in sensory perception, immunity and host defence were among the most rapidly evolving genes (Supplementary Table 28), further strengthening the conclusions from our analysis of immunity-related pig proteins. This conclusion is further supported by our observation that these genes are also enriched in porcine segmental duplications (Supplementary Information). To investigate further whether specific regions in the genome of European and Asian wild boar have been under positive selection, a selective sweep analysis was performed on the ten wild boar genome sequences using an approach similar to that recently described in the comparison of Neanderthal and Homo sapiens genomes[27]. Regions in the genome under strong positive selection after the divergence of these two populations are expected to share fewer derived alleles. Using stringent criteria (Supplementary Information), we identified a total of 251 putative selective sweep regions, with an average size of 111,269 base pairs (bp), together comprising around 1% of the genome and harbouring 365 annotated protein-coding genes (Supplementary Table 26). Many of these regions (112) do not contain any currently annotated protein-coding exons. In contrast, the 10 putative selective sweep regions located between positions 39–43 Mb on SSC3 together harbour 93 genes. This SSC3 region (Supplementary Fig. 25) is directly adjacent to the centromere and exhibits low recombination rates[28]. Low recombining regions have been shown to be more prone to selective sweeps and meiotic drive[29,30]. Although similar large putative selective sweep regions close to the centromere were only observed on SSC6 between positions 56.2–57.5 Mb, on most chromosomes selective sweep regions tended to cluster in the central part of chromosomes, thus exhibiting a clear correlation with regions of low recombination (Supplementary Fig. 27). As expected, regions with the highest nucleotide differentiation between European and Asian wild boars were observed in high recombination regions towards the end of the chromosomes on both metacentric and acrocentric chromosomes[28]. The putative selective sweep regions displayed significant over-representation of genes involved in RNA splicing and RNA processing, indicating possible changes in the regulation of genes at the level of RNA processing (Supplementary Table 27). Several of these genes (CELF1, CELF6, WDR83, RBM39, RBM6, HNRNPA1, HNRNPM) are involved in alternative splicing, and, small differences in expression might affect alternatively spliced transcripts of specific genes. Evolution of regulatory splicing factors such as heterogeneous ribonucleoprotein particle (hnRNP) proteins has been proposed as an evolutionary model for alternative splicing[31], and genetic variation in such factors can affect alternative splicing and result in different phenotypes or disease[32]. Our observation that specific genes involved in splicing show accelerated evolution in the pig lineage (Fig. 1) supports this hypothesis. Of particular interest is the selective sweep region observed at position 26 Mb on SSC3 around the ERI2 gene (Fig. 4), which encodes ERI1 exoribonuclease family member 2. Different gene variants have been fixed in European and Asian wild boar coding for proteins that differ at two amino acid positions: Cys52Arg and His358Leu encoded by exons 3 and 9 of the ERI2 gene, respectively. The precise function of ERI2 is unknown but the ERI1 exoribonuclease family members have been shown to be involved in mRNA decay[33] and in Caenorhabditis elegans ERI-1 has been shown to be involved in the degradation of microRNAs (miRNAs)[34].
Figure 4

Putative selective sweep region around the gene on SSC3.

The y axis shows the log-transformed value of the ratio for the observed/expected derived allele frequency using a sliding window at a bin size of 50,000 bp. The x axis shows the position on SSC3 in base pairs.

PowerPoint slide

Putative selective sweep region around the gene on SSC3.

The y axis shows the log-transformed value of the ratio for the observed/expected derived allele frequency using a sliding window at a bin size of 50,000 bp. The x axis shows the position on SSC3 in base pairs. PowerPoint slide

Independent domestication and admixture events in domestic breeds

A phylogenetic tree constructed using four European wild boar and domestic pigs and six East Asian wild boar and domestic pigs revealed a clear distinction between European and Asian breeds, thus substantiating the hypothesis that pigs were independently domesticated in western Eurasia and East Asia[3]. An admixture analysis revealed European influence in Asian breeds, and a ∼35% Asian fraction in European breeds (Supplementary Table 24). These results are consistent with the known exchange of genetic material between European and Asian pig breeds[35]. We also observed that European breeds form a paraphyletic clade, which cannot be solely explained by varying degrees of Asian admixture (Supplementary Information). Within each continent, our analysis revealed different degrees of relatedness between breeds and their respective wild relatives (Supplementary Table 20). During domestication, pigs were often allowed to roam in a semi-managed state and recurrent admixture between wild and domesticated individuals was not uncommon, especially in Europe[35]. Thus, the most likely explanation for the paraphyletic pattern seen in domestic individuals is a long history of genetic exchange between wild and domestic pigs.

The pig as a biomedical model

The pig is an important biomedical model and the ability to generate transgenics and knockouts in combination with somatic nuclear cloning procedures has resulted in a number of models for specific human diseases[36]. Naturally occurring mutations also offer opportunities to use pigs as biomedical models[37,38]. To explore the potential for natural models further, predicted porcine protein sequences were compared with their human orthologues. We observed 112 positions where the porcine protein has the same amino acid that is implicated in a human disease (Supplementary Table 29). Most of these changes in humans have been shown to increase risk in multifactorial traits such as obesity (ADRB3, SDC3) and diabetes (PPP1RA, SLC30A8, ZNF615) or shown to result in relatively mild phenotypes (for example, dyslexia: KIAA0319) or late-onset diseases such as Parkinson’s disease (LRRK2, SNCA) and Alzheimer’s disease (TUBD1, BLMH, CEP192, PLAU). These porcine variants are of interest, as they will allow detailed characterization in an experimental model organism whose physiology is very similar to that of human. Among 32,548 non-synonymous mutations identified by sequencing 48 individual pigs, representing 8 different European and Asian breeds and wild boars[39], 6 protein variants implicated in human disease were identified (Supplementary Table 30). In addition, another 157 nonsense mutations in 142 genes were identified, 11 of which have also been implicated in human disease (Supplementary Table 31). Most of these 11 variants were only observed in a heterozygous state and those for which homozygous individuals were observed probably result in either a mild phenotype (ASS1, mild form of citrullinaemia in humans) or in phenotypes unlikely to affect the fitness of wild boars (RBBP8, pancreatic carcinomas). Our estimate for the average number of nonsense mutations per individual (∼30) is smaller than that observed in humans[40] despite the observed threefold higher nucleotide diversity in pigs[39]. This is in agreement with the higher effective population size in the pig compared to that for the human population, which exhibited a strong bottleneck followed by an exponential increase in size during recent history[41]. When considering pig-to-human xenotransplantation, porcine endogenous retroviruses (PERVs) pose a risk of zoonotic infection. The pig genome contains fewer endogenous retroviruses than many vertebrates, including humans and mice, and most PERVs were characterized as defective. However, the potential risk posed by re-activation of rare replication-competent PERVs and defective PERVs by recombination remains, as shown for murine ERVs (XMRV)[42]. Most PERVs consist of γ and γ-like groups (68%), with β-retroviral ERVs comprising the second most abundant group (Supplementary Fig. 15). Our phylogenetic study shows a particularly close relationship between the most intact γ1 group of PERVs (γ1) and murine γ-ERVs, suggesting a potential recent instance of murine-to-porcine transmission of γ1 ERVs (Supplementary Fig. 15). We identified 20 almost intact PERV γ1 loci (Supplementary Table 10), none of which contained a complete set of gag, pol or env open reading frames, indicating that these proviruses are not replicable. We also identified four β-retroviral PERVs, each containing defects, primarily in env. These were distantly related to intracisternal type A particle (IAP) proviruses of mice and the mouse mammary tumour virus (MMTV)-like (HML) proviruses of humans. None of the above loci was shared in more than 120 pigs tested, indicating considerable PERV polymorphisms.

Conclusion

The draft pig genome sequence reported here has illuminated the evolution of Sus scrofa and confirmed its speciation in South East Asia and subsequent domestication at multiple regions across Eurasia. The high-quality annotated reference genome sequence has already proven to be a critical framework for comparing individual genomes[39,43,44] and its value is further illustrated in associated papers published elsewhere (http://www.biomedcentral.com/series/swine). The genome sequence also provides a valuable resource enabling effective uses of pigs both in agricultural production and in biomedical research.

Methods Summary

Assembly

We constructed a hybrid de novo assembly based primarily on sequences from BAC clones sequenced clone-by-clone and supplemented with Illumina whole-genome shotgun (WGS) reads. BAC clones were selected from the high-resolution physical (BAC contig) map[6] with CHORI-242 library clones prepared from DNA from a single Duroc sow (Duroc 2-14) chosen preferentially. The WGS sequence data were generated using DNA isolated from the same animal. The BAC-derived sequence data were assembled into sequence contigs using Phrap on a clone-by-clone basis and subsequently independently assembled WGS contigs (Supplementary Information) were used to extend BAC clone-derived sequence contigs and to close gaps between clone-derived contigs. Further details and other methods are described in Supplementary Information.
  41 in total

1.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

Review 2.  From evolutionary genetics to human immunology: how selection shapes host defence genes.

Authors:  Luis B Barreiro; Lluís Quintana-Murci
Journal:  Nat Rev Genet       Date:  2009-12-01       Impact factor: 53.242

3.  Impacts of the Cretaceous Terrestrial Revolution and KPg extinction on mammal diversification.

Authors:  Robert W Meredith; Jan E Janečka; John Gatesy; Oliver A Ryder; Colleen A Fisher; Emma C Teeling; Alisha Goodbla; Eduardo Eizirik; Taiz L L Simão; Tanja Stadler; Daniel L Rabosky; Rodney L Honeycutt; John J Flynn; Colleen M Ingram; Cynthia Steiner; Tiffani L Williams; Terence J Robinson; Angela Burk-Herrick; Michael Westerman; Nadia A Ayoub; Mark S Springer; William J Murphy
Journal:  Science       Date:  2011-09-22       Impact factor: 47.728

4.  Response of pigs to bitter-tasting compounds.

Authors:  S L Nelson; J D Sanregret
Journal:  Chem Senses       Date:  1997-04       Impact factor: 3.160

Review 5.  Cheaters sometimes prosper: distortion of mendelian segregation by meiotic drive.

Authors:  T W Lyttle
Journal:  Trends Genet       Date:  1993-06       Impact factor: 11.639

6.  A substitution of cysteine for arginine 614 in the ryanodine receptor is potentially causative of human malignant hyperthermia.

Authors:  E F Gillard; K Otsu; J Fujii; V K Khanna; S de Leon; J Derdemezi; B A Britt; C L Duff; R G Worton; D H MacLennan
Journal:  Genomics       Date:  1991-11       Impact factor: 5.736

7.  Alternative splicing in disease and therapy.

Authors:  Mariano A Garcia-Blanco; Andrew P Baraniak; Erika L Lasda
Journal:  Nat Biotechnol       Date:  2004-05       Impact factor: 54.908

8.  Identification of a single killer immunoglobulin-like receptor (KIR) gene in the porcine leukocyte receptor complex on chromosome 6q.

Authors:  Jennifer G Sambrook; Harminder Sehra; Penny Coggill; Sean Humphray; Sophie Palmer; Sarah Sims; Haru-Hisa Takamatsu; Thomas Wileman; Alan L Archibald; Stephan Beck
Journal:  Immunogenetics       Date:  2006-04-29       Impact factor: 2.846

9.  The genome sequence of taurine cattle: a window to ruminant biology and evolution.

Authors:  Christine G Elsik; Ross L Tellam; Kim C Worley; Richard A Gibbs; Donna M Muzny; George M Weinstock; David L Adelson; Evan E Eichler; Laura Elnitski; Roderic Guigó; Debora L Hamernik; Steve M Kappes; Harris A Lewin; David J Lynn; Frank W Nicholas; Alexandre Reymond; Monique Rijnkels; Loren C Skow; Evgeny M Zdobnov; Lawrence Schook; James Womack; Tyler Alioto; Stylianos E Antonarakis; Alex Astashyn; Charles E Chapple; Hsiu-Chuan Chen; Jacqueline Chrast; Francisco Câmara; Olga Ermolaeva; Charlotte N Henrichsen; Wratko Hlavina; Yuri Kapustin; Boris Kiryutin; Paul Kitts; Felix Kokocinski; Melissa Landrum; Donna Maglott; Kim Pruitt; Victor Sapojnikov; Stephen M Searle; Victor Solovyev; Alexandre Souvorov; Catherine Ucla; Carine Wyss; Juan M Anzola; Daniel Gerlach; Eran Elhaik; Dan Graur; Justin T Reese; Robert C Edgar; John C McEwan; Gemma M Payne; Joy M Raison; Thomas Junier; Evgenia V Kriventseva; Eduardo Eyras; Mireya Plass; Ravikiran Donthu; Denis M Larkin; James Reecy; Mary Q Yang; Lin Chen; Ze Cheng; Carol G Chitko-McKown; George E Liu; Lakshmi K Matukumalli; Jiuzhou Song; Bin Zhu; Daniel G Bradley; Fiona S L Brinkman; Lilian P L Lau; Matthew D Whiteside; Angela Walker; Thomas T Wheeler; Theresa Casey; J Bruce German; Danielle G Lemay; Nauman J Maqbool; Adrian J Molenaar; Seongwon Seo; Paul Stothard; Cynthia L Baldwin; Rebecca Baxter; Candice L Brinkmeyer-Langford; Wendy C Brown; Christopher P Childers; Timothy Connelley; Shirley A Ellis; Krista Fritz; Elizabeth J Glass; Carolyn T A Herzig; Antti Iivanainen; Kevin K Lahmers; Anna K Bennett; C Michael Dickens; James G R Gilbert; Darren E Hagen; Hanni Salih; Jan Aerts; Alexandre R Caetano; Brian Dalrymple; Jose Fernando Garcia; Clare A Gill; Stefan G Hiendleder; Erdogan Memili; Diane Spurlock; John L Williams; Lee Alexander; Michael J Brownstein; Leluo Guan; Robert A Holt; Steven J M Jones; Marco A Marra; Richard Moore; Stephen S Moore; Andy Roberts; Masaaki Taniguchi; Richard C Waterman; Joseph Chacko; Mimi M Chandrabose; Andy Cree; Marvin Diep Dao; Huyen H Dinh; Ramatu Ayiesha Gabisi; Sandra Hines; Jennifer Hume; Shalini N Jhangiani; Vandita Joshi; Christie L Kovar; Lora R Lewis; Yih-Shin Liu; John Lopez; Margaret B Morgan; Ngoc Bich Nguyen; Geoffrey O Okwuonu; San Juana Ruiz; Jireh Santibanez; Rita A Wright; Christian Buhay; Yan Ding; Shannon Dugan-Rocha; Judith Herdandez; Michael Holder; Aniko Sabo; Amy Egan; Jason Goodell; Katarzyna Wilczek-Boney; Gerald R Fowler; Matthew Edward Hitchens; Ryan J Lozado; Charles Moen; David Steffen; James T Warren; Jingkun Zhang; Readman Chiu; Jacqueline E Schein; K James Durbin; Paul Havlak; Huaiyang Jiang; Yue Liu; Xiang Qin; Yanru Ren; Yufeng Shen; Henry Song; Stephanie Nicole Bell; Clay Davis; Angela Jolivet Johnson; Sandra Lee; Lynne V Nazareth; Bella Mayurkumar Patel; Ling-Ling Pu; Selina Vattathil; Rex Lee Williams; Stacey Curry; Cerissa Hamilton; Erica Sodergren; David A Wheeler; Wes Barris; Gary L Bennett; André Eggen; Ronnie D Green; Gregory P Harhay; Matthew Hobbs; Oliver Jann; John W Keele; Matthew P Kent; Sigbjørn Lien; Stephanie D McKay; Sean McWilliam; Abhirami Ratnakumar; Robert D Schnabel; Timothy Smith; Warren M Snelling; Tad S Sonstegard; Roger T Stone; Yoshikazu Sugimoto; Akiko Takasuga; Jeremy F Taylor; Curtis P Van Tassell; Michael D Macneil; Antonio R R Abatepaulo; Colette A Abbey; Virpi Ahola; Iassudara G Almeida; Ariel F Amadio; Elen Anatriello; Suria M Bahadue; Fernando H Biase; Clayton R Boldt; Jeffery A Carroll; Wanessa A Carvalho; Eliane P Cervelatti; Elsa Chacko; Jennifer E Chapin; Ye Cheng; Jungwoo Choi; Adam J Colley; Tatiana A de Campos; Marcos De Donato; Isabel K F de Miranda Santos; Carlo J F de Oliveira; Heather Deobald; Eve Devinoy; Kaitlin E Donohue; Peter Dovc; Annett Eberlein; Carolyn J Fitzsimmons; Alessandra M Franzin; Gustavo R Garcia; Sem Genini; Cody J Gladney; Jason R Grant; Marion L Greaser; Jonathan A Green; Darryl L Hadsell; Hatam A Hakimov; Rob Halgren; Jennifer L Harrow; Elizabeth A Hart; Nicola Hastings; Marta Hernandez; Zhi-Liang Hu; Aaron Ingham; Terhi Iso-Touru; Catherine Jamis; Kirsty Jensen; Dimos Kapetis; Tovah Kerr; Sari S Khalil; Hasan Khatib; Davood Kolbehdari; Charu G Kumar; Dinesh Kumar; Richard Leach; Justin C-M Lee; Changxi Li; Krystin M Logan; Roberto Malinverni; Elisa Marques; William F Martin; Natalia F Martins; Sandra R Maruyama; Raffaele Mazza; Kim L McLean; Juan F Medrano; Barbara T Moreno; Daniela D Moré; Carl T Muntean; Hari P Nandakumar; Marcelo F G Nogueira; Ingrid Olsaker; Sameer D Pant; Francesca Panzitta; Rosemeire C P Pastor; Mario A Poli; Nathan Poslusny; Satyanarayana Rachagani; Shoba Ranganathan; Andrej Razpet; Penny K Riggs; Gonzalo Rincon; Nelida Rodriguez-Osorio; Sandra L Rodriguez-Zas; Natasha E Romero; Anne Rosenwald; Lillian Sando; Sheila M Schmutz; Libing Shen; Laura Sherman; Bruce R Southey; Ylva Strandberg Lutzow; Jonathan V Sweedler; Imke Tammen; Bhanu Prakash V L Telugu; Jennifer M Urbanski; Yuri T Utsunomiya; Chris P Verschoor; Ashley J Waardenberg; Zhiquan Wang; Robert Ward; Rosemarie Weikard; Thomas H Welsh; Stephen N White; Laurens G Wilming; Kris R Wunderlich; Jianqi Yang; Feng-Qi Zhao
Journal:  Science       Date:  2009-04-24       Impact factor: 47.728

10.  Completion of the swine genome will simplify the production of swine as a large animal biomedical model.

Authors:  Eric M Walters; Eckhard Wolf; Jeffery J Whyte; Jiude Mao; Simone Renner; Hiroshi Nagashima; Eiji Kobayashi; Jianguo Zhao; Kevin D Wells; John K Critser; Lela K Riley; Randall S Prather
Journal:  BMC Med Genomics       Date:  2012-11-15       Impact factor: 3.063

View more
  550 in total

1.  The combination of metformin and 2-deoxyglucose significantly inhibits cyst formation in miniature pigs with polycystic kidney disease.

Authors:  Xiaoying Lian; Xiaoyuan Wu; Zhongxin Li; Yingjie Zhang; Kangkang Song; Guangyan Cai; Qinggang Li; Shupeng Lin; Xiangmei Chen; Xue-Yuan Bai
Journal:  Br J Pharmacol       Date:  2019-01-10       Impact factor: 8.739

2.  On the importance of being structured: instantaneous coalescence rates and human evolution--lessons for ancestral population size inference?

Authors:  O Mazet; W Rodríguez; S Grusea; S Boitard; L Chikhi
Journal:  Heredity (Edinb)       Date:  2015-12-09       Impact factor: 3.821

3.  Comparative genomic analysis of eutherian tumor necrosis factor ligand genes.

Authors:  Marko Premzl
Journal:  Immunogenetics       Date:  2015-12-09       Impact factor: 2.846

4.  The devil is in the details: the effect of population structure on demographic inference.

Authors:  P Orozco-terWengel
Journal:  Heredity (Edinb)       Date:  2016-02-17       Impact factor: 3.821

5.  Reconstruction and evolutionary history of eutherian chromosomes.

Authors:  Jaebum Kim; Marta Farré; Loretta Auvil; Boris Capitanu; Denis M Larkin; Jian Ma; Harris A Lewin
Journal:  Proc Natl Acad Sci U S A       Date:  2017-06-19       Impact factor: 11.205

6.  No imprinted XIST expression in pigs: biallelic XIST expression in early embryos and random X inactivation in placentas.

Authors:  Huiying Zou; Dawei Yu; Xuguang Du; Jing Wang; Lei Chen; Yangyang Wang; Huitao Xu; Yunxuan Zhao; Shanjiang Zhao; Yunwei Pang; Yan Liu; Haisheng Hao; Xueming Zhao; Weihua Du; Yunping Dai; Ning Li; Sen Wu; Huabin Zhu
Journal:  Cell Mol Life Sci       Date:  2019-05-28       Impact factor: 9.261

7.  Expression variation of the porcine ADRB2 has a complex genetic background.

Authors:  Eduard Murani; Siriluck Ponsuksili; Henry Reyer; Dörte Wittenburg; Klaus Wimmers
Journal:  Mol Genet Genomics       Date:  2013-08-31       Impact factor: 3.291

8.  Haplotypic diversity of porcine LEP and LEPR genes involved in growth and fatness regulation.

Authors:  Dafne Pérez-Montarelo; M Carmen Rodríguez; Almudena Fernández; Rita Benítez; Fabián García; Luis Silió; Ana I Fernández
Journal:  J Appl Genet       Date:  2015-04-25       Impact factor: 3.240

9.  Porcine colonization of the Americas: a 60k SNP story.

Authors:  W Burgos-Paz; C A Souza; H J Megens; Y Ramayo-Caldas; M Melo; C Lemús-Flores; E Caal; H W Soto; R Martínez; L A Alvarez; L Aguirre; V Iñiguez; M A Revidatti; O R Martínez-López; S Llambi; A Esteve-Codina; M C Rodríguez; R P M A Crooijmans; S R Paiva; L B Schook; M A M Groenen; M Pérez-Enciso
Journal:  Heredity (Edinb)       Date:  2012-12-19       Impact factor: 3.821

10.  Dissecting structural and nucleotide genome-wide variation in inbred Iberian pigs.

Authors:  Anna Esteve-Codina; Yogesh Paudel; Luca Ferretti; Emanuele Raineri; Hendrik-Jan Megens; Luis Silió; María C Rodríguez; Martein A M Groenen; Sebastian E Ramos-Onsins; Miguel Pérez-Enciso
Journal:  BMC Genomics       Date:  2013-03-05       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.