Literature DB >> 31525233

Genetic diversity and population structure of four Chinese rabbit breeds.

Anyong Ren1, Kun Du1, Xianbo Jia1, Rui Yang2, Jie Wang1, Shi-Yi Chen1, Song-Jia Lai1.   

Abstract

There are a few well-known indigenous breeds of Chinese rabbits in Sichuan and Fujian provinces, for which the genetic diversity and population structure have been poorly investigated. In the present study, we successfully employed the restriction-site-associated DNA sequencing (RAD-seq) approach to comprehensively discover genome-wide SNPs of 104 rabbits from four Chinese indigenous breeds: 30 Sichuan White, 34 Tianfu Black, 32 Fujian Yellow and eight Fujian Black. A total of 7,055,440 SNPs were initially obtained, from which 113,973 high-confidence SNPs (read depth ≥ 3, calling rate = 100% and biallelic SNPs) were selected to study the genetic diversity and population structure. The mean polymorphism information content (PIC) and nucleotide diversity (π) of each breed slightly varied with ranging from 0.2000 to 0.2281 and from 0.2678 to 0.2902, respectively. On the whole, Fujian Yellow rabbits showed the highest genetic diversity, which was followed by Tianfu Black and Sichuan White rabbits. The principal component analysis (PCA) revealed that the four breeds were clearly distinguishable. Our results first reveal the genetic differences among these four rabbit breeds in the Sichuan and Fujian provinces and also provide a high-confidence set of genome-wide SNPs for Chinese indigenous rabbits that could be employed for gene linkage and association analyses in the future.

Entities:  

Year:  2019        PMID: 31525233      PMCID: PMC6746397          DOI: 10.1371/journal.pone.0222503

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Rabbits (Oryctolagus cuniculus) are one of the most recently domesticated animals with an estimated history of approximately 1,400 years [1, 2]. After the initial domestication in France, more than 200 modern breeds or populations have been recognized worldwide and all of them show a considerable phenotypic variation [3, 4]. In China, there are approximately 20 indigenous and recently imported rabbit breeds, which are widely kept for their meat, fur and wool [5]. Compared to the indigenous rabbit breeds, these imported breeds are more prevalent in the Chinese rabbit industry because of their better production performances on the important economic traits [6]. However, these indigenous breeds have superior disease resistance and environmental adaptation [7], and these characteristics make them important for the sustainable development of the rabbit industry in China. Unfortunately, the genetic diversity and population structure of Chinese indigenous rabbits have not been well studied yet especially at the genome-wide level. During the last decades, single nucleotide polymorphisms (SNPs) have become the most popular genetic markers for studying genetic diversity and population structure in wild and domestic animals. With rapid development of high-throughput sequencing techniques, restriction site-associated DNA sequencing (RAD-seq) provides a relatively cost-effective approach to obtain tens of thousands of genome-wide SNPs [8, 9]. The RAD-seq technique first employs one or more restriction enzyme(s) to randomly digest genome sequences into short fragments that are then subjected to massively parallel DNA sequencing [10]. Overall, the RAD-seq is a very prevalent approach in studies of population genetics because it has advantages for generating the relatively equally distributed SNPs suitable to reveal genetic diversity and population structure [11-13]. The objective of the present study was to discover the genome-wide SNPs by RAD-seq approach and then investigate genetic diversity and population structure of the four Chinese rabbit breeds. In addition to providing a high-confidence set of genome-wide SNP markers that could be employed for gene linkage and association analyses, the revealed inter-breed genetic differences will help us for better establishing the conservation strategies of genetic diversity and crossbreeding systems in rabbit industry.

Materials and methods

Ethics statement

All experimental protocols involved in this study were approved by the Institutional Animal Care and Use Committee of the College of Animal Science and Technology, Sichuan Agricultural University, Sichuan, China (No. DKYB20081003).

Blood sampling and DNA extraction

Blood samples were randomly collected from 104 unrelated individuals from four indigenous breeds of Chinese rabbits (Fig 1), including 30 Sichuan White (SW) and 34 Tianfu Black (TB) from Sichuan province, 32 Fujian Yellow (FY) and eight Fujian Black (FB) from Fujian province. All rabbits were raised in the experimental farms of the Sichuan Agricultural University and Sichuan Animal Science Academy, and none of them had genetic relationships during the previous three generations. Total genomic DNA was extracted using the standard procedure of the Animal Genomic DNA Kit (Tiangen, Beijing), and individual DNA quality was evaluated by NanoVue Plus (GE, USA).
Fig 1

Rabbit pictures from each of the four indigenous breeds in this study.

Library construction and Illumina sequencing

RAD-seq sequencing libraries were constructed according to the recommended pipeline [10]. Briefly, genomic DNA (~1 μg per sample) was first digested with EcoRI (NEB, Beijing), onto which the P1 adaptor was ligated. Subsequently, the samples were pooled, randomly sheared, and size-selected in sequential steps. After the second adapter (P2) was added, the DNA fragments of 300 to 500 bp in length were used to construct the sequencing libraries. Finally, the Illumina HiSeq2000 platform was employed to sequence the constructed libraries and generate 150 bp paired-end reads (BioMarker Co.Ltd., Beijing).

Quality control, read mapping and SNP calling

All the raw sequencing reads were first subjected to quality control by removing these low-quality reads, which were defined by any of three criteria: (i) reads containing low-quality bases (Qphred value < 5) more than 50% of the total length, (ii) reads containing adaptor sequences, and (iii) reads containing ambiguous bases more than 10% of its total length. This filtering step of reads was performed using the fastp tool (v0.19.5) [14], after which we obtained the clean reads that were subjected to SNP calling. All reads were mapped against the reference rabbit genome (OryCun2.0) using the BWA-MEM algorithm in BWA software (v0.7.17) [15] with default parameters. The generated SAM (Sequence Alignment/Map) files were manipulated with Picard tools (v1.134, http://broadinstitute.github.io/picard/), including the coordinate sorting and duplicate removing. Subsequently, the GATK software (v3.7) [16] was applied to SNP calling and individual genotyping according to recommendations of GATK Best Practices [17, 18]. Additionally, the local realignment around indels was conducted using GATK realignment algorithm. We further performed the hard filtering with expression of “QD < 2.0 || FS > 60.0 || MQ < 40.0 || MQRankSum < -12.5 || ReadPosRankSum < -8.0” for producing the clean SNPs. Finally, the high-confidence SNPs were finally retained for further analysis based on three criteria, including (i) coverage depth of reads ≥ 3 for every sample, (ii) calling rate of 100% (i.e., no any missing in the samples), and (iii) biallelic SNPs.

Data analyses

First, we investigated the overall read depth and chromosomal distribution for all SNPs using the VCFtools [19]. The nucleotide diversity (π), expected heterozygosity (He), observed heterozygosity (Ho), private allele number (Ap), frequency of the most frequent allele (P), fixation index (FST), and inbreeding coefficient (FIS) for each breeds were computed using the ‘population’ program in Stacks (v2.2) [20]. The PopSc toolkit [21] was utilized to calculate the polymorphism information content (PIC) for each breeds. FST and FIS values were computed to analyze pairwise population differences among the four breeds. To evaluate the genetic relationship among all four breeds, a principal component analysis (PCA) was conducted with GCTA (v1.26.0) [22] after converting the SNP data into PED format by PLINK (v1.07) [23]. All the related results were plotted using ggplot2 (v3.1.0) [24] from R package.

Results

We obtained 295 Gb of raw paired-end reads with an average of 2.83 Gb per sample, which ultimately produced 260 Gb of clean paired-end reads after the quality filtering (S1 Table). An average of 99.21% of clean reads were successfully mapped to the reference genome, by which we identified 7,955,814 raw SNPs and 7,055,440 clean SNPs, respectively. To avoid potential biases, we strictly selected a high-confidence set of 113,973 SNPs for the further analysis, among them 37,343 SNPs were located within these unplaced scaffolds. After splitting the 22 chromosomes into 750 bins of 3 Mb in size, there was an average of 102 SNPs per bin (Fig 2A). For all SNPs, we estimated an transition vs. transversion ratio of 2.23, including 78,732 transitions and 35,241 transversions (Fig 2B).
Fig 2

SNP distribution and nucleotide diversity.

(A) Distribution density of SNPs across 22 chromosomes, in which each block represents a split bin. (B) Substitution types of SNPs in all populations. (C) Distribution density of the nucleotide diversity in four rabbit breeds.

SNP distribution and nucleotide diversity.

(A) Distribution density of SNPs across 22 chromosomes, in which each block represents a split bin. (B) Substitution types of SNPs in all populations. (C) Distribution density of the nucleotide diversity in four rabbit breeds. We subsequently computed six indexes in relation to the intra-breed genetic diversity for every one of the four rabbit breeds (Table 1). There were 3,679 private alleles for FY, 1,089 for FB, 1,833 for SW and 4,506 for TB, respectively. The mean frequency of the most frequent allele ranged from 0.7833 (FY) to 0.8071 (FB), the nucleotide diversity from 0.2678 (FB) to 0.2902 (FY), and the polymorphism information content from 0.2000 (FB) to 0.2281 (FY). The FB breed had the lowest expected heterozygosity, whereas the highest observed heterozygosity was observed in FY breed. We further investigated the intra-breed overall distribution of nucleotide diversity for all SNPs (Fig 2C), which showed the FB breed had the highest variation.
Table 1

Values of genetic diversity in four rabbit breeds using SNP data.

BreedsApPπPICHeHo
FY3,6790.78330.29020.22810.28570.3418
FB1,0890.80710.26780.20000.25110.3072
SW1,8330.79890.26890.21090.26440.3158
TB4,5060.78670.28710.22640.28290.3115

AP, private allele number; P, frequency of the most frequent allele; π, nucleotide diversity; PIC, polymorphism information content; He, expected heterozygosity; Ho, observed heterozygosity.

AP, private allele number; P, frequency of the most frequent allele; π, nucleotide diversity; PIC, polymorphism information content; He, expected heterozygosity; Ho, observed heterozygosity. The pairwise comparisons of Wright’s FST values showed low to moderate levels of genetic differentiation among the four rabbit breeds (Fig 3A). Among them, the lowest and highest inter-breed differences were observed between FY and TB (FST = 0.0370) and between FB and SW (FST = 0.0504), respectively. The intra-population inbreeding coefficient of FIS ranged from -0.1109 (FY) to -0.0390 (TB). Furthermore, the PCA-based clustering first revealed that all the four breeds were clearly distinguishable (Fig 3B). In addition, the individuals from FY, FB and SW breeds were clustered together with each of these breeds. In contrast, the 34 Tianfu black rabbits (TB) were divided into two distinct subgroups.
Fig 3

Population structure of the four rabbit breeds.

(A) The plot shows the pairwise Wright’s FST values in the lower triangular area and FIS values in diagonal cells. (B) Plot of pairwise principal components (PC1 and PC2) of four rabbit breeds based on the SNP data.

Population structure of the four rabbit breeds.

(A) The plot shows the pairwise Wright’s FST values in the lower triangular area and FIS values in diagonal cells. (B) Plot of pairwise principal components (PC1 and PC2) of four rabbit breeds based on the SNP data.

Discussion

China has the largest volumes of consumption and production for rabbit meat, both comprising more than 60% of the world's totals [25]. Therefore, sustainable development of the Chinese rabbit industry significantly depends on a sufficient amount of genetic resources available, especially for these indigenous breeds. Although the genetic diversity and population structure of Chinese indigenous rabbits has been studied in a few sporadic reports on the basis of microsatellite markers [26, 27] and mitochondrial DNA [5], a genome-wide systematic investigation still remains to be addressed. In China, Sichuan and Fujian are the representative provinces of rabbit raising with a long history, both of them also have the well-known indigenous breeds, such as Sichuan White and Fujian Yellow rabbits. In the present study, we first discover the genome-wide SNPs comprehensively and then analyze genetic diversity and population structure of the four widely used indigenous rabbit breeds in Sichuan and Fujian provinces, which is expected to significantly facilitate the effective conservation and exploration of these genetic resources. Further, we anticipate that the SNP markers identified in the present study will be a valuable resource for conducting gene linkage and association analyses in other rabbit populations. Our results revealed that Fujian Yellow and Fujian Black rabbits have the highest and lowest genetic diversity, respectively; whereas only small differences of genetic diversity were observed among the four studied breeds on the whole. In addition, we should be cautious for the conclusion that Fujian Black rabbits have the lowest genetic diversity because only eight individuals were sampled in the present study. Based on 30 microsatellite markers, Xie and colleagues [26] previously reported that the polymorphism information content and expected heterozygosity of Fujian Yellow rabbits were 0.6766 and 0.7324, both of which are substantially higher than the corresponding values computed in the present study. Unfortunately, we are unable to compare the four breeds of Chinese indigenous rabbits with other Chinese rabbit breeds or with widely used European rabbit breeds because the allele frequency data of reference populations were unavailable. Interestingly, we also observed that the four Chinese rabbit breeds in the present study could be fully separated from each other based on the PCA-based clustering, which indicates that there were significant genetic differences among these populations. In conclusion, we comprehensively discover the genome-wide SNPs and systematically investigate the genetic diversity and population structure for four Chinese rabbit breeds. The results will help us to better conserve and explore these genetic resources, and also facilitate the future studies of gene linkage and association analyses in these and other rabbit populations.

Sequencing and quality filtering of reads.

(DOCX) Click here for additional data file.
  18 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  GCTA: a tool for genome-wide complex trait analysis.

Authors:  Jian Yang; S Hong Lee; Michael E Goddard; Peter M Visscher
Journal:  Am J Hum Genet       Date:  2010-12-17       Impact factor: 11.025

3.  Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers.

Authors:  Michael R Miller; Joseph P Dunham; Angel Amores; William A Cresko; Eric A Johnson
Journal:  Genome Res       Date:  2006-12-22       Impact factor: 9.043

4.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

Review 5.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing.

Authors:  John W Davey; Paul A Hohenlohe; Paul D Etter; Jason Q Boone; Julian M Catchen; Mark L Blaxter
Journal:  Nat Rev Genet       Date:  2011-06-17       Impact factor: 53.242

6.  Origin of rabbit (Oryctolagus cuniculus) in China: evidence from mitochondrial DNA control region sequence analysis.

Authors:  J-R Long; X-P Qiu; F-T Zeng; L-M Tang; Y-P Zhang
Journal:  Anim Genet       Date:  2003-04       Impact factor: 3.169

7.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

8.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  Rapid SNP discovery and genetic mapping using sequenced RAD markers.

Authors:  Nathan A Baird; Paul D Etter; Tressa S Atwood; Mark C Currey; Anthony L Shiver; Zachary A Lewis; Eric U Selker; William A Cresko; Eric A Johnson
Journal:  PLoS One       Date:  2008-10-13       Impact factor: 3.240

View more
  5 in total

1.  Using Linear Discriminant Analysis to Characterize Novel Single Nucleotide Polymorphisms and Expression Profile Changes in Genes of Three Breeds of Rabbit (Oryctolagus cuniculus).

Authors:  Ahmed I Ateya; Basma M Hendam; Hend A Radwan; Eman A Abo Elfadl; Mona M Al-Sharif
Journal:  Comp Med       Date:  2021-05-25       Impact factor: 0.982

2.  Quantitative trait loci for growth-related traits in Japanese quail (Coturnix japonica) using restriction-site associated DNA sequencing.

Authors:  Mohammad Ibrahim Haqani; Shigeru Nomura; Michiharu Nakano; Tatsuhiko Goto; Atsushi J Nagano; Atsushi Takenouchi; Yoshiaki Nakamura; Akira Ishikawa; Masaoki Tsudzuki
Journal:  Mol Genet Genomics       Date:  2021-07-12       Impact factor: 3.291

3.  Trait Analysis in Domestic Rabbits (Oryctolagus cuniculus f. domesticus) Using SNP Markers from Genotyping-by-Sequencing Data.

Authors:  Congyan Li; Yuying Li; Jie Zheng; Zhiqiang Guo; Xiuli Mei; Min Lei; Yongjun Ren; Xiangyu Zhang; Cuixia Zhang; Chao Yang; Li Tang; Yang Ji; Rui Yang; Jifeng Yu; Xiaohong Xie; Liangde Kuang
Journal:  Animals (Basel)       Date:  2022-08-11       Impact factor: 3.231

4.  Mapping of Quantitative Trait Loci Controlling Egg-Quality and -Production Traits in Japanese Quail (Coturnix japonica) Using Restriction-Site Associated DNA Sequencing.

Authors:  Mohammad Ibrahim Haqani; Shigeru Nomura; Michiharu Nakano; Tatsuhiko Goto; Atsushi J Nagano; Atsushi Takenouchi; Yoshiaki Nakamura; Akira Ishikawa; Masaoki Tsudzuki
Journal:  Genes (Basel)       Date:  2021-05-13       Impact factor: 4.096

5.  Exploring the genomic resources and analysing the genetic diversity and population structure of Chinese indigenous rabbit breeds by RAD-seq.

Authors:  Chenmiao Liu; Shuhui Wang; Xianggui Dong; Jiping Zhao; Xiangyang Ye; Ruiguang Gong; Zhanjun Ren
Journal:  BMC Genomics       Date:  2021-07-26       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.