Literature DB >> 30071886

Assessment of genetic diversity in Coho salmon (Oncorhynchus kisutch) populations with no family records using ddRAD-seq.

Sho Hosoya¹, Kiyoshi Kikuchi², Hiroshi Nagashima³, Junichi Onodera³, Kouichi Sugimoto³, Kou Satoh³, Keisuke Matsuzaki³, Masaki Yasugi⁴, Atsushi J Nagano^4,5,6, Akira Kumagayi³, Kenichi Ueda³, Tadahide Kurokawa⁷.

Abstract

OBJECTIVE: Selective breeding for desirable traits is becoming popular in aquaculture. In Miyagi prefecture, Japan, a selectively bred population of Coho salmon (Oncorhynchus kisutch) has been established with the original, randomly breeding population maintained separately. Since they have been bred without family records, the genetic diversity within these populations remains unknown. In this study, we estimated the genetic diversity and key quantitative genetic parameters such as heritability and genomic breeding value for body size traits by means of genomic best linear unbiased prediction to assess the genetic health of these populations.
RESULTS: Ninety-nine and 83 females from the selective and random groups, respectively, were genotyped at 2350 putative SNPs by means of double digest restriction associated DNA sequencing. The genetic diversity in the selectively bred group was low, as were the estimated heritability and prediction accuracy for length and weight (h2 = 0.26-0.28; accuracy = 0.34), compared to the randomly bred group (h2 = 0.50-0.60; accuracy = 0.51-0.54). Although the tested sample size was small, these results suggest that further selection is difficult for the selectively bred population, while there is some potential for the randomly bred group, especially with the aid of genomic information.

Entities: Chemical Disease Species

Keywords: Breeding value; Coho salmon (Oncorhynchus kisutch); Genetic diversity; Genomic best linear unbiased prediction; Heritability; Prediction accuracy; SNPs; Selective breeding; ddRAD-seq

Mesh：

Year: 2018 PMID： 30071886 PMCID： PMC6071332 DOI： 10.1186/s13104-018-3663-4

Source DB: PubMed Journal: BMC Res Notes ISSN： 1756-0500

Introduction

The aquaculture production of Coho salmon (Oncorhynchus kisutch) in Japan started in the late 1970s using populations imported from North America. Since then, the species is important in Japanese fish production as ranked fourth among marine aquaculture fish in 2014, with a harvest of 12,800 tonnes [1]. This quantity can be increased significantly with selective breeding for growth rate. For this purpose, a selection program for body size was begun at Miyagi Prefectural Fisheries Research Station starting with 16 selected females and 13 randomly chosen males, with the original, randomly breeding population maintained separately. However, since there was no pedigree, the extent of genetic relatedness among the individuals was unknown for both populations. The future of the ongoing breeding programs depends on the existing genetic diversity in the given population [2]. Thus, it was necessary to assess the extent of genetic diversity to maintain the health of the populations, rendering the maintenance of accurate family records essential. Recent advances in genome-wide single nucleotide polymorphisms (SNPs) genotyping permit a fine-grain assessment of the current level of genetic diversity, even for the population without family records. In this study, we genotyped genome-wide SNPs collected by means of double digest restriction associated DNA sequencing (ddRAD-seq) [3] for the selectively bred (SB) and the randomly breeding (RB) populations to infer the genetic relatedness between individuals within each population. We then estimated the heritability and genomic breeding values for body weight (BW) and fork length (FL) at 47 months post fertilization to examine the possibility of selective breeding using genomic information (genomic selection) of these populations.

Main text

Methods

Samples

Both populations (SB and RB) are maintained at the Inland Fisheries Experimental Station, Miyagi Prefecture Fisheries Technology Center (Miyagi, Japan), with a largely unknown family history. The original population was introduced from Lower Kalama hatchery (WA, USA) to Japan in 1978. This population was maintained without individual or family identification until 2000 when the first phenotypic selection was done using 29 individuals, followed by a second selection in 2003 using 50 individuals. The population was then bred randomly two times, once in 2006 (198 individuals) and again in 2009 (94 individuals). The progeny produced in 2009 were used for the subsequent genetic and phenotypic analyses in this study. The RB population used in this study was also produced in 2009 by random crosses among individuals from the original population. The two populations were reared separately throughout the experiment. At 47 months post fertilization, 1181 and 558 individuals were sampled from SB and RB, respectively, and the fork length and weight were measured. “Jack” males, which mature at a very early age [4], and other males that also matured somewhat early (3 years) were excluded from the populations, potentially distorting the genetic diversity among the males. Therefore, we used only females in this study (n = 100/population).

Genotyping

Genomic DNA was extracted from the caudal fin using the FUJIFILM QuickGene-810 extraction platform (Fujifilm, Japan) following the manufacturer’s instructions. ddRAD-seq was done following Sakaguchi et al. [5]. BglII and EcoRI were used for genomic DNA digestion. Sequencing of 100 bp paired-end reads and the index sequence of the library was done using HiSeq2500 (Illumina) with TruSeq v3 chemistry on two lanes. Reads were trimmed using Trimmomatic-0.35 [6] with the following parameters: ILLUMINACLIP TruSeq3-PE-2.fa:2:30:10, LEADING:19, TRAILING:19, SLIDINGWINDOW:30:20, AVGQUAL:20, and MINLEN:101. After filtering, an average approximately 2 million reads per individual were obtained. Samples with less than 60,000 reads (17 samples from RB and one from SB) were excluded. The remaining reads at both ends were mapped to the Coho salmon reference genome (Okis_V1; GenBank assembly accession: GCA_002021735.1) using BWA-mem [7] with default settings. Reads of mapping quality (MAPQ) less than 4 were removed. SNP calling was done using Stacks (ver 1.45) [8]. All the ref_map.pl parameters were set to default except for the following: minimum depth of coverage (-m = 5). We set minimum depth of coverage to 5 following Dodd et al. [9] who suggested that the minimal sequencing depth is around 2–4 for relatedness between individuals and 5–10 for self-relatedness. The rxstacks program was applied for genotype calling in individual samples using log likelihood filtering (–lnl_lim = − 120) followed by the cstacks and sstacks programs, which yielded a total of 378,125 loci. After the RAD loci with more than 3 SNPs and 3 alleles were filtered out, 43028 RAD loci remained. The RAD loci were selected under following criteria: (1) SNPs that genotyped more than 50% of the individuals, and those that genotyped more than 90% of the individuals, for both families and (2) minor allele frequency (MAF) was larger than 0.05. For the RAD loci with two SNPs, one of the SNPs was randomly selected by Stacks population program. With the filtration threshold of MAPQ (≥ 4), MAF (≥ 0.05) and number of alleles (= 2), it is expected that most of SNPs from paralogs regions were removed. We did not filtered out SNPs not in the Hardy–Weinberg equilibrium, because such SNPs are expected in the selected population with small effective population size and not necessarily removed [8]. Finally, 2350 (50% genotyped) and 1064 (90% genotyped) putative SNP loci remained. These SNP sets are referred to as 1K-SNPs and 2K-SNPs, respectively. Missing genotype data of 1K- and 2K-SNPs were imputed using Beagle (v4.1) [10]. The genetic analyses were done using 1-K SNPs and estimation of heritability and GEBV were done using 2K-SNPs.

Genetic analysis

Kin relationships among individuals were inferred using KING [11]. First, second and third degree relationships within pairs were determined using kinship coefficient ranges of > 0.177, 0.0884–0.177 and 0.0442–0.0884, respectively [11]. We also estimated effective population size using the Linkage Disequilibrium method implemented in NeEstimatorV2.1 (the lowest allele frequency = 0.05) [12]. Heritability estimation and genomic prediction of FL and BW were done for SB and RB by means of genomic best linear unbiased prediction (GBLUP) implemented in the R package, rrBLUP [13]. The REML (restricted maximum likelihood) estimates of the variance components and BLUP solution for genomic breeding values (GEBV) were obtained using the kin.blup function. The narrow sense heritability was calculated as h = σa2/σp2, where σa2 is the additive genetic variance and σp2 is the total phenotypic variance. The prediction accuracy of GEBV was calculated using a fivefold cross validation design following Tsai et al. [14] with some modifications; the cross validation procedure was repeated ten times independently to obtain the mean and the standard error of the measure of accuracy. At first, each population was randomly divided into five subsets, one for validation and the remaining for training. The phenotypes of the validation set were masked and GEBV of these individuals were estimated from the training set using the kin.blup function of rrBLUP. This step was repeated five times in total while rotating the validation sets. Accuracy was calculated as the average of the correlation between the GEBV and the observed phenotypes of the validation set divided by the square root of the heritability estimated from all individuals. The whole procedure was repeated ten times independently to calculate the mean and the standard error of the measure of accuracy.

Results

It was confirmed by t-test that SB (n = 99) was significantly larger than RB (n = 83) in FL (P = 0.003) and BW (P = 0.000014). Estimation of traditional pedigree-based relatedness was not possible for either population since the family history had not been recorded. However, our genome-wide SNP data enabled us to infer the kin relationship among the individuals. These results revealed the genetic relatedness among the individuals of the selected (SB) population; 33.9% of the individual pairs had at least a third degree relationship (compared to 23.6% in the randomly breeding (RB) population) (Table 1, Additional file 1: Fig. S1). Reflecting the close genetic relatedness, the estimated effective population size for SB (Ne = 36.9) was smaller than for RB (Ne = 43.8) (Table 2).

Table 1

Percentage of individual pairs in kin relationships within and between populations

	1st degree (%)	2nd degree (%)	3rd degree (%)	Total pairs
Between populations	0.0	0.0	0.0	8217
Randomly bred (RB)	0.7	8.8	14.1	3403
Selectively bred (SB)	1.2	14.4	18.3	4851

Table 2

Effective population size estimated by means of linkage disequilibrium method

	Ne	95% confidence interval
	Ne	Lower	Upper
Randomly bred (RB)	43.8	43.4	44.3
Selectively bred (SB)	36.9	36.5	37.2

Percentage of individual pairs in kin relationships within and between populations Effective population size estimated by means of linkage disequilibrium method Heritability and prediction accuracy for FL and BW were estimated using 2-K SNPs (Table 3). For both of the traits, a drop in heritability was observed in SB (h = 0.26–0.28) compared to RB (h = 0.50–0.60). Similarly, the prediction accuracies were low for SB (accuracy = 0.33–0.34), while those for RB were relatively high (accuracy = 0.51–0.59), although a strong correlation between the predicted and the observed phenotypes was seen for both traits in both the populations.

Table 3

Mean body size, additive and residual variations, heritability and prediction accuracy (mean ± SE) for body size traits (FL and BW) of the two populations

Traits	Stats	Randomly bred (RB)	Selectively bred (SB)
FL	Mean ± SD (cm)	40.6 ± 2.6	42.0 ± 3.7
	Heritability	0.60	0.28
	Correlation^a	0.97	0.89
	Accuracy	0.54 ± 0.02	0.34 ± 0.04
BW	Mean ± SD (g)	858 ± 176	1003 ± 259
	Heritability	0.50	0.26
	Correlation^a	0.95	0.88
	Accuracy	0.51 ± 0.03	0.34 ± 0.05

aBetween the observed and the predicted phenotypes

Mean body size, additive and residual variations, heritability and prediction accuracy (mean ± SE) for body size traits (FL and BW) of the two populations aBetween the observed and the predicted phenotypes

Discussion

The 1K-SNPs data obtained by means of ddRAD-seq enabled us to infer kin relationship among individuals. High degree of genetic relatedness and decreased effective population size were clearly observed in the selectively bred (SB) population when compared to the original, randomly bred, (RB) population (Tables 1, 2). The small population size and the high genetic relatedness evidently resulted in reduced additive genetic variance (σa2) and therefore, heritability (Table 3), both of which can indicate excessive inbreeding [15]. The differences in heritability between the two populations seemed larger than in Ne. This will be partly because additive genetic variation was substantially reduced in SB as selection and inbreeding decreases heritabilities for polygenic traits including body size [15, 16], while the two rounds of random mating might increase Ne without increasing additive genetic variance in SB population. Low values of predictability in SB could also be the consequence of exhaustion of genetic diversity within a few generations because SB was established from a limited broodstock on the one hand, and with a high degree of genetic relatedness on the other. All those results suggest the difficulty of continuation of breeding program for this population without restoration of genetic diversity by introduction of new genetic material from other populations. In contrast, the genetic diversity in RB seemed to be high enough for a breeding value prediction and genomic selection for body length and weight, since estimated heritability and prediction accuracy were relatively high (h = 0.50–0.60; accuracy = 0.51–0.59). The estimated effective population size (Ne = 43.8), however, suggests genetic diversity will be exhausted within several rounds of selection. One possible approach to apply selective breeding for these populations is to use genomic selection to select individuals from RB for crossing with individuals from SB. This will permit some restoration of the genetic diversity in SB with the minimum loss in its growth performance, and maintenance of the breeding program in SB, simultaneously.

Limitation

Our results demonstrate that ddRAD-seq worked well for the assessment of the current level of genetic diversity of the two Coho salmon populations bred without family records. High prediction accuracies for fork length and weight were observed in the randomly breeding population. However, it should be noted that some of the difference between populations could be due to tank effects since each population was raised in a single tank. Moreover, these analyses were done with the limited numbers of samples and SNPs, and thus, the estimated statistics are expected to have high variation. Therefore, the success and failure of the genomic selection for these populations should also be tested using large sample/SNP sizes. Additional file 1: Fig. S1. Kin relationships among individuals. Small rectangles on the outer edge refer to individual fish. Pairs of individuals with first and second degree relationship are connected with dark and light lines, respectively.

11 in total

1. Genotype Imputation with Millions of Reference Samples.

Authors: Brian L Browning; Sharon R Browning
Journal: Am J Hum Genet Date: 2016-01-07 Impact factor: 11.025

2. NeEstimator v2: re-implementation of software for the estimation of contemporary effective population size (Ne ) from genetic data.

Authors: C Do; R S Waples; D Peel; G M Macbeth; B J Tillett; J R Ovenden
Journal: Mol Ecol Resour Date: 2013-08-31 Impact factor: 7.090

3. A test of quantitative genetic theory using Drosophila- effects of inbreeding and rate of inbreeding on heritabilities and variance components.

Authors: T N Kristensen; A C Sørensen; D Sorensen; K S Pedersen; J G Sørensen; V Loeschcke
Journal: J Evol Biol Date: 2005-07 Impact factor: 2.411

4. Stacks: an analysis tool set for population genomics.

Authors: Julian Catchen; Paul A Hohenlohe; Susan Bassham; Angel Amores; William A Cresko
Journal: Mol Ecol Date: 2013-05-24 Impact factor: 6.185

Review 5. Relatedness in the post-genomic era: is it still useful?

Authors: Doug Speed; David J Balding
Journal: Nat Rev Genet Date: 2014-11-18 Impact factor: 53.242

6. Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species.

Authors: Brant K Peterson; Jesse N Weber; Emily H Kay; Heidi S Fisher; Hopi E Hoekstra
Journal: PLoS One Date: 2012-05-31 Impact factor: 3.240

Review 7. What can livestock breeders learn from conservation genetics and vice versa?

Authors: Torsten N Kristensen; Ary A Hoffmann; Cino Pertoldi; Astrid V Stronen
Journal: Front Genet Date: 2015-02-10 Impact factor: 4.599

8. Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors: Heng Li; Richard Durbin
Journal: Bioinformatics Date: 2009-05-18 Impact factor: 6.937

9. Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors: Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal: Bioinformatics Date: 2014-04-01 Impact factor: 6.937

10. Genome wide association and genomic prediction for growth traits in juvenile farmed Atlantic salmon using a high density SNP array.

Authors: Hsin-Yuan Tsai; Alastair Hamilton; Alan E Tinch; Derrick R Guy; Karim Gharbi; Michael J Stear; Oswald Matika; Steve C Bishop; Ross D Houston
Journal: BMC Genomics Date: 2015-11-18 Impact factor: 3.969

8 in total

1. Population Structure and Genetic Diversity of Nile Tilapia (Oreochromis niloticus) Strains Cultured in Tanzania.

Authors: Redempta A Kajungiro; Christos Palaiokostas; Fernando A Lopes Pinto; Aviti J Mmochi; Marten Mtolera; Ross D Houston; Dirk Jan de Koning
Journal: Front Genet Date: 2019-12-20 Impact factor: 4.599

2. Genetic Dissection of a Precocious Phenotype in Male Tiger Pufferfish (Takifugu rubripes) using Genotyping by Random Amplicon Sequencing, Direct (GRAS-Di).

Authors: Sota Yoshikawa; Masaomi Hamasaki; Kazushi Kadomura; Toshiyuki Yamada; Hisashi Chuda; Kiyoshi Kikuchi; Sho Hosoya
Journal: Mar Biotechnol (NY) Date: 2021-02-18 Impact factor: 3.619

3. Genomic prediction for testes weight of the tiger pufferfish, Takifugu rubripes, using medium to low density SNPs.

Authors: Sho Hosoya; Sota Yoshikawa; Mana Sato; Kiyoshi Kikuchi
Journal: Sci Rep Date: 2021-10-13 Impact factor: 4.379

4. Estimates of Effective Population Size in Commercial and Hatchery Strains of Coho Salmon (Oncorhynchus kisutch (Walbaum, 1792)).

Authors: Victor Martinez; Phillip J Dettleff; Nicolás Galarce; Cristian Bravo; Jessica Dorner; Robert N Iwamoto; Kerry Naish
Journal: Animals (Basel) Date: 2022-03-03 Impact factor: 2.752

5. A genomic-based vision on the genetic diversity and key performance traits in selectively bred Arctic charr (Salvelinus alpinus).

Authors: Christos Palaiokostas; Anam Anjum; Henrik Jeuthe; Khrystyna Kurta; Fernando Lopes Pinto; Dirk Jan de Koning
Journal: Evol Appl Date: 2021-07-02 Impact factor: 4.929

6. Genomic selection for heterobothriosis resistance concurrent with body size in the tiger pufferfish, Takifugu rubripes.

Authors: Zijie Lin; Sho Hosoya; Mana Sato; Naoki Mizuno; Yuki Kobayashi; Takuya Itou; Kiyoshi Kikuchi
Journal: Sci Rep Date: 2020-11-17 Impact factor: 4.379

7. A high-density SNP-based genetic map and several economic traits-related loci in Pelteobagrus vachelli.

Authors: Guosong Zhang; Jie Li; Jiajia Zhang; Xia Liang; Tao Wang; Shaowu Yin
Journal: BMC Genomics Date: 2020-10-07 Impact factor: 3.969

8. Genotyping Strategies Using ddRAD Sequencing in Farmed Arctic Charr (Salvelinus alpinus).

Authors: Fotis Pappas; Christos Palaiokostas
Journal: Animals (Basel) Date: 2021-03-21 Impact factor: 2.752

8 in total