Literature DB >> 32306899

Construction of a high-density genetic map and mapping of growth related QTLs in the grass carp (Ctenopharyngodon idellus).

Xiaoli Huang1,2, Yanxin Jiang1,2, Wanting Zhang1, Yingyin Cheng1, Yaping Wang1,2, Xiaocui Ma1,2, You Duan1,2, Lei Xia1,2, Yaxin Chen1,2, Nan Wu1, Mijuan Shi3, Xiao-Qin Xia4,5.   

Abstract

BACKGROUND: Grass carp (Ctenopharyngodon idellus) are important species in Asian aquaculture. A draft genome for grass carp has already been published in 2015. However, there is still a requirement for a suitable genetic linkage map to arrange scaffolds on chromosomal frameworks. QTL analysis is a powerful tool to detect key locations for quantitative traits, especially in aquaculture. There no growth related QTLs of grass carp have been published yet. Even the growth trait is one of the focuses in grass carp culture.
RESULTS: In this study, a pair of distantly related parent grass carps and their 100 six-month-old full-sib offspring were used to construct a high-density genetic map with 6429 single nucleotide polymorphisms (SNPs) by 2b-RAD technology. The total length of the consensus map is 5553.43 cM with the average marker interval of 1.92 cM. The map has a good collinearity with both the grass carp draft genome and the zebrafish genome, and it assembled 89.91% of the draft genome to a chromosomal level. Additionally, according to the growth-related traits of progenies, 30 quantitative trait loci (QTLs), including 7 for body weight, 9 for body length, 5 for body height and 9 for total length, were identified in 16 locations on 5 linkage groups. The phenotypic variance explained for these QTLs varies from 13.4 to 21.6%. Finally, 17 genes located in these regions were considered to be growth-related because they either had functional mutations predicted from the resequencing data of the parents.
CONCLUSION: A high density genetic linkage map of grass carp was built and it assembled the draft genome to a chromosomal level. Thirty growth related QTLs were detected. After the cross analysis of Parents resequencing data, 17 candidate genes were obtained for further researches.

Entities:  

Keywords:  Grass carp; Growth-related trait; Linkage map; Quantitative trait loci; Single nucleotide polymorphism

Mesh:

Year:  2020        PMID: 32306899      PMCID: PMC7168995          DOI: 10.1186/s12864-020-6730-x

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

The grass carp (Ctenopharyngodon idellus) belongs to the Cyprinidae family and is the only species of the genus Ctenopharyngodon. As one of the most important freshwater-cultured fish, the global production of grass carp was approximately 5.8 million tons, accounting for 12.18% of global freshwater fish production in 2015 [1]. Currently most studies about grass carp focused on fish immunity [2-4], nutrition [5-7], and stress resistance [8]. A few growth-related studies in grass carp were focused on the impacts of the additives, dietary, or the growth hormones [9]. Therefore, the underlying genes associated with growth traits are still waiting to be revealed. Growth traits are typical quantitative traits, which are influenced by multiple genes, and perhaps no any single gene shows significant impact on such a trait. So it is very difficult to discover these genes through reverse genetics, especially for the grass carp, a fish specie usually breed once a year. Forward genetic techniques, such as quantitative trait locus (QTL) location, are more effective in parsing genes for complex traits. As early as in the first few years of the new century, QTL research was applied to investigate the body weight trait in rainbow trout (Oncorhynchus mykiss) [10], the first economic fish with genetic linkage mapping [11]. In the same year, a QTL for body length of tilapia (Oreochromis mossambicus and Oreochromis aureus) was published [12] . Since then, deeper researches on growth-related traits also have been undertaken in other teleost fishes, such as Atlantic salmon (Salmo salar) [13]. In recent years, QTL studies on growth-related traits were reported in some of the main farmed species in China, e.g., common carp (Cyprinus carpio) [14] and bighead carp (Hypophthalmichthys nobilis) [15]. However, there is no similar research in the grass carp as yet. As the current dominated molecular markers used in researches of grass carps [16-20], microsatellites (or Simple Sequence Repeats, SSRs), were not suitable for the high-throughput genotyping methods. With the technological advances, the high-throughput SNP genotyping methods, such as SNP array [21] and next-generation sequencing (NGS), have been widely applied in construction of genetic maps and location of QTLs in the teleost fishes [14, 15, 22]. The SNP calling and genotyping have become feasible in grass carp since the publication of the draft genome [23]. Earlier NGS methods for QTL analysis, including reduced representation sequencing, complexity reduction of polymorphic sequences (CRoPS), restriction site associated DNA sequencing (RAD-seq), and low coverage genotyping, have been discussed previously [24]. The prominent advantage of RAD-seq is the reduction of labor and cost, due to the pooled library. However, possible genotyping errors, caused by many factors [25], have been already revealed for RAD-seq. An improved method, 2b-RAD, avoids most of potential errors which may come out of size selection or sequencing depth. Furthermore, 2b-RAD is suitable for parallel genotyping for more samples, and can be more flexible with adjusting the marker density [26]. Comparatively, linkage maps have provided a framework for genomic and genetic studies, such as molecular marker-assisted selection (MAS) for quality [27] and quantitative traits (QTL), as well as chromosomal frameworks for genome scaffolds. The genetic maps for many aquaculture species have been published [14, 15, 28–31]. A high-density genetic linkage map can provide a more precise localization of the loci related to target traits and mount more genomic contigs. The first genetic linkage map of grass carp has a low density with 279 markers [32]. A map with a high density is urgently needed for the genome frameworks and the locations of trait-related loci and genes. In this study, a high-density genetic linkage map was constructed as a chromosome framework for the draft genome assembling and it mounted 89.91% of the genome sequences, much higher than the mounting rate (64%) of the first genetic map. Thirty QTL loci for growth-related traits were then located on the map and a candidate gene list for subsequent growth research was obtained.

Results

SNP marker filtration

The genomic high-throughput sequencing data from 2 parents and 100 progenies was screened by SOAP2 [33] and RADtyping software [34]. As a result, 5818 codominate markers (SNP) and 3531 dominate markers (InDel), belonging to 16,359 tags were preliminarily selected. After further filtration using the software bowtie and bowtie 2, a total of 8608 truly unique tags were obtained. After excluding makers with significant segregation distortion (χ2 test, p < 0.05, df = 2), 6658 markers on 6602 tags were obtained for constructing the genetic linkage map. The average interval between markers in the genome was 0.119 Mb. The markers were distributed over 610 supercontigs, which covered 93.47% of the grass carp draft genome. In order to construct genetic linkage maps quickly and accurately, the location markers that were identical were merged, which allowed for the absence of missing genotypes in offsprings, due to the limitations of joinmap 4.1 regarding the number of makers. Then the markers with the highest number of successful genotypes in offspring were used as representative markers for constructing the map. From all the markers, 3381 markers were divided into 767 groups, and 3099 markers were not consistent with any others. 122 markers were ambiguous because they could be divided into two or more different groups due to missing genotypes. Therefore, 3866 actual markers (a-markers) for the map were obtained after ambiguous markers were removed.

The construction of the linkage map

The 3866 a-markers and all related SNPs were divided into 24 groups, with LOD ≥ 5.0. The ML algorithm in Joinmap 4.1 and the Mergemap [35] were used to construct the linkage maps of both the parents (male or female) and consensus. The result showed that the male map were consisted of 3875 markers distributed in 1973 loci with a total length of 6301.59 cM and an average interval of 3.23 cM (Supplementary Table S1). The female map consisted of 3742 markers distributed in 1898 loci with a total length of 5680.51 cM and an average interval of 2.89 cM (Supplementary Table S1). The consensus map consisted of 6429 SNPs distributed in 3340 loci with a total length of 5553.43 cM and an average interval of 1.92 cM (Table 1). Since the ML distance was longer than the regression one, the length of linkage groups obtained were generally longer than that of other maps generated throughthe regression algorithm [14, 15, 36, 37], included the first grass carp genetic maps [32].
Table 1

Summary statistics of the sex-averaged linkage map of grass carp

LG nameNum of SNPsNum of lociLG length (cM)Average Dist (cM)
Group1286150274.3031.841
Group2219135232.4371.735
Group3277132229.3541.751
Group4232131226.0731.739
Group54682876712.346
Group623195149.5521.591
Group7261140216.8921.56
Group820495181.9161.935
Group9264123224.7411.842
Group1018788205.3082.36
Group11352171319.0131.877
Group12331175318.6081.831
Group13344168341.4672.045
Group14316157304.8761.954
Group15255163364.7712.252
Group16274146285.5441.969
Group17298146257.3761.775
Group18238137314.9032.315
Group19287127244.1691.938
Group20231123219.6881.801
Group21229118210.3481.798
Group22224117211.941.827
Group23208115220.1491.931
Group24213101149.4731.495

LG linkage group, cM centiMorgan

Summary statistics of the sex-averaged linkage map of grass carp LG linkage group, cM centiMorgan From the total number of markers in maps of female, male and consensus, only 1188 markers were found as the hereozygous loci in both parents (ab × ab). These markers accounting for 18.47% of all markers. Therefore, the total number of SNPs on the concensus map far exceeded male or female maps. This finding confirmed that the parents selected in our research were indeed very different and their offspings were suitable for constructing genetic maps (pseudo-testcross).

Genome scaffold anchoring and synteny analysis of zebrafish

All 6429 markers were distributed on 605 supercontigs with a total length of 0.81 Gb, 89.91% of the grass carp draft genome. The 99 supercontigs with more than 20 SNPs, were selected for scaffold anchoring, and they also showed a good linear relationship with the linkage groups (Fig. 1a). The 99 supercontigs, covering 642 Mb (74.39%) of the total length, were longer than the 573 Mb used previously [23]. In addition, 45 of this supercontigs were reversed.
Fig. 1

The concensus genetic map and growth-related QTLs of the grass carp. The outmoset circle was the concensus genetic map. The circles inside showed the LOD score of each markers to the four growth-related traits. The order was BW,BL,TL and BH inwardly. The QTLs were marked by dark red

The concensus genetic map and growth-related QTLs of the grass carp. The outmoset circle was the concensus genetic map. The circles inside showed the LOD score of each markers to the four growth-related traits. The order was BW,BL,TL and BH inwardly. The QTLs were marked by dark red A similar strategy for screening unique tags has been applied to map SNP tags to the zebrafish genome. Consequently, 511 unique tags were obtained. Among these unique tags, 506 fell on zebrafish chromosomes. These SNPs showed a good macro-collinearity between grass carp and zebrafish (Fig. 1b). The factor that LG13 was syntenic to ZF10 (NC_007121.6) and ZF22 (NC_007133.6) was consistent with previous results [32].

QTL mapping of growth-related traits

Pairwise comparisons were conducted among the four growth-related traits (TL, BL, BH, BW), using Pearson’s correlation coefficient. It was revealed that all of the traits showed a high correlation (p < 2.2e-16). The correlation coefficients of BW/BL, BW/TL and BW/BH were 0.95, 0.95 and 0.93, respectively. The highest coefficient was 0.97 between BL and TL, and the lowest was 0.89 between BL and BH (Table S2). BL and TL conformed to the normal distribution (p BL = 0.175, pTL = 0.550) and the logarithm of BW and BH also conformed to the normal distribution (plog(BW) = 0.274, plog(BH) = 0.096). Based on the above treated phenotype data, 30 growth-related QTLs were found on 10 genome regions (GRs), 16 genetic linkage regions (GLRs), or five LGs including LG2, LG10, LG14, LG16 and LG18 (Table 2, Fig. 2). These LGs and corresponding supercontigs were in the synteny (Fig. 1c).
Table 2

The statistics of QTLs

TraitsQTL nameLGPosition (cM)LODExp%St. stie on DraftEnd site on DraftNearest marker
supercontigssitesupercontigssite
BWqBW1a2214.344.3918.3CI01000095529,014CI01000095531,012ref-59,824
qBW1b2223.3595.120.9CI01000093649,736CI010000931,143,791ref-37,413
qBW1c2229.2684.7219.5CI01000093429,749CI01000093662,875ref-10,619
qBW214220.3284.6119.1CI010003004,435,910CI010003004,575,906ref-99,718
qBW31612.8923.5215.0CI01000025235,941CI01000025253,860ref-105,583
qBW4a18305.4784.4818.6CI01000142246,876CI01000153714,348ref-27,615
qBW4b18284.5344.2717.8CI01000123278,805CI01000123285,480ref-111,308
BLqBL5a2214.344.1617.4CI01000095529,014CI01000095531,012ref-59,824
qBL5b2222.3594.6419.2CI01000093649,736CI010000931,143,791ref-37,413
qBL5c2229.2684.5418.9CI01000093429,749CI01000093662,875ref-10,619
qBL6a1070.1033.1313.4CI010000514,515,076CI010000515,243,968ref-38,239
qBL6b1070.1613.1313.4CI010000514,515,076CI010000515,243,968ref-38,239
qBL714220.3284.1317.3CI010003004,435,910CI010003004,575,906ref-99,718
qBL81612.8923.3714.4CI01000025235,941CI01000025253,860ref-105,583
qBL9a18310.4784.5819.0CI01000142246,876CI01000153714,348ref-91,024
qBL9b18284.5343.715.7CI01000123278,805CI01000123285,480ref-111,308
TLqTL10a2214.344.5919.1CI01000095529,014CI01000095531,012ref-59,824
qTL10b2223.3595.2521.5CI01000093649,736CI010000931,143,791ref-37,413
qTL11a1070.1033.2313.8CI010000514,515,076CI010000515,243,968ref-38,239
qTL11b1070.1613.2313.8CI010000514,515,076CI010000515,243,968ref-38,239
qTL1214220.3284.1717.5CI010003004,435,910CI010003004,575,906ref-99,718
qTL13a1632.8883.4914.8CI01000165610,647CI01000166419,046ref-134,757
qTL13b1633.5513.4914.8CI01000165610,647CI01000166419,046ref-134,757
qTL14a18284.5344.6219.2CI01000123278,805CI01000123285,480ref-111,308
qTL14b18304.4785.2821.6CI01000142246,876CI01000153714,348ref-27,615
BHqBH15a2223.3594.1217.3CI01000093649,736CI010000931,143,791ref-37,413
qBH15b2229.2684.2617.8CI01000093429,749CI01000093662,875ref-10,619
qBH161081.0153.2313.8CI01000325439,637CI01000325468,239ref-178,556
qBH1714220.3284.2617.8CI010003004,435,910CI010003004,575,906ref-99,718
qBH1818300.4783.7916.0CI01000142246,876CI01000153714,348ref-27,615
Fig. 2

The colinearity of the genetic maps with the genome of grass carp and zebrafish. a The colinearity of the LGs to the 99 supercontigs in the genome of grass carp. b. The synteny analysis result of the LGs to zebrafish genome. c. The five LGs which were located growth-related QTLs

The statistics of QTLs The colinearity of the genetic maps with the genome of grass carp and zebrafish. a The colinearity of the LGs to the 99 supercontigs in the genome of grass carp. b. The synteny analysis result of the LGs to zebrafish genome. c. The five LGs which were located growth-related QTLs For BW, there were 7 QTL regions, among which QTL qBW1b was with the highest LOD 5.1. QTL qBW1b was located on LG2 (at 221.359–226.697 cM), which accounted for 20.9% of the phenotypic variance (PVE). 9 QTL regions were identified for BL, and QTL qBL5b showed the highest LOD 4.64. QTL qBL5b was located on the same place as QTL qBW1b, and it accounted for 19.2% of the PVE. For TL, 9 QTL regions were identified. The correlation between TL and BL traits was the highest, the highest LOD of QTL regions of TL and BL were slightly different. The QTL with the highest LOD value was qTL14b, which was located on LG18 (at 298.478–314.903 cM), accounting for 21.6% of the PVE. For BH, five QTL regions were found. The highest LOD (4.26) was found for QTL qBH15b and qBH 17, which explained 17.8% of the PVE.

Candidate gene identification for growth-related traits

In order to detect candidate genes more accurately, the parental whole genomes were resequenced with an average depth of 30X. A total of 2,415,558 SNPs were revealed as heterozygous for at least one parent. The main genotypes were ab × aa, aa × ab and ab × ab, which accounted for 39.19, 38.99 and 21.70% of all SNPs, respectively, while the rest SNPs (0.13%) havethree or more genotypes in parents (Table S3). In addition, 1,135,559 InDels were obtained. Since 30 QTLs located on 16 GLRs or 10 GRs (Table 2), the responsible regions were used to scan the candidate genes. The start and end sites of the GRs were makerd by the QTL adjacent SNPs. The endpoints of the eight GRs were located on the same supercontigs, such as qBW1a (Table 2). For these GRs, genes located within the interval were extracted. Whereas, for the other two GRs, the state and end sites were located on different short supercontigs, eg. qBW4a (Table 2). For these two GRs, all genes were extracted. As the result, 49 pre-candidate genes were discovered. The further filter criterion was retained the genes which had at least one functional SNPs/InDels and finally 17 candidate genes were selected (Table 3).
Table 3

The Statistic of Growth-related candidate genes with mutations

QTLGene symblGene IDTotal mutationsFunctional mutations
SNPsIndelsSNPsIndels
qBL6a,qTL11aflrt2CI01000051_04796878_04799272242320
qBL6b,qTL11bserpina1CI01000051_05048070_0504976872000
qBH16snx14CI01000325_00423661_00470758836142
qBW2,qBL7,qTL12,qBH17dlc1CI01000300_04581207_04605807757060
qBW3,qBL8prtgaCI01000025_00216623_0024242614311390
qTL13athsd4CI01000166_00408136_00415727741651
qTL13bcelf6CI01000165_00657050_00657275391910
qBW4b,qBL9b,qTL14aadamts20CI01000123_00321346_003768941849481
qBH18zmym4CI01000142_00345504_00369452543550
qTL14brpz4CI01000142_00459713_00463377241420
qBW4agbpCI01000142_00593244_00613363221970
qBL9alrp5CI01000153_00227118_00245589776580
qBW1a,qBL5a,qTL10abrca2CI01000095_00508378_0052508511639362
qBW1a,qBL5a,qTL10a, qBL5bmsi2CI01000093_00885214_00889462184201
lgals9CI01000093_01024056_01030224382160
qBW1b,qTL10b,qBH15anos2bCI01000093_00515565_00527271321280
qBW1c,qBL5c,qBH15bmrps23CI01000093_00530430_00535255522420
The Statistic of Growth-related candidate genes with mutations

Discussion

The colinearity of the genetic maps and the draft genome

The high-density linkage maps provide chromosomal frameworks for genome assembly validation. A total of 99 supercontigs with more than 20 SNPs were anchored onto the chromosomal framework. 45 of the anchored supercontigs were reversed in direction compared to other supercontigs. Most of the of supercontigs, were linear with the linkage group, but there were a few exceptions, such as the obvious scattering between LG12 ~ CI49 and CI50. This may be caused by the inaccuracies in the original sketch sequences, which need to be further refined to obtain more precise results. The LGs and supercontigs were not perfectly collinear if all markers are taken into analysis. The reason for this is hard to distinguish. This is because of the lack of parental linkage phase information, which can only be estimated by the offspring data with the introduction of some deviations. Therefore, the genetic linkage map is suitable for scaffold assembly and partial verification, and it is not suitable for the fragmented sequence assembly.

The length of LG with different mapping algorithm and different gender

Pearson’s correlation coefficient among loci number, LG length and the average interval of three species -- grass carp, common carp and bighead carp were calculated (Table S4). The male, female and consensus map of grass carp were constructed through the ML algorithm and the regression algorithm was used in other fish. In the results, loci number and LG length had a strong correlation (≥0.92) in the ML maps, and the coefficients were generally around 0.6 in the regression maps (0.60 in common carp, 0.69 in bighead carp). This phenomenon may have been affected by the different mapping algorithms and it is indicated that in the ML maps more loci synchronized with the longer length of LGs. Additionally, there were gender differences. In our ML maps, the loci of males were slightly less than that of females (1973 < 1989), but the total length of LGs was higher than that of females (6301.59 cM > 5680.51 cM). This was also found in the common carp with regression maps [14], which shows that this phenomenon was not caused by differences between mapping algorithms.

The odd length of LG5

Notably, the length of LG5 is almost 3 times more than other LGs (Fig. 1). The LOD of markers of LG5 were more than 11.0, thus the effect of grouping error was excluded. The missing genotype of offspring in the SNPs would bring adverse effects to the accuracy of map distances, so the means of the missing genotypes in the markers on each LGs were calculated. The number for LG5 was 1.33, which was below the overall mean of 1.56. This indicated that LG5 abnormalities were not caused by a missing genotype. In order to find out a reasonable explanation, all markers of LG5 were evaluated based on two versions of the grass carp genome: one was published online [23] and the other one was assembled by the PacBio sequences (unpublished data). The published genome, N90 (179,941 bp) [23] was used as the standard for division. Then the 164,368 supercontigs were divided into two groups: ‘N90 seqs’ or ‘Fragment seqs’. It is well known that it is difficult to assemble sequences (fragment seqs) generally because of the repeated sequences, DNA secondary structure and other factors. The number of fragment seqs located in each LGs of the concensus map were counted (Fig. 3).
Fig. 3

The number of abnormal tags in each of the LGs. The blue bar displayed the proportion of ‘fragment seq’ in each LGs. The red bar showed the proportion of repeat tags, which were detected through the PacBio contigs in each LGs

The number of abnormal tags in each of the LGs. The blue bar displayed the proportion of ‘fragment seq’ in each LGs. The red bar showed the proportion of repeat tags, which were detected through the PacBio contigs in each LGs Moreover, 44 Gb of PacBio RSII sequences data were used for the de novo genome assembling. The sequencing sample was an adult grass carp, which was a third gynogenetic generation fish that had a nearly homozygous genome. The long sequencing reads and the homozygous genome made it easier to conquer the problem of the repeated regions than the published genome version. Then the markers on the map were mapped to the new de novo assembling contigs, the ‘new’ non-unique tags were counted (Fig. 3). Judging from the statistical results, the percentage of SNP tags on either ‘fragments seqs’ or ‘repeated tags’ of LG5 were the highest. In summary, it was speculated that the length anomaly of LG5 was affected by more repeated tags. However, whether there were more recombination hot spots on LG5 that could not be determined.

The growth-related QTL in the grass carp

QTL mapping is one of the important applications of genetic mapping. This is a bridge towards functional genome research from structural genome research and has important application value for production practice. Growth-related traits are the most important economic traits in aquaculture animals, as multiple genes, environments and their interactions control them. The current research on growth-related traits of QTL mapping is mainly concentrated on the Atlantic salmon, rainbow trout, perch, common carp and tilapia in Asia, the growth of grass carp QTL positioning characters are rarely reported. In this study, 30 growth-related QTLs were identified by analyzing 4 growth-related phenotype data and high-density genetic linkage maps of grass carp. For these locations, we found most of their locations were overlapped on linkage groups (Table 2), For example, marker ref-27,615 was located on LG18 and corresponded to 4 QTL, including qBW4a, qBL9a, qTL14b and qBH18. The reason for this result might be the high correlation coefficients among the 4 traits in grass carp. This can be seen in Table S2, the lowest correlation coefficients were 0.89 between BL and BH and the highest correlation coefficients were 0.97 between BL and TL.

The candidate genes of the growth-related QTL

Due to differences in genomic structure, it is difficult to directly compare QTLs between species, while the comparison of homologous genes is feasible. However, we did not find any intersection between the candidate genes of grass carp and the genes of the growth-related QTLs reported in salmonid fishes [38]. A reasonable explanation is that a QTL analysis can only find a very limited set of genes and miss most growth-related genes, so it is unlikely that a gene will be repeatedly detected in different QTL studies. In addtion, we cannot rule out the possibility that different fish species (or even different parent of the same species) have their unique alleles leading to differential growth in offsprings. The candidate gene list does not cover those well-known genes related to growth, such as growth hormone (GH) gene. This fact is likely to indicate that these essential growth-related genes are functionally conservative and structural mutations on them are rare in natural enviroment. However, some of the candidate genes in our research have been shown to be directly or indirectly related with the growth trait. For example, the gene rapunzel 4 (rpz4) on qTL14b was identified as the most significant QTL for TL in our research, its heterozygous missense mutation could result in axial skeletal overgrowth [39]. Another example came from Nitric Oxide Synthase (NOS). As a multifunctional messenger molecule, Nitric Oxide (NO) could be involved in neurogenesis, cell migration, immunity and apoptosis [40]. In zebrafish, Nitric Oxide Synthase 2 (NOS2) has two isoforms, NOS2a and NOS2b [41]. NOS2a is an innate immune factor and has been studied in mammals [42, 43] and fish [44, 45]. However, NOS2b was not localized in the immune cells during the development of embryos, and the result of whole-mount in site hybridization showed that it may play a role in neuropypophysis and thyroid primordium [41]. NOS2b protein in fish has a myristoylation consensus site at the extremity of the N-terminal, and is similar to mammal NOS3, which catalyze NO and mediates vascular endothelial growth factor (VEGF)-induced angiogenesis in coronary vessels [46]. All of these study are consistent to our results, in which NOS2b was significantly related to BW and BH.

Conclusion

A high-density genetic linkage map of grass carp was built. The map’s correctness is supported by the good collinearity with both the grass carp draft genome and the zebrafish genome, and its effectiveness is demonstrated by the mounting rate which is much higher than the first map. A total of 30 growth related QTLs were detected, and 17 candidate genes were obtained from a cross analysis of the resequencing data from parent fishes, while the genes located on the QTLs without separable or effective SNPs were excluded.

Methods

The polymorphic SSRs genotyping of all parents

In order to select a suitable mapping population, the fin samples of 89 grass carp parents were captured from wild populations in the Yangtze River, Pearl River and Xiangjiang River. The samples were collected for genomic DNA extraction with the standard phenol-chloroform protocol [47]. Eighty-nine samples were genotyped by PCR with 11 SSR markers (Table S5). The PCR reaction for each SSR was performed in 10 μL volumes containing 1 μL (about 20 ng) of sample DNA, 5 μL 2xEs Taq masteMix (CWBIO, CHINA), 0.1 μmol forward primer and 0.1 μmol reverse primer, under the following conditions: 94 °C 3 min, 35 cycles of 30 s at 94 °C, 30 s at 53 °C, 30 s at 72 °C, and then prolonged extension for 5 min at 72 °C. The PCR products were genotyped through ABI3730 (ABI, USA) and the matrix of PCR band sizes of the 89 samples on all 11 SSRs were obtained. The observed heterozygosity (Ho), expected heterozygosity (He) and polymorphism information content (PIC) of each SSR was calculated through cervus (v3.0.7) [34]. The average PIC of these markers was 0.84 and the minimum PIC was 0.78, indicating that these markers are highly polymorphic (Table 4).
Table 4

The genetic diversity of SSR markers

LocusNO. of allelesHobsHExpPIC
G023100.640.8150.787
G5010120.7980.8450.823
G5012150.8650.9030.888
G5020140.8540.8990.885
G5023120.820.8770.859
G5034110.7640.8610.84
G5035140.8880.870.852
G5004160.8990.8840.867
G5024110.9330.8180.791
G5025140.910.9040.891
G5036110.7530.8010.777
The genetic diversity of SSR markers

Hierarchical cluster analysis of all parents

Hierarchical clustering between samples was completed using R script. The band size was treated as a factor, rather than a numerical value. Between any two samples, the amount of different bands on every SSR loci were calculated as scores, then the Euclidean distance was calculated to determine the genomic similarity and the tree map was obtained (Fig. 4). Among all samples, G1-G10 were closely related and accurately clustered into a single branch. This supported the reliability of this method. Due to these results, male M3 from the Yangtze River and female F8 from the Pearl River were selected as the parents, and then their 100 randomly extracted progenies were used to construct the CP population.
Fig. 4

The hierachical clustering tree of the 89 grass carp parents based on SSR polymorphism. The orange branches were closely related fishes, the red branch was the mother (F8) and the blue branch was the father (M3)

The hierachical clustering tree of the 89 grass carp parents based on SSR polymorphism. The orange branches were closely related fishes, the red branch was the mother (F8) and the blue branch was the father (M3)

Mapping population and phenotypic data

The F1 progenies were bred in May 2015. 100 individuals that were 6 months old were randomly sampled. Growth-related traits, including body length (BL), total length (TL), body weight (BW) and body height (BH), were measured. The caudal fin of all samples including 89 adults and 100 progenies were preserved in 95% ethanol and the Genomic DNA was extracted following the standard phenol-chloroform protocol [47]. After sampling, all fish were released.

2b-RAD sequencing and screening of SNP tags

Libraries for 2b-RAD with BsaXI of two parents and 100 progenies were prepared [26] and then sequenced on X-10 (Illumina Inc.). Quality control was used in order to remove low-quality and non-restriction site tags (Table S6), then the parents’ data was mapped to the grass carp draft genome using SOAP2 with default values for all parameters. The tags which are uniquely mapped on the genome were filtered further by RADtyping to exclude those with a too high or too low sequencing depth [48]. After this process, the remaining unique reads from parent were used as the new reference sequences, to which all the data from progenies were mapped for genotyping. Markers were screened out as the preliminary SNP tags from the progenies’ reads by two criteria: (1) the markers were genotyped successfully in at least 80% of the progenies; (2) the markers were heterozygous in both of the parents. In order to eliminate the errors which might be introduced into the uniqueness of markers by any single software, the preliminary SNP tags were mapped to the grass carp genome using bowtie (v1.2) [49] and bowtie 2 (v2.3.3) [50] respectively with default parameters, reads with more than 2 mismatches or Indels in any mapping were excluded. Finally, the alignment results were merged and significant segregation distortion markers were removed using χ2 test, and the final unique SNP tags were obtained. In addition, the χ2 test was done by R. Considering that the length of linkage group 5 (LG5) is remarkably long, and in order to validate it, all tags were mapped to an upgraded grass carp genome, which was assembled with long reads (44×) generated by PacBio RSII.

Construction of genetic maps and QTL mapping

To reduce calculation time and achieve the most accurate linkage results, SNPs with the same parental genotype, such as ‘ab x aa’, were used for determining whether they were completely linked or not. An in-house python script was written to complete this process, in which markers with missing genotypes in some of the fish offsprings are allowed. The markers were grouped with a LOD threshold of 5.0 into 24 LGs using Joinmap 4.1 [51] with default parameters. The male and female maps were also calculated by Joinmap 4.1 with Monte Carlo ML algorithm [51] and the consensus map was merged by MergeMap [52] . Pearson’s correlations among the four growth-related traits (BW, BH, BL and TL) were performed in all progenies. QTL mapping for growth traits used the Multiple QTL Mapping (MQM) method, with a LOD interval of 1 cM through MapQTL 6.0 [53]. The consensus map and QTL mapping were visualized using circos (v0.66) [54].

The synteny analysis of grass carp draft genome and zebrafish

After QTL mapping above, the correspondence of marker positions on the genetic map and draft genome were obtained. The supercontigs with more than 20 markers located on the genetic map were retained. The markers on these supercontigs were used for synteny analysis, and for visualizing the results of synteny analysis ggplot [55] was then used. For the sake of making the image aesthetically pleasing, the supercontigs were renamed. The corresponding supercontigs are listed in Table S1–7. During the process of the synteny analysis of zebrafish, markers located on the grass carp genetic map were mapped to the zebrafish genome (GRCz10) using bowtie and bowtie 2 with default parameters to exclude repeated tags which can be mapped on multiple locations. Remaining unique SNP tags were used for the synteny analysis and the visualization of collinearity was done by circos (version: 0.66). The zebrafish chromosomal names were also renamed for aesthetic needs, and the corresponding list is shown in Table S2–7.

Obtainment and analysis of parents resequencing data

After the DNA sequencing libraries were constructed with an insert size of 300 bp and paired-end sequenced on an Illumina Xten sequencer, the data of the parents was obtained. Then the filtered reads were mapped to the grass carp draft genome with BWA (version: 0.7.12) using the default parameters. Duplicated reads were filtered with Picard (version: 2.1.1). SNP and Indel calling was performed using the Genome Analysis Toolkit (GATK, version: 3.5) with the adjustment of parameter ‘-glm’. The homozygous SNPs and Indels in both the parents were excluded firstly, and the rest of them were annotated with the gene transfer format (GTF) file of grass carp [20] and the SnpEff software [56] by using the default parameters. As the result of annotation, every SNP was assigned a label named as ‘effect impact’, which valued in a set of four ratings (High, Moderate, Low and Modifier), and can be used for subsequent filtering process. The interval size of upstream and downstream for each gene was 5 K in our analysis. Then the total SNPs which located in the gene-related regions, included upstream, 5’UTR, exon, intron, 3’UTR and downstream, were counted for each gene. We removed the markers that may have little effect on gene function with the ‘Modifier’ tags and definited the remaining SNPs and indels as the functional mutations. The number of functional SNPs/Indels per gene were calculated finally. Additional file 1. Additional file 2.
  40 in total

1.  [Expression of grass carp growth hormone in the yeast Pichia pastoris].

Authors:  Wei Wang; Yong-Hua Sun; Ya-Ping Wang; Zuo-Yan Zhu
Journal:  Yi Chuan Xue Bao       Date:  2003-04

2.  2b-RAD: a simple and flexible method for genome-wide genotyping.

Authors:  Shi Wang; Eli Meyer; John K McKay; Mikhail V Matz
Journal:  Nat Methods       Date:  2012-05-20       Impact factor: 28.547

Review 3.  Nitric oxide and adult neurogenesis in health and disease.

Authors:  Carmen Estrada; Maribel Murillo-Carretero
Journal:  Neuroscientist       Date:  2005-08       Impact factor: 7.519

4.  The draft genome of the grass carp (Ctenopharyngodon idellus) provides insights into its evolution and vegetarian adaptation.

Authors:  Yaping Wang; Ying Lu; Yong Zhang; Zemin Ning; Yan Li; Qiang Zhao; Hengyun Lu; Rong Huang; Xiaoqin Xia; Qi Feng; Xufang Liang; Kunyan Liu; Lei Zhang; Tingting Lu; Tao Huang; Danlin Fan; Qijun Weng; Chuanrang Zhu; Yiqi Lu; Wenjun Li; Ziruo Wen; Congcong Zhou; Qilin Tian; Xiaojun Kang; Mijuan Shi; Wanting Zhang; Songhun Jang; Fukuan Du; Shan He; Lanjie Liao; Yongming Li; Bin Gui; Huihui He; Zhen Ning; Cheng Yang; Libo He; Lifei Luo; Rui Yang; Qiong Luo; Xiaochun Liu; Shuisheng Li; Wen Huang; Ling Xiao; Haoran Lin; Bin Han; Zuoyan Zhu
Journal:  Nat Genet       Date:  2015-05-04       Impact factor: 38.330

5.  A microsatellite linkage map of Barramundi, Lates calcarifer.

Authors:  Chun Ming Wang; Ze Yuan Zhu; Loong Chueng Lo; Felicia Feng; Grace Lin; Wen Tong Yang; Jian Li; Gen Hua Yue
Journal:  Genetics       Date:  2006-12-18       Impact factor: 4.562

Review 6.  Nitric oxide and the immune response.

Authors:  C Bogdan
Journal:  Nat Immunol       Date:  2001-10       Impact factor: 25.606

7.  A detailed linkage map of rainbow trout produced using doubled haploids.

Authors:  W P Young; P A Wheeler; V H Coryell; P Keim; G H Thorgaard
Journal:  Genetics       Date:  1998-02       Impact factor: 4.562

8.  A consensus linkage map of the grass carp (Ctenopharyngodon idella) based on microsatellites and SNPs.

Authors:  Jun Hong Xia; Feng Liu; Ze Yuan Zhu; Jianjun Fu; Jianbin Feng; Jiale Li; Gen Hua Yue
Journal:  BMC Genomics       Date:  2010-02-24       Impact factor: 3.969

9.  Expression of zebrafish nos2b surrounds oral cavity.

Authors:  Kar-Lai Poon; Michael Richardson; Vladimir Korzh
Journal:  Dev Dyn       Date:  2008-06       Impact factor: 3.780

10.  An ultra-high density linkage map and QTL mapping for sex and growth-related traits of common carp (Cyprinus carpio).

Authors:  Wenzhu Peng; Jian Xu; Yan Zhang; Jianxin Feng; Chuanju Dong; Likun Jiang; Jingyan Feng; Baohua Chen; Yiwen Gong; Lin Chen; Peng Xu
Journal:  Sci Rep       Date:  2016-05-26       Impact factor: 4.379

View more
  3 in total

1.  De novo screening of disease-resistant genes from the chromosome-level genome of rare minnow using CRISPR-cas9 random mutation.

Authors:  Rong Huang; Mijuan Shi; Lifei Luo; Cheng Yang; Mi Ou; Wanting Zhang; Lanjie Liao; Yongming Li; Xiao-Qin Xia; Zuoyan Zhu; Yaping Wang
Journal:  Gigascience       Date:  2021-11-19       Impact factor: 6.524

Review 2.  Animal board invited review: Widespread adoption of genetic technologies is key to sustainable expansion of global aquaculture.

Authors:  Ross D Houston; Christina Kriaridou; Diego Robledo
Journal:  Animal       Date:  2022-09-29       Impact factor: 3.730

3.  Dynamic Transcriptional Landscape of Grass Carp (Ctenopharyngodon idella) Reveals Key Transcriptional Features Involved in Fish Development.

Authors:  You Duan; Qiangxiang Zhang; Yanxin Jiang; Wanting Zhang; Yingyin Cheng; Mijuan Shi; Xiao-Qin Xia
Journal:  Int J Mol Sci       Date:  2022-09-30       Impact factor: 6.208

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.