Literature DB >> 35559043

Mutations in Growth-Related Genes Induced by EMS Treatment in Scallops.

Caihui Wang1, Bo Liu1, Min Chen2, Junhao Ning2, Xia Lu2, Chunde Wang1,2.   

Abstract

Background: The goal of genetic breeding is to select variants with mutations that are related to expected traits, such as fast growth. Artificial induction has been widely used to obtain strains with more mutations for further selection. Ethylmethylsulfone (EMS) is one of the most commonly used chemical mutagens in plant and microorganism breeding. However, the application of EMS mutagenesis in shellfish has not been reported. The aim of this study is to evaluate the potential use of EMS as a mutagen in scallop breeding, especially in characterization of mutations in growth-related genes.
Results: Our results indicated that hatching of about 50% of fertilized eggs was blocked by treatment with 20 mM EMS for 3 h and the resulted larvae developed normally into adult stages. We then evaluated the mutagenic effects of EMS by sequencing the genomes of 4 adult scallops from the control group and 12 from the treatment group at 8 months after fertilization. On average, after removing shared types of mutations, there were 1,151,380 ± 258,188 SNPs (Single Nucleotide Polymorphisms) and 229,256 ± 51,714 InDels (insertion-deletion) in each animal in the EMS treatment group, while there were only134841 ± 10,115 SNPs and 42,605 ± 5,136 InDels in the control group. The average mutation rate in the genome of the EMS treatment group (0.0137 ± 0.0013%) was about 9 times that of the control group (0.0015 ± 0.0002%). GO (Gene Ontology) annotation and KEGG (Kyoto Encyclopedia of Genes and Genomes) enrichment analyses revealed that mutations induced by EMS occurred evenly in most biological processes, cellular components and functions, as well in most pathways. However, significant lower percentage of mutations were found in the exonic region, in non-synonymous or Stopgain/Stoploss SNPs and in coding domains, suggesting apparent DNA repair or selection during grow-out stage. Analyses of the growth-related genes with mutations indicated that mutations in MFS (Major Facilitator Superfamily) and Tubulin were only found in the large-sized group (Five largest scallops: Treated-1, Treated-2, Treated-3, Treated-4, and Treated-5) and Homeobox and Socs (Suppressor of cytokine signaling) only in the small group (Two smallest scallops: Treated-11 and Treated-12). These results suggested that these genes may be involved in the regulation of growth in these animals, although further verification is certainly warranted.
Conclusion: Treatment of fertilized eggs with 20 mM EMS for 3 h induced 9 times more mutations in scallop genomes. We found that mutations in MFS and Tubulin may be related to fast growth in the large-sized group and those mutations in Homeobox and SOCs may be involved in the slow growth in the small-sized scallops. EMS can be used to accelerate selection of economically important traits in molluscs.
Copyright © 2022 Wang, Liu, Chen, Ning, Lu and Wang.

Entities:  

Keywords:  EMS; INDEL; SNP; gene; genome sequencing; mutagen; scallop

Year:  2022        PMID: 35559043      PMCID: PMC9086186          DOI: 10.3389/fgene.2022.879844

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.772


Background

The purpose of genetic breeding in aquaculture is to select variants with superior economically important traits, such as rapid growth, disease resistance and high survival. No matter which breeding method is used, successful selection depends on the availability of mutations in the base population. Mutations are the primary agents of long-term evolution. Genetic variation and naturally occurring mutations play an important role in evolution (Hou et al., 2019). Although naturally occurring mutations represent a major force driving evolution, the occurrence rate of spontaneous mutation is extremely low (Lian et al., 2020). Studies have shown that the natural mutation rate of specific loci in fish is generally lower than 1.0 × 10–6 (Kuroyanagi et al., 2013). The effective way to obtain a high mutation rate is by artificial induction. Artificially induced genetic variations represent important supplementary variation in plant breeding programs complementary to sources from natural origins (Lu et al., 2016). In recent years, artificial mutagenesis has been widely used in animal and plant breeding as well as in microbe breeding. For example, application of physical and chemical mutagenesis has led to great progress in the selection of algal strains with increased lipid production (Ornuma et al., 2018). Physical and chemical approaches are the most commonly used artificial induction methods in the construction of plant mutant library (Chen et al., 2018). Compared with physical mutagenesis, chemical mutagenesis is more cost-effective and does not require complicated equipment. Ethylmethylsulfone (EMS) is one of the most commonly used chemical mutagens in breeding of plants such as wheat and Maize (Tran et al., 2020; Xiong et al., 2020). By alkylation of guanine, EMS tends to induce more point mutations from G:C to T:A (Till et al., 2007; Sikora et al., 2011). EMS has been used in the selection of new strains with increased lipid productivity in Chlorella sp. (Ornuma et al., 2018) and also new strain with elevated tolerance to salt in wheats (Johanna et al., 2020). In recent years, EMS chemical mutagenesis has also begun to be applied in animals such as Drosophila (Lee et al., 2016) and nematodes (Song et al., 2014). However, EMS has not been used in genetic breeding of molluscs. The bay scallop Argopecten irradians is a commercially important bivalve widely cultured in northern China (Ning et al., 2018). The cultured population of the bay scallops originated from only a small number of brood stocks survived from the last attempt of 3 consecutive introductions in early 1980s (Zhang et al., 1986). Efforts have been devoted to improve the stocks through re-introduction of more brood stocks, hybridization and selective breeding. Selective breeding has been proven to be an effective approach for selection of variants with superior traits, mainly from populations with naturally occurring mutations (Wang et al., 2018). However, selective breeding based on naturally occurred mutations is limited by decreasing genetic diversity caused by continuous inbreeding and thus lack of selectable mutants (Zhao et al., 2019). Therefore, it is necessary to broaden the genetic diversity of the bay scallops by artificial induction. In the present study, we aimed to evaluate the feasibility of using EMS as a mutagen in the genetic breeding of scallops and characterize the mutations in growth-related genes.

Materials and Methods

Animals

The animals used in this study were ‘Bohai Red’ scallops, a new bay scallop strain selected from the hybrids between the bay scallop and the Peruvian scallop, which is widely cultured in northern China (Wang et al., 2017). The scallops were cultured in a scallop farm located in Yangma Island, Mouping, Shandong. They were conditioned to mature in the scallop hatchery of Yantai Spring-Sea AquaSeed Co., Ltd. located in Laizhou, Shandong.

Determination of 50% Lethal Concentration of EMS Treatment

When the scallops became mature, they were induced to spawn individually in beakers of 1 L by exposing them to air for 30 min followed by a temperature shock from 18 to 23°C. The spawning scallops were watched carefully to collect eggs and sperm separately. Eggs or sperm from different scallops were pooled together. Then sperm were mixed with eggs to obtain fertilized eggs for subsequent experiments. After fertilization, fertilized eggs were divided into six groups and a certain amount of EMS stock solution was added to the fertilized egg suspension so that the concentrations of EMS in the containers were 0, 10, 20, 30, 40, 50, and 60 mM. Three replicates were set for each group. EMS stock solution was purchased from Sigma (Ronkonkoma, United States). The fertilized eggs were exposed to EMS solutions of different concentrations for 3 h before they were filtered and washed with fresh seawater and kept in water bath for hatching at 23°C. The fertilized eggs were stirred every half hour. Twenty-eight hours after fertilization, three 1 ml samples were taken from each container and observed under a microscope. Hatching rates, the percentages of D-formed larvae in total fertilized eggs, were determined for each container. The 50% lethal concentration of EMS was determined based on the hatching rates of fertilized eggs exposed to different concentrations of EMS.

Animal Rearing and Sampling

The shell height and shell length of the larvae in each EMS concentration group were measured on Day 3, 6, 9, and 10 after fertilization to examine the effects of EMS on larval growth. Five 1 ml samples were randomly taken from each container for measurements. Ten days after fertilization, when over 50% of the larvae developed eyespots, the larvae were set on plastic collectors and allowed for metamorphosis. After metamorphosis, the set juveniles were taken first to the ponds and then to the open sea for nursery. When the juveniles reached 2 cm in shell height in late Aug., they were dispersed into lantern nets for grow-out. About 8 months after fertilization, about 20,000 scallops were harvested and screened for individuals with apparent phenotypic mutation. As large phenotypic variations existed in shell height, shell length and shell width, we used the sum of shell height, shell length and shell width to select the largest and smallest individuals. As a result, 5 largest (Treated-1, Treated-2, Treated-3, Treated-4, and Treated-5), 2 smallest scallops (Treated-11 and Treated-12) and 5 normal-sized (Treated-6, Treated-7, Treated-8, Treated-9, and Treated-10) were selected from the two ends and middle of the normal distribution of individual shell lengths for whole genomic resequencing. Four scallops from the untreated control group were also randomly selected for whole genome sequencing. The sizes of the sequenced scallops were given in Table 1.
TABLE 1

Sizes of the sequenced individuals in the control and treatment group.

GroupsSample IDLength (mm)Height (mm)Width (mm)Sum
Control groupControl_1666531162
Control_2656431160
Control_36563.531159.5
Control_4646330157
Large-sized treatment groupTreated_179.57731187.5
Treated_2776632175
Treated_3777632185
Treated_4777633186
Treated_5777731185
Normal-sized treatment groupTreated_6666431161
Treated_76565.529159.5
Treated_86451.529144.5
Treated_9635129143
Treated_10595828145
Small-sized treatment groupTreated_11514928128
Treated_125049.528.5128
Sizes of the sequenced individuals in the control and treatment group.

Whole Genome Sequencing

After harvest, the scallops were dissected to collect their adductor muscles. DNA was extracted from each adductor muscle sample. And the qualified DNA was randomly broken into 350bp fragments by Covaris crusher, and NDM607-01 was used to build the database. The whole library was prepared by terminal repair, ployA tail addition, sequencing connector addition, purification and PCR amplification. After the library construction was completed, Qubit2.0 was used for preliminary quantification, and the library was diluted to 1 ng/μl. Agilent 2100 was then used to test the insert size of the library, and QPCR was used to accurately quantify the effective concentration of the library (effective concentration of the library >2 nM). And sequenced using paired-end Illumina platform.

Evaluation of EMS-Induced Mutations

We aligned the raw data to the Bay scallop reference genome (BioSample accessions: SAMN08322131; Liu et al., 2020 using the mem algorithm in BWA (Burrows-Wheeler Alignment) (Li and Durbin, 2009). We then used Samtools to convert the file format and sort the subsequent alignment sequences (Li et al., 2009). Subsequently, GATK (The Genome Analysis Toolkit) was used to remove repeat sequences and to detect SNPs and InDels. SNPs and InDels specific to each group were obtained by removing shared mutations identified by comparing their mutation information (including CHROM, POS, REF, ALT, and GENO) in both the control and treatment group. Then we annotated each variation with annovar to obtain information for further identification of synonymous and nonsynonymous mutations. Finally, SNPs/InDels were annotated to the reference genome using the annovar package (Alexander et al., 2009; Vilella et al., 2009; Wang et al., 2010; Danecek et al., 2011; Yang et al., 2011; Schiffels and Durbin, 2014; Zhang et al., 2018). For GO and KEGG enrichment analysis, the final results were obtained by performing enrichment analysis using hypergeometric test, followed by FDR (False Discovery Rate) correction of the p-values.

Screening of Mutations in Growth-Related Genes

We compared the genes with nonsynonymous SNP sites in the large- and small-sized individuals against controls and normal-sized individuals to find the mutated genes that appeared in only large- or small-sized individuals. From these mutated genes, we then searched for genes that are potentially related to growth.

Statistics and Data Analyses

Statistical analyses of the data were performed using the SPSS (Statistical Product and Service Solutions) statistical package (SPSS Science, Chicago, Illinois). Briefly, the raw data on size parameters in different treated group and hatching rate were first tested for normality of distribution using the Kolmogorov-Smirnov and Shapiro-Wilk tests, and for homogeneity of the variance using the Levene tests. If the raw data failed these tests, square root or natural logarithm transformations were then performed. When both the normality and homogeneity were met, the differences in growth and hatching rate were analyzed with one-way analysis of variance (ANOVA) followed by multiple comparison tests (Tukey’s test) (Wang et al., 2010).

Results

Determination of the 50% Lethal Concentration of EMS Treatment

To figure out the appropriate dose for EMS treatment, we first determined the 50% lethal concentration of EMS by examining the hatching rates of fertilized eggs treated with different concentration of EMS. Our results showed that EMS inhibited hatching rate in a dose-dependent manner. In the treatment groups, some fertilized eggs either developed into abnormal embryos or ceased to develop at early stages (Figure 1). Compared with the control group, about 50% of the fertilized eggs was blocked from hatching by 20 mM EMS (Figure 2). Therefore, we chose 20 mM EMS in subsequent experiments.
FIGURE 1

Hatching rate of fertilized eggs treated with different concentrations of EMS.

FIGURE 2

D-formed larvae at 28 h after fertilization in the control (A) and the EMS treatment group (B).

Hatching rate of fertilized eggs treated with different concentrations of EMS. D-formed larvae at 28 h after fertilization in the control (A) and the EMS treatment group (B).

Effects of EMS Treatment on Larval Growth

EMS exhibited slightly inhibitory effects on larval growth. As can be seen in Figure 3, the growth curve of shell height and shell length of the treatment group (20 mM EMS) were significantly different from that of the control group during larval stage (p < 0.05, ANOVA). Shell height on day 6 and day 10 and shell length on Day 6, 9, and 10 in the control group were significantly larger than those of the EMS treatment group (p < 0.05, t-test).
FIGURE 3

Effects of EMS on growth in shell height (A) and shell width (B) of scallop larvae.

Effects of EMS on growth in shell height (A) and shell width (B) of scallop larvae.

Genomic Analyses of EMS-Induced Mutations

To identify the mutations induced by EMS that may have contributed to important traits such as growth, we re-sequenced the genomes of the largest and smallest individuals from the treatment group, as well as those of the control group. In total, we obtained 156.9 Gb clean data from these 16 animals. The Q30 values of all samples were higher than 80%, indicating that the data were of low error rate and high quality (Table 2). The average mapping rate to the reference genome sequence was 99.54% with an average sequencing depth of 11.19× (Table 3).
TABLE 2

Genome sequencing data.

Sample IDClean Reads pairsClean base (bp)LengthQ20 (%)Q30 (%)GC (%)
Control_172,397,38810,730,101,234148.34; 148.0896.8; 97.189.0; 89.635.0; 34.9
Control_286,746,90612,841,328,031148.15; 147.9296.4; 96.988.1; 89.035.5; 35.3
Control_378,575,28411,637,694,182148.29; 147.9296.8; 96.988.9; 89.235.1; 35.0
Control_480,070,25611,846,376,980148.14; 147.7696.5; 96.788.2; 88.636.1; 36.0
Treated_167,832,69010,041,975,926148.2; 147.8897.3; 97.690.2; 91.035.9; 35.9
Treated_264,159,1909,470,920,591148.25; 146.9896.7; 96.388.7; 87.335.4; 35.4
Treated_363,990,2249,454,811,079147.95; 147.5696.3; 96.587.3; 87.635.1; 35.1
Treated_465,044,5609,615,563,726148.02; 147.6496.4; 96.687.5; 87.734.9; 34.8
Treated_567,712,52810,006,244,700148.06; 147.4996.5; 96.487.8; 87.435.7; 35.7
Treated_665,341,5109,667,122,876148.06; 147.8496.5; 96.887.9; 88.335.7; 35.7
Treated_766,519,7289,826,753,951148.07; 147.3896.3; 96.387.3; 86.934.8; 34.7
Treated_866,587,0849,863,157,597148.17; 148.0896.7; 97.088.2; 89.134.9; 34.8
Treated_978,747,89611,665,293,124148.25; 148.0296.8; 97.189.0; 89.635.2; 35.1
Treated_1064,408,9829,530,201,612148.16; 147.7796.7; 96.988.8; 89.234.8; 34.7
Treated_1168,486,44810,122,154,287148.18; 147.4296.5; 96.388.2; 87.535.7; 35.7
Treated_1274,752,34811,073,164,954148.29; 147.9796.7; 96.988.7; 89.134.9; 34.7
TABLE 3

Summary of assembly results.

SampleAverage_sequencing_depth (x)Coverage (%)Mapped (%)Properly mapped (%)Singletons mapped (%)
Control_111.4984.2999.5991.160.15
Control_213.3684.7199.6190.770.15
Control_312.5884.5599.691.110.15
Control_412.5184.6999.5990.510.15
Treated_110.6583.8299.6792.190.13
Treated_210.1484.0099.2490.110.17
Treated_310.2282.5799.4990.620.17
Treated_410.5283.8799.5590.930.17
Treated_510.6684.2499.5590.810.16
Treated_610.2984.0399.6191.410.15
Treated_710.6682.8799.5490.680.18
Treated_810.8183.9699.5491.150.15
Treated_912.4884.6099.6190.470.14
Treated_1010.5282.9299.5690.600.17
Treated_1110.8584.5399.6190.800.15
Treated_1212.1284.4599.5290.970.15
Genome sequencing data. Summary of assembly results.

SNPs

The average number of SNPs in the control group (5,866,344 ± 95,080) is not significantly different from that of the EMS treatment group (5,626,416 ± 297,062) (Table 4). However, after removing the shared SNPs with the control group, significantly more SNPs were found in the EMS treatment group than the control group (p < 0.05, t-test). As can be seen from Table 4, the number of SNPs in the control group was 134,841 ± 10,115 while that in the EMS treatment group was 1,151,380 ± 258,188. The SNPs are mainly located in intergenic regions, followed by intronic regions, and the rest were located in upstream and exon region. The Ts/Tv (Transitions/Transversions) value in the EMS treatment group was significantly higher than that in the control group (p < 0.05) (Table 5). (Transitions means a transformation between nucleotides of the same type; Transversions means a transformation between nucleotides of the different type).
TABLE 4

SNP information in the control and EMS treatment group after calling.

Sample IDSNP numTransitionTransversionTs/Tv
Control_15,734,8702,994,8392,649,2341.13
Control_25,962,0103,114,4682,754,5711.13
Control_35,888,8223,075,8692,721,2731.13
Control_45,879,6753,075,3632,711,4451.13
Treated_15,595,9112,927,4782,576,9771.14
Treated_25,655,1612,958,4532,606,8121.13
Treated_35,174,4232,699,8032,377,1891.14
Treated_45,769,9203,016,8642,662,8031.13
Treated_55,658,2792,961,1102,606,8431.14
Treated_65,704,3522,985,5032,628,3241.14
Treated_75,404,2892,820,6292,488,6611.13
Treated_85,635,5512,942,4062,602,2251.13
Treated_96,012,3003,141,6942,778,3851.13
Treated_105,037,0142,622,7212,320,4831.13
Treated_115,913,4703,095,1252,728,0731.13
Treated_125,956,3253,110,3182,754,4781.13
TABLE 5

SNP number in the control and EMS treated group after removing shared types with controls.

Sample IDSNP NumberTransitionTransversionTs/TvExonicIntronicMutation rate (%)
Control_1144,15774,61269,5451.075,22155,3370.001343
Control_2133,64169,13664,5051.075,07647,0830.001041
Control_3140,42172,74467,6771.075,09352,1570.001207
Control_4121,14463,02058,1241.084,21745,9830.001023
Treated_1919,124486,397432,7271.1233,221343,9860.009153
Treated_2899,577474,911424,6661.1231,786338,1800.009498
Treated_31,259,663665,325594,3381.1244,543480,8910.013323
Treated_4836,603441,775394,8281.1228,400317,3730.008701
Treated_5884,410467,731416,6791.1231,431336,8620.008839
Treated_6844,165447,477396,6881.1331,064322,8160.008732
Treated_71,301,091686,338614,7531.1243,564499,7730.013240
Treated_81,283,904676,586607,3181.1143,240484,3510.013017
Treated_91,479,438779,953699,4851.1249,384557,0130.012682
Treated_101,188,795625,965562,8301.1140,539451,1100.012474
Treated_111,452,761768,189684,5721.1251,409553,6800.014352
Treated_121,467,033772,501694,5321.1148,801547,5070.013249
SNP information in the control and EMS treatment group after calling. SNP number in the control and EMS treated group after removing shared types with controls. The results also showed that the average number of non-synonymous SNP in the treatment group (17,020 ± 3,449) were much higher than those in the controls (2471 ± 275). The average number of Stopgain/Stoploss SNPs in the treatment group (358 ± 60) was also higher than that of the control group (64 ± 9) (Table 6).
TABLE 6

SNP mutation types in the control and EMS treatment groups.

Sample IDSynonymous SNPNon-synonymous SNPStopgainStoploss
Control_124532664627
Control_223432604686
Control_324612551529
Control_420572065494
Treated_118,23214,47728733
Treated_217,66113,66727831
Treated_324,73019,12036336
Treated_415,77312,17423524
Treated_517,55113,42028925
Treated_617,30813,31724827
Treated_724,46418,45136546
Treated_824,10018,55134443
Treated_927,65221,06236750
Treated_1022,71017,30031052
Treated_1129,03821,69636649
Treated_1227,13721,00437553
SNP mutation types in the control and EMS treatment groups. GO enrichment analysis for the genes with SNPs located in the upstream and exon regions revealed that genes were significantly enriched in the GO terms of protein binding, cell-cell adhesion via plasma-membrane adhesion molecules, homophilic-cell adhesion via plasma-membrane adhesion molecules, binding and cell-cell adhesion in the EMS treatment group, but the genes were only enriched in GO term of protein binding in the control group (Figure 4).
FIGURE 4

(A) GO annotation analysis of SNPs located in upstream and exon region in the control group. (B) GO annotation analysis of SNPs located in upstream and exon region in the EMS treatment group.

(A) GO annotation analysis of SNPs located in upstream and exon region in the control group. (B) GO annotation analysis of SNPs located in upstream and exon region in the EMS treatment group. GO and KEGG analyses of the genes with mutations showed that there were no distinct distribution for the GO terms that were classified into the biological processes, cellular components and molecular functions between the EMS-treated group and control group, as well as for the KEGG pathways (Figure 5).
FIGURE 5

(A) GO annotation analysis of nonsynonymous SNPs in the control group. (B) GO annotation analysis of nonsynonymous SNPs in the treatment group. (C) KEGG enrichment analysis of nonsynonymous SNPs in the control group. (D) KEGG enrichment analysis of nonsynonymous SNPs in the treatment group.

(A) GO annotation analysis of nonsynonymous SNPs in the control group. (B) GO annotation analysis of nonsynonymous SNPs in the treatment group. (C) KEGG enrichment analysis of nonsynonymous SNPs in the control group. (D) KEGG enrichment analysis of nonsynonymous SNPs in the treatment group.

InDels

The average numbers of InDels in the genome and in the CDs of the control group (1,548,571 ± 28,720 in the genome and 9,016 ± 310 in the CDs) was also not significantly different from that of the EMS treatment group (1,462,511 ± 83,433 in the genome and 8,518 ± 552 in the CDs) (Table 7). After removing shared InDels in the controls, the average numbers of InDels in the genome and in the CDs of the EMS treatment group (229,256 ± 51,714 in the genome and 1798 ± 302 in the CDs) were much higher than those in the control group (42,605 ± 5,136 in the genome and 640 ± 15 in the CDs) (Table 8).
TABLE 7

InDels in the control and EMS treatment group after calling.

SampleGenome totalGenome insertionGenome deletionCDS totalCDS insertionCDS deletion
Control_11,511,400681,992829,4088,6063,5155,091
Control_21,580,568713,062867,5069,2333,7735,460
Control_31,556,430701,814854,6168,9463,6585,288
Control_41,545,887696,134849,7539,2783,7865,492
Treated_11,424,749636,777787,9729,1133,6795,434
Treated_21,457,488657,042800,4468,4243,4394,985
Treated_31,341,891607,142734,7497,7873,1004,687
Treated_41,500,738676,211824,5278,2393,2684,971
Treated_51,465,430660,129805,3018,8083,5825,226
Treated_61,466,508659,425807,0838,7893,6275,162
Treated_71,412,924639,855773,0697,7943,1704,624
Treated_81,468,654661,746806,9088,4443,4035,041
Treated_91,589,485716,156873,3299,3873,7185,669
Treated_101,312,194593,108719,0867,6943,1444,550
Treated_111,535,662692,346843,3168,7853,5885,197
Treated_121,574,406709,673864,7338,9583,5995,359
TABLE 8

InDels in the control and EMS treated group after removing shared types with controls.

SampleGenome totalGenome insertionGenome deletionCDS totalCDS insertionCDS deletion
Control_147,08121,17825,903658263395
Control_238,75517,31521,440634236398
Control_337,59818,52419,074623225356
Control_446,98717,56829,419645261387
Treated_1167,84376,623101,42218677211,146
Treated_2169,39876,465103,6721,501589912
Treated_3175,975107,611142,50119136681,245
Treated_4178,04573,19096,2081,260432828
Treated_5180,137112,309147,8401,525589936
Treated_6238,23276,06299,9131,518599919
Treated_7250,11272,36095,48317716621,109
Treated_8253,471129,826168,34218476641,183
Treated_9260,149109,500143,97122888431,445
Treated_10284,295123,192161,10318376961,141
Treated_11295,252128,607166,64520967821,314
Treated_12298,168102,954135,27821517771,347
InDels in the control and EMS treatment group after calling. InDels in the control and EMS treated group after removing shared types with controls. Go and KEGG analysis of InDels showed that the number of genes enriched in the treated group was almost twice that of the control group. And mutations were evenly distributed across nearly all biological processes, cellular components and molecular functions, and nearly all pathways (Figure 6).
FIGURE 6

(A) GO annotation analysis of InDels in the control group. (B) GO annotation analysis of Indels in the treatment group. (C) KEGG enrichment analysis of Indels in the control group. (D) KEGG enrichment analysis of Indels in the treatment group.

(A) GO annotation analysis of InDels in the control group. (B) GO annotation analysis of Indels in the treatment group. (C) KEGG enrichment analysis of Indels in the control group. (D) KEGG enrichment analysis of Indels in the treatment group.

Mutation Rates Induced by EMS Treatment

In this study, statistics were performed on data after removing mutations shared between control and treated groups. we have found that the average mutation rate as revealed from SNPs of the control group was 0.00115 ± 0.00015%, while that of the EMS treatment group was 0.01144 ± 0.00222%. And the average mutation rate as revealed from InDels of the control group was 0.000365 ± 0.000064%, while that of the EMS treatment group was 0.002281 ± 0.000473%. Taken together, the average mutation rate in the genome of the EMS treatment group (0.013721 ± 0.001347%) was much higher than that in the control group (0.001518 ± 0.000215%) (p < 0.05). Our results showed that the average mutation rate of the treatment group was about 9 times that of the control group. We also found that the mutations in the treatment group tend to occur in non-coding or intronic regions, resulting in synonymous mutations or mutations with fewer Stopgain/Stoploss. The percentage of exonic SNPs in the total SNPs in the EMS treatment group (3.47 ± 0.12%) was significantly lower than that of the control group (3.63 ± 0.13%; p < 0.05) while the percentage of intronic SNPs in the total SNPs was not different between them (p > 0.05) (Table 5). The percentage of non-synonymous SNPs in the EMS treatment group (1.49 ± 0.06%) was significantly lower than that of the control group (1.83 ± 0.10%) while the percentage of synonymous SNPs in the EMS treatment group (1.93 ± 0.06%) was significantly higher than that of the control group (1.73 ± 0.03%) (p < 0.001 for both comparisons) (Table 6). Similarly, the percentage of Stopgain/Stoploss SNPs in the treatment group (0.032 ± 0.002%) was also significantly lower than that of the control group (0.048 ± 0.006%) (p < 0.001; Table 7). The percentage of InDels in the coding domains out of the total InDels in the whole genome in the treatment group (0.81 ± 0.16%) was much lower than that of the control group (1.52 ± 0.15%) (p < 0.001; Table 8).

Mutations in Growth-Related Genes

To find the potential mutations that may be involved in growth regulation, we attempted to screen the mutations in growth-related genes that existed in only the large-sized animals or small-sized animals but not in the control group that were not exposed to EMS or normal-sized animals that were treated with EMS. We made a matrix using the information of nonsynonymous SNPs by first filtering for genes that contained only nonsynonymous SNP sites in the treated group and then further filtering for genes that contained only nonsynonymous SNP sites in the large individual group as well as only in the small individual group. Our results showed that mutations in major facilitator superfamily (MFS) and Tubulin were found only in the large-sized group while mutations in Homeobox and Socs were found only in the small-sized group.

Discussion

Effects of EMS Treatment on Hatching and Larval Growth

In this study, we found that EMS had inhibitory effects on hatching and larval growth. Similar results have also been observed in previous studies (Hou et al., 2019). We speculate that the mutation induced by EMS may affect the transcription of mRNA and disorder the expression of protein, consistent with the principle of EMS mutagenesis.

Feasibility of Using EMS as a Mutagen in Scallops

As naturally occurring mutations are often limited for genetic breeding programs, artificial mutation induction has become a potent method for the development of novel germplasm (Penna et al., 2012). In this study, we examined the feasibility of using EMS as a mutagen in scallops breeding. We demonstrated that by treating fertilized eggs at 20 mM EMS for 3 h, we were able to induce 9 times more mutations than naturally occurring mutations. Of these mutations, about 3.47% occurred in the coding region and 1.48% were nonsynonymous mutations. These mutations may cause alteration in amino acid sequences in genes that are related to important traits and thus provide rich variations for subsequent selective breeding. Since only the mutations in the gonads may be inherited to offspring, screening of mutated individuals with remarkable phenotypes (or expected traits) in the next generation is thus needed for breeding. The mutated individuals may also provide new opportunities for the exploration of the genes related to the mutated traits. This is demonstrated by the subsequent analyses of growth-related genes utilizing the largest and smallest individuals in the treatment group, as discussed later.

Features of EMS-Induced Mutations in Scallops

In this study, EMS induced extensive variations in the genome, with a high mutation rate of 0.014 ± 0.001%. Mutations in the format of both SNPs and InDels were induced in the mutated individuals. SNPs accounted for the majority of mutations, indicating that point mutations were the main type of variation induced by EMS treatment. Similar results have been found in Oryzias latipes induced by ENU (N-ethyl-N-nitrosourea), another commonly used chemical mutagen (Yoshihito et al., 2006). After removing the shared SNPs of the controls, we found that the Ts/Tv values of both the treatment group and the control group were slightly larger than 1.0, indicating more transitions than transversions. The cause for more transitions than transversions may lie in that fact that 5-methylcytosine residues are the most prone mutation sites in the genome and can be easily deaminated to form thymine. The Ts/Tv values in both the control and the treatment group were significantly higher than the theoretical value of Ts/Tv, which is 0.5, a phenomenon named conversion deviation. It was believed that the formation of conversion deviation may be a way to reduce harmful mutations due to the selection in the long-term evolutionary process (Wakeley, 1996). Therefore, we speculate that the individuals with increased Ts/Tv in the treatment group may had a better chance of survival than those with lower Ts/Tv. Our results also showed that there were more nonsynonymous than synonymous SNPs in the control group. In the contrary, more synonymous than nonsynonymous SNPs were found in the EMS treatment group. Nonsynonymous mutations are thought to be largely deleterious due to their property of changing amino acids. For example, it has been found that the ratio of nonsynonymous mutation/synonymous mutation involved in cancer related genes was significantly lower than that of normal genes (Chu and Wei, 2019). It is thus possible that individuals with more nonsynonymous mutations eventually did not survive, resulting in a low nonsynonymous mutation/synonymous in the EMS treatment group. Previous studies have shown that Stopgain/Stoploss SNPs may alter protein functions (Boschiero et al., 2018). Our results also showed that the numbers of Stopgain/Stoploss SNPs in the individuals treated with EMS were significantly more than those in the control group, indicating that EMS treatment may also affect protein functions by these Stopgain/Stoploss SNPs. Besides the SNPs in coding regions, the importance of those SNPs located in the non-coding regions cannot be overlooked. In human, almost 90% of disease-related gene variants are found to be located in the non-coding region of the genome. The variations in non-coding regions can cause diseases through alterations in gene expression by changing important regulatory elements (Rojano et al., 2016). In fact, the abundance of SNPs in noncoding regions may serve as an important source of variations for breeding programs (Arabnejad et al., 2018).

Mutation Sites in the EMS Treatment Group

Our results showed that the mutations induced by EMS treatment distributed evenly within the genome, among all the biological processes, cellular components, molecular functions as well as pathways. While EMS seemed to induce mutations randomly in the genome, the mutations in the survivors of the treatment group tend to be less lethal to the animals. More mutations occurred in the intronic or non-coding domains, resulting in fewer non-synonymous mutations that may alter the amino acid sequences of functional proteins. The reasons for this observation may be caused by DNA repair after EMS induction and higher mortality in the individuals with lethal mutations in functional proteins. Recently, Monroe et al. (2022) indicated that mutations tend to occur at sites that are less critical to the survival of plants. The organisms seemed to have developed certain mechanisms to ensure that their important domains not affected by mutations. Our results apparently also support this hypothesis.

Mutations in the Growth-Related Genes

In this study, we found that mutations in MFS and Tubulin were only found in large-sized individuals, suggesting that these mutations in MFS and Tubulin may be the major cause for the rapid growth in these large-sized individual. Similarly, the mutations in Homeobox and Socs were only found in small-sized individuals, suggesting that these mutations may be related to the slow growth in these small-sized individuals. The major facilitator superfamily (MFS) is a ubiquitous group of proteins involved in the transport of a wide range of compounds (Elena et al., 2006). MFS gene is a meristem regulating gene in rice development and mutation in MFS may result in additional organ growth (Ren et al., 2017). Tubulin has been reported to be involved in cell elongation in flax (Gavazzi et al., 2017). The homeobox family is a large and diverse superclass of genes, many of which may act as transcription factors that play important roles in embryogenesis, tissue differentiation and in animals (Pearson et al., 2005; Di et al., 2015; Hench et al., 2015; Fu et al., 2021) or as master regulators for developmental genes (Ramanathan et al., 2018). Homeobox gene may also be involved in DNA repair during growth (Feltes, 2019). The SOCs are key negative regulators of cytokine and growth factor signaling (Wang et al., 2019) which may exert negative effects on cytokine signaling pathways involved in immunity, growth and development (Liu et al., 2016; Tian et al., 2020). Studies have shown that Socs gene are required for cytokine signal attenuation in mammary epithelial cells and act to limit proliferation through a negative feedback mechanism (Arun et al., 2015). In human cancer cells, activation of SOCs by epigenetic modulation through histone acetylation may induce apoptosis (Sanaei et al., 2020). Despite these lines of indirect evidence, the functions of these growth-related genes have not been well studied. Therefore, further studies are certainly needed to understand how mutations in these genes affect growth in the scallops.
  44 in total

1.  Ethyl methanesulfonate mutant library construction in Gossypium hirsutum L. for allotetraploid functional genomics and germplasm innovation.

Authors:  Xin Lian; Yan Liu; Huihui Guo; Yijie Fan; Jianfei Wu; Haixia Guo; Chengzhi Jiao; Zhengmin Tang; Li Zhang; Yupeng Fan; Zhongyuan Gou; Changyu Zhang; Tongtong Li; Fanchang Zeng
Journal:  Plant J       Date:  2020-04-28       Impact factor: 6.417

2.  Molecular characterization, expression pattern and evolution of nine suppressors of cytokine signaling (SOCS) gene in the swamp eel (Monopterus albus).

Authors:  Bo Tian; Dongdong Tang; Jinming Wu; Meng Liang; Du Hao; Qiwei Wei
Journal:  Fish Shellfish Immunol       Date:  2019-12-04       Impact factor: 4.581

3.  Transcriptomic Profiling Provides Insights into Inbreeding Depression in Yesso Scallop Patinopecten yessoensis.

Authors:  Liang Zhao; Yangping Li; Jiarun Lou; Zhihui Yang; Huan Liao; Qiang Fu; Zhenyi Guo; Shanshan Lian; Xiaoli Hu; Zhemin Bao
Journal:  Mar Biotechnol (NY)       Date:  2019-07-13       Impact factor: 3.619

4.  Mutagenesis as a tool in plant genetics, functional genomics, and breeding.

Authors:  Per Sikora; Aakash Chawade; Mikael Larsson; Johanna Olsson; Olof Olsson
Journal:  Int J Plant Genomics       Date:  2012-01-22

5.  Novel breeding approach for Japanese flounder using atmosphere and room temperature plasma mutagenesis tool.

Authors:  Hou Ji-Lun; Zhang Xiao-Yan; Wang Gui-Xing; Sun Zhao-Hui; Du Wei; Zhao Ya-Xian; Si Fei; Wang Li-Yan; Xing Xin-Hui; Wang Yu-Fen
Journal:  BMC Genomics       Date:  2019-04-29       Impact factor: 3.969

6.  Draft genomes of two Atlantic bay scallop subspecies Argopecten irradians irradians and A. i. concentricus.

Authors:  Xiao Liu; Chao Li; Min Chen; Bo Liu; Xiaojun Yan; Junhao Ning; Bin Ma; Guilong Liu; Zhaoshan Zhong; Yanglei Jia; Qiong Shi; Chunde Wang
Journal:  Sci Data       Date:  2020-03-23       Impact factor: 6.444

7.  Mutation bias reflects natural selection in Arabidopsis thaliana.

Authors:  J Grey Monroe; Thanvi Srikant; Pablo Carbonell-Bejerano; Claude Becker; Mariele Lensink; Moises Exposito-Alonso; Marie Klein; Julia Hildebrandt; Manuela Neumann; Daniel Kliebenstein; Mao-Lun Weng; Eric Imbert; Jon Ågren; Matthew T Rutter; Charles B Fenster; Detlef Weigel
Journal:  Nature       Date:  2022-01-12       Impact factor: 69.504

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  New approach for fish breeding by chemical mutagenesis: establishment of TILLING method in fugu (Takifugu rubripes) with ENU mutagenesis.

Authors:  Miwa Kuroyanagi; Takashi Katayama; Tadashi Imai; Yoshihisa Yamamoto; Shin-ichi Chisada; Yasutoshi Yoshiura; Tomokazu Ushijima; Tomonao Matsushita; Masashi Fujita; Aoi Nozawa; Yuzuru Suzuki; Kiyoshi Kikuchi; Hiroyuki Okamoto
Journal:  BMC Genomics       Date:  2013-11-13       Impact factor: 3.969

10.  Transcriptome sequencing reveals hotspot mutation regions and dwarfing mechanisms in wheat mutants induced by γ-ray irradiation and EMS.

Authors:  Hongchun Xiong; Chunyun Zhou; Huijun Guo; Yongdun Xie; Linshu Zhao; Jiayu Gu; Shirong Zhao; Yuping Ding; Luxiang Liu
Journal:  J Radiat Res       Date:  2020-01-23       Impact factor: 2.724

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.