Literature DB >> 31598078

Association between sequence variants in cadmium-related genes and the cadmium accumulation trait in thermo-sensitive genic male sterile rice.

Xiaohua Hao1,2, Canming Wu1, Rong Wang1, Lianfu Tian1, Taoyu Song1, Hang Tan1, Yangcheng Peng2, Meng Zeng1, Liangbi Chen1, Manzhong Liang1, Dongping Li1.   

Abstract

Although cultivation of hybrid rice varieties has been increasing, there are risks that high levels of cadmium (Cd) will accumulate in grain when such rice is grown in Cd-polluted environments. To produce Cd-safe hybrid rice, one practical approach is the generation of low Cd-accumulating parental lines. In two-line hybrid breeding, thermosensitive genic male sterile (TGMS) lines function as female parents to yield hybrid seeds. Recently, Cd accumulation-related genes have been identified; however, the effect of these genes on Cd accumulation in the grains of TGMS lines has yet to be reported. Here, 174 TGMS lines were selected for Cd accumulation phenotyping, and 30 TGMS lines, including 15 stable low-Cd and 15 high-Cd lines, were selected for single-nucleotide polymorphism (SNP) genotyping and association analysis. Association studies were conducted to identify the relationship between Cd accumulation and variable sites within seven candidate Cd-associated genes using logistic models. Nine sequence variant sites in four of the candidate genes were found to be significantly associated with Cd accumulation, two of which in OsNRAMP1 and OsNRAMP5 are low-Cd favorable variants, explaining 46.4% and 22.6% of the phenotypic variation, respectively. These loci could be developed as new molecular markers for identification of Cd accumulation characteristics and low-Cd marker-assisted breeding.
Copyright © 2019 by JAPANESE SOCIETY OF BREEDING.

Entities:  

Keywords:  Cd accumulation; association analysis; hybrid rice; molecular marker; variant loci

Year:  2019        PMID: 31598078      PMCID: PMC6776150          DOI: 10.1270/jsbbs.18191

Source DB:  PubMed          Journal:  Breed Sci        ISSN: 1344-7610            Impact factor:   2.086


Introduction

In recent years, cadmium (Cd) contaminated rice has become an issue of major concern in a number of rice-growing countries. For people living in China and East Asia, where rice is as staple food, this crop plant is the main source of ingested Cd. Cd over-accumulation in the human body can damage health, or even lead to illness, such as the well-known Itai-itai disease (Horiguchi , Takahashi ), and increase the risk of carcinogenesis in multiple human organs, including the pancreas (Buha ), breast (Van Maele-Fabry ), kidney (Huff , Il’yasova and Schwartz 2005, Song ), bladder (Feki-Tounsi and Hamza-Chaffai 2014), prostate gland (Rapisarda , Vinceti ), and testicle (Goyer ). In attempts to lower Cd accumulation in rice, a number of remedial measures have been taken, including physical, chemical, and microbiological methods, whereby Cd ions are fixed in soils or removed from fields (Brown , Chaney , Murakami , 2009). Nevertheless, it is generally very difficult to remediate Cd-polluted paddy soils to safe levels within a short period of time. Consequently, a more practical approach is to breed inbred rice varieties or hybrid combinations with a low-Cd accumulation trait. Over the past few decades, high-yielding hybrid rice varieties have been planted extensively to meet the needs of increasing populations. To ensure that hybrid rice produces Cd-safe grains, it is crucial to breed low-Cd trait parent lines. In two-line hybrid rice breeding, thermosensitive genic male sterile (TGMS) line rice is used as the female parent to bear hybrid seeds with heterosis. TGMS lines are a new type of male sterile line, in which the sterility varies with temperature. At the booting stage, when temperatures are lower than 23°C, TGMS lines bear normal pollen grains, which give rise to TGMS seeds. However, when temperatures exceed 23°C, these lines bear abortive pollen grains, and can be used as the maternal parents to produce hybrid combinations. The discovery of TGMS has opened up new avenues for the utilization of heterosis in rice and provided a germplasm basis for the transformation of hybrid rice from a “three-line method” to a “two-line method”. However, although the relative frequency of ideal hybrid rice varieties obtained is considerably higher when using the two-line method, information on natural variation in the major Cd-accumulation genes, including the reported link with the Cd-accumulation trait in the TGMS lines of brown rice, is relatively limited. In recent years, several genes that affect grain Cd accumulation in rice have been reported, including OsLCD (Shimo ), OsHMA2 (Takahashi ), OsHMA3 (Miyadate ), OsNRAMP1 (Takahashi ), OsNRAMP5 (Ishikawa , Tang ), OsMTP1 (Yuan ), OsLCT1 (Uraguchi ), OsABCG43 (Oda ), and OsCCX2 (Hao ). OsNRAMP5 is a root-expressed Mn transporter, and is a key factor involved in Cd uptake by roots (Sasaki ). Knockout of this gene inhibits Cd uptake, and is accordingly a pivotal target for low-Cd breeding (Sasaki , Takahashi , Yang ). Although OsNRAMP1 functions primarily as an iron transporter, it also plays a role in Cd uptake (Takahashi ). OsHMA3 and OsHMA2 are members of the ATPase transporter family in rice. OsHMA3 can sequester Cd in the vacuoles of root cells (Miyadate , Ueno , Wu ). Enhancing OsHMA3 expression decreases grain Cd content (Ueno ), whereas dysfunction of OsHMA3 has been found to lead to an increase in the Cd content of aboveground parts (Miyadate , Ueno , 2011, Uraguchi , Yan ). Almost all of the japonica rice cultivars contain the OsHMA3 gene and have low grain Cd contents, whereas most of the indica rice cultivars contain a mutated OsHMA3 gene and have higher grain Cd contents. OsHMA2 is expressed in the vascular system of both roots and nodes, and Cd contents in the aerial parts of rice plants are reduced when OsHMA2 is mutated (Satoh-Nagasawa , Takahashi , Yamaji ). However, although these Cd accumulation-related genes have been demonstrated to have certain effects on the Cd content in some materials, it remains to be clarified whether they have similar effects on Cd accumulation in the grains of TGMS lines. Recombinant inbred line-based (RIL) linkage mapping is an efficient method for quantitative trait locus analysis, which has the merit of simple genetic background (Huang and Han 2014). However, this method can only be used to explore sites with the strongest influence on the trait in question, and cannot be used to analyze phenotypes with low frequency in natural populations. In addition, when using this method, there are stringent requirements for the mapping population. Firstly, the population number must be sufficiently large, and, secondly, the genetic background of individual samples must be stable and homozygous, which can be achieved via selfing of the F2 population for several generations. Association analysis is an analytical method based on linkage disequilibrium (LD) that can be used to identify the relationship between a group of target traits and genetic markers (Gupta , Zondervan and Cardon 2004). It encompasses two analytical approaches, one of which is based on candidate gene sequences, and the other of which is based on whole-genome sequences (also referred to as genome-wide association study, GWAS) (Flint-Garcia , 2005, Zhu ), which are efficient methods for exploring effective variant loci (Liu ). Over the past few decades, scientists have mapped many pathogenic loci by applying the association analysis technique. Inspired by genetic research on human diseases (Cardon and Palmer 2003), association analysis has also been adopted by plant breeders for research on crop genetics and breeding (Flint-Garcia ). Using natural populations, researchers can analyze the association between candidate gene polymorphisms and target traits, making it possible to identify associations between polymorphisms of functional loci and phenotypes. Thornsberry and colleagues (2001), for example, performed association analysis for the first time in plants based on candidate gene strategies, whereby they examined the association between the polymorphic loci of the Dwarf 8 gene and the flowering time phenotype in Zea mays. Subsequently, Andersen verified the association of maize Dwarf8 polymorphism with flowering time and plant height in inbred lines under four different environmental conditions. Association analysis has also been shown to be an effect approach for gene mining and gene function verification, and has opened up new opportunities for quantitative trait research. In this study, as experimental material, we used 174 TGMS lines, which were collected from different rice ecological regions and show wide genetic diversity. On the basis of a population structure analysis, we used a logistic model to analyze the relationship between the single-nucleotide polymorphism (SNP) variations of seven candidate genes (OsLCD, OsNRAMP1, OsNRAMP5, OsHMA2, OsHMA3, OsMPT1, OsABCG43) and the Cd accumulation trait associated with each gene. The findings of this analysis will contribute to gaining a better understanding of the association between sequence variants in Cd-related genes and Cd accumulation in TGMS lines of brown rice. Moreover, this approach could enable us to identify novel molecular marker(s) associated with the low-Cd trait in TGMS lines, which could in turn be applied in novel low-Cd rice resource breeding, including low-Cd TGMS lines.

Materials and Methods

Plant materials and growth conditions

The 174 TGMS lines analyzed in this study were obtained from the Hunan Province Key Laboratory of Crop Sterile Germplasm Resource Innovation and Application. Analyses were conducted over two successive years (2016 and 2017). Rice seeds were soaked in water for 2 days, and then germinated for 2 days in an incubator at 30°C. Fifteen-day-old seedlings were transplanted to paddy soil (containing 3.9 mg Cd per kg soil) at the end of June in each of the study years. Two seedlings were planted in individual buckets (30 cm × 30 cm) containing the paddy soil, and three replicates were assessed for each line. At the booting stage, the plants were exposed to low temperature (19–23°C) for 15 days in an intelligent cryogenic pool. As mentioned in the introduction, low temperature treatment below 23°C can promote the recovery of pollen grain fertility in TGMS varieties, thereby facilitating viable seed production via self-crossing. In the present study, we examined pollen fertility to ensure that it was greater than 80%. After the low temperature treatment, all materials were placed in the same pool for uniform moisture management. During the latter stage of grain filling, water content was controlled to maintain an appropriate level of soil moisture, and to ensure that no water accumulated in the barrel until the seeds were ripe. When the seeds were mature, the grain Cd content of plants with a seed setting rate of 70% or more was determined. Because of the different growth cycles, the materials were subjected to low temperature treatment in batches, their pollen fertility was checked separately, and water control management was carried out from 20 days after anthesis to seed maturity.

Analysis of grain Cd concentrations

For grain Cd analysis, brown rice samples were placed in an oven at 105°C for 12 h and subsequently at 70°C until completely dehydrated. Each sample was then weighed and digested with 10 mL of HNO3–HClO4 (4:1 v/v) solution at 160–180°C. Cd concentrations were determined by atomic absorption spectrophotometry (AAS, GBC3000).

Simple sequence repeat (SSR) marker genotyping

Genomic DNA was extracted from leaf tissue according to a previously described method (Murray and Thompson 1980). The 48 pairs of SSR markers recommended by the Ministry of Agriculture of China for the identification of rice cultivars (NYT1433-2014) were used to genotype 30 selected TGMS lines. PCR was performed using 50-μL reaction mixtures containing 2 μL of 50 ng μL−1 template DNA, 1 μL of 10 pmol μL−1 forward primer, 1 μL of 10 pmol μL−1 reverse primer, 25 μL of 2× PCR Mix (TaKaRa), and 11 μL of ddH2O. PCR amplifications were conducted using the following cycle program: an initial denaturation at 95°C for 5 min, followed by 31 cycles of denaturation at 95°C for 30 s, annealing at approximately 55°C for 30 s (depending on the Tm values of primer pairs), and extension at 72°C for 30 s, and a final extension step at 72°C for 10 min. PAGE electrophoresis and silver staining procedures were performed as described by Liu .

Population structure analysis

A total of 48 SSR primer pairs were used to analyze the population structure. When a pair of SSR primers was used to detect single loci, each polymorphic band was regarded as a unique allele. Band analysis was based on the following criteria: if two bands had different intensities, the stronger band was read; if two bands had a similar intensity, they were both retained as common alleles. If neither of these two criteria could be applied, the data was considered as being missing. On the basis of the primer amplification results, samples with clear and polymorphic bands were selected for statistical analysis. According to the size of the amplified fragments, 1 and 0 databases were established. At the same migration location, if a band was present, it was labeled 1, whereas if no band was visible, it was labeled 0. Structure 2.3 software was used to analyze the population structure (Pritchard ). Ten replicates were run according to the following procedure: the population number was set from 1 to 9, burn-in was set as 100,000, run length was set as 100,000, and a model allowing for admixture and correlated allele frequencies was adopted. The most probable for value for K (for the number of clusters or subpopulations among individual samples) was calculated according to the methods described by Evanno . The analysis results were subsequently used to correct the deviation of association analysis.

Full-length sequencing of candidate genes

The primers used for PCR were designed based on the reference sequence of the ‘Nipponbare’ variety of japonica rice in the NCBI database (Supplemental Table 1). The sequences of the Cd-related genes were amplified from the genomic DNA of the TGMS rice by PCR. High fidelity Taq DNA ploymerase (AccuPrime™, Thermal Fisher) was used for the PCR amplifications. The PCR procedures were set as follows: an initial hold at 95°C for 4 min; 32 cycles of 95°C denaturation for 30 s, 55°C~60°C annealing 30 s depending on Tm values of the primers, and 68°C elongation for 1 min, and a final extension of 10 min at 68°C. The amplified PCR products were sent for direct sequencing. In order to avoid base mismatch and amplification failure caused by the long gene sequence, we divided the candidate genes into several segments, and designed the primers according to the principle of terminal overlap (Supplemental Fig. 1). The downstream section of primer A1 of the first segment was located in the upstream section of primer S2 of the second segment, with the overlapping region between the two fragments being approximately 100 bp. Subsequently, the overlapping regions were removed and the sequence fragments were joined to obtain the complete full-length gene sequence. The full length of the Cd-related genes are defined in Table 2.
Table 2

Polymorphism of 7 low cadmium-related genes

GeneFull lengthNo. of variationsNo. of SNPNo. of INDELΠ × 103


Coding regionNoncoding regionCoding seq. regionNoncoding seq. region
OsABCG4348902571262.28
OsMTP136211311112.49
OsHMA34057169612.97
OsNRAMP1484646431113.05
OsLCD1392225121123.62
OsNRAMP5728154142114.03
OsHMA2769015437852306.67
Total3630733360208362

Π: Nucleotide diversity.

Sequence polymorphism analysis

In order to ensure the accuracy and reliability of the sequences obtained, DNAMAN 7.0 software was used to compare the measured sequences with the reference sequence and to remove redundant sequences at the ends of primers. Clustal X software was used to compare all the target sequences obtained (Thompson ). The compared sequence was saved as a PHYLIP format file, which was used for subsequent association analysis. TASSEL 5.0 software was used to analyze sequence diversity. When the LD structure was analyzed, the measured values D’ (Standardized Disequilibrium Coefficients) and R2 (Squared Allele-frequency Correlations) were selected, and the Permutations value was set to 1000 (Balding 2006).

Association analysis

TASSEL 5.0 software was used to analyze the association between low-Cd candidate genes and the low-Cd trait (Bradbury ). The results of gene sequence alignment, population structure, and low-Cd data in TGMS lines were analyzed for association using a logistic regression model. For threshold selection, a P-value of <0.05 was considered significant and P < 0.01 considered extremely significant.

Phenotypic effect of alleles

On the basis of the association analysis sites, we used the null allele method proposed by Breseghello and Sorrells (2006) to determine the phenotypic effects of each allele. The method used to calculate the phenotypic effect of allele variation is as follow: where ai represents the phenotypic effect value of the i allele variation. Xij is the phenotypic value of the material character of the j material carrying the i allele variation, and Ni is the number of materials with the i allelic variation. Nk is the phenotypic value of the k material with invalid alleles, and nk is the number of materials with invalid alleles. If the ai value is positive, it is considered that the allelic variation is a synergistic allelic variation and vice versa.

Results

Phenotypic variation in the Cd content of grains of different TGMS lines

Plants of the TGMS lines were grown in Cd-polluted soil in two successive years (2016 and 2017). At the booting stage, they were subjected to low temperature treatment (19–23°C) to reverse pollen sterility. The content of Cd in grains was determined for plants with a seed-setting rate higher than 70%. The grains were harvested and the Cd contents were determined after they had ripened. The results obtained in 2016 and 2017 years were found to be consistent. The frequency distribution of Cd contents in the 174 lines is presented in Fig. 1. The Cd content was found to vary from 0.063 mg/kg DW (dry weight) to 1.466 mg/kg DW, including 15 extremely low (<0.1 mg/kg DW), and 15 extremely high (>0.4 mg/kg DW) TGMS lines (Table 1). The average value was 0.23 mg/kg DW. Approximately 47% of the 174 lines accumulated less Cd than the accepted standard level in grains (<0.2 mg/kg DW), whereas 8% of the lines showed an extremely high grain Cd concentration (>0.4 mg/kg DW), and 45% of the lines had accumulated Cd contents of between 0.2 and 0.4 mg/kg DW. Among the TGMS lines examined, the following with grain Cd contents lower than 0.1 mg/kg DW were identified as low-Cd varieties: B8S, SD34-1S, guanzhan63-4S, B14S, T91-22S, ZiseBLO2-1S, T95-5S, SD40S, B15S, Xiangling628S, 201507S, B76S, B216S, SD39S, and SD39-1S.
Fig. 1

Frequency distribution of brown rice Cd content.

Table 1

Grain Cd contents of 30 TGMS lines for cloning and sequencing

Varieties[Cd] in grain (mg/kg DW)Average

20162017
B8S0.074 ± 0.030.052 ± 0.030.063
SD34-1S0.068 ± 0.040.060 ± 0.040.064
Guangzhan63-4S0.087 ± 0.040.051 ± 0.030.069
B14S0.085 ± 0.030.073 ± 0.040.079
ZiBL02-1S0.061 ± 0.040.097 ± 0.040.079
T91-22S0.067 ± 0.020.113 ± 0.090.090
T95-5S0.110 ± 0.040.074 ± 0.050.092
SD-40S0.120 ± 0.050.066 ± 0.040.093
B15S0.084 ± 0.040.104 ± 0.090.094
Xiangling628S0.098 ± 0.030.094 ± 0.030.096
201507S0.110 ± 0.040.086 ± 0.050.098
B76S0.084 ± 0.040.112 ± 0.060.098
SD39S0.120 ± 0.050.078 ± 0.050.099
B216S0.140 ± 0.050.060 ± 0.040.100
SD39-1S0.094 ± 0.040.106 ± 0.110.100
B206S0.388 ± 0.120.406 ± 0.220.397
B152S0.250 ± 0.090.598 ± 0.190.424
C1S10S0.480 ± 0.130.412 ± 0.160.446
21504S0.210 ± 0.080.742 ± 0.210.476
HSQ9S0.610 ± 0.210.342 ± 0.110.476
B204S0.350 ± 0.110.636 ± 0.220.493
Zhu77S0.470 ± 0.230.536 ± 0.180.503
W6154S0.580 ± 0.120.430 ± 0.260.505
B161S0.410 ± 0.140.684 ± 0.230.547
640S0.680 ± 0.160.452 ± 0.220.566
zhu25S0.610 ± 0.210.584 ± 0.320.597
B63S0.740 ± 0.220.460 ± 0.120.600
SD44S0.650 ± 0.090.856 ± 0.230.753
B73S0.920 ± 0.231.170 ± 0.331.045
C1S7S1.200 ± 0.331.732 ± 0.651.466

Population structure analysis of the TGMS lines

On the basis of the results obtained for Cd accumulation in grains, 30 TGMS lines (15 stable low-Cd and 15 high-Cd lines) were selected for further analysis. In order to make the association analysis more accurate, we initially performed a population structure analysis of the 30 TGMS lines prior to their use for candidate gene cloning. Genotype analysis was based on the use of 48 SSR markers that are evenly distributed throughout the rice genome, and which were revised and re-published by the Ministry of Agriculture of China in 2014. These markers were screened from 35 representative rice varieties with different genetic background characteristics. They cover 12 chromosomes of rice and have high polymorphism. The genetic structure of the 30 selected lines was analyzed using Structure 2.2 software (Pritchard ). The results showed that the 48 pairs of SSR primers could be used to effectively divide the examined materials into different sub-populations, and K values with an obvious inflection point were obtained. The results showed that as the model parameter K increases, the log-likelihood also showed the same tendency. Furthermore, the ΔK value was considerably higher for the model parameter K = 4 than for any other value of K (Fig. 2A), suggesting that the population studied comprised a mixture that can be divided into four subpopulations (Fig. 2B).
Fig. 2

Population structure analysis of 30 TGMS rice accessions. (A) Changes in ΔK (B) Bar plot of the genetic composition of 30 TGMS lines based on admixture model.

Analysis of the genetic diversity of the 30 TGMS lines indicated that 46 SSR loci were polymorphic, which represents a polymorphic frequency of 95.8% among the 48 SSR loci examined. A total of 140 alleles were detected using the 46 SSR markers, ranging from two to five alleles per locus, with an average of 3.04 alleles at each SSR site. The value of Shannon’s index for the population ranged from 0.0853 to 0.6926, the gene diversity index ranged from 0.0331 to 0.4995, and the allele frequency ranged from 0.0168 to 1.0. These data revealed that the TGMS lines have rich genetic diversity and were suitable for further sequence polymorphism analysis. The full sequence information of seven low Cd-related genes were obtained from the web site https://www.ncbi.nlm.nih.gov/, and the full-length sequences were cloned by PCR. Polymorphism analysis revealed a total of 333 variant sites among all the sequences, comprising 268 SNPs and 65 INDELs (Insertion-Deletion sites), most of which were located in the non-coding regions of the gene sequences (Table 2). The OsHMA2 gene was found to contain the largest number variable sites (on average, one site per 50 bp), whereas OsMTP1 has the least number (on average, one site per 278 bp). We identified 62 variable sites in the coding regions of the candidate genes, with OsHMA2 again being found to harbor the largest number of variable sites (up to 39), whereas OsNRAMP5 contains just a single variable site in its coding regions. Compared with the ‘Nipponbare’ reference sequences, we identified 30 non-synonymous SNPs spread across the OsNRAMP1, OsABCG43, OsLCD1, OsHMA3, and OsHMA2 genes The measured nucleotide diversity (II) values varied from 2.28 × 10−3 (OsABCG43) to 6.67 × 10−3 (OsHMA2).

Association analysis between gene sequence polymorphisms and the Cd accumulation trait

Analysis of the association between Cd accumulation and polymorphisms of the seven selected gene sequences from the TGMS lines was carried out using the Logistic module in TASSEL 5.0 software (Bradbury ). General linear model (GLM) data revealed that a total of 42 variants, in four of the seven genes, were significantly or extremely significantly associated with Cd accumulation (Table 3). Furthermore, we found that most of the variant sites are located in non-coding regions, with only four sites being located in coding regions (one SNP at 3751 bp in the 10th exon of OsNRAMP5, and the other three SNPs located at 450, 513, and 568 bp in the 1st exon of OsHMA2).
Table 3

Marker-trait associations with P-values of less than 0.05 by GLM analysis, and the r2 (phenotypic variance explained, PVE)

GeneSiteP-valuer2
OsHMA38400.01860.159
OsNRAMP13 sites in LDa0.00140.279
1736, 5781b0.00370.239
25730.00030.464
OsNRAMP513060.02050.226
10 sites in LDc0.00390.235
OsHMA281, 109d0.01620.238
21 sites in LDe0.02340.155
6450.03670.134

a, 3 sites in complete LD: 124, 164, 1076; b, 2 sites in complete LD: 1736, 5781; c, 10 sites in complete LD: 2267–2279, 3530, 3543–3346, 3615, 3623–3624, 3627, 3751 (10 exon), 4016, 4263, 4710–4727; d, 2 sites in complete LD: 81, 109; e, 21 sites in complete LD: 141, 143–157, 177, 180, 182, 188–189, 203, 227, 242, 245–247, 250, 256, 258–280, 301, 316, 389, 417, 425–427, 450 (1 exon), 513 (1 exon), 568 (1 exon).

, were in complete LD.

We found that one SNP detected in the 5′UTR region of OsHMA3, at base pair position 840 (C-T), was significantly associated with Cd accumulation (P < 0.05), and could explain 15.89% of the phenotypic variation. Three polymorphic sites in OsNRAMP1 were significantly related to Cd accumulation (P < 0.01), whereas three SNPs in complete LD (124, 164, and 1076 bp), two SNPs in complete LD (1736 and 5781 bp), and one INDEL located at 2573 bp in OsNRAMP1 explained 27.9%, 23.9%, and 46.4% of the phenotypic variation, respectively. In OsNRAMP5, eight SNPs and three INDELs were significantly associated with Cd accumulation (P < 0.01). One SNP at position 1306 bp explained 22.6% of phenotypic variation, where the remaining seven SNPs (3530, 3615, 3623–3624, 3627, 3751, 4016, and 4263 bp) and the three INDELs (2267–2279, 3543–3346, 4710–4727 bp) in complete LD (Table 3) explained 23.5% of the phenotypic variation. Twenty SNPs and four INDELs in the OsHMA2 genes were shown to be significantly associated with Cd accumulation in the grains of rice plants subjected to Cd pollution (P < 0.05). Two SNPs in complete LD (81 and 109 bp), and 17 SNPs and four INDELs in complete LD explained 23.8% and 15.5% of phenotypic variation, respectively. The other SNP located at 645 bp was found to explain 13.4% of the phenotypic variation in Cd accumulation.

Combination of favorable alleles in TGMS lines

On the basis of the allelic effect on Cd accumulation in brown rice, each of the identified variants was classified as either favorable or unfavorable. The phenotypic allele effect was estimated through comparison between the average phenotypic value of accessions with the specific allele and that of accessions with a “null allele”. Of the nine significantly associated loci, seven showed positive association with Cd accumulation in grain and the remaining two loci were negatively correlated (Table 4). These latter two loci in OsNRAMP1 and OsNRAMP5 showed a close association with a low-Cd phenotype and can thus be considered favorable alleles with polymorphisms for low-Cd grain. The other seven loci are associated with the accumulation of Cd in grain and can be classed as high-Cd alleles in grains. The aggregation of favorable alleles can contribute to enhancing certain favorable traits. In this regard, among the 30 TGMS lines examined, we found that B8S and B14S have two favorable alleles that synchronously contribute to low Cd accumulation. Accordingly, these two lines could be considered as low-Cd TGMS lines and used as putative resources for low-Cd breeding in the future. In contrast, the lines C1S7S and B152S harbor seven alleles that can be considered unfavorable with respect to low Cd accumulation, indicating that these lines would not be suitable as TGMS lines for low-Cd hybrid breeding (Table 4). However, it should be emphasized that these polymorphic sites might not necessarily have a direct influence on phenotypic characters, and thus our findings may simply reflect an indirect effect of strong LD with trait-linked sequence variants.
Table 4

Estimates of the allelic effect of Cd-related genes for phenotypic variation of the TGMS lines

Gene SiteAlleleAllelic effectTypical carrier variety
OsHMA3
 840C0.0
T0.493C1S7S, W6154S, B152S
OsNRAMP1
 124aC0.0
G0.53C1S7S, B63S, B152S
 1736bC0
G0.634C1S7S, B152S
 25730.0
INS T−0.303B8S, B14S
OsNRAMP5
 1306T0
DEL−0.207B8S, SD34-1S, Guangzhan63-4S, B14S, ZiBL02-1S, T95-5S, B15S, Xianglin628S, B76S, 201507S, SD39-1S, B206S, 21504S, Zhu77S, Zhu25S
C0.39C1S7S, W6154S, B152S
 2267–2279c0.0
Indel 120.945C1S7S, B152S
OsHMA2
 81dA0.0
C0.156T91-22S, SD39-1S, B216S, SD39S, HSQ9S, C1S7S, SD44S, Zhu25S, 640S, Zhu77S, B152S, B206S
 141eA0.0
T0.197SD39-1S, B216S, SD39S, HSQ9S, C1S7S, SD44S, Zhu25S, B161S, 640S, Zhu77S, B152S, B206S
 645T0.0
DEL0.157T91-22S, SD39-1S, B216S, SD39S, HSQ9S, C1S7S, SD44S, Zhu25S, B161S, 640S, Zhu77S, B152S, B206S

a, 3 sites in complete LD: 124, 164, 1076; b, 2 sites in complete LD: 1736, 5781; c, 10 sites in complete LD: 2267–2279, 3530, 3543–3346, 3615, 3623–3624, 3627, 3751 (10 exon), 4016, 4263, 4710–4727; d, 2 sites in complete LD: 81, 109; e, 21 sites in complete LD: 141, 143–157, 177, 180, 182, 188–189, 203, 227, 242, 245–247, 250, 256, 258–280, 301, 316, 389, 417, 425–427, 450 (1 exon), 513 (1 exon), 568 (1 exon).

, were in complete LD. The varieties with low Cd were underlined.

Discussion

In this study, we examined the genetic diversity of 30 TGMS lines using 48 pairs of SSR primers. A total of 140 alleles were amplified using 46 SSR markers, with two to five alleles (average of 3.04 alleles) being detected at each SSR site. In addition, the values of Shannon’s index for the population ranged from 0.0853 to 0.6926, the gene diversity index ranged from 0.0331 to 0.4995, the average Nei’s gene diversity index was 0.28, and the allelic frequency ranged from 0.0168 to 1.0. On the basis of these data, it can be seen that the genetic diversity of the 30 TGMS lines is high, genetic relationships are relatively low, and the genetic background is diverse. Hence, it is suitable for further genotyping and association analysis though the population tested is not so big. Association analysis revealed 42 variable sites within the genes OsHMA2, OsHMA3, OsNRAMP1, and OsNRAMP5 that are highly associated with Cd accumulation, of which four are located in exonic regions, and the others are located in intronic and 5′UTR regions. However, no highly Cd-associated variant sites were detected in 3′UTR regions. The four polymorphic sites located in the coding regions of OsNRAMP5 and OsHMA2 were shown to be significantly associated with Cd accumulation, of which the two polymorphic sites in OsHMA2 are non-synonymous SNPs at 450 bp and 568 bp that result in the conversion of the amino acids Arg and Ser to Gly and Ala, respectively. In both cases, a polar hydrophilic amino acid is replaced with a non-polar hydrophobic amino acid. Furthermore, the Ser residue might be a putative phosphorylation site. Accordingly, the presence of these non-synonymous SNPs may contribute to a conformation change in the OsHMA2 protein, and thereby influence its function. The other two exonic polymorphic sites are synonymous SNPs that are not associated with amino acid conversion. Noteworthy are the SNPs and INDELs detected in the 5′UTR sequences. The three INDEL sites located in the 5′UTR region of OsHMA2 (two large INDEL regions of 15 bp and 22 bp, and a small INDEL of 4 bp) were significantly associated with Cd accumulation traits. Differences in the SNPs and INDELs in gene promoter regions might be involved in gene expression regulation. For example, the intron in the Gmubi gene promoter region regulates the expression of this gene in Glycine max (De La Torre and Finer 2015). Similarly, two SNPs in the promoter region of the GS5 gene show close association with the expression levels of this gene in developing young panicles of rice to regulate the size of grain width (Xu ). We also identified certain intronic variable sites with extremely significant associations. There are 13 variable sites located in the OsNRAMP1, OsNRAMP5, and OsHMA2 genes, including two large INDEL regions of 13 bp and 17 bp in OsNRAMP5. In the past, the importance of intron sequences in the genome went unrecognized, and for a long time, introns were considered simply as “garbage” DNA. However, more recently, introns have begun to attract increasing attention and the important roles played by these regions are gradually being recognized. Introns can have a significant influence on gene expression in plants and many other eukaryotes; for example, by providing auxiliary cis-acting elements for the regulation of gene expression, and playing roles in modifying gene transcription efficiency (Rose 2004, 2008, Rose ). Analysis has indicated that introns in the 5′UTR regions of two E3 ligase genes are essential for the expression of these genes in Arabidopsis (Aguilar-Hernández and Guzmán 2013). However, whether intronic variable loci play regulatory roles in Cd-associated gene function needs further study. In this regard, it is conceivable that new markers could be developed for intronic variants to facilitate the identification of Cd-accumulating rice varieties. By determining the positive and negative effects of nine loci that we identified to be significantly associated with Cd accumulation in grain, we found that only two (in OsNRAMP1 and OsNRAMP5) of the nine loci have a negative effect on Cd accumulation, and are thereby non-conducive to Cd accumulation and can thus be considered as low-Cd markers. In contrast, the other seven polymorphic loci appear to make a positive contribution to Cd accumulation in grain and can accordingly be considered as high-Cd markers. Among the 30 TGMS lines we examined, B8S and B14S contain two sites with a negative influence on Cd accumulation and can be considered as potential low Cd-accumulating varieties. These markers can provide a basis for Cd accumulation trait identification and molecular marker-assisted breeding of rice. Using a combination of molecular marker-assisted selection and progeny identification with B8S and B14S as female parents, we believe that it will be possible to develop new varieties characterized by low Cd accumulation and other excellent agronomic traits.
  57 in total

Review 1.  The complex interplay among factors that influence allelic association.

Authors:  Krina T Zondervan; Lon R Cardon
Journal:  Nat Rev Genet       Date:  2004-02       Impact factor: 53.242

2.  Hyperthermic effects of gold nanorods on tumor cells.

Authors:  Terry B Huff; Ling Tong; Yan Zhao; Matthew N Hansen; Ji-Xin Cheng; Alexander Wei
Journal:  Nanomedicine (Lond)       Date:  2007-02       Impact factor: 5.307

3.  Genome-wide association mapping of cadmium accumulation in different organs of barley.

Authors:  Dezhi Wu; Kazuhiro Sato; Jian Feng Ma
Journal:  New Phytol       Date:  2015-06-09       Impact factor: 10.151

Review 4.  Dietary exposure to cadmium and risk of breast cancer in postmenopausal women: A systematic review and meta-analysis.

Authors:  Geneviève Van Maele-Fabry; Noömi Lombaert; Dominique Lison
Journal:  Environ Int       Date:  2015-10-18       Impact factor: 9.621

5.  Intron sequences that stimulate gene expression in Arabidopsis.

Authors:  Alan B Rose; Amanda Carter; Ian Korf; Noah Kojima
Journal:  Plant Mol Biol       Date:  2016-08-05       Impact factor: 4.076

6.  Identification of C-terminal Regions in Arabidopsis thaliana Phytochelatin Synthase 1 Specifically Involved in Activation by Arsenite.

Authors:  Shimpei Uraguchi; Yuka Sone; Yumika Ohta; Naoko Ohkama-Ohtsu; Christian Hofmann; Natalia Hess; Ryosuke Nakamura; Yasukazu Takanezawa; Stephan Clemens; Masako Kiyono
Journal:  Plant Cell Physiol       Date:  2018-03-01       Impact factor: 4.927

Review 7.  Cadmium and cancer of prostate and testis.

Authors:  Robert A Goyer; Jie Liu; Michael P Waalkes
Journal:  Biometals       Date:  2004-10       Impact factor: 2.949

8.  The OsHMA2 transporter is involved in root-to-shoot translocation of Zn and Cd in rice.

Authors:  Ryuichi Takahashi; Yasuhiro Ishimaru; Hugo Shimo; Yuko Ogo; Takeshi Senoura; Naoko K Nishizawa; Hiromi Nakanishi
Journal:  Plant Cell Environ       Date:  2012-05-22       Impact factor: 7.228

9.  Gene limiting cadmium accumulation in rice.

Authors:  Daisei Ueno; Naoki Yamaji; Izumi Kono; Chao Feng Huang; Tsuyu Ando; Masahiro Yano; Jian Feng Ma
Journal:  Proc Natl Acad Sci U S A       Date:  2010-09-07       Impact factor: 11.205

10.  A loss-of-function allele of OsHMA3 associated with high cadmium accumulation in shoots and grain of Japonica rice cultivars.

Authors:  Jiali Yan; Peitong Wang; Peng Wang; Meng Yang; Xingming Lian; Zhong Tang; Chao-Feng Huang; David E Salt; Fang Jie Zhao
Journal:  Plant Cell Environ       Date:  2016-06-16       Impact factor: 7.228

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.