Literature DB >> 26840449

Genome-Wide Detection of SNP and SV Variations to Reveal Early Ripening-Related Genes in Grape.

Yanshuai Xu1,2, Zhihong Gao1, Jianmin Tao1, Weihua Jiang3, Shijie Zhang1, Qiunan Wang3, Shenchun Qu1.   

Abstract

Early ripening in grape (Vitis vinifera L.) is a crucial agronomic trait. The fruits of the grape line 'Summer Black' (SBBM), which contains a bud mutation, can be harvested approximately one week earlier than the 'Summer Black' (SBC)control. To investigate the molecular mechanism of the bud mutation related to early ripening, we detected genome-wide genetic variations based on re-sequencing. In total, 3,692,777 single nucleotide polymorphisms (SNPs) and 81,223 structure variations (SVs) in the SBC genome and 3,823,464 SNPs and 85,801 SVs in the SBBM genome were detected compared with the reference grape sequence. Of these, 635 SBC-specific genes and 665 SBBM-specific genes were screened. Ripening and colour-associated unigenes with non-synonymous mutations (NS), SVs or frame-shift mutations (F) were analysed. The results showed that 90 unigenes in SBC, 76 unigenes in SBBM and 13 genes that mapped to large fragment indels were filtered. The expression patterns of eight genes were confirmed using quantitative reverse transcription-polymerase chain reaction (qRT-PCR).The re-sequencing data showed that 635 SBC-specific genes and 665 SBBM-specific genes associated with early ripening were screened. Among these, NCED6 expression appears to be related to NCED1 and is involved in ABA biosynthesis in grape, which might play a role in the onset of anthocyanin accumulation. The SEP and ERF genes probably play roles in ethylene response.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26840449      PMCID: PMC4740429          DOI: 10.1371/journal.pone.0147749

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Grapevine (Vitis vinifera L.) is one of most important fruit crops worldwide, as it provides fruit used for both table grapes and wine. The genome size of grapevine is about 475–500 Mb [1] and consists of 19 chromosomes (chr) with a high degree of heterozygosity [2]. Early ripening is an attractive trait for producers, especially in areas when heavy rain occurs at the time of the year when the fruits are harvested. However, the mechanism of fruit ripening in grape is not clear. Fruit ripening is affected by many factors, such as natural environmental conditions, including temperature, water, sunlight, diseases, turgor, and sugar accumulation [3]. Furthermore, plant hormones, especially ethylene (ETH) and abscisicacid (ABA) play a decisive role in regulating the growth and development of fruits [4, 5]. Plant hormones such as ETH and ABA promote ripening, whereas auxins (IAAs) inhibit berry ripening and colouring in grape [6], and the role of cytokinins (CTKs) is unclear. Böttcher et al. found that the CTK level increased during the ripening of Vitis vinifera cv. Shiraz berries [7]. On the basis of the fairly stable and low respiration rates and the low levels of ethylene in berries, grape has been defined as a non-climacteric fruit [8]. Previous research suggested that the plant hormone ABA might be involved in the regulation of non-climacteric berry ripening [9, 10], whereas the function of ETH in the ripening of grape was not very clear. Brassinosteroids (BRs) are also involved in strawberry fruit ripening [11]. In addition, anthocyanins affect the appearance of fruit, and an increasing concentrate of anthocyanin contributes to fruit maturation [12, 13]. Other significant pathways, such as the accumulation of sugar, berry softening, the catabolism of organic acids, and flavour maturation at the onset of véraison in grape are associated with fruit ripening [14]. Sequence variation and structural alteration belong to genetic variation. Normally, sequence variation involves SNPs, indels, transposable elements and microsatellites, similar to the humansickle-cell anaemia and cystic fibrosis diseases, which are caused by one or three nucleotide changes in the haemoglobin beta gene, respectively [15]. Structural variation is generally described as large scale deletions, insertions, duplications, inversions and translocations. Whole genome amplicon re-sequencing has provided much information on genetic variation and gene function in biological and medical studies, as well as in the validation and assessment of SNPs and SVs involving indels and copy number variants (CNVs) in two grape lines by comparison with the reference genome. Furthermore, robust next generation sequencing (NGS) platforms for in silico SNP identification have supplied a very effective alternative to re-sequencing for SNPs studies in plants [16]. Abundant SNPs and indel markers have become a powerful tool for many applications, including marker assisted selection, association studies, diversity analysis, genetic mapping, and the map-based cloning of genes [17-19]. The ‘Summer Black’ grape is a seedless, triploid grape cultivar with a dark, purple-black pericarp, it is a European and American hybrid, derived from the seedless grape cultivars ‘Kyoho’ and ‘Thompson’ [20]. ‘Summer Black’ possesses many advantageous horticultural traits, including a strong resistance, high yield, very early maturation, easy colouring, strong growth, and the ability to withstand storage and transportation. The bud germination rate is 85%–90%, the graft rate is 95%, and the yield is 30,600–37,500 kg•hectare-1[21] and it can be planted throughout China. We have found a precocious bud mutation in the grapevines of the ‘Summer Black’ line (SBBM). Since the fruits mature a week earlier, the SBBM cultivar might potentially be economically valuable. Given the importance of this mutation, whole genome re-sequencing was performed and simple sequence repeat (SSR) molecular markers were used, in addition to measurements of the content of hormones and anthocyanins, to assess the differences between SBBM and control ‘Summer Black’ (SBC) at the molecular and physiological levels.

Methods

Ethical Statement

This study was carried out in a private vineyard, and permission of the vineyard owner was obtained. The plant material used was a common grapevine variety and was not endangered.

Treatment of plant materials

The two grapevines (SBC and SBBM) were planted under natural illumination in Changzhou City, Jiangsu Province, China. By observing the phenotypic agronomic traits, we found that SBBM colouring and ripening occurred approximately one week earlier than in SBC. We harvested young leaves of the two cultivated grape lines to perform genome re-sequencing, and determined the DNA level of SNPs and SVs, and assessed differences in gene expression. The young leaves were stored at -70°Cimmediately, and were used to extract DNA. The grapes used to determine the hormone content were harvested on 17 June, 8 July, and 29 July. The ‘Summer Black’ grapes were harvested five, eight and eleven weeks after flowering, respectively (Fig 1B).
Fig 1

The grapes materials used in this study.

Investigation of physiological indicators and agricultural traits

The control ‘Summer Black’ grape lines are represented by CGF, BMRF represents the red grapes of the bud mutation line and BMPF represents the purple grapes of the bud mutation line (Fig 1A and Fig 2). Physiological indicators were measured at the three different grape ripeness time points in Fig 1A. The vertical diameter and width of grapes were measured using a Verniercaliper. The content of soluble solids was determined using a hand saccharimeter, the cover plate was opened and a drop of juice from a crushed grape was placed on the cleaned detection prism. Three replicate measurements were performed.
Fig 2

The differences of physiological indicators at five weeks after flowering.

The organic acid content was measured using an acid–base titration method with some modifications. Firstly, 30 skinned grapes were squeezed to obtain juice, which was then centrifuged at 4°C and 8,000 rpm for 30 min and incubated at 75–80°C in a water bath for 30 min. Subsequently, 1 mL of the supernatant was added to 30 mL distilled water and two drops of 1% phenolphthalein in a 150mL flask. Then, using NaOH, the solution was titrated until the pink colour remained for 30 s without fading, and the volume of NaOH added was recorded for three replicates. The results were calculated using the following formula: Total acidity (%) = V2×N×conversion coefficient×100/V1, where V1 represents the volume of the sample titration solution (mL), V2 represents the volume of the standard solution of sodium hydroxide consumed (mL), and N represents the molar concentration of the sodium hydroxide standard solution (mol/L), with a conversion coefficient for tartaric acid of 0.075 [22]. For the anthocyanin assay, two grape lines were ground in liquid nitrogen (by adding 1 g of grape powder to achieve mixing uniformity) and the powder was resuspended in 10 mL 1% concentrated HCl at room temperature. Ultrasonic vibration was then provided for 30 min at 100 W and 40 KHz. The brei was centrifuged for 30 min in glass tubes at 8,000 rpm and 4°C and the supernatant was decanted and another 10 mL of the solvent was added to the residue. The brei was placed in the dark, the process was repeated, and the supernatant was diluted to a final volume of 25 mL. Next, 6 mL of the supernatant and 9 mL KCl/HCl buffersolution (pH 1.0) were incubated for 50 min and 6 mL of the supernatant and 9 mL citric acid/disodium hydrogen phosphate buffersolution (pH 5.0) were incubated for 80 min in the dark. The total anthocyanin content was then measured using the mixed solvent (5 mL) by determining the absorbance at 510 mm using a Pye Unicam recording spectrophotometer. Three replicate measurements were performed [23]. The concentration of hormones was determined using an enzyme-linked immunosorbent assay (icELISA) [24]. The grape samples were ground and centrifuged at 3,500 rpm for 8 min after 4 hours at 4°C. The supernatant was filtered through a C-18 solid phase extraction column. The results were calculated using the logit curve as follows: Logit (B/B0) = ln (B/B0)/(1-B/B0) = ln B/(B0-B), where B0 is the chromogenic value at 0 ng/mL and B is the chromogenic value of the other samples. The logarithm value corresponds to the value in the logit curve. The hormone content (ng/mL) was obtained by calculating the antilog value (ng/mL). All the above data were analysed by ANOVA using IBM SPSS Statistics 19.

Genomic variation identified by SSR

The DNA of the two grape lines was extracted using the CTAB method[25]. The concentration of each DNA sample was measured using a spectrophotometer at 260 and 280 nm and each DNA sample was also quantified on an agarose gel. The 48 pairs of SSR primers were designed using Primer Premier 5 software and PCR was performed in a final volume of 20 μL. The PCR mixture was as follows: 1.5 μL (100 ng/μL) DNA template, 1 μL each primer, 2 μL 10× PCR reaction buffer (without MgCl2), 1.6 mM dNTP mix, 1.2 μL MgCl2, 0.2 μL rTaq DNA polymerase and 11.5 μL ddH2O. The PCR was carried out in a thermocycler with the following protocol: 94°C for 5 min, 35 cycles with 94°C denaturation for 50 s, annealing for 55 s, and 73°C elongation for 1 min, followed by 4°C elongation for 10 min. All PCR products were validated by analysis on a polyacrylamide gel.

DNA variation identified by whole-genome re-sequencing

Whole-genome re-sequencing of the grape lines was performed on an Illumina HiSeq2000™ by Biomarker Technologies (Beijing, China). The procedure was performed in accordance with the standard Illumina protocol, including sample preparation and sequencing as follows: quantified DNA was extracted and DNA fragments were obtained using ultrasound, which was purified using the QIA quick PCR kit. End repair was performed with poly-A on the 3´ ends, then the adaptors were ligated, clusters were generated, agarose gel electrophoresis was used to select fragments, and PCR amplification was performed. Sequencing was performed by establishing a library with Illumina HiSeq2000™. The short reads were aligned using the Burrows–Wheeler transformation [26].

SNP and SV screening

According to the results of our data and comparison with the reference genome sequence, we used SAMtools [27] software (http://samtools.souRceforge.net/Samtools.shtml) to detect SNPs by filtering the bases of low depth and low quality (minimum depth of 2×, and that for a heterozygous SNP is 3×, the maximum depth did not exceed 3× of the average depth, and the genotype quality value was not less than 20 and was not located in simple sequence repeat regions). We obtained a high reliability mismatch (mismatch of base) loci array, which indicated single molecular polymorphism (SNP) sites. Pindel [28] and BreakDancer [29] software was applied for SNP and SV detection. Based on BAM and SAM (short DNA sequence read alignments in the SAM, BAM formats) data types, Pindel (https://trac.nbic.nl/pindel/) is a software package designed specifically for bwa (Public Library of Bioinformatics) software to detect structural variations in genes. BreakDancer (http://breakdancer.sourceforge.net/breakdancermax.html) can predict six types of SV sites, including INS (insertions), DEL (deletions), IDE (insertion inside deletions), INV (inversions), ITX (intra-chromosome transfers), and CTX (inter-chromosome transfers).

DNA level functional annotation categories

The mainly functional annotation of genes compared the differences in gene expression with various annotation databases, in which unigene annotation information was represented by annotated homologous genes. The NR, SwissProt, GO, COG, and KEGG annotation databases were used to compare homologous genes in our approach. The five linkage websites are listed in S1 Table.

The expression pattern of genes related to ripening and colour assessed by qRT-PCR

Expression differences in ripening and colour relevant genes were validated using qRT-PCR. Total RNA was extracted using the CTAB method. Agarose gel electrophoresis was used to test the products, and the concentration of total RNA was measured using a micro-spectrophotometer. Reverse transcription of total RNA into cDNA was performed using a transcription kit. In the next step of the qRT-PCR experiment, the cDNA was diluted to100 ng/μL. The specific primers were designed using Beacon designer software. The 18S gene was selected as the housekeeping gene. The qRT-PCR was performed in 96well plates with a total volume of 20 μL: 10 μL SYBR Green II Master, 8.6 μL ddH2O, 1 μL (100 ng/μL) of the cDNA template, and 0.2 μL each primer. The reaction was performed on a 7300 Real-Time PCR System (Applied Biosystems, Foster City, CA). All genes were assessed in triplicate and the reactions used for qRT-PCR are listed in Table 1. Gene expression is presented as relative expression levels calculated using the 2-ΔΔXT method.
Table 1

Primers pairs used for qRT-PCR.

GeneGene IDForward primerReverse primer
CHSYGSVIVT010329680015´-ATGGTGGTGGTTGAAGTA-3´5´-GCTTGGTGAGTTGATAGTC-3´
3MATGSVIVT010004610015´-GGTGAGGATGTTGGTGAA-3´5´-GACTTGGCTGCGACTATA-3
ERF5GSVIVT010139340015´-ACCTTCCAGATTGAGTCAT-3´5´-CCTAGATCCTCTCCGATTC-3´
ERF03GSVIVT010263340015´-ACCTTCCAGATTGAGTCAT-3´5´-CCTAGATCCTCTCCGATTC-3´
AI5L2GSVIVT010332160015´-GCCTTATCTTCTCCATTGATG-35´-TGTTCTCCAGTTCATTTGTG-3
SEP1GSVIVT010081390015´-TGGGAGATTGATGGGATTT-35´-GGCTACAAGGTTTGGAAAG-3´
PT106GSVIVT010207800015´-TATGCTTGCTGCTGTTATG-3´5´-GCTCATCAATGTAAGTGGA-3´
NCED6GSVIVT010290570015´-CATATACCTGGCGATAGCA-3´5´-CTTCTCCTTCTCCTCCTTC-3´

Note: Gene function annotation. CHSY: Chalcone synthase. 3MAT: Malonyl-coenzyme, A: anthocyanin3-O-glucoside-6-O-malonyltransferase. ERF5:Ethylene-responsive transcription factor. ERF03: Ethylene-responsive transcription factor. AI5L2: ABSCISIC ACID-INSENSITIVE 5-like protein. SEP1: Developmental protein SEPALLATA1. PT106: Probable sugar phosphate/phosphate translocator. NCED6:9-cis-epoxycarotenoid dioxygenase NCED6, chloroplastic.

Note: Gene function annotation. CHSY: Chalcone synthase. 3MAT: Malonyl-coenzyme, A: anthocyanin3-O-glucoside-6-O-malonyltransferase. ERF5:Ethylene-responsive transcription factor. ERF03: Ethylene-responsive transcription factor. AI5L2: ABSCISIC ACID-INSENSITIVE 5-like protein. SEP1: Developmental protein SEPALLATA1. PT106: Probable sugar phosphate/phosphate translocator. NCED6:9-cis-epoxycarotenoid dioxygenase NCED6, chloroplastic.

Results

Variation in phenotypic agronomic traits and plant hormones between SBBM and SBC

Phenotypic observation showed that the main difference between SBBM and SBC was the ripening date, with SBBM ripening approximately one week earlier than SBC (Fig 1). Physiological indicators showed no differences in the vertical diameter or cross diameter, which were about 1.8 cm and 1.9 cm, respectively. The SBBM line had a higher single fruit weight than SBC, of about 3.7g for SBBM and 3.3g for SBC. When the pericarp colour deepened, the content of soluble solids increased, and was 6.8 brix (BX) for CGF, 9.7 BX for BMRF and 11 BX for BMPF, and the content of organic acid decreased and was 3.3% for CGF, 1.81%for BMRF and 1.47% for BMPF. The content of anthocyanin increased: -0.003 mg/g for CGF, 0.643 mg/gfor BMRF and 0.343 mg/g for BMPF (Fig 2). Because hormones regulate plant developmental processes, we measured the content of gibberellin (GA), auxin (IAA),abscisic acid (ABA) and brassinosteroid (BR) by enzyme-linked immunosorbent assays (icELISA). At the three different timepoints, the content of GA in SBC gradually decreased, whereas the concentration of GA in SBBM increased, but the content was always very low, i.e., up to 20 ng•g-1 FW (fresh weight) during the ripening process (Fig 3). The IAA level in SBC decreased gradually, and the concentration of IAA in SBBM was overall lower, but there was no obvious regular pattern in SBBM.
Fig 3

The concentration of GA, IAA, ABA, BR at five, eight and 11 weeks after flowering.

Abscisic acid plays a vital role in the maturation of grape berries. As expected, our results also showed that the endogenous ABA concentration in SBC berries decreased from the onset of ripening, but increased in SBBM from véraison to maturation. The concentration of ABA was nearly 140 ng•g-1 FW, which was the highest concentration among the four hormones. The concentration of BR in SBC and SBBM decreased and then increased. The concentration of BR was lowest compared with that of GA, IAA and ABA, i.e., less than 15 ng•g-1 FW.

SSR molecular markers

We investigated SSR molecular markers to analyse the diversity of SBBM and SBC. After polyacrylamide gel electrophoresis, bands were visualised using silver staining. Multiple band patterns were produced and the clearest polymorphisms were detected in each sample. Forty-eight pairs of SSR primers (S2 Table) yielded on average one to eleven bands; three primers yielded the minimum number of bands (1.00). The results reveal that the most reliable polymorphic bands were the same in the two samples, whereas primer 18 (F: 5´-ATTAACGAGGATGTGTTTGG-3´. R: 5´-AAGGATCCATTTCACATACG-3´), primer 27 (F: 5´-ATCTGACAAAGGAAAGGAGAA-3´, R:5´-GTAACATACCGAGGAGGCA-3´), primer 33 (F: 5´-GGTACATCAGTACTTGAAATGGTTGC-3´, R:5´-TTCTCCGTAGAAGCGTAAACAGC-3´), primer 42 (F: 5´-TTATCTGCTTAGGGAAAACGTA-3´, R:5´-AACACACCTTGAGAAAATAGCA-3´) could be clearly distinguished (Fig 4). Compared with SBBM, a band appeared for SBC at the downmost bands which were amplified by primers 18, 27, 33 and 42.
Fig 4

48 pairs of SSR were used to verify the difference between ‘SBC’ and ‘SBBM’.

SNP and SV catalogue

We performed whole genome re-sequencing to evaluate the original reads (paired-end) and compared the reads with the reference genome. High reliability SNPs (3,692,777 in SBC and 3,823,464 SBBM) were identified with SAMtools software (Table 2). The transition percentage of SBC and SBBM was 69.66% and 69.53%, respectively. We also identified homozygous and heterozygous SNPs. The results showed that the homozygous and heterozygous ratios were above 27% and 71%, respectively (Fig 5). The distribution of SNPs, type, and zygosity of variants per chromosome in the two lines is shown in Table 3. The highest SNP number of 214,459 in SBC and 222,506 in SBBM was located on chr18 and the minimum number of 10,156 SNPs in SBC and 104,890 in SBBM on chr17. The percentage conversion to the same base type ranged from 68.6% to 71.6% and the percentage of conversion to a different type of base was between 28.4% and 31.75%.
Table 2

Total number of variants, type and zygosity of variants in each genotype.

GenotypeaSNP numberbTransition percentagecTransversionpercentagedHeterozygosity percentageeHomozygositypercentagef
SBC3,692,77769.66%30.34%71.69%28.31%
SBBM3,823,46469.53%30.47%72.74%27.26%

a Sample name.

b The total number of SNPs.

C Percentage conversion into same basetype.

d Percentage conversion into different types of bases.

e Heterozygous (two or more bases exist) percentage.

f Homozygous (only one base exists) percentage.

Fig 5

Different variation occurred in each chromosome.

Table 3

Distribution of SNPs, type and zygosity of variants per chromosome in the two grape strains.

ChraSNPsbTrscTrvdHeteHomf
GenotypeSBCSBBMSBCSBBMSBCSBBMSBCSBBMSBCSBBM
Chr119877320731468.6%68.64%31.35%31.36%76.71%77.84%23.29%22.16%
Chr212595413069969.5%69.57%30.42%30.43%77.2%77.88%22.8%22.12%
Chr316911317434970.4%70.19%29.53%29.81%68.33%69.43%31.69%30.57%
Chr416531717230969.9%69.79%30.04%30.21%74.92%75.97%25.08%24.03%
Chr522520023144269.7%69.69%30.21%30.31%74.65%75.59%26.35%24.41%
Chr618482619312668.9%68.87%31.04%31.13%71.56%72.48%28.44%27.52%
Chr716118316819969.1%69.05%30.82%30.95%72.03%73.12%27.97%26.88%
Chr819636120288768.7%68.73%31.25%31.27%64.95%65.71%35.05%34.29%
Chr919625820320569.8%69.76%30.16%30.24%70.58%71.86%29.42%28.14%
Chr1014949315440569.8%69.75%30.11%30.25%66.6%67.68%33.4%32.14%
Chr1114609215208170.4%70.14%29.58%29.86%75.55%76.4%24.45%23.6%
Chr1218573319098769.4%69.35%30.57%30.65%65.96%67.16%34.04%32.84%
Chr1321482622297669.2%69.09%30.73%30.91%71.5%73.03%28.5%26.97%
Chr1420194920892669.5%69.49%30.45%30.51%73.3%74.25%26.7%25.74%
Chr1514097014611870.8%70.69%29.16%29.31%78.53%79.51%21.47%20.49%
Chr1618620619286370.0%69.98%29.91%30.02%71.43%72.56%28.57%27.44%
Chr1710156510489071.6%71.25%28.4%28.75%64.37%64.98%35.63%35.02%
Chr1821445922250668.9%68.79%31.07%31.21%63.88%64.58%36.12%35.42%
Chr1918251619106469.2%69.17%30.75%30.83%85.96%86.92%14.06%13.08%
Total3346794334703669.6%69.58%30.29%30.42%72.00%73.00%28.00%27.00%

aChromosome.

bThe total number of SNPs.

cPercentage conversion into the same basetype.

dPercentage conversion into different types of bases.

eHeterozygous (two or more bases exist) percentage.

fHomozygous (only one base exists) percentage.

a Sample name. b The total number of SNPs. C Percentage conversion into same basetype. d Percentage conversion into different types of bases. e Heterozygous (two or more bases exist) percentage. f Homozygous (only one base exists) percentage. aChromosome. bThe total number of SNPs. cPercentage conversion into the same basetype. dPercentage conversion into different types of bases. eHeterozygous (two or more bases exist) percentage. fHomozygous (only one base exists) percentage. The changes to the sequences occurred in the intergenic, exon, and intronic regions. Table 4 shows that 61.02% and 62.13% of SNPs were in the intergenic region, 5.65% and 5.46% SNPs were in the exon region, and 33.33% and 32.41% SNPs were in the intron region in SBC and SBBM, respectively. The number of SNPs was 100times higher than the number of SVs in exons and the number of SNPs in the intergenic and intronic regions was50-fold and 30-fold higher in SBC and SBBM, respectively.
Table 4

Different component statistics of SNP, SV in the genomes.

SampleTypeIntergenicExonIntron
SBCfSNP2,253,3272084971,231,188
SV36,2392,12042,864
SBBMSNP2,375,628208,6921,239,392
SV40,7912,15742,853

Numbers of SNPs and SVslocated in exon, intron and intergenic regions.

Numbers of SNPs and SVslocated in exon, intron and intergenic regions. We considered insertions, deletions, inversions, intra-chromosome transfers and inter-chromosome transfers as SVs. The distribution of SVs and the type of variants per chromosome in the two genotypes (Table 5) shows that the proportion of insertions and deletions was greater than 95%, whereas other SVs represented less than 5%. The indel size distribution and frame shift mutation in the whole genome and coding region (CDS) is shown in Fig 6. The number of single-bp (base pair) indels, two-bp indels, three-bp indels, insertions and deletions that occurred in the SBC and SBBM whole genomes was extremely similar, and were about 7,500 and 12,500, 2,500 and 4,500, and 1,500 and 2,500, respectively and those in the CDS region were about 750 and 850, 240 and 110, and 200 and 150, respectively.
Table 5

Distribution of SV and the type of variants per chromosome in the two grape lines.

ChraSV NumberbINSc(%)DELd(%)IDELe(%)INVf(%)ITXg(%)CTXh(%)
genotypeSBCSBBMSBCSBBMSBCSBBMSBCSBBMSBCSBBMSBCSBBMSBCSBBM
Chr15,0435,21160.7258.4535.9538.980.480.350.180.130.380.442.31.65
Chr22,8313,01761.1458.6735.9638.780.530.630.040.030.570.71.771.19
Chr33,0873,27658.7956.2638.140.290.710.980.10.030.750.851.551.59
Chr44,1684,42558.7156.1439.0441.790.650.540.070.110.550.450.980.97
Chr55,2395,36361.7359.4636.1738.560.590.670.10.130.610.540.80.63
Chr64,7935,19461.5358.5936.7239.890.420.520.060.120.380.250.90.64
Chr74,0064,25459.9457.3637.9440.690.60.630.10.120.650.630.770.56
Chr85,4085,79561.5457.0137.1541.570.550.570.150.160.370.40.240.29
Chr93,6793,91062.2759.2335.6138.390.630.610.220.260.951.070.330.43
Chr103,3453,49059.5255.5935.6140.570.690.430.360.290.750.493.082.64
Chr113,6463,76258.7255.6936.9440.160.80.530.140.160.580.432.833.03
Chr124,3204,50957.5255.9837.5239.590.790.60.020.041.020.933.122.86
Chr134,7395,04357.6554.7138.3440.970.610.830.080.060.80.932.512.5
Chr145,0535,27758.2456.0737.9240.380.850.760.120.150.850.722.021.91
Chr152,9132,98058.1556.5436.6339.090.790.740.270.20.650.673.52.75
Chr163,4753,75161.2956.8134.4539.30.780.690.060.130.860.882.562.19
Chr172,6082,82456.2954.0740.8742.530.50.640.120.110.460.421.762.23
Chr185,7396,46458.1854.8138.8442.390.540.510.070.120.840.791.531.38
Chr193,4093,61461.8758.8334.8538.210.790.80.150.110.880.751.471.3
Total77,50182,15959.6756.8637.0840.110.650.630.130.130.680.651.791.62

aChromosome genotype.

bNumber of SVs.

cPercentage of insertion type.

dPercentage of deletion type.

ePercentage of the deletion that contained an insertion.

fPercentage of the inversion type.

gPercentage of intra-chromosome transfer.

hPercentage of inter-chromosome transfer.

Fig 6

Diferent InDel sizes distribution and frameshift mutation between the whole genome and CDS region.

aChromosome genotype. bNumber of SVs. cPercentage of insertion type. dPercentage of deletion type. ePercentage of the deletion that contained an insertion. fPercentage of the inversion type. gPercentage of intra-chromosome transfer. hPercentage of inter-chromosome transfer. The gene: SNP (non-synonymous) ratio in SBC and SBBM (Table 6) varied from21,251 to 118,778 and from 21,235 to 118,643, respectively, whereas the gene: SNP (≥10 non-synonymous) ratio was between 3,332 to 53,295 and 3,298 to 53,051, respectively. The gene: indel (with frame-shift) ratio was 630 to 661 and 626 to 650, respectively, whereas the gene: SV (with SVs) ratio was 1,922 to 2,120 and 1,931 to 2,157, respectively.
Table 6

Genotype-specific statistical differences between the various categories of gene mutation.

SampleGenes:SNPsa(with non-synonymous)Genes:SNPsb(with(> = 10) non-synonymous)Genes:InDelsc(with frame-shift)Genes:SVsd(withSVs)
SBC21,251:118,7783,332:53,295630:6611,922:2,120
SBBM21,235:118,6433,298:53,051626:6501,931:2,157

aNumber of genes with non-synonymous mutations: Number of the corresponding SNPs.

bNumber of SNPs with non-synonymous mutations where the number of SNPs is greater than 10: the number of the corresponding SNPs.

cThe number of genes containing frame shift mutations (InDels): the number of the corresponding InDels.

dThe number of genes containing variants of SVs: the corresponding number of SVs.

aNumber of genes with non-synonymous mutations: Number of the corresponding SNPs. bNumber of SNPs with non-synonymous mutations where the number of SNPs is greater than 10: the number of the corresponding SNPs. cThe number of genes containing frame shift mutations (InDels): the number of the corresponding InDels. dThe number of genes containing variants of SVs: the corresponding number of SVs. We consider between 20,000 and 25,000 bp per SNP to be ahigh-density SNP region. Compared with the reference genome, many high-density SNP regions were found on different chromosomes (Table 7). Chr17 (six regions) and chr2 (four regions) had more high-density SNP regions than the other chromosomes. Twelve high-density SNP regions and 15 low-density SNP regions were detected in SBC, whereas 10 high-density SNP regions and 14 low-density SNP regions were found in SBBM. In addition, SBC had two high-density SNP regions and one low-density SNP region that SBBM did not possess.
Table 7

Recognizable high- or low-density SNP regions compared between SBC and SBBM.

SBCSBBM
ChromosomeRegionaSNPnumberbHighorlowcChromosomeRegionaSNPnumberbHigh or lowc
chr27,280,0017,660,00016Hchr27,760,0018,560,00028H
chr29,120,00110,500,00059Hchr29,120,00110,080,00048H
chr211,120,00111,880,00036Lchr211,120,00111,880,00038L
chr214,200,00114,540,00017Lchr214,200,00114,540,00017L
chr32,820,0013,180,00017Lchr32,840,0013,180,00016L
chr410,120,00110,600,00020Hchr410,160,00110,620,00019H
chr71,500,0013,300,00089Lchr71,500,0013,300,00088L
chr718,460,00118,880,00019Hchr718,440,00118,880,00019H
chr914,560,00114,960,00020Lchr914,540,00114,960,00021L
chr1411,800,00112,660,00043LChr1411,800,00112,660,00043L
chr1419,020,00119,660,00029LChr1419,020,00119,640,00027L
chr1427,000,00127,400,00018Hchr1427,000,00127,400,00019H
chr156,140,0017,100,00034HChr156,140,0017,100,00039H
chr157,620,0017,960,00017LChr157,620,0017,960,00017L
chr1610,100,00110,540,00022HChr1610,180,00110,540,00018H
chr1615,060,00115,400,00017LChr1615,060,00115,400,00017L
chr1619,660,00120,500,00039LChr1619,660,00120,500,00034L
chr17200,001680,00024LChr17200,001680,00024L
chr174,900,0015,320,00021LChr174,920,0015,320,00020L
chr177,120,0017,520,00020LChr177,100,0017,520,00021L
chr1711,800,00112,760,00034HChr1712,020,00112,760,00027H
chr1713,040,00113,520,00024HChr1713,080,00113,520,00022H
chr1714,780,00115,340,00027HChr1714,780,00115,340,00027H
chr1914,060,00114,520,00022LChr1914,080,00114,560,00024L
chr513,340,00113,660,00016H
chr117,060,0017,400,00017L
chr1815,780,00116,100,00016H

aRegion on chromosome.

bSNP number on chromosome region.

cHigh or low SNP compared with the reference genome.

aRegion on chromosome. bSNP number on chromosome region. cHigh or low SNP compared with the reference genome.

Gene categories, functional annotation and differences in genes

Non-synonymous (NS) SNP mutations and frameshifts (F) caused by missing bases not in a multiple of three bases, as well as SVs caused by indels, transversions, and transitions might influence the expression of the relevant protein. All tested and functional annotated genes were classified into gene ontology (GO) categories (Fig 7), which mainly divided the genes into three categories: cellular components, molecular function, and biological process. The GO enrichment classification suggested that the genes from the cellular component, molecular function and biological process categories could be divided into 18, 18 and 26 groups, respectively.
Fig 7

GO classify for differential UniGenes between ‘SBC’ (up) and ‘SBBM’(down).

The genes annotated by BLASTx, with a threshold of 10-5, were recorded in four public databases including the NCBI non-redundant database (NR), the Swissprot protein database, the Clusters of Orthologous Groups of proteins database (COG) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) (S1 Table). A total of 23,405 genes in SBC and 23,431 genes in SBBM were functionally annotated (Fig 8). All the genes were recorded in the NR database, and the genes annotated in the four databases were quantitatively similar. 5,956 genes in SBC and 5,890 genes in SBBM were annotated by all four databases.
Fig 8

Homology analyze of ‘SBC’ (A) and ‘SBBM’ (B) relative to reference genome.

Compared with the reference genome, different homologous unigenes associated with ripening and colour was screened in the two cultivated lines. Thirteen proteins associated with ripening and colour were detected as S/NS, SV or F, these genes encoded MYB, bHLH or WD40 proteins or were involved in ETH, anthocyanidin, ABA, CTK, auxin, GA, BR, sugar, polyamine, and salicylic acid pathways. Ninety unigenes in SBC and 76 unigenes in SBBM were detected as S/NS, SV or F (S3 Table). Among these, Myb-related protein 3R-1 (GSVIVT01011986001) and Myb-related protein 306 (GSVIVT01008005001) occurred as S/NS only in SBBM. In total, 635 SBC-specific genes and 635 SBBM-specific genes were screened. Thirteen genes related to ripening, i.e., CHSY, MYBF, 3MAT, ERF5, ERF03, AI5L2, SEP1, STC, ERDL5, PT106, AX10A, AXX15, and NCED6 (Table 8) occurred with a large Indel fragment. Among these, differences in expression for eight genes were validated by qRT-PCR (Fig 9).
Table 8

Thirteen large-fragment InDels in candidate genes related to ripening.

Annotated proteinaGene & protein IDbGene ID &ChromosomecChange-SBCdChange-SBBMe
NSNS/Sv
Chalcone synthasesp|P51090|CHSY_VITVIGSVIVT01032968001 in chr1424,673,846 T AA24,673,846 T AA
halcone synthasesp|P51090|CHSY_VITVIGSVIVT01032968001 in chr1424,673,881 A CC24,673,881A CC
Chalcone synthasesp|P51090|CHSY_VITVIGSVIVT01032968001 in chr1424,673,978 C TT24,673,978 C TT
Chalcone synthasesp|P51090|CHSY_VITVIGSVIVT01032968001 in chr1424,674,524 T CC24,674,194 T CY
Chalcone synthasesp|P51090|CHSY_VITVIGSVIVT01032968001 in chr1424,674,524 T CC
Chalcone synthasesp|P51090|CHSY_VITVIGSVIVT01032968001 in chr1424,675,008 C TT
Chalcone synthasesp|P51090|CHSY_VITVIGSVIVT01032968001 in chr14DEL: 13,541 from 24,674,193 to 24,687,822
NS/Sv/FNS
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13838,415 C GG838,867 C GG
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13838,558 C AM838,900 T AW
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13838,867 C GG839,256 T CC
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13838,900 T AW839,647 T CY
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13839,256 T CC844,828 T CC
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13839,647 T CY844,891T CY
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13839,929 G AR
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13840,005 C TY
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13840,080 C AM
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13844,891T CY
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13DEL4: from 838,413 to 838,420
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13INS 2: from 84,0284to84,0286
Myb-family transcription factorsp|Q700D9|MYF _ARATHGSVIVT01032776001 in chr13INS 1: from 84,4790to 84,4799
NSNS/Sv
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,202,818 A TT8,202,818 A TT
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,202,844 T CY8,203,022 G AA
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,203,022 G AA8,203,027 A CM
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,203,027 A CM8,203,486 C TT
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,203,207 A TW8,210,389 G CS
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,203,216 A TW8,210,633 A GR
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,203,346 C TY
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,203,486 C TT
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,210,571 T CY
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr128,210,633 A GR
Malonyl-coenzyme Asp|Q8GSN8|3MAT_DAHPIGSVIVT01000461001 in chr12DEL25,483: from 829,838 to 8,235,429
NSNS/Sv
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,012,358 G AR7,011,626 T CC
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,012,365 A GR7,012,358 G AR
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,012,405 T GK7,025,216 G CS
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,020,752 G CS7,025,318 C TT
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,025,275 G AA7,025,356 T CY
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,025,356 T CY7,025,450 C TY
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,025,751 C TY7,025,692 A GR
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,025,772 A CC
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr167,025,782 G CC
Ethylene-responsive transcription factorsp|Q40478|ERF5_TOBACGSVIVT01013934001 in chr16DEL8,818: from 7,011,486 to 7,020,308
Sv
Ethylene-responsive transcription factorsp|Q94AW5|ERF03_ARATHGSVIVT01026334001 in chr4DEL12,460: from 14,176,993 to 14,189,460
Sv
ABSCISIC ACID-INSENSITIVE 5-like proteinsp|Q9LES3|AI5L2_ARATHGSVIVT01033216001 in chr4DEL7,576:from 9,840,430 to 9,847,952
SS/Sv/F
Developmental protein SEPALLATA 1sp|P29382|SEP1_ARATHGSVIVT01008139001 in chr17DEL4: from 5,464,406 to 5,464,422
NSNS/Sv
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,932,262 A GG21,932,208 C TY
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,932,385 T CY21,932,226 G AR
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,932,496 C TY21,932,253 A GR
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,939,640 A GR21,932,262 A GG
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,939,792 T GK21,932,389 A CM
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,940,202 T GK21,932,496 C TY
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,940,212 C AM21,933,149 C AA
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,940,573 T CY21,939,727 G AR
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,940,659 A CC21,939,792 T GK
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,940,702 G CS21,940,059 A CM
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,941,645 C TY21,940,118 G TK
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,941,697 C TY21,940,202 T GK
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,941,924 A GR21,940,212 C AM
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,941,948 A GR21,940,573 T CY
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,942,036 C AM21,940,659 A CC
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr1421,942,092 A TW21,940,702 G CS
Sugar carrier protein Csp|Q41144|STC_RICCOGSVIVT01036440001 in chr14DEL3,328: from 21,935,542 to 21,938,876
NS/SvNS
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,161,687 A GR4,161,760 G AR
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,161,760 G AR4,161,928 G CS
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,161,906 T CY4,161,979 C TT
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,161,928 G CS4,162,107 C TT
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,161,979 C TT4,164,395 C TT
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,162,107 C TT4,164,776 T GG
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,164,823 G AR
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr144,164,829 G CC
Sugar transportersp|Q3ECP7|ERDL5_ARATHGSVIVT01022025001 in chr14DEL1,903: from 4,164,061 to 4,165,992
NSNS/Sv
sugar phosphate/phosphate translocatorsp|Q8H184|PT106_ARATHGSVIVT01020780001 in chr122,066,209 A CM2,066,209 A CM
sugar phosphate/phosphate translocatorsp|Q8H184|PT106_ARATHGSVIVT01020780001 in chr122,068,143 A TT2,066,236 C TY
sugar phosphate/phosphate translocatorsp|Q8H184|PT106_ARATHGSVIVT01020780001 in chr122,068,152 C TY2,068,143 A TT
sugar phosphate/phosphate translocatorsp|Q8H184|PT106_ARATHGSVIVT01020780001 in chr122,071,379 C TY2,068,152 C TY
sugar phosphate/phosphate translocatorsp|Q8H184|PT106_ARATHGSVIVT01020780001 in chr122,071,382T CY2,071,379 C TY
sugar phosphate/phosphate translocatorsp|Q8H184|PT106_ARATHGSVIVT01020780001 in chr122,071,382 T CY
sugar phosphate/phosphate translocatorsp|Q8H184|PT106_ARATHGSVIVT01020780001 in chr12DEL17,541: from 2,095,192 to 2,112,841
NSNS/Sv
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,484 A TW2,525,618 C AM
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,502 T AW2,525,636 G TT
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,618 C AM2,525,670 T CC
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,636 G TT2,525,815 C GS
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,670 T CC2,525,831 T AW
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,815 C GS2,525,874 T CY
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,831 T AW
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr32,525,874 T CY
Auxin-induced proteinsp|P33080|AX10A_SOYBNGSVIVT01023911001 in chr3DEL7,836: from 2,525,478 to 2,533,405
NSNS/Sv
.Auxin-induced proteinsp|P33082|AXX15_SOYBNGSVIVT01024129001 in chr3888,891 C TY888,891 C TY
Auxin-induced proteinp|P33082|AXX15_SOYBNGSVIVT01024129001 in chr3888,934 G TK
Auxin-induced proteinp|P33082|AXX15_SOYBNGSVIVT01024129001 in chr3DEL9,447: from 886,035 to 895,516
NSNS/Sv
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,589,359 G CC11,589,359 G CC
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,589,455 T GK11,589,444 T GK
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,589,539 T GG11,589,455 T GK
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,589,680 C TT11,589,680 C TX
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,589,818 T GG11,589,818 T GG
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,589,955 T AA11,589,955 T AA
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,589,956 T CY11,589,956 T CY
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,590,009 G TK11,590,009 G TK
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,590,035 C AM11,590,283 G AR
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,590,697 C TY11,590,829 C TT
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,590,698 G AR
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr511,590,829 C TT
9-cis-epoxycarotenoid dioxygenasesp|Q9LRM7|NCED6_ARATHGSVIVT01029057001 in chr5DEL15: from 11,591,007 to11,591,031

aThe functional annotation of genes.

bGenes and proteins ID.

cGene ID and chromosome position.

dChange occurred inSBC.

eChange occurred inSBBM. S/NS/Sv/F represents synonymous/non-synonymous/structure variation/frameshift variation.

Fig 9

Flowchart for finding the causative different genes for ripening and coloring.

aThe functional annotation of genes. bGenes and proteins ID. cGene ID and chromosome position. dChange occurred inSBC. eChange occurred inSBBM. S/NS/Sv/F represents synonymous/non-synonymous/structure variation/frameshift variation.

Genes on the transcriptional level validated by qRT-PCR

Eight genes that correlated with ripening and colour were detected and qRT-PCR was performed to validate their relative expression (Fig 10). The eight genes were CHSY, 3MAT, ERF5, ERF03, AI5L2, SEP1, PT106, and NCED6.
Fig 10

The expression of eight genes that potentially relate to SBBM grape ripening via qRT-PCR.

The results showed that the expression of CHSY, 3MAT, ERF5, ERF03, AI5L2, SEP1, and PT106 was up-regulated and the expression of SEP1 was down-regulated during véraison compared to during the green period (period of fruit inflation).

Discussion

Sequence variation is commonly reflected by SNPs and SVs and small variations give rise to huge variations in horticultural traits in plants. Combined with a considerable number of SNPs and SVs in genes, proteins will be a major focus in future molecular biology-assisted breeding programs.

SNP and SV analysis

The results here show that the ratios of SNPs identified in intergenic, exonic, or intronic regions were 61.0% and 62.1%, 5.6% and 5.5%, 33.4% and32.4%, in SBC and SBBM, respectively. Compared with Arabidopsis, rice [30, 31], and sorghum [32], the intergenic regions of grape genes contain more SNPs. Furthermore, compared with the reference genome, SBBM had more SNPs, transversions (Trvs) and heterozygous genes than SBC on each chromosome. Moreover, the several high- or low-density SNP regions distributed on the SBC and SBBM chromosomes will be useful for further mapping, particularly to identify markers that are closely linked to QTLs. Similarly, some regions, such as the region between 13.34 and 13.66 Mb on chromosome 5 (16 SNPs) and the region between 15.78 and 16.10 Mb on chr18 (16 SNPs), had a higher SNP density only in SBC. The region between 7.06 and 7.40 Mb on chr11 (17 SNPs) had a higher SNP density in SBBM than in SBC. In contrast with in other species, the number of SNPs per 0.1 Mb region was lower in our two grape culti-species, as beans contain about 650 SNPs per Mb in [33]. Moreover, some chromosomes did not have high-density SNP regions, i.e., chr1, chr6, chr8, chr9, chr10, chr12, and chr13 in SBC, and chr1, chr5, chr6, chr8, chr9, chr10, chr11, chr12, chr13, and chr18 in SBBM. Research into grape berry colour has identified that the berry colour locus is a cluster of Myb and Myb-like genes within a 200-kb region located on chr2 [34]. Compared with the reference genome, one high-density SNP region was present on chr2. These results might be related to early colouring and early ripening in SBBM. The distribution of indel sizes shows that there were more insertions than deletions in the two genomes compared with the reference genome. Moreover, single-base indels accounted for a larger proportion than two- and three-base indels in the two cultivars. Interestingly, three-base indels in the CDS were present at a higher frequency than two-base indels. The GO enrichment analysis provided 62 categories in cellular components, molecular functions and biological processes that were significantly overrepresented. Notably, genes related to pigmentation, growth, and carbon-use, as well as the response to stimuli was found within the biological processes. Unigene homology analysis suggested that all the annotated unigenes were present in the NR database, followed by the Swissprot database, although the Swissprot database was more reliable, due to its manually entered homology unigenes. In total, 635 unigenes in SBC and 665 unigenes in SBBM were screened. Among these, we filtered 90 unigenes in SBC and 76 unigenes in SBBM that were related to colouring and ripening. Changes (NS, SV or Fin SBC and SBBM) were found in the unigenes for ABA (23 and 23), ETH (8 and 7), BRs (11 and 5), GAs (6 and 3), sugar (5 and 1), IAAs (14 and 12), CTKs (4 and 3), anthocyanidins (1 and 1), MYB (1 and 3), bHLH (1 and 1), WD40 (1 and 2), salicylic acid (13 and 12) and polyamine (2 and 4), respectively (S3 Table). These results might be mainly related to colour, early ripening, and disease resistance compared with the ‘Pinot noir’ reference genome, which shows the agronomic traits of lighter colouration, mid-maturation and weak disease resistance in Europe and America [35]. In addition to the functionally annotated genes, many new unnamed unigenes were detected that require further exploration to determine gene function.

The NCED6 gene appears to be involved in ABA biosynthesis and ABA might promote anthocyanin accumulation

The ABA concentration results show that the content of ABA increased at about the time of véraison [36]. Previous studies have shown that the content of ABA increases at the start of sugar accumulation and then reaches a peak two to three weeks later, after which the ABA content declines as the fruit approaches ripeness [37,38]. Zhang et al. found that the ABA content in grapes gradually increases at the onset of ripening and reaches the highest level at about 20 days before the harvest period [39]. We found that the endogenous ABA concentration in SBC berries decreased from the onset of ripening, but increased in SBBM from véraison to maturation (Fig 3). These observations agree well with those in the abovementioned studies and suggest that the concentrations of ABA lead to early ripening. Based on the physiological indices of the ABA content, gene expression led to a similar conclusion. A key enzyme in ABA biosynthesis in plants is NCED. Soar [40] found that the profile of ABA accumulation appears to roughly correlate with the expression of the VvNCED1 gene, and Mei et al. [41] subsequently found that VvNCED1 initiates ABA biosynthesis at the onset of fruit ripening. Also, during tomato fruit ripening, SlNCED1 has been implicated as a key gene involved in ABA metabolism [42]. The GlNCED1 gene is closely related to Arabidopsis thaliana NCED6 [43], which is involved in ABA biosynthesis in vivo [44]. The expression of NECD6 gene at véraison was lower than period of inflation in our experiment. This result suggest that NCED6 expression roughly like correlates with NCED1expression and is involved in ABA biosynthesis in grape, as the expression level is higher at the onset of fruit ripening, and is then down-regulated at véraison. Previous studies have shown that anthocyanin accumulation increases with colour deepened, although there is little effect of ABA on sugar accumulation [45, 46]. The effect of genes related to ABA biosynthesis in grapes has been described in some detail [47]. We have shown that the ABA level increased at the onset of ripening after decreasing at the beginning of grape maturation (Fig 3), and the qRT-PCR results (Fig 10) show that NCED6 expression was higher at the onset of ripening. Thus, ABA might promote anthocyanin accumulation.

ERF-related genes might regulate early berry ripening in SBBM

Ethylene response factor transcription factors might affect fruit ripening in climacteric fruits [48-50]. Similarly, it has been shown that ethylene is required for fruit ripening in strawberry [48]. As early as the 1970s, it was shown that ethylene levels do not increase in grape berries at véraison [49, 50]. On the contrary, a modest peak in ethylene was observed seven weeks after flowering, accompanied by a higher transcript level of 1-aminocyclopropane-1-carboxylic acid oxidase (ACO) and enzyme activity at berry colouration peaks [51]. These results support the view that at, or just prior to véraison, a modest increase in ethylene biosynthesis occurs. For example, several peaks in ethylene levels are observed in litchi at different stages of development [52], and increases in ethylene levels have also been reported in other non-climacteric fruit, e.g., strawberries [53]. ERF5is a transcriptional activator in Arabidopsis, whereas ERF3 is an active repressor in tobacco [54]. The ERF proteins have been implicated in ethylene synthesis [55] and the expression of ERF3 and ERF5correspond with ethylene production. In this study, although the expression of ERF3 was extremely lower than ERF5 gene, both ERF3 and ERF5 expression increased at the onset of ripening, and therefore, might promote early berry ripening in SBBM. The recombinant enzymatic profiles of Dv3MaTare closely associated with native anthocyanin malonyltransferase activity in dahlia flowers [56], as the Dv3MaT transcript is abundant in red stems and red petals. Similarly, the Dv3MaT transcript was abundant during véraison, and was expressed at a lower level in the period of inflationin SBBM. Our results shows 3MaTgene expressed at a lower level from period of inflation to véraison, but expressed increasing (Fig 10). Therefore, the role of 3MaTgene might not only function in pigment stability in the flowers, but also in grape berries. The function of PT106 gene is probable sugar phosphate/phosphate translocator, and we have measured the soluble solids were less than 10 BX from period of inflation to period of véraison, according with the expressed of PT106 gene showed very low level from period of inflation to period of véraison. The function of AI5L2 gene is abscisic acid-insensitive 5-like protein, the expression of AI5L2 gene was increased at a low level, with concentrate of ABA increasing from period of inflation to period of véraison.

The SEPALLATA (SEP) gene probably affects grape ripening via ethylene response

The CaMADS-RIN gene in pepper belongs to the SEP clade and affects fruit ripening both in ethylene-independent and ethylene-dependent pathways in tomato [57]. Previous research has suggested that the SlMADS-RIN gene is bound to the cis-element of ACS2, which controls biosynthesis of ethylene[58]. In apple trees that showed suppressed expression of MADS8/9, which belongs to the SEPALLATA1/2 (SEP1/2) class, severely reduced ripening and a low ethylene content were observed [59]. Seymour et al. [60] demonstrated that silencing a SEP1/2-like (FaMADS9) gene in strawberry led to suppressed ripening in achene and receptacle tissue. It can therefore be concluded that SEP genes play a key role in the regulation of ripening in both climacteric and non-climacteric fruits. In this study, compared to the locus in the reference genome, the SEPALLATA1 gene in SBC contained a synonymous mutation, whereas S/SV/F variation was present at the locus in SBBM, in which four bases were deleted (Table 8). The expression level of SEP1 (GSVIVT01008139001) was severely reduced when the pericarp darkened in SBBM, whereas expression of the ERF3 and ERF5 genes increased when the colour deepened. Therefore, we concluded that SEP probably down-regulated grape ripening via ethylene signaling or response.

The anthocyanin content and CHS gene expression are possibly associated with early berry ripening

Anthocyanin accumulation is considered to contribute to the colouration of red and black grapes, this has been studied extensively in grapes, and anthocyanins are usually located in the pericarp of the berry. As the concentration of anthocyanins increases, the berry colour deepens. Measurements of the final anthocyanin content of the three different colours of grape varied considerably in this study: CGF contained -0.003 ng•g-1FW anthocyanin, whereas BMRF contained 0.06433 ng•g-1FWand BMPF contained 0.34333 ng•g-1 FW(Fig 2). During berry colouring and ripening, when the content of anthocyanins and soluble solids increased, the organic acid content decreased. The anthocyanin concentration also showed significant differences in all three types of grape. Genes encoding phenylalanine ammonialyase (PAL), chalcone synthase (CHS), chalcone isomerase (CHI), flavanone 3-hydroxylase (F3H), dihydroflavonol 4-reductase (DFR), leucoanthocyanidin dioxygenase (LDOX), and flavonoid 3-O-glucosyltransferase (UFGT) and many other key genes are involved in the anthocyanin biosynthesis pathway [61]. Following isogene functional annotation and filtering, the CHS gene was screened and in SBBM, was found to contain a NS/SV deletion of 13,541 bases from 24,674,193 to 24,687,822 on chr14, whereas in SBC, the locus was NS compared with the reference genome. Previous research has suggested that the expression of CHS3 correlates with anthocyanin accumulation in berry skins [62]. In Camellia chekiangoleosa, the expression of CHS3 genes involved in anthocyanin biosynthesis was found, using transcriptome deep sequencing, to increase during véraison, after initially decreasing [63]. This observation agrees well with our results for grape berries. In addition, genes encoding WD40repeat proteins regulate anthocyanin biosynthesis during fruit development in red pear [64]. The MYB–bHLH–WD40 (MBW) complex consists of different bHLH and MYB transcription factors, as well as the WD40 factor TTG1, which activates anthocyanin synthesis in vegetative tissues and Arabidopsis seeds [65]. The MYB114 transcription factor participates in the MBW complex, and various MBW complexes can be formed by the interaction between MYB114, and TTG1 and others, as shown by yeast two-hybrid screens [65]. S3 Table shows the transcription factor MYB114 (GSVIVT01022657001) in SBC and the transcription factor MYB114 (GSVIVT01022661001) in SBBM, which were screened and found to possess S/NS variations. Our data suggest that a large number of genes possessed S/NS/SV/F variations compared with the reference genome. Among these, genes associated with hormones, sugar, anthocyanidins, polyamine, and other factors were filtered. In addition to hormones, sugars and anthocyanidins, polyamine is also related to fruit ripening. The specific relationships among these homologous genes with fruit ripening need to be further researched.

Conclusions

Based on whole genome re-sequence analysis, 3,692,777 SNPs and 81,223 SVs in SBC and 3,823,464 SNPs and 85,801 SVs in SBBM were screened and found to distinguish these genomes from the reference genome. We found 166 homologous unigenes related to ripening and colour in SBC and SBBM. Compared with the reference genome, the number of genes in the two lines (SBC and SBBM, respectively) that possessed NS, SVs or F was as follows: ABA (1,921 and 1,922), ETH (1,294 and 1,302), BRs (733 and 724), GAs (533 and 520), sugar (601 and 601), IAAs (1,397 and 1,405), CTKs (425 and 420), anthocyanidins (223 and 221), MYB (338 and 344), bHLH (122 and 116), WD40 (205 and 207), salicylic acid (1,504 and 1,504), and polyamine (269 and 271). The CHS (GSVIVT01032968001) gene was potentially related to early berry ripening and the ERF genes (GSVIVT01013934001 and GSVIVT01026334001) might regulate early berry ripening in SBBM. The SEP gene (GSVIVT01008139001) probably functions in grape ripening via ethylene response. The expression of NCED6 (GSVIVT01029057001) appeared to roughly correlate with NCED1 expression and might be involved in ABA biosynthesis in grape, because ABA promotes anthocyanin accumulation.

Additional Information

Accession codes: The two raw sequencing data of this paper have been deposited at SRA under accession SRP067230, the BioProject is PRJNA304490. The data for ‘Summer Black’ grape have been deposited at BioSample under accession SAMN04299023. The data for ‘Summer Black’ bud mutation grape have been deposited at BioSample under accession SAMN04299024.

Databases and linkage websites.

(DOCX) Click here for additional data file.

The 48 pairs of primers used to confirm differences in expression between SBC and SBBM.

(DOCX) Click here for additional data file.

Different uingenes associated with ripening and changes in colour in two cultivated species compared with the reference, respectively: 90 uingenes for SBC and 76 uingenes for SBBM.

(DOCX) Click here for additional data file.
  40 in total

1.  Comparison between conventional indirect competitive enzyme-linked immunosorbent assay (icELISA) and simplified icELISA for small molecules.

Authors:  Jing Zhao; Gang Li; Guo-Xiang Yi; Bao-Min Wang; Ai-Xing Deng; Tie-Gui Nan; Zhao-Hu Li; Qing X Li
Journal:  Anal Chim Acta       Date:  2006-04-29       Impact factor: 6.558

2.  Grapes on steroids. Brassinosteroids are involved in grape berry ripening.

Authors:  Gregory M Symons; Christopher Davies; Yuri Shavrukov; Ian B Dry; James B Reid; Mark R Thomas
Journal:  Plant Physiol       Date:  2005-12-16       Impact factor: 8.340

3.  Anthocyanins, phenolics, and color of Cabernet Franc, Merlot, and Pinot Noir wines from British Columbia.

Authors:  G Mazza; L Fukumoto; P Delaquis; B Girard; B Ewert
Journal:  J Agric Food Chem       Date:  1999-10       Impact factor: 5.279

4.  Genomewide SNP variation reveals relationships among landraces and modern varieties of rice.

Authors:  Kenneth L McNally; Kevin L Childs; Regina Bohnert; Rebecca M Davidson; Keyan Zhao; Victor J Ulat; Georg Zeller; Richard M Clark; Douglas R Hoen; Thomas E Bureau; Renee Stokowski; Dennis G Ballinger; Kelly A Frazer; David R Cox; Badri Padhukasahasram; Carlos D Bustamante; Detlef Weigel; David J Mackill; Richard M Bruskiewich; Gunnar Rätsch; C Robin Buell; Hei Leung; Jan E Leach
Journal:  Proc Natl Acad Sci U S A       Date:  2009-07-13       Impact factor: 11.205

5.  Deep sequencing of the Camellia chekiangoleosa transcriptome revealed candidate genes for anthocyanin biosynthesis.

Authors:  Zhong-Wei Wang; Cong Jiang; Qiang Wen; Na Wang; Yuan-Yuan Tao; Li-An Xu
Journal:  Gene       Date:  2014-01-21       Impact factor: 3.688

6.  Genetic structure and differentiation in cultivated grape, Vitis vinifera L.

Authors:  Mallikarjuna K Aradhya; Gerald S Dangl; Bernard H Prins; Jean-Michel Boursiquot; M Andrew Walker; Carole P Meredith; Charles J Simon
Journal:  Genet Res       Date:  2003-06       Impact factor: 1.588

7.  Cloning and functional analysis of 9-cis-epoxycarotenoid dioxygenase (NCED) genes encoding a key enzyme during abscisic acid biosynthesis from peach and grape fruits.

Authors:  Mei Zhang; Ping Leng; Guanglian Zhang; Xiangxin Li
Journal:  J Plant Physiol       Date:  2009-03-23       Impact factor: 3.549

8.  Genome-wide patterns of genetic variation in sweet and grain sorghum (Sorghum bicolor).

Authors:  Lei-Ying Zheng; Xiao-Sen Guo; Bing He; Lian-Jun Sun; Yao Peng; Shan-Shan Dong; Teng-Fei Liu; Shuye Jiang; Srinivasan Ramachandran; Chun-Ming Liu; Hai-Chun Jing
Journal:  Genome Biol       Date:  2011-11-21       Impact factor: 13.583

9.  Single strand conformation polymorphism based SNP and Indel markers for genetic mapping and synteny analysis of common bean (Phaseolus vulgaris L.).

Authors:  Carlos H Galeano; Andrea C Fernández; Marcela Gómez; Matthew W Blair
Journal:  BMC Genomics       Date:  2009-12-23       Impact factor: 3.969

10.  Genomic variants of genes associated with three horticultural traits in apple revealed by genome re-sequencing.

Authors:  Shijie Zhang; Weiping Chen; Lu Xin; Zhihong Gao; Yingjun Hou; Xinyi Yu; Zhen Zhang; Shenchun Qu
Journal:  Hortic Res       Date:  2014-10-01       Impact factor: 6.793

View more
  7 in total

1.  Catastrophic Unbalanced Genome Rearrangements Cause Somatic Loss of Berry Color in Grapevine.

Authors:  Pablo Carbonell-Bejerano; Carolina Royo; Rafael Torres-Pérez; Jérôme Grimplet; Lucie Fernandez; José Manuel Franco-Zorrilla; Diego Lijavetzky; Elisa Baroja; Juana Martínez; Enrique García-Escudero; Javier Ibáñez; José Miguel Martínez-Zapater
Journal:  Plant Physiol       Date:  2017-08-15       Impact factor: 8.340

2.  Evolutionary genomics of grape (Vitis vinifera ssp. vinifera) domestication.

Authors:  Yongfeng Zhou; Mélanie Massonnet; Jaleal S Sanjak; Dario Cantu; Brandon S Gaut
Journal:  Proc Natl Acad Sci U S A       Date:  2017-10-17       Impact factor: 11.205

3.  Genome-wide DNA polymorphisms in four Actinidia arguta genotypes based on whole-genome re-sequencing.

Authors:  Miaomiao Lin; Jinbao Fang; Chungen Hu; Xiujuan Qi; Shihang Sun; Jinyong Chen; Leiming Sun; Yunpeng Zhong
Journal:  PLoS One       Date:  2020-04-10       Impact factor: 3.240

4.  Identification of Selective Sweeps in the Domesticated Table and Wine Grape (Vitis vinifera L.).

Authors:  Ling Kui; Min Tang; Shengchang Duan; Shouling Wang; Xiao Dong
Journal:  Front Plant Sci       Date:  2020-05-14       Impact factor: 5.753

5.  Fruit ripening in Lycium barbarum and Lycium ruthenicum is associated with distinct gene expression patterns.

Authors:  Jianhua Zhao; Haoxia Li; Yue Yin; Wei An; Xiaoya Qin; Yajun Wang; Yunfang Fan; Yanlong Li; Youlong Cao
Journal:  FEBS Open Bio       Date:  2020-06-29       Impact factor: 2.693

Review 6.  Facing Climate Change: Biotechnology of Iconic Mediterranean Woody Crops.

Authors:  Carlos De Ollas; Raphaël Morillón; Vasileios Fotopoulos; Jaime Puértolas; Patrick Ollitrault; Aurelio Gómez-Cadenas; Vicent Arbona
Journal:  Front Plant Sci       Date:  2019-04-16       Impact factor: 5.753

7.  Genome Resequencing, Improvement of Variant Calling, and Population Genomic Analyses Provide Insights into the Seedlessness in the Genus Vitis.

Authors:  Myung-Shin Kim; Youn Young Hur; Ji Hong Kim; Soon-Chun Jeong
Journal:  G3 (Bethesda)       Date:  2020-09-02       Impact factor: 3.154

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.