Literature DB >> 30910819

Mapping Gene Markers for Apple Fruit Ring Rot Disease Resistance Using a Multi-omics Approach.

Fei Shen1, Zhenyu Huang1, Baoguo Zhang2, Yi Wang1, Xi Zhang1, Ting Wu1, Xuefeng Xu1, Xinzhong Zhang3, Zhenhai Han3.   

Abstract

Apple fruit ring rot (FRR), caused by Botryosphaeria dothidea, is a worldwide disease that impacts Asian apple production regions. However, no substantial progress has thus far been made toward the mapping of candidate genes or the development of effective genetic makers. In this five-year study, the resistance of 1,733 F1 hybrids from the cross 'Jonathan' × 'Golden Delicious' was phenotyped by non-wounding inoculation with four B. dothidea isolates. We first conducted systematic comparison of different analytic strategies for bulk segregant analysis by re-sequencing (BSA-Seq) and obtained suitable one for outbreeding species such as Malus Forty-six quantitative trait loci (QTL) for resistance/susceptibility to the four isolates, including one QTL 'hotspot' on chromosome 14, were identified via BSA-Seq. Using integrated multi-omics strategies including RNA-sequencing, parental re-sequencing, BSA-Seq and meta-analysis of RNA-sequencing, fifty-seven candidate genes and corresponding functional mutations from the QTL were predicted. Functional mutations located on the candidate genes were validated using kompetitive allele-specific PCR in hybrids and Malus germplasm accessions with extremely resistant/susceptible phenotypes. Ten effective markers for apple ring rot were developed. The results provide an example of rapid candidate gene mapping for complex traits in outbreeding species.
Copyright © 2019 Shen et al.

Entities:  

Keywords:  BSA-Seq; Botryosphaeria dothidea (Moug. ex Fr.) Ces. & De Not; Malus domestica Borkh.; RNA-Seq; multi-omics

Mesh:

Substances:

Year:  2019        PMID: 30910819      PMCID: PMC6505150          DOI: 10.1534/g3.119.400167

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Apple fruit ring rot (FRR), caused by Botryosphaeria dothidea, is a worldwide disease that severely impacts Asian apple production regions (Guo ). The phenotype of FRR incidence is qualitatively dominant against non-incidence, and the variation in susceptibility to FRR is attributed to the segregation of three major gene loci (Zhuang ). Thirty-four major gene loci and six quantitative trait loci (QTL) for bot canker and FRR resistance/susceptibility have been genetically mapped on the apple genome (Cui ). However, these QTL regions are too large for identifying candidate genes or to be effectively used in marker-assisted selection. Bulk segregant analysis (BSA) is a powerful method for identifying DNA markers that are tightly linked to the causal genes in inbreeding plant species (Michelmore ). Markers for both qualitative and quantitative trait loci can be identified by BSA, and BSA has been extensively used in cereal crops, vegetables, ornamentals, and some tree species (Lu ; Pandey ; Ries ; Singh ; Takagi ; Wang ; Win ; Yang ). BSA has been greatly improved in recent years by next-generation sequencing (NGS) technology, defined as BSA-Seq or QTL-Seq, and has been effectively used for QTL mapping in many plant species (Geng ; Illa-Berenguer ; Lu ; Pandey ; Ries ; Singh ; Takagi ; Win ). Integrated multi-omics analysis can significantly enhance the efficient identification of candidate genes from QTL regions. Deep re-sequencing is an effective and reliable technology for detecting genome-wide genetic variations among cultivars (Kunihisa ; Xing ; Zhang ). The transcriptome can reveal both the differentially expressed genes associated with biological processes and the genetic variations in coding sequences (CDS) (Balan ; Bonnet ; Hill ). Large online transcriptome datasets are now available, and the meta-analysis of RNA-sequencing (RNA-Seq) datasets can provide an important reference for characterizing gene functions (Balan ; Balan ; Guo ). For instance, copy number variation in a gene cluster encoding endopolygalacturonase, which mediates flesh texture and stone adhesion, was identified in peach using re-sequencing and RNA-Seq technology following traditional QTL mapping analysis (Gu ). The biosynthesis, regulation, and domestication of bitterness in cucumber were determined using integrated approaches, including genome-wide association studies (GWAS), re-sequencing, and transcriptome analysis (Shang ). In this study, we integrated whole genome parental re-sequencing, BSA-Seq, and RNA-Seq meta-analysis to identify QTL and screen candidate genes for FRR resistance/susceptibility. Markers in the regions of the candidate genes were validated by kompetitive allele-specific PCR (KASP).

Materials and Methods

Plant material

Malus germplasms (60) and biparental cross hybrids (1,773) (‘Jonathan’ × ‘Golden Delicious’) were used as segregating populations. The hybrid cross was performed in 2002 and the seedlings were planted in 2003. All plant materials were subjected to conventional cultivation management practices.

Phenotyping and sampling

In the years of 2008, 2009, 2014, 2015, and 2016, phenotyping for FRR resistance was undertaken under room temperature conditions using the ripened fruit and mycelia of four B. dothidea isolates (Zz026, Ls1, Lw023, and Lw048) (Guan ; Zhuang ).

Re-sequencing of ‘Golden Delicious’ and ‘Jonathan’ and data processing

Genomic DNA was extracted from the leaves of ‘Golden Delicious’ and ‘Jonathan’ using a Genomic DNA Isolation Kit (TianGen, Beijing, China). The Illumina sequencing libraries were constructed using NEBNext DNA Library Prep Master Mix. Paired-end sequencing was performed using the Illumina HiSeq2500 sequencer (Illumina, San Diego, CA). Burrows-Wheeler Aligner (BWA) was used to map clean reads to the apple double haploid (DH) genome (Li and Durbin 2009; Daccord ). SNP and InDel calling were conducted using SAMtools (Li ; Li 2011). SNPs and InDels with a lowest phred-scaled quality of 30 and a minimum read depth of 10 were kept. Structure variation calling was conducted using the SpeedSeq pipeline (Chiang ). InterproScan5.21was used to conduct functional domain analysis of all the genes in the apple genome (Zdobnov and Apweiler 2001). Cis-acting regulatory elements in upstream of the genes were analyzed using PlantCARE web tools (Rombauts ). Finally, all the variations were annotated with ANNOVAR based on the functional domain, cis-acting regulatory elements, and gene model information (Wang ).

DNA pooling and BSA data processing

The resistant/susceptible bulks were constructed using 1,773 hybrids of ‘Jonathan’ × ‘Golden Delicious’ (Guan ; Zhuang ). Based on the lesion length data after fruit non-wounding inoculation with four isolates for five years, hybrids exhibiting the non-incidence phenotype in at least two years were included in the resistance (R) bulk, while incident hybrids with random lesion lengths in at least two years were chosen as the susceptible (S) bulk. Genomic DNA was extracted from the leaves of the selected hybrids using a Genomic DNA Isolation Kit (TianGen). Eight DNA libraries were constructed after DNA pooling based on the group information. The method of Genomic DNA extraction and the DNA library construction were the same as above. Paired-end sequencing was performed using the Illumina HiSeq X Ten platform (Illumina). The clean reads were mapped to the apple DH genome using BWA, and only uniquely mapped reads with a minimum phred-scaled map quality score of 20 were kept. The SNPs showing polymorphisms between parents or which were double-heterozygous in the two parents were subdivided into three subsets: J-type, whereby the genotype of the markers is homozygous in ‘Golden Delicious’ and heterozygous in ‘Jonathan’; G-type, whereby the genotype of the markers is heterozygous in ‘Golden Delicious’ and homozygous in ‘Jonathan’; and H-type, whereby the genotype of the markers is heterozygous in both parents. Read counts with different genotypes from extremely resistant and susceptible pools were extracted and separated based on the G-, J-, and H-type subsets, respectively. Thus, for each isolate, three files (G, J, and H) with read counts of the coupled pools were generated. The allele frequency difference (AFD) between two coupled pools and G-value in each site were calculated based on the read counts. Nadaraya-Watson kernel regression was used as a smoothing function, and the G’-value at each site within the 1 Mbp sliding window size was calculated (Magwene ). The significance threshold of G’ was estimated using an empirical approach proposed by Magwene . Significant regions with false discovery rate (FDR) <0.01 were selected as candidate QTL. The QTL analysis was performed three times using each of the three subsets of data. The QTL could be mapped to the pollen, maternal, or both parents using the marker data subset G, J, or H, respectively. QTL with peak G’ value exceeding 30% of the significant threshold were defined as major QTL.

Estimating the impact of the reference genome on QTL detection

To investigate the impact of the reference genome on QTL detection, the clean sequencing reads from the two bulked DNA pools (resistant/susceptible to isolate Zz26) were processed to detect QTL using two versions of the apple reference genome (diploid version 1.0p and DH) (Velasco ; Daccord ). The DNA sequences without gaps (Ns) of the QTL were obtained from the corresponding genome (diploid version 1.0p or DH) using SAMtools software (Li ; Li 2011). The LASTZ sequence alignment program was used to align the sequences against the genome, and dot plots were visualized in the R statistical environment (Harris 2007). To verify the reliability of the QTL, 3-5 markers were selected from each QTL region, and all individuals in the two bulks were genotyped using the KASP assay. Chi-square tests of the genotypic frequency of a certain marker indicating significant differences (P-value <0.05) between the two bulks were considered reliable.

Comparison of different statistical methods for QTL identification

To determine the best statistical method, we compared the SNP index (SI), Euclidean distance (ED), and the G-value (GV) methods for QTL identification using the phenotype data of resistance/susceptibility to the Zz26 isolate. The absolute value of the △SNP index (|△SI|) was used for SI (Wang ). Where large numbers of SNPs exist, fitting the ED using a Loess curve with a polynomial exponent of one in the MMAPPR may be hugely time-consuming and computationally intensive (Hill ). Instead, we used Nadaraya-Watson kernel regression as a smoothing function in ED (Magwene ). To get a fair decision of the three statistical methods, we adopted the same threshold selection method. The median of all the scores plus three-times the standard deviation (SD) were used as thresholds for SI, ED, and GV.In the GV method, the G-value was calculated at each SNP using the standard G–statistic followed by smoothing using equation 1, where the sum includes all SNPs within the window W bracketing the SNP, and equation 2, where D is the standardized distance, with value 0 at the focal position and value 1 at the edge of the window. Similarly, in the ED method, the Euclidean distance is first calculated at each SNP location using equation 3, where the letters represent the corresponding allele frequency of the bases in each pool. This distance is then raised to four-times the power, following which the data are fit using Nadaraya-Watson kernel regression, i.e., equation 4. Also, for the SI method in this study, |△SI| was calculated at each maker and then smoothed using the same smoothing function.

RNA-Seq library construction, sequencing, and data processing

The fruit of three hybrids randomly chosen from each of the extremely resistant/susceptible bulks were used for the RNA-Seq analysis (as three independent biological replicates). Three ripened apples from each hybrid were inoculated without wounding with the mycelia of the B. dothidea Zz26 isolate, and healthy flesh from the very edge of the lesion was cut and sampled at 0 h, 24 h, 48 h, and 96 h after inoculation (Guan ). Three apples of each non-inoculated hybrid were also sampled as controls. Total RNA was extracted using the modified CTAB method (Moser ). For each sample, 5 μg total RNA was used to isolate mRNA for the preparation of an RNA-Seq library using NEBNext Poly(A) mRNA Magnetic Isolation Module and NEBNext Ultra Directional RNA Library Prep Kit for Illumina (New England Biolabs) following the manufacturer’s protocols. The cDNA library was sequenced (paired-end 150) from both the 5′ and 3′ ends on the Illumina HiSeq X Ten platform (Illumina) according to the manufacturer’s instructions. The RNA-Seq reads were mapped to the apple DH genome with HISAT2 (Kim ). StringTie was used to conduct transcript assembly and quantification (Pertea ). DESeq2 software was used to detect differentially expressed genes (DEGs) between the resistant and susceptible hybrids (Love ). Blast2GO was used for gene ontology (GO) classification and GO enrichment analysis (Conesa ). KOBAS2.0 was used for Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolism annotation and KEGG enrichment analysis (Xie ). The .bam files from one individual hybrid were merged, followed by genotyping by SAMtools (Li ; Li 2011). SNPs and InDels were called and clustered between resistant and susceptible hybrids.

Candidate gene mining From the QTL regions based on multi-omics data

Narrowing down the QTL regions with varied sliding windows sizes:

By using the BSA-Seq data, all markers located on the gene body or up-stream regions of genes were selected from ± 0.5 Mb regions of the QTL main peak and sub-peaks, if present. The markers were then filtered, and those makers with a G-value above the G’-value were selected for subsequent analysis. Sliding window analysis with 0.125 Mbp, 0.25 Mbp, 0.5 Mbp, 0.75 Mbp, and 1 Mbp window sizes was used. The standard deviation and mean of those peaks were calculated to measure the stability of the peak position. Overlapped peak regions with different window sizes were considered to be the defined region

Excluding genes from QTL intervals using the parental re-sequencing data:

Too many genes were present in the narrowed-down QTL intervals. To further reduce the gene number, another criterion for candidate functional single nucleotide variants (SNVs) and structural variations (SVs) screening is that all the variations must be consistent with Mendelian inheritance; i.e., if the QTL is located only in the pollen parent, then the genotype of the screened variant in the pollen parent must be heterozygous, the genotype in the maternal parent must be homozygous, and vice versa. By using the parental SNV and SV database, many genes within the narrowed-down QTL intervals were culled, except where upstream (2,000 bp) variations between the two parents may affect cis-element functioning, or where exonic variations between parents may cause stop-gain, frameshift, or domain non-synonymous altering.

Excluding genes from candidates using transcriptome data:

Gene transcription is often affected by variations in up-stream regions, especially on the cis-elements (Espley ; Kobayashi ; Zhang ). The exonic variation can be also detected in the corresponding cDNA samples if the gene is transcribed (Han ; Piskol ). By using transcriptomic data, the unassociated genes can be further culled from the above-mentioned candidate genes, including those absent in the unigene dataset and thus un-transcribable with those variations in upstream cis-elements, but absent in the DEG dataset; and those with exonic variations absent in the unigene mRNA SNP/InDel-calling dataset.

Excluding genes from candidates with functional alterations in the incorrect bulk pool:

We attempted to assemble haplotype blocks using paired-reads. The Hidden Markov Model-based algorithm integrated in SAMtools was adopted to assemble haplotype blocks from the DNA sequence reads (Li ; Li 2011). The haplotype blocks from the resistant/susceptible bulks were considered as main haplotype blocks representing the haplotype shared by most of the samples in the bulks. BSA markers were used as ‘anchor markers’ to assign the allele frequency to the haplotype. Thus, the haplotype blocks could be determined as enriched in resistant or susceptible pools based on the BSA makers.

Marker validation and gene expression analysis

To examine the RNA-Seq data, randomly selected genes were validated using real-time quantitative (RT-q)PCR, and the gene-specific primers were designed with primer premier software (version 5.0) (Premier Biosoft Interpairs, Palo Alto, CA) (Table S1). One microgram of total RNA was used in the reverse transcription in a total volume of 20 μL in the presence of 6-mer random primer and oligo primer according to the protocol of the TaKaRa kit (TaKaRa Biotechnology Co., Ltd., Japan). The PCR reactions were run in three replicates in a Bio-Rad Sequence Detection System (Bio-Rad Life Science Research, Hercules, CA, USA). Actin was chosen as an internal control gene for normalization. Quantifying the relative expression of the genes at the three sampling time points was performed using the delta-delta Ct method (Livak and Schmittgen 2001). All data were expressed as the mean ± SD after normalization.

Validation of SNPs and InDels

Primers were designed using Primer-blast software (NCBI, Maryland, USA) based on the 200-bp sequences flanking the SNPs and InDels to be validated. Then, monoclonal PCR products comprising SNP loci of interest were amplified in both parents and sequenced using Sanger sequencing (BGI, Beijing, China) to verify the SNPs obtained from the re-sequencing data between the parents.

KASP genotyping assay and statistical analysis

A total of 110 DNA samples, including 20 and 30 individuals in the bulks resistant and susceptible to isolate Zz26, respectively, and also 60 Malus germplasm accessions were genotyped using the KASP assay (LGC Genomics, Beverly, MA, USA). The primers were designed based on the 200-bp sequence flanking the SNPs (Table S2). The DNA of ‘Jonathan’ and ‘Golden Delicious’ was used as control. Then the genotypes were validated by Sanger sequencing. Data were analyzed using the “Endpoint Genotyping” method of the Light Cycler 480 Software (release 1.5.0). The segregation biases of markers between resistant and susceptible bulks were tested by Chi-square test. Chi-square test was run twice, the first, allele A was supposed to be completely/partially dominant on B, and the next, allele B completely/partially dominant on A. The scenario which Chi-square value was statistically significant was accepted.

Data Availability

The RNA-sequencing data and the whole genome re-sequencing data of the pooled DNA from the bulks and un-pooled DNA from the parental cultivars have been deposited in the NCBI Sequence Read Archive (SRA) with the accession number PRJNA392908. Gene expression data and other additional datasets and codes are also available at Figshare and File ‘Readme.txt’ contains detailed description of all the additional datasets. File S1 contains all the detailed descriptions of supplemental materials. Supplemental material available at Figshare: https://doi.org/10.25387/g3.7819688.

Results

Bulk construction and whole genome re-sequencing

Based on the phenotypes over five years (2008, 2009, 2014, 2015, and 2016), eight bulks were developed, each including 20∼37 hybrids with extremely resistant or susceptible phenotypes to each of the four B. dothidea isolates (Zz26, Ls1, Lw023, and Lw048). A total of 199 hybrids were included in the eight bulks, of which 17 and 21 exhibited extreme resistance or susceptibility to at least two pathogen isolates, respectively (Figure 1 and Table S3).
Figure 1

Venn diagrams of the hybrid numbers (‘Jonathan’ × ‘Golden Delicious’) included in the bulks with phenotypes of extremely resistant (R) and susceptible (S) to apple fruit ring rot (Botryosphaeria dothidea) isolates Ls1 (red), Lw023 (orange), Lw048 (green) and Zz26 (blue).

Venn diagrams of the hybrid numbers (‘Jonathan’ × ‘Golden Delicious’) included in the bulks with phenotypes of extremely resistant (R) and susceptible (S) to apple fruit ring rot (Botryosphaeria dothidea) isolates Ls1 (red), Lw023 (orange), Lw048 (green) and Zz26 (blue). The whole genome re-sequencing data of the pooled DNA from the bulks and un-pooled DNA from the parental cultivars, ‘Jonathan’ and ‘Golden Delicious’, have been deposited in the NCBI Sequence Read Archive (SRA) with the accession number PRJNA392908. A total of 163,934,426 and 172,856,412 clean reads were obtained by parental re-sequencing for ‘Golden Delicious’ and ‘Jonathan’, respectively (Table S3). About 329.4 M reads were mapped to the apple DH genome (Daccord ), of which about 207.5 M reads were uniquely mapped (Table S3). SNVs and SVs called and annotated using the parental re-sequencing data are included in Table S4–S7. The overall consistency between the NGS and Sanger sequencing platforms was 99.9% and 85% for SNVs and SVs, respectively (Table S8 and Table S9). Based on the re-sequencing data of the two parents, a total of 1,126,610 J-type SNPs, 500,397 G-type SNPs, and 3,098,152 H-type SNPs were identified (Table 1).
Table 1

Numbers of different types of markers used in bulk segregant analysis

IsolateJ-typeG-typeH-typeSum
J vs. G112661050039730981524725159
Zz26147550254502512722743292801
Ls17033212575157329351693771
Lw02310139743812929610132356279
Lw048153828353554714335173507347

Notes: G-type: the genotype of the marker is heterozygous in ‘Golden Delicious’ and homozygous in ‘Jonathan’; J-type: the genotype of the marker is homozygous in ‘Golden Delicious’ and heterozygous in ‘Jonathan’; H-type: the genotype of the marker is heterozygous in both parents;J vs. G:all the numbers of the three types of markers based on the genotype of ‘Golden Delicious’ and ‘Jonathan’.

Notes: G-type: the genotype of the marker is heterozygous in ‘Golden Delicious’ and homozygous in ‘Jonathan’; J-type: the genotype of the marker is homozygous in ‘Golden Delicious’ and heterozygous in ‘Jonathan’; H-type: the genotype of the marker is heterozygous in both parents;J vs. G:all the numbers of the three types of markers based on the genotype of ‘Golden Delicious’ and ‘Jonathan’. Sequencing of the eight bulks generated a total of 917,367,968 reads. The sequence reads of the bulked samples provided on average 39.6–60 X coverage for the DH apple genome (Table S3). For all the reads (937,097,528) from the eight bulks, 97.89% (917,367,968) were mapped to the reference genome and 72.8% (682,727,645) were uniquely mapped (Table S3). After filtering, the sum of the remaining J-, G-, and H-type SNPs ranged from 1,693,771 to 3,507,347, and the corresponding SNP density across the genome ranged from 2601/Mb to 5387/Mb in the extreme bulks of the four isolates (Table 1). The distribution of SNPs in the genome, shown as the number of SNPs per Mb with a 10 Kbp sliding window, was not even, and almost all regions in the genome excluding the gaps were covered by the sliding window analysis (Figure S1–S4).

Comparison of statistical methods for QTL identification

The profiles of the corresponding major QTL (J06.1, J08.1, J10.1, J10.2, J10.3, G11.1, G15.1, H03.1, H14.1, H14.2, and H16.1) identified through SI, ED, and GV coincided or overlapped well, indicating that the effects of the major QTL were relatively robust. For all statistical methods, the effects of noise were effectively reduced and AFD was measured between the two extreme pools (Figure S5, Figure S6 and Figure S7). However, the QTL by SI covered broader regions and were less significant than by ED or GV (Figure S5, Figure S6, Figure S7 and Table 2), implying that the QTL sensibility of SI was comparatively lower. As a consequence, one reliable minor QTL (G02.1-GV) was missed in SI (Figure S7 and Table 2). Distinct unbiasedness and low false positives in the QTL were achieved using GV, especially for H-type QTL. The QTL intervals by GV (2,551,554 bp) were much narrower than that by ED (2,618,442 bp) (Table 2). Additionally, some false positive QTL were found in ED (Figure S6 and Table S11). Collectively, GV was the best statistical method and was thus explored subsequently.
Table 2

Comparison of overlapped QTL for apple (Jonathan × Golden Delicious) resistance/susceptibility to Botryosphaeria dothidea isolate Zz26, detected by three statistic methods

Statistic methodQTLChromosomeOriginQTL startQTL endQTL regionPeak positionPeak (SI/G’/ED)ThresholdPercentage exceeding thresholdKASP confirmed (YES/NO)
GVH03.1Chr03H33952320357769741824654348226055.22693.865935.21%YES
SIH03.1Chr03H33397901358691122471211352033530.19730.170016.07%YES
EDH03.1Chr03H34041631358892321847601353892320.0170.012140.43%YES
EDJ06.1Chr06J15018027049918554811648000680.02380.0076211.49%YES
GVJ06.1Chr06J15018027049918554811648000688.9353.8199133.91%YES
SIJ06.1Chr06J16688637529246586038347998460.22760.150051.71%YES
EDJ08.1Chr08J10795726118431551047429112957260.00940.007623.60%YES
GVJ08.1Chr08J10795726117957261000000112957264.40683.819915.36%YES
SIJ08.1Chr08J1082489711743561918664112957260.16440.15009.59%YES
GVJ10.1Chr10J25782140271400111357871261112636.68543.819975.02%YES
EDJ10.1Chr10J25781888272799841498096266686700.01270.007666.85%YES
SIJ10.1Chr10J25477302271400111662709266007920.17870.150019.13%YES
EDJ10.2Chr10J29538171385278878989716377488930.01870.0076145.20%YES
SIJ10.4Chr10J30998909385229837524074377498740.20340.1500115.00%YES
GVJ10.4Chr10J36744410385059071761497377488937.3133.819991.45%YES
SIG11.1Chr11G9782493110212961238803104087190.16410.140017.24%YES
EDG11.1Chr11G9782493110389061256413104087190.0080.005545.00%YES
GVG11.1Chr11G9908719109497571041038104087194.54333.504629.64%YES
EDH14.1Chr14H14647346176830753035729152699580.02190.012180.77%YES
GVH14.1Chr14H14685326172830752597749154668645.9193.865953.11%YES
SIH14.1Chr14H14856674172890652432391154668640.20520.170020.68%YES
SIH14.3Chr14H25112559261125591000000256125590.23730.170039.61%YES
EDH14.2Chr14H25112559264361471323588256125590.03550.0121193.23%YES
GVH14.2Chr14H25112559262957111183152256125597.40293.865991.49%YES
EDG15.2Chr15G51474032533637761889744519740320.01090.005599.46%YES
GVG15.1Chr15G51484641531584771673836519846414.95193.504641.30%YES
SIG15.1Chr15G51659034549303213271287525140300.16950.140021.11%YES

Notes: Statistic method, GV: G value method, SI: SNP index method, ED: Euclidean distance method.

Notes: Statistic method, GV: G value method, SI: SNP index method, ED: Euclidean distance method.

The impact of the reference genome on QTL detection

By using the diploid (version 1.0p) and DH genomes, 14 and 12 significant QTL were identified, respectively (Figure 2 and Table S10). The seven reliable QTL in the DH genome coincided or overlapped precisely with nine reliable QTL from the diploid genome (Table S13). However, five reliable QTL (G02.1, J08.1, H16.1, H14.1, and H14.2) in the DH genome did not match any QTL from the diploid genome (Table S13). Conversely, two reliable QTL (G09.1 and G09.2) identified by using the diploid genome were missing from the results of the DH genome (Table S13). Based on the sequence alignment, gaps were clearly detected in two QTL regions between the two genome versions (Table S13 and Figure S9). One QTL (J05.1) in the diploid genome was aligned to chromosome 10 of the DH genome and overlapped with one QTL (J.10.2) (Table S13, Figure S9, and Figure S10). These data confirmed the reliability of the QTL detection and also suggested that using DH genome should generally produce better results, but a few QTL may be potentially missed due to the loss of segments/regions during DH process and recombination.
Figure 2

G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistance/susceptibility to Botryosphaeria dothidea isolate Zz26. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’

G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistance/susceptibility to Botryosphaeria dothidea isolate Zz26. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’

Identification of QTL for FRR resistance

For resistance/susceptibility to isolate Zz26, 12 significant QTL were obtained, six of which (J06.1-Zz26, J10.1-Zz26, J10.2-Zz26, J10.3-Zz26, H14.1-Zz26, and H14.2-Zz26) were considered as major QTL (Figure 2 and Table S10). For resistance/susceptibility to B. dothidea isolate Ls1, Lw023, and Lw048, 14, 8, and 12 significant QTL were detected, respectively. Of these QTL, seven (H14.1-Ls1, G15.1-Ls1; G02.1-Lw023, G13.1-Lw023, G14.2-Lw023, H14.1-Lw048, and J15.2-Lw048) constituted major QTL (Table S14 and Figures 2, 3, 4, and 5).
Figure 3

G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistant/susceptible to Botryosphaeria dothidea isolate Ls1. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’

Figure 4

G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistant/susceptible to Botryosphaeria dothidea isolate Lw023. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’

Figure 5

G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistant/susceptible to Botryosphaeria dothidea isolate Lw048. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’

G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistant/susceptible to Botryosphaeria dothidea isolate Ls1. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’ G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistant/susceptible to Botryosphaeria dothidea isolate Lw023. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’ G’ value profile of the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistant/susceptible to Botryosphaeria dothidea isolate Lw048. Y-axis represents G’ value. X-axis represents chromosome position. Red lines: ‘Jonathan’, blue lines: ‘Golden Delicious’, black lines: ‘Golden Delicious’ & ‘Jonathan.’ Of the significant QTL, five QTL regions coincided or overlapped with resistance/susceptibility to more than two isolates. The QTL G02.1-Zz26/G02.1-Ls1, G15.1-Ls1 / H15.1-Lw023 and G06.1-Ls1/G06.1-Lw048 coincided, while the QTL regions of J10.2-Zz26/G10.2-Lw023 and H14.2-Zz26/H14.1-Ls1/H14.1-Lw048 overlapped (Table S14 and Figure 6). H14.2-Zz26/H14.1-Ls1/H14.1-Lw048 were consistently mapped to the same region along with four previously reported major gene loci (sf-g14-ls1, sf-g14-lw048, Rb-g14-lw048, and Rb-g14-zz26) (Figure S11) (Cui ).
Figure 6

Circular overview of the QTL for apple resistance/susceptibility to fruit ring rot (Botryosphaeria dothidea) isolates Ls1 (red), Lw023 (orange), Lw048 (green), and Zz26 (blue) across the genome. The outer circle: QTL from ‘Golden Delicious’ (G); intermediate circle: QTL from ‘Jonathan’ (J); the inner circle: QTL from both ‘Golden Delicious’ and ‘Jonathan’. The colored dots in the central map indicate the geographical origin of the pathogen isolates

Circular overview of the QTL for apple resistance/susceptibility to fruit ring rot (Botryosphaeria dothidea) isolates Ls1 (red), Lw023 (orange), Lw048 (green), and Zz26 (blue) across the genome. The outer circle: QTL from ‘Golden Delicious’ (G); intermediate circle: QTL from ‘Jonathan’ (J); the inner circle: QTL from both ‘Golden Delicious’ and ‘Jonathan’. The colored dots in the central map indicate the geographical origin of the pathogen isolates

Narrowing down the QTL intervals with varied sliding window sizes

By using varied sliding window sizes, the 46 QTL for FRR resistance to the four pathogen isolates were significantly reduced (Table S15). The spanning regions of the 13 major QTL were narrowed down from 1,685,139 bp to 743,971 bp on average, and thus the significance of the average GV scores was also increased from 5.80 to 6.25 (Table S15). Several broader QTL intervals were split into two or more narrower sub-peaks; e.g., the QTL J06.1-Zz26 was spliced into two sub peaks, J06.1_sub1-Zz26 and J06.2_sub2-Zz26, respectively (Table S15). As a result of this splicing, the QTL interval was narrowed down from 4,800,068 bp to 1,000,000 bp and 500,000 bp, and the average GV scores increased from 8.93 to 8.34 and 9.71, respectively (Table S15).

Candidate gene mining from QTL regions based on multi-omics data

Excluding genes from the QTL intervals using parental re-sequencing data:

Of the 3,058 genes scattered on the 49 narrowed-down QTL regions including sub peaks (Table S15), 1,346 genes were by excluded using the parental SNV and SV databases, and 24,977 SNVs and 253 SVs were distributed in the 1,712 remaining candidate genes (Table S16). The RNA-Seq data were deposited in the NCBI Sequence Read Archive (SRA) with the accession number PRJNA392908. After removing low-quality reads and those derived from rRNA, the total reads input for mapping was 285.86 M, of which 207.9 M (80.63%) were aligned to the apple DH genome (Table S17). Overall, 136,983 transcripts from 86,808 genes (63,538 referenced and 23,270 non-referenced) were assembled. The Fragments Per Kilobase of transcript per Million mapped reads (FPKM) values of three representative genes, MD05G1236300, MD05G1261100, and MD08G1128800, were closely correlated with their relative expressions by RT-qPCR (R2 = 0.921-0.962, P < 0.01) (Figure S12). The correlation analysis and hierarchical cluster analysis revealed a close grouping of the same stage in both the extremely susceptible and resistant hybrids (Figure S13). In total, 1,570 DEGs were detected between the extremely susceptible and resistant hybrids across the sampling time points (Figure S14). SNVs, including 503,067 SNPs and 37,402 InDels, were identified in 36,796 genes. Following functional annotation, 103,510 non-synonymous SNVs were identified and were distributed across 24,533 genes. In addition to the transcriptomic data in this study, the RNA-Seq data of 25 transcriptome bio-projects and 432 samples from the SRA database were downloaded (Table S18). All the reads were mapped to the DH reference genome to quantify the expression of each gene. The expression data of the genes were also used to assist in the screening of organ-specific or non-expressive genes (Eide ). Using the transcriptomic data, 56 non-expressed genes and 46 non-DEGs without variations in their CDS were culled from the 503 candidates in the QTL intervals for resistance/susceptibility to isolate Zz26 (Table S16). In the QTL regions for resistance/susceptibility to isolate Ls1, Lw023, and Lw048, 221 of the 1,209 genes were excluded owing to non-expression in any apple fruit samples (Table S16).

Excluding genes from candidates by functional annotation:

One thousand and one of the 1,389 candidate genes were successfully assigned with Gene Ontology (GO) terms. Based on the detailed gene annotations, 386 genes were excluded due to the inconsistency of their organ/tissue/sub-cellular localization, developmental dynamics, and physiological pathway annotations, after which 1,003 genes remained (Table S16).

Excluding genes from candidates based on their functional alteration appearing in the incorrect bulk:

The allele frequency of the screened functional variations in the extremity pools can aid candidate gene prediction. However, only small parts of the SNPs used in the BSA analysis can be directly assigned with allele frequency. Here, we phased the variations and constructed 740,329 haplotype blocks in the two parents. BSA markers were used as ‘anchor markers’ to assign the AFD between two bulks to the haplotype. For the pollen parent ‘Golden Delicious’, 5,505,264 SNPs were phased into 364,768 haplotype blocks, and after anchoring, 162,184 (44.46%) haplotype blocks were assigned an AFD between bulks. For the maternal parent ‘Jonathan’, 6,122,520 SNPs were phased into 375,561 haplotype blocks, and 196,220 (52.25%) haplotype blocks were assigned an AFD to the bulk pools. Considering meiotic crossover, the haplotypes of the male and female parent were referenced to each other. As for resistance/susceptibility to isolate Zz26, in this study, 2,051 of the 3,211 variations in 278 candidate genes were assigned an AFD, and the 42 genes with unexpected AFD values between the two bulk pools were excluded. For example, a stop-gain variation was screened in the CDS of MD15G1419300 in QTL G15.1, which is annotated as ‘mitogen-activated protein kinase kinase kinase’ gene; however, the stop-gain variation was enriched in the R pool, which is inconsistent with its function. On the contrary, a stop-gain variation in the CDS of MD10G1288400 in J10.3, a serine/threonine-protein kinase-like protein, was enriched in the S pool, which was expected. Ultimately, 57 candidate genes from 46 significant QTL associated with resistance/susceptibility to four B. dothidea isolates were selected for further experimental validation (Table 3 and Table S14).
Table 3

Summary of the candidate genes mined from the QTL for apple (‘Jonathan’ × ‘Golden Delicious’) resistance/susceptibility to Botryosphaeria dothidea isolates

IsolateChromosomeQTLgene IDR/SMutation typeGene annotation
Lw048Chr02H02.1MD02G1036600Sframe-shift insertionProtein kinase superfamily protein
Lw048Chr02H02.1MD02G1040200RCREs & nonsynonymous SNVTranscriptional corepressor LEUNIG
Lw048Chr02H02.2MD02G1085900frame-shift deletionAnkyrin repeat family protein
Lw048Chr02H02.3MD02G1170800RCREsAP2/B3-like transcriptional factor family protein
Lw048Chr02H02.3MD02G1174000SCREs & nonsynonymous SNVProbable receptor-like serine/threonine-protein kinase
Lw048Chr02H02.3MD02G1191600Sstop-gainPathogenesis-related protein 1C
Lw048Chr03H03.1MD03G1067300Snonsynonymous SNVNB-ARC domain-containing disease resistance protein
Lw048Chr06G06.1MD06G1173700Sstop-gainSerine/threonine-protein kinase ATM
Lw048Chr08J08.1MD08G1229700frame-shift insertionProtein kinase family protein
Lw048Chr08J08.1MD08G1239000frame-shift insertionDisease resistance family protein
Lw048Chr08J08.1MD08G1241800nonsynonymous SNVLeucine-rich receptor-like protein kinase family protein
Lw048Chr08J08.1MD08G1244800Sstop-lossProtein kinase superfamily protein
Lw048Chr04J04.1MD04G1066000Sstop-gainCTR1_ARATH Serine/threonine-protein kinase CTR1
Lw048Chr14H14.1MD14G1162500CREsUnknown
Lw048Chr15J15.3MD15G1282900Sstop-gainLeucine-rich receptor-like protein kinase family protein
Ls1Chr06G06.1MD06G1173700Sstop-gainSerine/threonine-protein kinase ATM
Ls1Chr10G10.1MD10G1006400Sstop-gainWRKY transcription factor 19
Ls1Chr10G10.1MD10G1007300nonsynonymous SNVDisease resistance protein
Ls1Chr10G10.1MD10G1007600Snonsynonymous SNVdisease resistance protein
Ls1Chr10G10.1MD10G1006500nonsynonymous SNVDisease resistance protein
Ls1Chr10G10.1MD10G1006600Snonsynonymous SNVDisease resistance protein
Ls1Chr10G10.1MD10G1006800nonsynonymous SNVDisease resistance protein
Ls1Chr10G10.1MD10G1006900nonsynonymous SNVDisease resistance protein
Ls1Chr10G10.2MD10G1021000Sframe-shift insertionDisease resistance protein
Ls1Chr13H13.1MD13G1191200nonsynonymous SNVPLAT/LH2 domain-containing lipoxygenase family protein
Ls1Chr13J13.1MD13G1210700CREs & nonsynonymous SNVHR-like lesion-inducing protein-related
Ls1Chr14H14.1MD14G1162500CREsUnknown
Ls1Chr15G15.1MD15G1103400nonsynonymous SNVTranscriptional corepressor LEUNIG
Ls1Chr15G15.1MD15G1104000Snonsynonymous SNVDisease resistance protein
Ls1Chr15G15.1MD15G1103200nonsynonymous SNVAIG2-like (avirulence induced gene) family protein
Ls1Chr15G15.2MD15G1129100frame-shift deletionTOPLESS-related 1
Ls1Chr17J17.2MD17G1028400frame-shift deletionProtein TPR2
Ls1Chr17J17.2MD17G1028700Sstop-gainAnkyrin repeat-containing protein NPR4
Ls1Chr17J17.2MD17G1029100Sstop-gainAnkyrin repeat family protein
Ls1Chr17J17.2MD17G1029200frame-shift deletionAnkyrin repeat family protein
Ls1Chr02G02.1MD02G1213800Snonsynonymous SNVPentatricopeptide repeat-containing protein
Ls1Chr17J17.2MD17G1054100nonsynonymous SNVWRKY DNA-binding protein 3
Lw023Chr02G02.1MD02G1229600SupstreamTwo-component response regulator ORR22
Lw023Chr02G02.1MD02G1230200upstreamSeven transmembrane MLO family protein
Lw023Chr10G10.1MD10G1111400SCREs & nonsynonymous SNVBasic form of pathogenesis-related protein 1
Lw023Chr10G10.2MD10G1243000Snonsynonymous SNVWRKY transcription factor 14
Lw023Chr10G10.2MD10G1243400RupstreamReceptor-like serine/threonine-protein kinase ALE2
Lw023Chr14G14.2MD14G1236800nonsynonymous SNVHCP-like superfamily protein
Lw023Chr14G14.2MD14G1236600nonsynonymous SNVProstaglandin E synthase 2
Lw023Chr15H15.1MD15G1106800Sstop-gainLeucine-rich repeat protein kinase family protein
Lw023Chr15H15.1MD15G1103700Sstop-lossDisease resistance protein
Lw023Chr15H15.1MD15G1104000Sframe-shift deletionDisease resistance protein
Lw023Chr15H15.1MD15G1106600CREs & nonsynonymous SNVWRKY DNA-binding protein 11
Zz26Chr02G02.1MD02G1213800Snonsynonymous SNVPentatricopeptide repeat-containing protein
Zz26Chr11G11.1MD11G1116700SCREsAnkyrin repeat-containing protein
Zz26Chr15G15.1MD15G1416500Sstop-gainDisease resistance protein
Zz26Chr03H03.1MD03G1259600CREsSerine/threonine-protein kinase
Zz26Chr14H14.2MD14G1162500CREsUnknown
Zz26Chr16H16.1MD16G1236700RCREsSerine/threonine-protein kinase
Zz26Chr06J06.1_sub1MD06G1017400SCREsF-box/kelch-repeat protein
Zz26Chr06J06.1_sub2MD06G1037600SCREsNDR1/HIN1-like 8
Zz26Chr08J08.1MD08G1120500Sstop-gainPentatricopeptide repeat-containing protein
Zz26Chr08J08.1MD08G1120700Sframe-shiftPentatricopeptide repeat-containing protein
Zz26Chr10J10.1MD10G1169200CREsKinase-like protein TMKL1
Zz26Chr10J10.2MD10G1255900Sstop-gainF-box/kelch-repeat protein
Zz26Chr10J10.3MD10G1288400frame-shift deletionSerine/threonine-protein kinase-like protein
Zz26Chr10J10.3MD10G1288500nonsynonymous SNVReceptor-like protein kinase HSL1

Notes: Enriched pool (R/S/—): the enriched pools of the candidate functional variations, R: extremely resistant pools, S: extremely susceptible pools; —: unknown; CREs: cis-acting regulatory elements.

Notes: Enriched pool (R/S/—): the enriched pools of the candidate functional variations, R: extremely resistant pools, S: extremely susceptible pools; —: unknown; CREs: cis-acting regulatory elements.

Experimental validation of the candidate genes

The presence/absence of variations in the 14 candidate genes from 11 major QTL linked to resistance to FRR isolate Zz26 was confirmed by Sanger sequencing (Table S19 and Table S20). The segregation of the variants was then validated by KASP genotyping in hybrids with extreme phenotypes, and all the variants exhibited significant associations between genotypes and phenotypes (P < 0.05) (Table S21). The presence/absence of the markers was then verified in each 30 Malus germplasm accessions highly resistant/susceptible to FRR isolate Zz26 (Table S22). Ten KASP markers from nine QTL exhibited significant associations with the phenotypes (Table S23). Of the 14 candidate genes, five genes (MD03G1259600, MD16G1236700, MD10G1169200, MD10G1288400, and MD10G1288500), located in four QTL (H03.1, H16.1, J10.1, and J10.3), constituted protein kinases or kinase-like proteins (Table S19). The variations were validated by Sanger sequencing in the cis-element or transcription factor binding site of the promotors of MD03G1259600, MD16G1236700, and MD10G1169200 (Table S20), which is highly consistent with the finding that the expressions of these genes in fruit from the resistant bulk 48 h after inoculation were significantly higher than that in the susceptible bulk (Figure S15). Frame-shift deletions and stop-gain mutants were located in the CDS of the other two protein kinase genes, MD10G1288400 and MD10G1288500, respectively, which may cause functional loss of the genes (Table S20). From the major QTL J06.1, MD06G1037600, annotated as ‘NDR1/HIN1-like 8’ was ultimately screened as a candidate gene (Table S19). Re-sequencing and Sanger sequencing confirmed the variations in the cis-element and transcription factor (TF) binding site of this gene between parents (Table S20). The KASP analysis of the maker (KASP100) located in the promoter of MD06G1037600 exhibited significant association with the phenotype in the bulks (χ2= 7.4108 P = 0.006480 <0.05) and the 60 accessions with extreme phenotypes (χ2 = 5.93, P = 0.0146 <0.05) (Table S21, Table S23, and Table S20). The expression of this gene in the fruit from the resistant bulk 48 h after inoculation was consistently significantly higher than that in the susceptible bulk (Figure S15). Three pentatricopeptide repeat-containing genes, MD02G1213800, MD08G1120500, and MD08G1120700, were identified as candidate genes from QTL G02.1 and J08.1 (Table S19). Nonsynonymous SNVs were distributed in the functional domain of MD02G1213800, and the KASP analysis of one functional variation marker (KASP146) exhibited significant association with the phenotype of the two extreme pools (χ2 = 9.736, P = 0.001807 <0.05) (Table S18 and Table S21). Stop-gain and frame-shift variations were detected and validated in both MD08G1120500 and MD08G1120700 (Table S19 and Table S20). Furthermore, KASP analysis confirmed that the stop-gain mutation A (KASP107), enriched in the susceptible bulk, exhibited significant AFD values between the two pools (χ2= 5.333, P = 0.02090 <0.05) and the 60 extreme accessions (χ2= 4.593, P = 0.0321 <0.05) (Table S20, Table S21, and Table S23). At the same time, frame-shift variations (KASP117) were also significantly enriched in the susceptible bulk (χ2 = 3.8595, P = 0.04950 < 0.05) and extremely susceptible accessions (χ2 = 4.0074, P = 0.0453 <0.05) (Table S20, Table S21, and Table S23). In the region of QTL G15.1, a disease-resistance gene (MD15G1416500) was detected with the largest G’ value (Table S19). A functional SNP (C to A) causing stop-gain was present in its CDS, and allele A was enriched in the susceptible bulk (Table S19). Furthermore, the corresponding KASP maker (KASP66) was significantly associated with the phenotype in both the resistant/susceptible pools (χ2 =9.145, P = 0.0025 <0.05) and 60 extreme accessions (χ2= 5.079, P = 0.0242 <0.05) (Table S20, Table S21, and Table S23). Two F-box/kelch-repeat proteins (MD06G1017400 and MD10G1255900) were detected in the QTL J06.1_sub1 and J10.2, respectively (Table S19). This protein is a component of SCF (ASK-cullin-F-box) E3 ubiquitin ligase complexes, which may mediate the ubiquitination and subsequent proteasomal degradation of target proteins (Zhu ). Protein ubiquitination processes play an important role in plant–pathogen interactions (Lee ). Re-sequencing and Sanger sequencing confirmed the SNPs and InDels in the cis-element and TF binding site of MD06G1017400 between the parents (Table S18). One SNP (KASP140) located in the TF binding site was significantly associated with the phenotype in the two pools (χ2= 5.718, P = 0.0168 <0.05) and 60 extreme accessions (χ2= 5.880, P = 0.0153 <0.05) (Table S20, TableS21, and Table S23). The expression of this gene in the fruit of hybrids from the resistant bulk 48 h after inoculation was consistently significantly higher than that of the susceptible bulk (Figure S15). While for another gene, MD10G1255900, a functional SNP (G-to-A) (KASP87) causing stop-gain was present in its CDS, and an enriched allele A was also present in the susceptible bulk and exhibited significant AFD between the two pools (χ2 = 5.203, P = 0.02255 <0.05) and the 60 extremes accessions (χ2 = 9.72, P = 0.001823 <0.05) (Table S19, Table S20, Table S21, and Table S23). Ankyrin repeat-containing genes participate in the processes of signal transduction and protein kinase activity and thus may play an important role in the plant immune system (Mou ; Ngaki ). In the QTL of G11.1, an ankyrin repeat-containing gene (MD11G1116700) was identified in the peak (Table S19). The expression of this gene in the fruit of hybrids from the resistant bulk 48 h after inoculation was significantly higher than that of the susceptible bulk (Figure S15). The significant difference in expression was possibly caused by the validated mutants in the promoter region of this gene between parents (Figure S15 and Table S20). Furthermore, the KASP marker KASP126, developed in the candidate functional InDels, was significantly associated with the phenotype in the two pools (χ2= 5.966, P = 0.01459 < 0.05) and the 60 extreme accessions (χ2= 5.253, P = 0.02191 < 0.05) (Table S20, Table S21, and Table S23).

Discussion

The efficiency of the BAS-Seq strategy for QTL mapping in an outbreeding population

The double pseudo-testcross hypothesis was successfully applied to BSA mapping by separating the three types of markers. Data from either BSA-Seq or parental re-sequencing revealed three types of marker polymorphisms. The J- and G-type markers exhibit allelic heterozygosity and resemble the test cross for the maternal and paternal parent, respectively (Grattapaglia and Sederoff 1994). H-type markers exhibiting dual allelic heterozygosity are quite similar to the F2 progeny in inbreeding species (Ries ; Takagi ). The functional variations in the candidate genes controlling the target traits are also expected to possess the three types of polymorphisms (Bai ; Takos ). Here, 13 major QTL were detected, four of which were from G, five of which were from J, and four of which were from the H polymorphism types (Table S14 and Figure 2-5). The predicted and validated functional mutations in the candidate genes also exhibited J-, G-, and H-type polymorphisms, which facilitates further gene exploration (Table S19 and Table S20). The QTL detected by using 3-type method exhibited the narrowest QTL intervals or the highest G’ value (File S2). Hence, BSA mapping by separating the three marker types allowed for the full utilization of polymorphic markers as well as the direct mapping of QTL on certain parent(s). The GV method with a sliding window can realize noise reduction and unbiased statistical analysis for the three marker types. Three methods, namely SI, ED, and GV, are commonly used in BSA-Seq to reduce the noise and to measure the AFD between two extreme pools (Hill ; Magwene ; Takagi ). In this study, 11 of the 12 significant QTL detected using the three methods coincided perfectly, indicating comparable effectiveness (Figure S5, Figure S6, Figure S7, and Table 2). However, the QTL profiles detected by GV were both narrower and more significant than those detected by SI (Table 2) and exhibited better unbiasedness and a lower false-positive ratio than that of ED (Table S11). Sliding window analysis using different window sizes significantly narrowed down the QTL intervals. Fine-mapping based on large natural or pedigree segregation populations has been widely used but is both costly and labor-intensive/time-consuming (Bai ; Laurens ; Terakami ). It is very difficult to create recombinant inbred lines or near isogenic lines in outbreeding woody perennials, such as apple, pear, and peach, which greatly hinders fine mapping in these species (Di Pierro ). Here we showed that a large amount of markers derived from the deeper re-sequencing of the two DNA pools made it possible to narrow down the QTL intervals with varied sliding window sizes analysis, e.g., for resistance/susceptibility to isolate Zz26, the marker density is close to 1059/Mbp on average (Table 1 and Figure S1), which is almost 200 times more than conventional MapQTL methods (Gardner ; Sun ). By using varied sliding windows analysis, the spanning region of the QTL was narrowed down significantly from 1,639,342 bp to 707,246 bp, the averaged GV score increased from 5.55 to 6.03, and the stability of the QTL could also be evaluated (Figure S16 and Table S15). Compared with the previous studies using the same segregating population, more QTL with much narrow interval were detected and a large number of previously reported QTL or major loci were reproduced in this study (Figure S11) (Cui ; Zhuang ). H14.1-Ls1, H14.1-Lw048 and H14.2-Zz26 coincided with Sf-g14-ls1, Sf-g14-lw048 (Table S14 and Figure S11) (Cui ). Sf-j10-zz26 was split into three narrower QTL, J10.1-Zz26, J10.2-Zz26 and J10.3-Zz26, showing that the accuracy on QTL identification is higher by BSA-seq than MapQTL based on low density linkage maps (Cui ). The pathogen isolate Mx1 was not used in this study because its weak pathogenicity to apple fruit (Zhuang ). Therefore, a few QTL or major loci reported by Cui and Zhuang were not presented.

Multi-omics enables gene culling from candidates in QTL regions

Deep re-sequencing indicated a lot of genetic variation between the two parental cultivars, which is consistent with previous reports (Lee ; Xing ; Zhang ). In the QTL intervals, genes lacking potential function-altering variations can be excluded. Transcriptomes can simultaneously indicate DEGs and genetic variations in CDS (Balan ; Bonnet ; Hill ). In this study, 1,570 DEGs and 103,510 SNVs in the CDS of 24,533 genes were identified between the resistant and susceptible hybrids (Figure S14). These data, combined with large online transcriptome datasets, assisted the efficient excluding of many genes from the QTL intervals (Balan ; Balan ; Guo ).

The polygenetic signature of apple resistance to FRR provides evidence for the pathogenic variation among isolates

In this study, phenotyping was performed for five years, and hybrids with year-long robust resistance/susceptibility were selected for bulk creation. A total of 46 QTL, including 13 major QTL, were ultimately detected for resistance/susceptibility to the four B. dothidea isolates (Table S14). At least three major QTL for each pathogen isolate were found. These QTL demonstrated the apparent polygenetic signature of apple resistance to FRR, which corroborates previous reports (Cui ; Zhuang ). A variety of factors influence apple resistance to FRR, such as the size and density of the lenticels, the thickness of the wax layer, and the polyphenol content (Bai ; Bénaouf and Parisi 2000; Fan ; Li ; Yu ; Guan ). Protein kinase plays an important role in signal transduction and pathogen recognition in the plant immune system (Park ). Receptor-like protein kinases (RLPs) are widely reported to participate in plant–pathogen interactions (Kruijt ). The NDR1 gene was once reported to confer resistance to Pseudomonas syringae pv. tomato DC3000 and plays an important role in defense against bacteria and viruses, and in the response to salicylic acid (Varet ; Varet ). MD06G1037600 (MdNDR1), MD10G1288500 (RLP), MD10G1288400 (Serine/threonine-protein kinase-like protein), and MD10G1169200 (Kinase-like protein TMKL), which are associated with the activation of disease resistance, pathogen recognition, and disease resistance signaling, were predicted as the most likely candidates determining resistance/susceptibility to FRR (Table 3 and Table S19). Only four of the 46 QTL for resistance/susceptibility to the four isolates coincided or overlapped with others (Table S14,). The pathogenicity of B. dothidea differs significantly among isolates (Liu ; Zhuang ). The biological characteristics, pathogenicity, and internal transcribed spacer sequences differ significantly between pathogen strains of FRR (Lv ). The diversity in resistance/susceptibility of apple to varied pathogen isolates supports the genetic differentiation of the pathogen according to the gene-for-gene hypothesis (Keen 1990; Van der Biezen and Jones 1998). For a complex trait like apple ring rot disease resistance, large number of QTL each capture only a small proportion of the total genetic variance, and the pathogenicity varies dramatically among isolates (Cui ). The 46 QTL identified in this study can be developed into QTL based genomics-assisted breeding or can be used as diagnostic markers among high density SNP arrays, which enables empirical genomic selection (Bianco ; Bianco ). In addition, the candidate genes can help further research on the molecular mechanism of apple FRR resistance.
  71 in total

1.  Plant science. Biosynthesis, regulation, and domestication of bitterness in cucumber.

Authors:  Yi Shang; Yongshuo Ma; Yuan Zhou; Huimin Zhang; Lixin Duan; Huiming Chen; Jianguo Zeng; Qian Zhou; Shenhao Wang; Wenjia Gu; Min Liu; Jinwei Ren; Xingfang Gu; Shengping Zhang; Ye Wang; Ken Yasukawa; Harro J Bouwmeester; Xiaoquan Qi; Zhonghua Zhang; William J Lucas; Sanwen Huang
Journal:  Science       Date:  2014-11-28       Impact factor: 47.728

2.  QTL-seq identifies an early flowering QTL located near Flowering Locus T in cucumber.

Authors:  Hongfeng Lu; Tao Lin; Joël Klein; Shenhao Wang; Jianjian Qi; Qian Zhou; Jinjing Sun; Zhonghua Zhang; Yiqun Weng; Sanwen Huang
Journal:  Theor Appl Genet       Date:  2014-05-21       Impact factor: 5.699

3.  HISAT: a fast spliced aligner with low memory requirements.

Authors:  Daehwan Kim; Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2015-03-09       Impact factor: 28.547

4.  TATA Box Insertion Provides a Selection Mechanism Underpinning Adaptations to Fe Deficiency.

Authors:  Meiling Zhang; Yuanda Lv; Yi Wang; Jocelyn K C Rose; Fei Shen; Zhenyun Han; Xinzhong Zhang; Xuefeng Xu; Ting Wu; Zhenhai Han
Journal:  Plant Physiol       Date:  2016-11-23       Impact factor: 8.340

5.  Analysis of 'Fuji' apple somatic variants from next-generation sequencing.

Authors:  H S Lee; G H Kim; S I Kwon; J H Kim; Y S Kwon; C Choi
Journal:  Genet Mol Res       Date:  2016-08-12

6.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

7.  QTL-seq for rapid identification of candidate genes for 100-seed weight and root/total plant dry weight ratio under rainfed conditions in chickpea.

Authors:  Vikas K Singh; Aamir W Khan; Deepa Jaganathan; Mahendar Thudi; Manish Roorkiwal; Hiroki Takagi; Vanika Garg; Vinay Kumar; Annapurna Chitikineni; Pooran M Gaur; Tim Sutton; Ryohei Terauchi; Rajeev K Varshney
Journal:  Plant Biotechnol J       Date:  2016-05-26       Impact factor: 9.803

8.  A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species.

Authors:  Erica A Di Pierro; Luca Gianfranceschi; Mario Di Guardo; Herma Jj Koehorst-van Putten; Johannes W Kruisselbrink; Sara Longhi; Michela Troggio; Luca Bianco; Hélène Muranty; Giulia Pagliarani; Stefano Tartarini; Thomas Letschka; Lidia Lozano Luis; Larisa Garkava-Gustavsson; Diego Micheletti; Marco Cam Bink; Roeland E Voorrips; Ebrahimi Aziz; Riccardo Velasco; François Laurens; W Eric van de Weg
Journal:  Hortic Res       Date:  2016-11-23       Impact factor: 6.793

9.  Combined biotic stresses trigger similar transcriptomic responses but contrasting resistance against a chewing herbivore in Brassica nigra.

Authors:  Christelle Bonnet; Steve Lassueur; Camille Ponzio; Rieta Gols; Marcel Dicke; Philippe Reymond
Journal:  BMC Plant Biol       Date:  2017-07-17       Impact factor: 4.215

10.  Fast and cost-effective genetic mapping in apple using next-generation sequencing.

Authors:  Kyle M Gardner; Patrick Brown; Thomas F Cooke; Scott Cann; Fabrizio Costa; Carlos Bustamante; Riccardo Velasco; Michela Troggio; Sean Myles
Journal:  G3 (Bethesda)       Date:  2014-07-16       Impact factor: 3.154

View more
  8 in total

1.  dQTG.seq: A comprehensive R tool for detecting all types of QTLs using extreme phenotype individuals in bi-parental segregation populations.

Authors:  Pei Li; Liu-Qiong Wei; Yi-Fan Pan; Yuan-Ming Zhang
Journal:  Comput Struct Biotechnol J       Date:  2022-05-14       Impact factor: 6.155

2.  Intricate genetic variation networks control the adventitious root growth angle in apple.

Authors:  Caixia Zheng; Fei Shen; Yi Wang; Ting Wu; Xuefeng Xu; Xinzhong Zhang; Zhenhai Han
Journal:  BMC Genomics       Date:  2020-12-01       Impact factor: 3.969

3.  Application of genome-wide insertion/deletion markers on genetic structure analysis and identity signature of Malus accessions.

Authors:  Xuan Wang; Fei Shen; Yuan Gao; Kun Wang; Ruiting Chen; Jun Luo; Lili Yang; Xi Zhang; Changpeng Qiu; Wei Li; Ting Wu; Xuefeng Xu; Yi Wang; Peihua Cong; Zhenhai Han; Xinzhong Zhang
Journal:  BMC Plant Biol       Date:  2020-11-30       Impact factor: 4.215

4.  Role of MdERF3 and MdERF118 natural variations in apple flesh firmness/crispness retainability and development of QTL-based genomics-assisted prediction.

Authors:  Bei Wu; Fei Shen; Xuan Wang; Wen Yan Zheng; Chen Xiao; Yang Deng; Ting Wang; Zhen Yu Huang; Qian Zhou; Yi Wang; Ting Wu; Xue Feng Xu; Zhen Hai Han; Xin Zhong Zhang
Journal:  Plant Biotechnol J       Date:  2021-01-06       Impact factor: 9.803

5.  PNGSeqR: An R Package for Rapid Candidate Gene Selection through Pooled Next-Generation Sequencing.

Authors:  Sihan Zhen; Hongwei Zhang; Yuxin Xie; Song Zhang; Yan Chen; Riliang Gu; Sanzhen Liu; Xuemei Du; Junjie Fu
Journal:  Plants (Basel)       Date:  2022-07-11

6.  PyBSASeq: a simple and effective algorithm for bulked segregant analysis with whole-genome sequencing data.

Authors:  Jianbo Zhang; Dilip R Panthee
Journal:  BMC Bioinformatics       Date:  2020-03-06       Impact factor: 3.169

7.  Parallel Bud Mutation Sequencing Reveals that Fruit Sugar and Acid Metabolism Potentially Influence Stress in Malus.

Authors:  Jirong Zhao; Fei Shen; Yuan Gao; Dajiang Wang; Kun Wang
Journal:  Int J Mol Sci       Date:  2019-11-28       Impact factor: 5.923

8.  Genomics-assisted prediction of salt and alkali tolerances and functional marker development in apple rootstocks.

Authors:  Jing Liu; Fei Shen; Yao Xiao; Hongcheng Fang; Changpeng Qiu; Wei Li; Ting Wu; Xuefeng Xu; Yi Wang; Xinzhong Zhang; Zhenhai Han
Journal:  BMC Genomics       Date:  2020-08-10       Impact factor: 3.969

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.