Literature DB >> 35231054

Linkage disequilibrium and population structure in a core collection of Brassica napus (L.).

Mukhlesur Rahman1, Ahasanul Hoque1,2, Jayanta Roy1.   

Abstract

Estimation of genetic diversity in rapeseed is important for sustainable breeding program to provide an option for the development of new breeding lines. The objective of this study was to elucidate the patterns of genetic diversity within and among different structural groups, and measure the extent of linkage disequilibrium (LD) of 383 globally distributed rapeseed germplasm using 8,502 single nucleotide polymorphism (SNP) markers. We divided the germplasm collection into five subpopulations (P1 to P5) according to geographic and growth habit-related patterns. All subpopulations showed moderate genetic diversity (average H = 0.22 and I = 0.34). The pairwise Fst comparison revealed a great degree of divergence (Fst > 0.24) between most of the combinations. The rutabaga type showed highest divergence with spring and winter types. Higher divergence was also found between winter and spring types. Admixture model based structure analysis, principal component and neighbor-joining tree analysis placed all subpopulations into three distinct clusters. Admixed genotype constituted 29.24% of total genotypes, while remaining 70.76% belongs to identified clusters. Overall, mean linkage disequilibrium was 0.03 and it decayed to its half maximum within < 45 kb distance for whole genome. The LD decay was slower in C genome (< 93 kb); relative to the A genome (< 21 kb) which was confirmed by availability of larger haplotype blocks in C genome than A genome. The findings regarding LD pattern and population structure will help to utilize the collection as an important resource for association mapping efforts to identify genes useful in crop improvement as well as for selection of parents for hybrid breeding.

Entities:  

Mesh:

Year:  2022        PMID: 35231054      PMCID: PMC8887726          DOI: 10.1371/journal.pone.0250310

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Rapeseed (Brassica napus L., AACC, 2n = 4x = 38), is a recent allopolyploid of polyphyletic origin that evolved from hybridization events between two parental ancestors of B. oleracea (Mediterranean cabbage, CC, 2n = 2x = 18) and B. rapa (Asian cabbage, AA, 2n = 2x = 20) [1]. Rapeseed genotypes having < 2% erucic acid in seed and < 30 μM glucosinolates in seed meal is known as canola, which is the second largest oilseed crops produced in the world after soybean [2]. Canola oil is mostly used in frying and baking, margarine, salad dressings, and many other products. Because of its fatty acid profile and the lowest amount of saturated fat among all other oils, it is commonly consumed all over the world and is considered a very healthy oil [3]. Canola oil is also rich with alpha-linolenic acid (ALA), which is associated to a lower risk of cardiovascular disease [3]. Additionally, canola is utilized as a livestock meal and is the second largest protein meal in the world after soybean [4]. Rapeseed oil has various industrial usages. The rapeseed oil, being simple alkyl esters is the best alternative to diesel fuel. It is more energy-economic and environment friendly than diesel fuel [5]. The high erucic acid content in rapeseed oil also made it suitable for using as lubricants [6] and surfactants [7]. Rapeseed expresses three growth habits, winter, spring, and semi-winter. The spring canola is planted in the early spring and harvested in the late spring of the same growing season [8]. The winter type canola is seeded in the fall, vernalized over the winter to induce flower and harvested in the summer [8]. The semi‐winter type is needed for a shorter period of vernalization to induce flower [9]. Rutabaga (Brassica napus ssp. napobrassica L.) is a cool-weather root crop, grown as table vegetable and fodder for animals [10]. Likewise rapeseed, rutabaga was also derived from natural or spontaneous hybridization between B. rapa and B. oleracea [11]. European immigrants brought rutabaga to North America [12] from its center of origin Sweden or Finland [10, 13]. Likewise most cruciferous vegetables, rutabaga bears anti-cancer properties [14] and showed considerable variability for morphology, biotic and abiotic stress resistance, seed yield and quality [10, 15]. In the United States of America (USA), the canola production increased 13.5 folds from five years average of 1991–1995 (0.11 m tons) to five years average of 2015–2019 (1.49 m tons) [16]. At the same time, canola oil consumption has increased rapidly in last few years. Statistics shows, though canola production increased, but it is not enough to meet the demand. That’s why, every year USA imports huge amount of canola oil (2.50 m tons in 2019) from other countries [2]. In USA, canola production is restricted to north-central region and North Dakota (ND) is the leading canola growing state, where 83% of US canola is grown. The North Dakota State University (NDSU) canola-breeding program could play a vital role in canola economy by developing high yielding varieties, shortening the breeding cycle and expanding canola growing acreage. NDSU canola breeding has already developed few varieties and handsome amount of breeding populations. However, in recent years, the low genetic diversity of the parental stock is hampering the sustainability of the program. This happened because of same sets of parents has already been crossed in different combinations. The recent origin of B. napus as a species and its very recent domestication (400 years ago), as well as selection on few phenotypes (e.g. low erucic and glucosinolate acids, seed yield) also accelerated the low diversity which threatens sustainable improvement of the crop [17]. The narrow genetic diversity might also limit the prospects for hybrid breeding where complementing genepools are needed for the optimal exploitation of heterosis [18]. Therefore, we want to expand the genetic base of NDSU stock by incorporating diversified germplasms to existing collection. To shorten the breeding cycle and maximize genetic gain, we want to use cutting-edge breeding techniques such genome wide association mapping (GWAS) and marker-assisted selection. The knowledge of population structure, genetic relatedness, and patterns of linkage disequilibrium (LD) are also prime requirements for genome-wide association study (GWAS) and genome selection directed breeding strategies [19, 20]. Therefore, it is crucial to study, preserve, and even introduce genetic diversity into rapeseed since the diversity ensures the variability for biotic and abiotic stress resistance, and various agronomical and morphological traits. We could assess the diversity of a germplasm collection by observing the phenotypic variations or genomic variations among the individuals. Before the advent of marker technology and next generation sequencing technique (NGS), crop diversity was usually assessed based on phenotypic performance. However, phenotyping is time consuming and labor intensive. Moreover, plant growth stages and environmental factors severely affect the phenotyping, results in erroneous prediction [21]. To overcome phenotyping limitations, researchers use DNA-based molecular markers for assessing the genetic diversity. Utilization of molecular markers accelerates the pre-breeding activities, as field phenotyping and pedigree information are not required [22]. Multiple genetic diversity and population structure studies, based on molecular markers [23-27], whole genome resequencing [28], transcriptome and organellar sequencing [29] have already provided information regarding genetic diversity in various B. napus collections around the world. However, the genetic diversity of the core collection maintained by the NDSU canola-breeding program has not been revealed yet. That is why; we carried out this research to explore the genetic diversity, population structure level and relatedness among the genotypes and to investigate the linkage disequilibrium (LD) and haplotype block pattern.

Materials and methods

Plant materials

A core collection of 383 rapeseed germplasm accessions was used for this study. The core is composed of 67 advanced breeding lines developed by NDSU canola breeding program, 252 germplasm accessions collected from North Central Regional Plant Introduction Station (NCRPIS), Ames, Iowa, USA and 64 varieties collected from different countries. The breeding lines are F7 generation genotypes, obtained by crossing different parents in different combinations. Initially, we collected 500 accessions from NCRPIS and phenotyped them under field conditions. No flowering occurred in case of winter type. Among them, we choose 252 relatively homogeneous genotypes for the core collection. Finally, the core collection was composed of 155 spring, 151 winter, 60 semi-winter, and 17 rutabaga types (S1 Table). The core collection is being and will be maintained through selfing. We grouped the core collection into five subpopulations (P1 to P5) according to their type and origin. Hereafter, we referred the European winter type as subpopulation-1 (P1), Asian semi-winter type as subpopulation-2 (P2), spring type NDSU genotypes (advanced breeding lines) as subpopulation-3 (P3), spring type from different countries other than NDSU breeding lines as subpopulation-4 (P4), and rutabaga type as subpopulation-5 (P5).

Genotyping and sequencing

DNA was extracted from young leaf tissue, collected from 30 days old plants. We collected three leaf samples per genotype in tubes and flash frozen in liquid nitrogen. Each sample was composed of leaves from three different plant of same genotype. Then we lyophilized leaf tissue and ground it in tubes with stainless beads using a plate shaker. Qiagen DNeasy Kit (Qiagen, CA, USA) was used for DNA extraction (3 samples per genotype) following the manufacturer’s protocol. DNA concentration was measured using a NanoDrop 2000/2000c Spectrophotometer (Thermofisher Scientific). The sample that contains good concentration of DNA was kept and other two discarded. Then we prepared the GBS library using ApekI enzyme [30]. Finally, Sequencing of the library was done at the University of Texas Southwestern Medical Center, Dallas, Texas, USA using Illumina HiSeq 2500 sequencer.

SNP calling

SNP calling was done by TASSEL 5 GBSv2 pipeline [31] was used for SNP calling using a 120-base kmer length and minimum kmer count of ten. For alignment of the reads the rapeseed reference genome [32] (available at: ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/686/985/GCF_000686985.2_Bra_napus_v2.0/) was used. The alignment was done using Bowtie 2 (version 2.3.0) alignment tool [33]. After passing all the required steps, TASSEL 5 GBSv2 pipeline yielded 497,336 unfiltered SNPs. To obtain high quality SNPs, we filtered the raw SNPs using VCFtools [34]. Filtering criteria: minor allele frequency (MAF) ≥ 0.05, missing values (max-missing) ≤ 50%, depth (minDP) ≥ 5, min-alleles = 2 and max-alleles = 2 was maintained to have bi-allelic SNPs. This filtering yielded 53,616 SNPs. To make SNP unlinked, we thinned out SNPs present within 1,000 bp distance. The SNPs that were located outside chromosomes (i.e., position unknown), were removed. As canola is a self-pollinating crop, the SNPs that were heterozygous in more than 25% of total genotypes, were also removed using TASSEL [35]. Finally, we selected 8,502 SNP markers for this study.

Data analysis

To investigate the population structure, the core collection was differentiated into clusters using STRUCTURE v2.3.4 [36] software. For this purpose, we used admixture model with various combinations of burn-in lengths (5,000 to 100,000) and Monte Carlo Markov Chain (MCMC) lengths (5,000 to 100,000). Each combination was replicated 10 times per K (K1-K10). As we grouped the collection into five subpopulations according to their type and origin, we ran each replication considering genotype assigned to specific subpopulation as well as no subpopulation i.e. genotype unassigned to any specific subpopulation. These were done to determine the parameters needed to reach convergence. We used DeltaK approach [37] to determine the ideal number of subpopulations, which was performed by Structure Harvester [38]. We also used median (MedMedK and MaxMedK) or mean (MedMeaK and MaxMeaK) estimators of the “best” K to group the subpopulations into optimum clusters [39, 40]. Ten replicates of Q matrix were assembled using CLUMPP [41] to get individual Q matrix. Structure output was visualized using the Structure Plot v2 software [42]. Principal component analysis (PCA) was conducted by covariance standardized approach in TASSEL [35]. We constructed phylogenetic tree using MEGAX program with 1,000 bootstraps [43] using neighbor-joining (NJ) algorithm. Resulting tree was displayed using FigTree V1.4.4 [44]. We calculated analysis of molecular variance (AMOVA) to partition the genetic variance among subpopulations in Arlequin3.5. To show the divergence, we calculated average pairwise between subpopulations F values using Arlequin3.5 [45]. Tajima’s D value of each group was calculated using MEGAX software [43]. GenAlex v6.5 [46] was used to estimate percentage of polymorphic loci, number of effective alleles, Shannon’s information index, expected heterozygosity and unbiased expected heterozygosity of each marker and subpopulation. To visualize SNP density, we developed a distribution plot of SNP using R package CMplot (available at: https://github.com/YinLiLin/R-CMplot). The polymorphism information content (PIC) of markers was calculated using software Cervus [47]. To show relatedness among individuals, we calculated kinship (IBS) matrix using software Numericware i [48] on a 1 to 2 scale. The kinship heatmap and histogram were visualized using R package ComplexHeatmap [49]. The correlation between level of relatedness (IBS coefficients) and Shannon’s information index (I) and diversity (H) was calculated in R v3.5.2 [50]. Linkage disequilibrium (LD) pattern of whole collection and different subpopulations were analyzed using PopLDdecay [51]. The mean linked LD was calculated by dividing total r value with total number of corresponding loci pair. In this case, r > 0.2 was considered only. Same procedure was followed to calculate mean unlinked LD where r ≤ 0.2 was considered. Haplotype block analysis was done using PLINK [52] with a window size of 5 Mb. Confidence interval (CI) method [53] was used to identify haplotype blocks with high LD. Haplotype blocks (>19 kb), observed in one subpopulation but not in the other, were considered to be subpopulation-specific block. Haplotype blocks (>19 kb) shared by more than one subpopulation, were considered to be common to corresponding subpopulations.

Results

SNP profile

We used 8,502 SNPs, covering 19 chromosomes for this study. The marker density was one per 99.5 kb. Highest number (685 SNPs, 8.06%) markers was situated on chromosome A3 and lowest (236 SNPs, 2.78%) was on chromosome A4. In terms of density, it was highest on chromosome A7 (71.1 kb) and was lowest on chromosome C9 (134.5 kb) (Table 1, Fig 1).
Table 1

Chromosome-wise distribution of SNP markers.

ChromosomeNo. of SNPs% SNPsStart position aEnd position bLength (Mb)Density c (Kb)
A14405.181491633580607535.781.0
A23924.61134303469290534.788.5
A36858.0627694910358349.171.7
A42362.78328052351767123.599.5
A54134.86186683143510531.476.1
A64485.271204093600510335.980.1
A73844.52858692738832227.371.1
A82813.312314272773441027.597.9
A95416.36814044584126845.884.6
A103053.591338532208573722.072.0
C14455.23866715066087250.6113.7
C25896.93924316826022268.2115.7
C36517.6638398036588980.36123.4
C46347.461389307050741770.4111.0
C53664.30267604412449744.1120.5
C64144.872751904547932745.2109.2
C75186.092711136230482762.0119.8
C83834.50579344631742946.3120.8
C93774.439208855162708650.7134.5
Mean447.4799.5

a Position of the 1st marker on a particular chromosome corresponding to reference genome

b Position of the last marker on a particular chromosome corresponding to reference genome

c Density was calculated by dividing the length with the marker number.

Fig 1

Chromosome-wise SNP density map.

Frequency of SNPs varies according to color gradient.

Chromosome-wise SNP density map.

Frequency of SNPs varies according to color gradient. a Position of the 1st marker on a particular chromosome corresponding to reference genome b Position of the last marker on a particular chromosome corresponding to reference genome c Density was calculated by dividing the length with the marker number. The transition SNPs (4,956 SNPs) was more frequent than transversions (3,546 SNPs) with a ratio of 1.40. The ratio of transitions to transversions SNPs was higher in A genome (1.41) than that of in C genome (1.38). In both genome, G/C transversions were lowest (4.33% and 4.29%), but A/G and C/T transitions occurred in almost similar frequencies (Table 2). The inbreeding coefficient within individuals (F), inbreeding coefficient within subpopulations (F), observed heterozygosity (Ho) and fixation index (F) of all the markers ranged from -0.45 to 1.00, 0 to 0.73, 0 to 0.57 and 0.40 to 1.00, respectively. The mean Shannon’s information index (I) of all markers 0.37 with a range from 0.10 to 0.69. The expected heterozygosity (He) was from 0.05 to 0.50 with a mean value of 0.27. The polymorphic information content (PIC) of all markers was less than 0.50 with a mean value of 0.22 (range: 0.05 to 0.37) (S2 Table). Subpopulation-wise marker diversity parameters are presented in S3 Table.
Table 2

Transition and transversion SNPs across the genome.

GenomeSNP typeModelNo. of sitesFrequencies (%)Total (percentage)
ATransitionsA/G119514.062416 (28.3%)
C/T122114.36
TransversionsA/T4575.381709 (20.1%)
A/C4244.99
G/T4605.41
G/C3684.33
CTransitionsA/G127314.972540 (29.9%)
C/T126714.90
TransversionsA/T4965.831837(21.6%)
A/C4825.67
G/T4945.81
G/C3654.29

Population structure

We did structure analysis seven times with accessions unassigned and seven times with accession assigned to their type and countries of origin. Delta K approach indicated 3 to 9 clusters (Fig 2A and 2B), while four alternative statistics (MedMedK, MedMeaK, MaxMedK, and MaxMeaK) determined following Puechmaille [39] and Li and Liu [40] indicated 3 clusters (Table 3, Fig 2C). For each run, Delta K approach showed differences in cluster number for both conditions: genotypes unassigned or assigned to their respective type and countries of origin. However, opposite scenario was found for MedMedK, MedMedK, MedMedK, and MaxMeaK statistic i.e., for each run it indicated three clusters. These outputs confirmed that Puechmaille [39] and Li and Liu [40] method was more consistent than Evanno [37] method (Table 3). Structure analysis revealed that 70.76% of genotypes belong to any of the three clusters at similarity coefficient of 0.7 and 29.24% of genotypes are admixed (Table 4, Fig 2D). Spring type accessions fall under cluster-1, whereas winter type European accessions fall under cluster-3. Cluster-2 consists of all rutabaga types and different type rapeseed accessions (Table 4). We performed principal component analysis (PCA) to show the genetic similarity among subpopulations and genotypes. The first two axes explained 21% (PCA1 13.5% and PCA2 7.22%) of the total observed variation (S4 Table). The PCA revealed that rutabaga (P5) and other types having Asian origin make one group, whereas spring type (P3, P4) and European winter type (P1) make two distinct groups (Fig 3). In addition to that, we also constructed unrooted phylogenetic tree based on neighbor joining (NJ) criteria (Fig 4). The output of neighbor-joining (NJ) tree analysis was in line with that of PCA.
Fig 2

Bayesian clustering of whole collection using 8,502 SNP markers in STRUCTURE v.

2.3.4. Graphical representation of optimal number of clusters (K) determined by Evanno’s method [37] with genotypes unassigned (A) and assigned (B) to their respective countries, as well as by Puechmaille [39] and Li and Liu [40] method (C). Estimated population structure of 383 rapeseed genotypes on K = 3 (D) using Puechmaille [39] and Li and Liu [40] method.

Table 3

Clustering of core collection based on Evanno et al. (2005) [37] and Puechmaille et al. (2016) [39] methods using different combinations of burn-in lengths and Markov Chain Monte Carlo (MCMC) lengths.

Run #Burn-in lengthsMCMC lengthsNumber of clusters (K)Number of RepsNumber of populationsαNumber of populationsβ
ΔK (Unassigned) aΔK (Assigned) bMedMedKMedMeaKMaxMedKMaxMeaK
1500050001010363334
210000100001010883333
320000200001010833333
420000500001010833333
550000500001010963333
6500001000001010933333
71000001000001010373334

α The ad hoc ΔK method [31]

aAccessions unassigned to any subpopulation

bAccessions assigned to subpopulation based on type and origin

βThe median (MedMedK and MaxMedK) or mean (MedMeaK and MaxMeaK) [33] estimators used to determine the number of cluster (K).

Table 4

Proportion of admixed and non-admixed accessions per subpopulation based on membership coefficients.

Cluster (K)Core collection subpopulation based on type and originaTotal Number
P1: Winter (151)P2: Semi-winter (60)P3: Spring_mixed origin (88)P4: Spring_NDSU (67)P5: Rutabaga (17)
K131258390112
K21512801752
K3996110107
Admixture b343021270112
In-Cluster77.48%50%76.13%59.71%100%70.76%
Admixture22.52%50%23.87%40.29%0%19.24%

a Number of genotypes having q ≥ 0.7 were assigned to specific cluster.

b Genotypes having q < 0.7 were considered as admixed genotype.

Fig 3

Principal component analysis of SNP diversity based on genetic distance.

Colors represent subpopulations.

Fig 4

Phylogenetic tree (unrooted) based on neighbor-joining (NJ) algorithm using information from 8,502 SNP markers based on 1000 bootstraps.

Each branch is color-coded according to genotype belongs to subpopulation P1 to P5. Genotypes were grouped into three clusters by dividing the tree using black solid lines according to structure output.

Bayesian clustering of whole collection using 8,502 SNP markers in STRUCTURE v.

2.3.4. Graphical representation of optimal number of clusters (K) determined by Evanno’s method [37] with genotypes unassigned (A) and assigned (B) to their respective countries, as well as by Puechmaille [39] and Li and Liu [40] method (C). Estimated population structure of 383 rapeseed genotypes on K = 3 (D) using Puechmaille [39] and Li and Liu [40] method.

Principal component analysis of SNP diversity based on genetic distance.

Colors represent subpopulations.

Phylogenetic tree (unrooted) based on neighbor-joining (NJ) algorithm using information from 8,502 SNP markers based on 1000 bootstraps.

Each branch is color-coded according to genotype belongs to subpopulation P1 to P5. Genotypes were grouped into three clusters by dividing the tree using black solid lines according to structure output. α The ad hoc ΔK method [31] aAccessions unassigned to any subpopulation bAccessions assigned to subpopulation based on type and origin βThe median (MedMedK and MaxMedK) or mean (MedMeaK and MaxMeaK) [33] estimators used to determine the number of cluster (K). a Number of genotypes having q ≥ 0.7 were assigned to specific cluster. b Genotypes having q < 0.7 were considered as admixed genotype.

Population diversity

Polymorphic loci percentage was greater than 75% in all subpopulations. P1 bears highest (99%) polymorphic loci, whereas it was lowest in P5 (75%). The diversity (H) was lowest in P4 and P5 (0.19) and was highest in P2 (0.25) with an average of 0.22. The Shannon’s information index (I) ranged from 0.31 (P4 and P5) to 0.40 (P2) with an average of 0.34. The Tajima’s D value ranged from -0.70 (P4) to 0.53 (P1) with an average of 0.13 (Table 5).
Table 5

Subpopulation-wise diversity parameters.

SubpopulationsPolymorphic loci (%)Na aNe bI cH dUh eTajima’s D*
P199.121.991.320.350.210.210.53
P294.321.941.400.400.250.250.30
P396.981.971.350.360.220.230.30
P480.671.811.300.310.190.19-0.70
P575.251.751.310.310.190.200.23
Mean89.271.891.340.340.220.220.13

a No. of different alleles

b No. of effective alleles

c Shannon’s information index

d Diversity

e Unbiased diversity. SE (standard error) was zero in all cases. Indices calculated using 8191 SNPs with GenAlex v. 6.5.

* was calculated with 1000 permutations.

a No. of different alleles b No. of effective alleles c Shannon’s information index d Diversity e Unbiased diversity. SE (standard error) was zero in all cases. Indices calculated using 8191 SNPs with GenAlex v. 6.5. * was calculated with 1000 permutations.

Population genetic differentiation

The analysis of molecular variance (AMOVA) revealed that variance among subpopulations covered 24% of total variation and rest of its was covered by among individual variance (Table 6) with a F and Nm value of 0.24 and 1.28, respectively.
Table 6

Summary of AMOVA.

Sources of variationd.f.Sum of squaresVariance components% of variation F st N m
Among subpopulations4130814.6228.5***23.50.241.28
Within subpopulations761565699.6743.476.5
Total765696514.1971.9

*** indicates p < 0.001 for 1023 permutations.

*** indicates p < 0.001 for 1023 permutations. We found significant (p < 0.01) between subpopulation F in all combinations. Except combinations P3 and P4 (0.11), P1 and P2 (0.19), we found F > 0.20 for all combinations. The pairwise F > 0.30 was observed between P1 and P5, P3 and P5, P4 and P5 (Table 7).
Table 7

Genetic differentiation among subpopulations.

Subpopulation pairwise Fst
 P1P2P3P4P5
P10
P20.19**0
P30.25**0.24**0
P40.21**0.24**0.11**0
P50.34**0.24**0.34**0.39**0

Diagonal values are pairwise F comparisons, performing 1000 permutations using Arlequin v. 3.5.

**indicates p < 0.01.

Diagonal values are pairwise F comparisons, performing 1000 permutations using Arlequin v. 3.5. **indicates p < 0.01. Kinship analysis showed that the IBS coefficients of the collection ranged from 1.21 to 1.94 with an average coancestry 1.47 between any two canola genotypes (Fig 5, S5 Table). Under P2 subpopulation, almost 50% of total genotypic pairs shows IBS coefficients less than 1.50. In case of other subpopulation, portion of genotypic pairs having IBS coefficient less than 1.50, was very low (Table 8, S1 Fig).
Fig 5

Heatmap of kinship matrix of entire collection.

Table 8

Summary of subpopulation-wise kinship (IBS) matrix.

SubpopulationsWhole collectionP1P2P3P4P5
IBS coefficients range1.21–1.941.40–1.941.27–1.931.29–1.931.46–1.941.35–1.92
Mean of IBS coefficients1.471.581.491.551.621.60
Pairs having ≤ 1.50 IBS coefficients (%)63.99.650.721.71.118.0
Pairs having ˃ 1.50 IBS coefficients (%)36.190.449.378.398.982.0
We also performed correlation analysis between mean pairwise relatedness (IBS coefficients) among individuals within subpopulation and Shannon’s information index (I), diversity (H). The I and H were significantly and negatively correlated with relatedness (r = -0.97, -0.98, and p < 0.01), respectively.

Linkage disequilibrium pattern

Subpopulation, genome, and chromosome-wise linkage disequilibrium (LD) pattern was investigated. LD = r values showed inverse relationship with distance i.e., mean LD was high (r > 0.22) at short distance bin (0–2 kb) and decreases with bin distance increment (S6 Table). In the entire collection considering both A and C genome, the mean linked LD and mean unlinked LD was 0.44 and 0.02 respectively; and loci pair under linked LD and unlinked LD was 1.81% and 98.20%, respectively. Subpopulation-wise mean linked LD ranged from r = 0.41 (P2) to r = 0.48 (P1). Subpopulation P5 harbored highest (8.76%) loci pair in linked LD and it was lowest in P1 (1.52%). The mean linked LD, mean LD and loci pair under linked LD was always higher in all cases in case of C genome than that of A genome (Table 9). We also compared the LD decay rate based on distance at which LD decayed to its half maximum (half-life), which is the point at which the observed r between sites decays to less than half the maximum r value. In the whole collection, LD decayed to its half maximum within < 45 kb distance for whole genome, < 21 kb for A genome, and < 93 kb for C genome. In all subpopulations, the distance for LD decay to its half maximum was always higher for C genome than A genome. LD decay rate also varied according to chromosome (S2 and S3 Figs). LD decay was lowest in chromosome C1 (348 kb) and C2 (244 kb), but was highest in chromosome A5 (13 kb) and A1 (16 kb) (S7 Table). LD decayed to its half–maximum within < 29 kb for P1, <45 kb for P2 & P3, <101 kb for P4, and <120 kb for P5. In all subpopulations, LD persisted also longest in all chromosomes of C genome than that of A genome (Fig 6, S7 Table).
Table 9

Linkage disequilibrium in the studied collection.

SubpopulationMean linked LD aMean unlinked LD bMean LD cLoci pairs in linked LD (%)Loci pairs in unlinked LD (%)
AC_Genome
Whole collection0.440.020.031.8198.2
P10.480.010.021.5298.5
P20.410.020.032.6597.4
P30.450.020.031.9498.1
P40.450.020.043.9896.0
P50.430.030.078.7691.2
A_Genome
Whole collection0.330.020.021.3498.7
P10.380.010.021.1298.9
P20.320.020.032.0298.0
P30.360.020.021.4198.6
P40.400.020.033.4596.6
P50.380.040.067.0692.9
C_Genome
Whole collection0.500.020.032.2197.8
P10.520.010.021.8398.2
P20.460.020.043.2796.7
P30.500.020.032.3597.7
P40.480.020.044.4195.6
P50.460.030.0810.5789.4

a Mean linked LD was calculated by dividing total r (r > 0.2 was considered) value with total number of corresponding loci pair.

b Mean unlinked LD was calculated by dividing total r (r ≤ 0.2 was considered) value with total number of corresponding loci pair.

c Mean LD was calculated by dividing total value with total number of corresponding loci pair.

Fig 6

Linkage disequilibrium (LD) differences and decay pattern among subpopulations.

a Mean linked LD was calculated by dividing total r (r > 0.2 was considered) value with total number of corresponding loci pair. b Mean unlinked LD was calculated by dividing total r (r ≤ 0.2 was considered) value with total number of corresponding loci pair. c Mean LD was calculated by dividing total value with total number of corresponding loci pair. We also performed haplotype block (HBs) analysis to investigate LD variation patterns across whole genome. A total 200 blocks covering 18 Mb out of the 976 Mb anchored B. napus reference genome [32], were identified. A and C genome contained 67 and 133 haplotype blocks, respectively. The total length of A and C genome specific HBs were 1.8 Mb and 16 Mb, respectively. The total length of HBs varied greatly from chromosome to chromosome. Total HBs length varies from 24 kb on A1 to 901 kb on A9 in A genome and in C genome it varies between 40 kb on C9 to 3,610 kb on C2. The haplotype block (HBs) number and size in C genome chromosome was always higher than that of A genome chromosome (Table 10). We analyzed subpopulation specific and common HBs. We found C genome chromosome bears more subpopulation specific HBs than A genome chromosome (Table 11). We also found some HBs were shared by different subpopulations, but we did not find any HBs blocks that was shared by all five subpopulations (Table 12). The shared HBs were usually located on C genome chromosome. Rutabaga type shared different HBs with other types also.
Table 10

Subpopulation-wise number and length of haplotype blocks (HBs) along chromosomes.

 Entire panelP1P2P3P4P5
Chr.NoaSizebNoaSizebNoaSizebNoaSizebNoaSize bNoaSizeb
A152467332565575208000
A2880796111865431500
A36465295448511092718
A4546127217001600
A575721335035138542711
A69308656442955255100
A7870772215741291092113
A843041232122346265400
A91090176214149366491393121
A105296502211654511
C1233099143314144638192947204295114
C22636111931419450322375616523773594
C31496913930131480161603910702192
C416344015535115533014450213606343989
C59423841010516711487267912
C61812049972711911312061533573954
C7172394172583770162479101414113
C848935865101289694171143173
C964174944387223311114
AC Genome2001793815519264127216151682097515833956268888
A Genome671865481648382160482168588688543
C Genome133160731071761689194551201880710025267218845

a Number of haplotype blocks on each chromosome.

b Total length of haplotype blocks for each chromosome in kb.

Table 11

Subpopulation specific number and length of haplotype blocks (HBs) along chromosomes.

 P1 specificP2 specificP3 specificP4 specificP5 specific
Chr.No1Size2No1Size2No1Size2No1Size2No1Size2
A11683.0600.0000.0021491.2400.00
A2139.0300.002598.9700.0000.00
A300.0000.0000.002878.6500.00
A4127.0200.0000.0000.0000.00
A500.002491.271109.372408.8500.00
A61522.4300.00123.0400.0000.00
A7128.4200.003392.3651054.8200.00
A800.0000.0000.0032411.2800.00
A900.0041409.29136.8731339.60120.50
A1000.0000.0000.00122.1500.00
C141723.8843236.731800.2972334.0700.00
C261537.9274464.1862741.6353464.3152796.14
C32616.6541024.204724.952378.7600.00
C441764.0762844.272237.7375148.8413439.16
C5129.992121.081771.3122315.2000.00
C63795.861691.724782.7642310.661715.45
C732085.4300.004767.172511.8800.00
C800.001390.001885.2141121.11173.33
C9127.362262.00132.6100.0000.00

1 Number of specific haplotype blocks longer than 19 kb on each chromosome

2 Total length (kb) of specific haplotype blocks longer than 19 kb on each chromosome.

Table 12

Shared haplotype blocks (HBs) among subpopulation along chromosomes.

Chr.Shared HBs (size and corresponding subpopulation) a
A119.991 (P1, P4), 515.231 (P3, P4)
A20
A30
A40
A50
A60
A70
A8232.016 (P3, P4)
A919.986 (P1, P2, P4)
A100
C120.457 (P1,P2, P3, P4), 38.038 (P1, P3), 134.065 (P2, P3, P4), 241.815 (P2, P3), 260.121 (P2, P3, P4), 336.839 (P1, P3), 374.884 (P3, P4), 438.341 (P1, P4), 652.629 (P3, P4), 718.372 (P1, P2)
C220.408 (P1, P2, P4), 28.252 (P1, P4), 164.414 (P3, P4), 729.678 (P1, P3, P4), 781.808 (P1, P4, P5)
C339.715 (P1, P4), 191.098 (P2, P5), 202.569 (P1, P2, P3), 611.558 (P3, P4)
C498.616 (P3, P5), 149.89 (P1, P2, P3), 378.898 (P1, P3), 436.853 (P1, P2, P3), 447 (P1, P3, P5), 601.227 (P2, P3), 867.133 (P1, P3, P4), 1265.92 (P1, P2, P3)
C5337.986 (P1, P2, P3, P4)
C696.217 (P4, P5), 136.404 (P3, P4), 142.519 (P2, P5), 237.159 (P3, P4), 308.114 (P2, P4)
C7390.085 (P1, P3), 828.502 (P3, P4)
C8255.517 (P1, P2), 592.433 (P1, P2)
C9171.465 (P2, P3)

a Length (kb) of common NBs longer than 19 kb on each chromosome with their corresponding subpopulation shown in parenthesis.

a Number of haplotype blocks on each chromosome. b Total length of haplotype blocks for each chromosome in kb. 1 Number of specific haplotype blocks longer than 19 kb on each chromosome 2 Total length (kb) of specific haplotype blocks longer than 19 kb on each chromosome. a Length (kb) of common NBs longer than 19 kb on each chromosome with their corresponding subpopulation shown in parenthesis.

Discussion

Genotyping-by-sequencing [30] is one approach to obtain high frequency SNPs. The strategy has been used for population genetic studies, association mapping, and proven to be a powerful tool to dissect multiple genes/QTL in many plant species [54-56]. We obtained 497,336 unfiltered SNPs markers of which 8,502 high quality SNP markers were used for genetic diversity analysis of 383 genotypes. Delourme et al. (2013) [23] conducted genetic diversity analysis in B. napus using 7,367 SNP markers of 374 genotypes. However, different marker technologies such as Single Sequence Repeat (SSR), Sequence Related Amplified Polymorphism (SRAP) markers have been used by other researchers for genetic diversity analysis in B. napus. Chen et al. (2020) [57] used 30 SSR markers, Wu et al. (2014) [58] utilized 45 SSR markers, Ahmad et al. (2014) [59] used 20 SRAP markers for genetic diversity and population structure analysis of B. napus. Earlier, our group conducted a genetic diversity study of flax using 373 germplasm accessions with 6200 SNP markers [60]. The SNP markers were distributed throughout 19 chromosomes of B. napus and the marker density was one per 99.5 kb. This is comparable density to earlier study conducted by Delourme et al. (2013) [23]. Therefore, this marker density provides a sufficient resolution to estimate genome-wide diversity as well as the extent of LD within the genome. This marker density will also help in association mapping studies to identify a causal locus/loci or linked loci that can be further used either in MAS or to pinpoint the causative locus [61] especially for oligogenic traits. However, for polygenic traits such as seed yield, it is better to incorporate more markers for genome wide association studies. The core collection utilized in this study represents mostly adapted lines from various breeding programs. Therefore, sources of variation, markers of interest identified in the collection can be directly used in breeding programs. We have identified higher frequency of transition SNPs over transversion SNPs that is an agreement with Bus et al. (2012) [62], Clarke et al. (2013) [63], and Huang et al. (2013) [64] in B. napus. Higher number of transition SNPs over transversion is also reported in other crop species such as Hevea brasiliensis [65], Camellia sinensis [66], Camelina sativa [67], and Linum usitatissimum [60]. To assess the suitability of marker for linkage analysis and diversity, we calculated PIC and expected heterozygosity (He) of markers [68]. In our research, the PIC value is ranged from 0.05 to 0.35 indicating that the markers are modestly informative. The similar lower PIC value (0.1 to 0.35) was reported by Delourme et al. (2013) [23] in B. napus. The lower PIC value is a result of bi-allelic nature of SNP markers and probable low mutation rate [69]. In our study, the He value of each marker was always greater than corresponding PIC value indicating an average lower allele frequency in the population [68].

Population diversity and structure

We have identified a moderate diversity (average H = 0.22) within the subpopulations. B. napus is capable of self-pollination, and little cross-pollination may be occurred by insect. Being a mostly self-pollinated crop a low to moderate subpopulation diversity in B. napus is expected. Low to moderate diversity was also found in previous studies [70-72]. Along with the reproduction system, one needs to look at evolution and domestication history for explaining low to moderate levels of diversity in B. napus. This allopolyploid species originated at Mediterranean coast because of a natural cross between B. rapa and B. oleracea which occurred approximately 0.12–1.37 million years ago [73, 74]. The domestication of B. napus occurred very recently, around 400 years ago with the first rapeseed being most likely a semi-winter type due to the mild climate in the region [75, 76]. Later on, European growers developed the winter and spring type Brassicas through selection for cold hardiness or early flowering to expand its cultivation in further North in the last century [77]. Therefore, the low to moderate diversity in winter and spring B. napus can be mostly explained by a recent history of the species, followed by infrequent exchange of genetic material with other Brassicas [23], as well as by the traditional breeding practices selecting for only few phenotypes. In our study, the more diversity in semi-winter type (P2, H = 0.25) than winter (P1, H = 0.21) and spring (P4, H = 0.19) type is supported by its domestication history. The N value was greater than one, which indicates that there was enough gene flow among semi-winter, winter, and spring types. These findings also support the evolution of winter and spring types from semi-winter type. In this research, Tajima’s D value was calculated to identify the extent of availability rare and unique alleles [78]. Recently, the NDSU canola-breeding program developed the P4 advanced breeding lines through crossing different genetic resources including winter, spring, and semi winter types and subsequent selection. This current expansion of P4 was supported by its negative Tajima’s D value, which harbors more rare alleles [79]. The subpopulation P1, P2, P3, and P5 showed positive Tajima’s D value indicating an excess of intermediate frequency alleles, which may be caused by balancing selection, population bottleneck, or population subdivision. Previously, negative Tajima’s D values were found in spring and winter type B. napus accessions [80]. The negative correlation between diversity indices (H and I) and relatedness (average IBS coefficients) indicates that subpopulation differentiation was also due to selfing and genetic drift. Flax [60] and Arapaima gigas species [81] also showed same scenario. To exploit diversity and transgressive segregation, parents from divergent group should be crossed. Pairwise F statistic, a parameter describing population structure differentiation [82], was estimated among five subpopulations. In the present study all pairwise F values comprising both low and high values, were statistically significant. Similar results were also found in other studies [80, 83–85]. Lower pairwise F (0.11) was identified between spring type originated in USA (P4) and spring type originated in other countries (P3). This is reasonably justified as both subpopulations comprise of spring type genotypes and germplasm exchanged occurred between USA and other countries. It also indicating that we will not get higher genetic diversity in population if we use only spring types in the crossing program. But this combination is good for accumulating specific elite trait if the targeted trait is found in members of one and missing from the members of another group. We found spring type (P3 and P4) genotypes are greatly divergent (F > 0.20) from winter and semi-winter type (P1 and P2) genotypes. Utilization of genotypes from these groups in crossing program will broaden the genetic base of developed population results in high heterosis. This potentiality has already been proved as hybrids between the Chinese semi-winter and European (including Canada) spring type exhibited high heterosis for seed yield [86]. The P5 (rutabaga type) showed the higher F with other subpopulations such as the highest F was observed between P5 and P4 (NDSU spring type) followed by P3 (other spring type), P1 (winter type) and P2 (semi-winter type). This outcome clearly shows that rutabaga is genetically distinct from spring and winter type canola that is confirmed by previous studies [75, 87, 88]. This distinctness of rutabaga can be exploited through heterosis breeding. Several previous studies have already showed rutabaga as a potential gene pool for the improvement of spring canola [89, 90]. NDSU canola breeding program also utilized winter and rutabaga types in the breeding program for increasing genetic diversity and for improvement of spring canola. AMOVA showed that variation among individual within subpopulation captured greater portion of total variation, than that by among subpopulation. This finding is also supported by earlier researches [56, 72, 91, 92]. This finding supports that within subpopulation genotype from P2, P3, and P1 could be crossed as they showed high diversity (H > 0.20) for cultivar development. Principal component analysis and distance-based population structure analysis such as NJ tree yielded three subgroups in the core collection. Here, we ran structure analysis many times to obtain convergence before the best number of clusters was determined. It was done because previous studies [39, 93] reported that STRUCTURE program did not depict the main clusters within a collection. Based on Evanno’s ΔK method [37] and MedMedK, MedMeaK, MaxMedK and MaxMeaK statistics [39], structure analysis divided the core collection into three distinct clusters. Cluster-1 contains spring type NDSU advanced breeding lines (P4) and spring type (P3) other than those. This finding is supported by low genetic differentiation (F = 0.11) between P3 and P4 due to sharing of parents by advanced breeding lines from P3. Cluster-3 is solely dominated by European winter type (P1) genotypes. This is also supported by high F between winter and other types which may be due to geographic barriers between Europe and America, Asia. Cluster-2 contained all rutabaga types as well as other type Asian genotypes which indicates that all types share considerable amount of SNP markers attributing to this cluster. These findings also indicate that there is gene flow among different types, which is also supported by N > 1. Structure analysis revealed that all clusters contained both non-admixed as well as admixed (share alleles attributed to different subpopulation) genotypes. For broadening genetic diversity of population, non-admixed genotypes should be crossed. However, for improving or introgression of specific traits, admixed genotypes could also be crossed which will reduce the population size required for phenotypic screening. However, structure analysis may overestimate the differentiation among individuals, as the individuals may not share alleles from same ancestors [94]. Since a breeder would like to combine historically never combined favorable alleles, IBS values directs which individuals should be crossed. Low IBS is the best. However, self-pollinated crops exhibit higher kinship values than cross-pollinated crops, as homozygosity increases probability of being identical by state [95]. We found approximately 64% of pairwise coancestry ranged from 1.21 to 1.50. Crossing among genotypes from subpopulation P2 will demonstrate more diversity, than that of other subpopulations, as most genotypic combinations of P2 shows low IBS coefficients than others do. This finding is in line with the evolutionary history of B. napus where semi-winter type is the base population containing more divergence. Gradually this diversity is narrowed down in P3 (spring type, mixed origin) and P1 (winter type), because genotypic pairs belong to P3 and P1 having high IBS values evolved from semi-winter type [77]. Subpopulation P4 exhibited highest number of pairs having IBS > 1.5, which is obvious as these genotypes are advanced breeding lines developed from crossing of same set of parents in different combinations. Genotypic pairs of P5 (rutabaga type) also showed high coancestry may be due to the duplicates which is supported by low genetic differentiation of Nordic rutabaga accessions [27]. We could discard the duplicates during the crossing program.

Linkage disequilibrium

Linkage disequilibrium can be defined as the correlation among polymorphisms in a given population [96]. The strength of association mapping relies on the degree of LD between the genotyped marker and the functional variant. Linkage disequilibrium analysis provides insight into the history of both natural and artificial selection (breeding) and can give valuable guidance to breeders seeking to diversify crop gene pools [17]. SNPs in strong LD are organized into haplotype blocks, which can extend even up to few Mb based on the species and the population used. Genetic variation across the genome is defined by these haplotype blocks. Haplotypes, which are subpopulation-specific, are defined by various demographic parameters like population structure, domestication, and selection in combination with mutation and recombination events. Conserved haplotype structure can then be used for the identification and characterization of functionally important genomic regions during evolution and/or selection [97]. In addition, the extent of LD needs to be quantified across the genome at high resolution (down to approximately one Kbp) [98]. The information is important for choosing crossing schemes, association studies and germplasm preservation strategies [99-102]. We used markers from across the genome to quantify the LD for the core collection. Low level of LD was evident for each individual subpopulation in A, C, and whole gnome. The low level of LD can be due to multiple factors. First, canola is a partially outcrossing species with an average of 21–30% of cross-pollination [103-105]. The outcrossing occurring in canola leads to more recombination and to a breakdown of haplotype blocks. Secondly, the ancestral history of canola is limited in comparison with other crops, such as rice, common bean, wheat, and corn, restricting the selection of desirable haplotypes during the evolution. In other words, there was no adaptation or domestication pressure on the species, which would lead towards positive selection. Third, the only selection pressure imposed on the species for a relatively short time was breeding. However, the breeding practices were biased towards selection of only few phenotypes. Additionally, the short period under selection pressure might have not been sufficient to select favorable haplotypes in the genome. Fourth, since canola cultivars with different growth habits are compatible there has been always gene flow present between them contributing to the low level of LD. The N >1 was observed in this study, which supports this gene flow. Fifth, the restriction enzyme used to develop the libraries for sequencing of the core collection helped in identification of SNPs largely residing in genic regions, which are prone to high recombination, contributing to the low level of LD. Finally, the low level of LD may be due to thinning of markers, as we did not use all markers (53,616) for LD analysis rather used 8,502 markers after thinning. That can be confirmed by analysis the LD using whole marker set in further analysis. In this study, we have identified that the LD decay in B. napus varied across chromosomes of both A and C genomes. In addition, LD in C genome decayed much slower than A genome. C genome also contained larger haplotype blocks than A genome. This LD patterns are consistent with previous findings [17, 26, 106–108]. The slower LD decay and presence of long haplotype blocks in C genome indicates that high level of gene conservation could have resulted from limited natural recombination or could be exchanged of large chromosomal segment during evolution. In the whole genome, presence of subpopulation specific haplotype blocks suggests that these regions had been experienced selection pressure for specific geographic regions adaptation. In all subpopulations, presence of shorter haplotype blocks in A genome than C genome reveals that B. rapa progenitor of B. napus containing A genome, which has been used as oilseed crop and probably being used in hybridization process. Sharing haplotype blocks by different subpopulations especially in C genome also confirms its conserved nature. The low level of LD or haplotype blocks has implications for association mapping and a proper experimentation design is necessary for utilizing a reduced set of markers by tagging major haplotypes [109]. Though low LD of A genome requires more markers to pinpoint the location of various QTL, but once a marker is found to be significantly associated with a phenotype, there might be a higher probability of identifying the casual gene than that of C genome.

Conclusions

This study provides a new insight to select the best parents in crossing plan to maximize genetic gain in the population. The population structure analysis showed a clear geographic and growth habit related clustering. The rutabaga type showed the highest genetic divergence with spring and winter types accessions. Therefore, the breeding strategies to increase the genetic diversity may include generating population from rutabaga and spring crosses, or using rutabaga and winter crosses. The linkage disequilibrium analysis revealed the decay pattern and haplotype blocks in A and C genome. This output will help the breeder to formulate breeding strategies to develop improved cultivars using modern breeding tools by utilizing this collection and SNP markers.

List of the genotypes analyzed in this study.

(XLSX) Click here for additional data file.

Marker diversity parameters.

(XLSX) Click here for additional data file.

Subpopulation-wise marker diversity parameters.

(XLSX) Click here for additional data file. (XLSX) Click here for additional data file.

Kinship matrix.

(XLSX) Click here for additional data file.

Mean LD values according to distance.

(XLSX) Click here for additional data file.

Subpopulation-wise and chromosome-wise LD decay rate (Kb) within each subpopulation.

(XLSX) Click here for additional data file.

Histogram of IBS coefficients.

(TIFF) Click here for additional data file.

Chromosome-wise LD decay rate (Kb) in A genome considering whole collection.

(TIFF) Click here for additional data file.

Chromosome-wise LD decay rate (Kb) in C genome considering whole collection.

(TIFF) Click here for additional data file. 3 Jun 2021 PONE-D-21-09997 Linkage disequilibrium and population structure in a core collection of Brassica napus (L.) PLOS ONE Dear Dr. Rahman , Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jul 18 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see:  http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at  https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols . We look forward to receiving your revised manuscript. Kind regards, Tzen-Yuh Chiang Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. In your revised cover letter, please address the following prompts: 2a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. 2b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. We will update your Data Availability statement on your behalf to reflect the information you provide. 3. Thank you for stating the following financial disclosure: The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. At this time, please address the following queries: a)              Please clarify the sources of funding (financial or material support) for your study. List the grants or organizations that supported your study, including funding received from your institution. b)              State what role the funders took in the study. If the funders had no role in your study, please state: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.” c)              If any authors received a salary from any of your funders, please state which authors and which funders. d)              If you did not receive any funding for this study, please state: “The authors received no specific funding for this work.” Please include your amended statements within your cover letter; we will change the online submission form on your behalf. 4. Thank you for submitting the above manuscript to PLOS ONE. During our internal evaluation of the manuscript, we found significant text overlap between your submission and the following previously published works, some of which you are an author. https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-020-06922-2 We would like to make you aware that copying extracts from previous publications, especially outside the methods section, word-for-word is unacceptable. In addition, the reproduction of text from published reports has implications for the copyright that may apply to the publications. Please revise the manuscript to rephrase the duplicated text, cite your sources, and provide details as to how the current manuscript advances on previous work. Please note that further consideration is dependent on the submission of a manuscript that addresses these concerns about the overlap in text with published work. We will carefully review your manuscript upon resubmission, so please ensure that your revision is thorough. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: No Reviewer #2: Partly Reviewer #3: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes Reviewer #3: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: No Reviewer #3: No ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: No Reviewer #2: No Reviewer #3: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors proposed to analyse the genetic diversity present in a collection of 383 accessions of Brassica napus representing genotypes present worldwilde and aims at characterizing the genepool present in US breeding programs. This seems quite interesting and the size of of the populations should be sufficient to draw interesting conclusions. However hypotheses and research questions are not clearly expressed at the end of the introduction, and too many extra data are provided and not used in the mansucript (for instance flowering time data). Moreover, many analyses are performed , but may be too much without really refering to a global strategy and the global aim of each batch of analyses. Detailled analysis and questions are reported in the attached file A rapid check at table S1 showed some problems considering geogagraphic origins of different accessions for instance Lembkes, Krapphauser, fertodi => Germany instead of South Korea Dramor (Poland) should be Darmor (France) I think Bienvenu (USA) should be Bienvenu (France) Jet-neuf (Canada) should be Jet-neuf (France) .... I didn't check carefully all accessions but these mistakes may lead to inappropriate interpertations of subpopulations composition Reviewer #2: The manuscript describes an attempt to study genetic diversity and population differentiation inherent in a germplasm assemblage of Brassica napus. Study is of interest for plant breeders engaged in the improvement of this important oilseed crop. Abstract and introduction are very poorly written. The method and material section must include some information about the procedure followed to maintain germplasm lines used for genotyping. In the absence of any information, one would suspect these to be the products of open pollination. Inherent heterozygosity can be a problem for genetic studies in any often cross-pollinated crop like B.napus. Filtering out of a large proportion of heterozygous SNPs may be the cause for the retention of only 8,502 SNPs out 497336 discovered initially. Structure analysis shows lot of admixing. Heterozygosity inherent in the germ plasm will affect all genetic inferences e.g. polymorphic loci, population variation, LD, genetic differentiation and even genetic diversity. It is always better to use homozygous lines. One may not like to use the term core collection for any germplasm assemblage. Authors may like to provide reference or provide details regarding method used to extract the core ( if it is actually a core collection) from a large germplasm base. In spite of these limitations, it was interesting to find clear geographic and growth habit related clustering from STRUCTURE analysis. The manuscript also suffers from narrative coherence. Reviewer #3: Manuscript PONE-D-21-09997 describes genetic diversity, population structure and LD in Brassica napus - the second major oilseed crop grown worldwide. Here are some comments: This study does not provide details on sampling (how many plants were sampled for DNA isolation, how SNPs were scored codominant/dominant; how heterozygosity was handled etc). I could not see genotypic data of 383 accessions. Have authors submitted genotypic data to any public database repository or provided in a supplementary Table? L172-173: Table 1 and Fig 1 does not support this claim. L191-192: It is worthwhile to provide statistics of transitions and transversions across different chromosomes L340: GBS is one of the approaches (Illumina SNP, resequencing, sequencing and DArTseq) L360: There are no wild accessions of B. napus. L460: Favourable alleles? Fig 3: How much genetic variation is explained by PC1 and PC2? What these lines indicate? Fig4: I could not see bootstrap values here. Please label all accessions (name or serial number 1 to 383 and support with supplementary Table) Figure 5 is not legible. You may move to supplementary information. Minor comments L52: Use either rapeseed or canola (it is already described in L49) L72: There are several studies to support collinearity between Arabidopsis and Brassica (see publications of Parkin and Chris Pires groups, https://doi.org/10.1111/pce.12644 etc) L527: There are several reports. You meant NSDU accessions? ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. Submitted filename: PONE-D-21-09997_reviewer.pdf Click here for additional data file. 31 Aug 2021 Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. Response: We have checked and tried our best to format the manuscript according to the PLOS ONE’s style. 2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. In your revised cover letter, please address the following prompts: 2a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. Response: No restriction 2b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repo-sitory and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Response: Datasets are available online. 3. Thank you for stating the following financial disclosure: The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. At this time, please address the following queries: a) Please clarify the sources of funding (financial or material support) for your study. List the grants or organizations that supported your study, including funding received from your institution. Response: The study was funded by the U.S. Department of Agriculture - National Institute of Food and Agriculture (Hatch Project No. ND01581). b) State what role the funders took in the study. If the funders had no role in your study, please state: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.” Response: The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. c) If any authors received a salary from any of your funders, please state which authors and which funders. Response: No salary received. d) If you did not receive any funding for this study, please state: “The authors received no specific funding for this work.” Response: Not applicable. 4. Thank you for submitting the above manuscript to PLOS ONE. During our internal evaluation of the manuscript, we found significant text overlap between your submission and the following previously published works, some of which you are an author. https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-020-06922-2 We would like to make you aware that copying extracts from previous publications, especially outside the methods section, word-for-word is unacceptable. In addition, the reproduction of text from published reports has implications for the copyright that may apply to the publications. Please revise the manuscript to rephrase the duplicated text, cite your sources, and provide details as to how the current manuscript advances on previous work. Please note that further consideration is dependent on the submission of a manuscript that addresses these concerns about the overlap in text with published work. Response: We have rephrased the duplicated text in revised throughout the manuscript. Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: No Reviewer #2: Partly Reviewer #3: Yes Response: We edited revised manuscript to solve these issues. ________________________________________ 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes Reviewer #3: Yes ________________________________________ 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: No Reviewer #3: No Response: Data are available online ________________________________________ 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: No Reviewer #2: No Reviewer #3: Yes Response: We edited and rearranged text in revised manuscript to solve these issues. 5. Review Comments to the Author Reviewer #1: The authors proposed to analyse the genetic diversity present in a collection of 383 accessions of Brassica napus representing genotypes present worldwilde and aims at characterizing the genepool present in US breeding programs. This seems quite interesting and the size of of the populations should be sufficient to draw interesting conclusions. However hypotheses and research questions are not clearly expressed at the end of the introduction, and too many extra data are provided and not used in the manuscript (for instance flowering time data). Moreover, many analyses are performed , but may be too much without really refering to a global strategy and the global aim of each batch of analyses. Detailed analysis and questions are reported in the attached file A rapid check at table S1 showed some problems considering geogagraphic origins of different accessions for instance Lembkes, Krapphauser, fertodi => Germany instead of South Korea Dramor (Poland) should be Darmor (France) I think Bienvenu (USA) should be Bienvenu (France) Jet-neuf (Canada) should be Jet-neuf (France) .... I didn't check carefully all accessions but these mistakes may lead to inappropriate interpertations of subpopulations composition Response: Thank you for pointing these out. In revised manuscript, we have tried to incorporate the suggestions of the reviewer to make the hypotheses and research questions more clear (at later part of introduction). We remove the flowering time data and clarified how the core collection was obtained (Line: 107-116 in manuscript). We think performed analyses strengthen each other as well as the objective. In case of table S1, data was provided according to USDA (GRIN-Global) website (source: https://npgsweb.ars-grin.gov/gringlobal/search). We did a google search to know the origin of many cultivars and failed to get the information. USDA-GRIN recorded many genotypes on the basis of country of collection – and we used this recorded information in this manuscript. Therefore, to clear it out, we have written as “Country or origin/collected”. Reviewer #2: The manuscript describes an attempt to study genetic diversity and population differentiation inherent in a germplasm assemblage of Brassica napus. Study is of interest for plant breeders engaged in the improvement of this important oilseed crop. Abstract and introduction are very poorly written. The method and material section must include some information about the procedure followed to maintain germplasm lines used for genotyping. In the absence of any information, one would suspect these to be the products of open pollination. Inherent heterozygosity can be a problem for genetic studies in any often cross-pollinated crop like B.napus. Filtering out of a large proportion of heterozygous SNPs may be the cause for the retention of only 8,502 SNPs out 497336 discovered initially. Structure analysis shows lot of admixing. Heterozygosity inherent in the germ plasm will affect all genetic inferences e.g. polymorphic loci, population variation, LD, genetic differentiation and even genetic diversity. It is always better to use homozygous lines. One may not like to use the term core collection for any germplasm assemblage. Authors may like to provide reference or provide details regarding method used to extract the core ( if it is actually a core collection) from a large germplasm base. In spite of these limitations, it was interesting to find clear geographic and growth habit related clustering from STRUCTURE analysis. The manuscript also suffers from narrative coherence. Response: Thanks for your critical inputs. In revised manuscript, we have made the changes according to reviewer’s suggestions. We rewrite the introduction section to make it clear. To improve the narrative coherence, we edited almost all section in revised manuscript. How core collection has been obtained and maintained, was described in materials and methods section (Line 107 to 116). SNP number reduced, because of following strict filtering criteria (Line 134 to 139) Reviewer #3: Manuscript PONE-D-21-09997 describes genetic diversity, population structure and LD in Brassica napus - the second major oilseed crop grown worldwide. Here are some comments: This study does not provide details on sampling (how many plants were sampled for DNA isolation, how SNPs were scored codominant/dominant; how heterozygosity was handled etc). I could not see genotypic data of 383 accessions. Have authors submitted genotypic data to any public database repository or provided in a supplementary Table? Response: Thank you for pointing these points. In revised manuscript, we have incorporated the information according to reviewer’s suggestion. For DNA extraction, we used three leaf samples per genotype (Line 119) For diversity analysis we do not need to score the codominant/dominant markers. To run structure, we scored the homozygous major, homozygous minor and heterozygous SNP as 1, 0 and 0.5 respectively using Tassel. Data availability: GBS and SNP data are available at: PRJNA687906 (https://www.ncbi.nlm.nih.gov/biosample/17159566) and PRJEB42419 (https://www.ebi.ac.uk/eva/?eva-study=PRJEB42419), respectively. Reviewer #3: L172-173: Table 1 and Fig 1 does not support this claim. Response: While we appreciate the reviewer’s feedback, we respectfully disagree. The SNP density was highest on chromosome A7 as every 71.1 kb distance contains at least one SNP and was lowest on chromosome C9 as every 134.5 kb distance contains at least one SNP. Reviewer #3: L191-192: It is worthwhile to provide statistics of transitions and transversions across different chromosomes Response: As our primary objective is not to describe the SNPs in details, we think statistics of transitions and transversions across genome is enough to give an idea to the reader. Reviewer #3: L340: GBS is one of the approaches (Illumina SNP, resequencing, sequencing and DArTseq) Response: Yes. Reviewer #3: L360: There are no wild accessions of B. napus. Response: Yes, we agree the reviewer’s comment. We deleted this information from revised manuscript Reviewer #3: L460: Favorable alleles? Response: We changed positive alleles to favorable alleles. Reviewer #3: Fig 3: How much genetic variation is explained by PC1 and PC2? What these lines indicate? Response: PC1 and PC2 explained 13.50% and 7.22% of variation respectively. Lines indicate regression. (Line 226) Reviewer #3: Fig4: I could not see bootstrap values here. Please label all accessions (name or serial number 1 to 383 and support with supplementary Table) Response: bootstrap value added. But did not add label as it make the figure congested. Reviewer #3: Figure 5 is not legible. You may move to supplementary information. Response: We would like to keep it as it is for the better understanding of readers. Minor comments Reviewer #3: L52: Use either rapeseed or canola (it is already described in L49) Response: In revised manuscript, we used rapeseed. Reviewer #3: L72: There are several studies to support collinearity between Arabidopsis and Brassica (see publications of Parkin and Chris Pires groups, https://doi.org/10.1111/pce.12644 etc) Response: Yes. Reviewer #3: L527: There are several reports. You meant NSDU accessions? Response: Yes, we mean NDSU accessions i.e. NDSU advanced breeding lines. Submitted filename: Response to editor and reviewer.docx Click here for additional data file. 21 Oct 2021
PONE-D-21-09997R1
Linkage disequilibrium and population structure in a core collection of Brassica napus (L.)
PLOS ONE Dear Dr. Rahman , Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Dec 05 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Tzen-Yuh Chiang Academic Editor PLOS ONE [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #2: All comments have been addressed Reviewer #4: (No Response) ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #2: Yes Reviewer #4: No ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #2: Yes Reviewer #4: N/A ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes Reviewer #4: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #2: Yes Reviewer #4: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: (No Response) Reviewer #4: (No Response) ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: No Reviewer #4: Yes: Rudolph Fredua-Agyeman [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. Submitted filename: Reviewer Comments.pdf Click here for additional data file. 29 Nov 2021 The manuscript describes studies the population structure and genetic diversity of 383 Brassica napus (canola) accessions using 8502 SNP. The authors inferred six groups based on STRUCTURE, principal component analysis and NJ analyses. The manuscript is well written. However, the STRUCTURE analysis was not comprehensive enough. In addition, the interpretation of the results was not consistent with the Figures 2 and 3. Therefore, I recommend rejection of the MS and re-consideration for publication after major revisions. Major issues 1. Introduction The literature cited by the authors are dated (2012 to 2014). More recent population studies on B. napus and other Brassica exist e.g., An et al. (2019) Nature Communications; Lu et al. (2019) Nature Communications and Yu et al (2021) BMC Genomics. The authors should include how their study will add to the current knowledge on genetic diversity of canola. Response: We have added the references mentioned by the reviewer in revised manuscript. The study was done to exploit the diversity of NDSU canola core collection. The results will be used to enrich NDSU canola parental stock. 2. Materials and Methods The authors run the STRUCTURE program using the admixture model at 10,000 burning period and 50,000 Monte Carlo Markov Chain. Yu et al (2021) showed that Puechmaille (2016) method (MedMedK, MedMeaK, MaxMedK and MaxMeaK statistics) was a better estimator of the number of clusters compared to the Evanno et al. (2005) method (DeltaK statistics) which was used by the authors. The authors should re-run STRUCTURE under different parameters to verify that the inference on population sub-populations is robust. Response: We re-ran the structure analysis according to reviewer suggestion. We discussed it in methodology section (Line151 to line160). 3. Results (a) A requirement for the STRUCTURE program is that markers be unlinked. Inclusion of multiple closely linked markers may have a large effect on calculation of population structure. Also, it is not known whether these SNPs are in non-coding region or are synonymous, to eliminate the bias introduced by selection when inferring population structure. The authors should comment on the appropriateness of using the selected SNP markers for their study. Response: To break the linkage between markers we considered the markers which are located > 1000bp from each other (L145 to L146). We didn’t consider whether SNPs are synonymous or non- synonymous as dN/dS ratio is insensitive to selection pressure in intraspecific individuals or individual from same population (Kryazhimskiy and Plotkin, 2008). (b) The authors did not interpret their STRUCTURE and PCA data correctly: (i) Figure 2 shows that the maximum peak is at K=3. The second and smaller peak is at K=6. Therefore, the number of inferred clusters using STRUCTURE was likely 3 and not 6. (ii) Figure 3-The Principal component analysis shows 3 clusters and not 6. Group 1-red (winter Europe), Group 2 -green and blue (spring_mixed origin and spring_NDSU) and Group 3-violet and yellow (Rutabaga, winter and semi winter Asia) NB: the miscellaneous group (black) which are scattered is in all 3 groups cannot be considered as a different subgroup (Authors own admission Lines 220-221). Response: In revised manuscript, we corrected and discussed the figures according to reviewer suggestion (L218 to L237). (c) The authors should indicate the NJ using rooted and unrooted trees. The authors should also indicate UPGMA rooted and unrooted trees. Response: Unrooted NJ tree was used. We belief that Structure, PCA and NJ tree already addressed our targets. (d) Please could the numbers on the trees be related back to the accession identifiers? Response: Yes, We hope the provided new picture will clarify the issue. (e) Some accessions had the same name or variation of it but different accession number. Some accessions were also advanced breeding lines. Can the authors comment on the genetic similarity of such accessions in the results section? Will these influence the calculation of population structure? Response: Before formulating the core collection, we discarded the accession, which has same name with different PI number or same PI number with different name. So that we did not have any accession like that. The similarity among advanced breeding lines as well as among other genotypes has been discussed in discussion section. Discussion The authors need to be update after the suggested changes. Response: We updated the discussion section. Minor issues 1. Line 112-Did the authors use leaves from single plants or bulk plants? Response: Bulk plants 2. Line 136-reword the core collection was divided into subgroups-Do you mean the number of sub-groups were inferred using STRUCTURE. Response: Rewording done in revised manuscript. Reference Kryazhimskiy, S., and J.B. Plotkin. 2008. The population genetics of dN/dS. PLoS Genet. 4(12): e1000304. Submitted filename: Response to reviewer.docx Click here for additional data file. 13 Jan 2022
PONE-D-21-09997R2
Linkage disequilibrium and population structure in a core collection of Brassica napus (L.)
PLOS ONE Dear Dr. Rahman , Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.
 
Please submit your revised manuscript by Feb 27 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Tzen-Yuh Chiang Academic Editor PLOS ONE [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #2: All comments have been addressed Reviewer #5: (No Response) ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #2: Yes Reviewer #5: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #2: Yes Reviewer #5: No ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #2: Yes Reviewer #5: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #2: Yes Reviewer #5: No ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: The authors have revised the manuscript as per the suggestions by the reviewers. The revised manuscript can be accepted for publication. Reviewer #5: Review of Rahman et al. Rahman et al. analyzed the genotypes of 383 germplasms of Brassica napus, a part of which consists of NDSU core collection using the genotype by sequencing method. They identified population structures and evaluated the level of linkage disequilibrium. They also observed that the C genome have much higher level of LD than the A genome. This manuscript has been revised twice, but this is the first time for me to review the paper. I would agree that the manuscript would provide valuable information for this research area and the data presented here are mostly well analyzed, but the manuscript lacks coordination in writing and still has a lot of typos, some of which would be described later. I therefore cannot recommend the manuscript to be published in PLOS ONE in its current form. Major comments 1. Although the introduction part is very detailed. Some important information seems missing. In particular, it is not clear how the authors defined the five subpopulations is not clear to me. There is no description for rutabaga type, which would be difficult to understand for non-specialist of B. napus. 2. I initially thought that 8502 high-quality SNPs out of 497,336 unfiltered SNPs are too small, but probably the number is the one after the thinning. Please describe the number of high-quality SNPs before the thinning. 3. There are several comparisons of statistics, but there are little statistical tests. For example, in the line 193, the authors stated that ts/tv ratio in A genome was higher than that in C genome, but the difference was not properly tested. So many of the analyses were more or less too descriptive. 4. In the STRUCTURE anslysis, I do not get what is the meaning of “accession assigned/unassigned”. STRUCTURE usually does not require any a priori assumption, in my understanding. Please add more description in the method section. 5. In the line 313, the meanings of “the mean linked LD”, “mean unlinked LD”, and “loci pair under linked LD”, are not clear to me. In the method section, the mean lined LD was calculated using SNPs with r2<0.2 (line 179), but it means they are unlinked. In addition, % is not added to 0.44, 0.02? It is somewhat confusing. 6. In the line 401, the authors mentioned that the higher transition to transversion ratio is due to natural selection, but I do not agree with it. The pattern is ubiquitously found in many organisms including humans, and in non-coding regions, and people usually think that this is due to higher mutation rate of transition than transversion. 7. The sentence in the line 427-428 seems contradicting to the previous sentence. Probably the words, “homogeneity of the diversity indices” is not appropriate. 8.In the line 471, NJ is certainly distance-based method, but PCA is not. 9.In the line 524, the authors mentioned that the level of LD in B. napus is low compared with the other crops, but there is no quantitative statement. I also wonder whether authors used the thinned data to calculate the mean LD. If ones remove the linked SNPs from the analysis, the level of LD obviously decreases. Minor comments, 1. L143: row SNPs would be raw SNPs 2. L153: burn-in length 3. L405: Rephrase the words “moderate or low informative”. 4. L446: the word “type” is not necessary? 5. The assignment of Ks in Figure 4 is not clear. Are they determined using the presented tree? ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #2: No Reviewer #5: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 25 Jan 2022 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #2: The authors have revised the manuscript as per the suggestions by the reviewers. The revised manuscript can be accepted for publication. Response: Thank you very much for accepting the manuscript. Reviewer #5: Review of Rahman et al. Rahman et al. analyzed the genotypes of 383 germplasms of Brassica napus, a part of which consists of NDSU core collection using the genotype by sequencing method. They identified population structures and evaluated the level of linkage disequilibrium. They also observed that the C genome have much higher level of LD than the A genome. This manuscript has been revised twice, but this is the first time for me to review the paper. I would agree that the manuscript would provide valuable information for this research area and the data presented here are mostly well analyzed, but the manuscript lacks coordination in writing and still has a lot of typos, some of which would be described later. I therefore cannot recommend the manuscript to be published in PLOS ONE in its current form. Major comments 1. Although the introduction part is very detailed. Some important information seems missing. In particular, it is not clear how the authors defined the five subpopulations is not clear to me. There is no description for rutabaga type, which would be difficult to understand for non-specialist of B. napus. Response: We added description of rutabaga type in introduction (line 68 to 74). We assumed the subpopulation of collection according to their type and origin. Details has been added in materials and method section (line 129 to 134). 2. I initially thought that 8502 high-quality SNPs out of 497,336 unfiltered SNPs are too small, but probably the number is the one after the thinning. Please describe the number of high-quality SNPs before the thinning. Response: We added the number (line 151 to 164) in materials and method section 3. There are several comparisons of statistics, but there are little statistical tests. For example, in the line 193, the authors stated that ts/tv ratio in A genome was higher than that in C genome, but the difference was not properly tested. So many of the analyses were more or less too descriptive. Response: ts/tv ratio is a simple descriptive statistics and ratio of two numbers. It does not need any statistical test. Other statistical test such as Tajima's D (line 277: 1000 permutations), AMOVA (line 291: 1023 permutations, p ˂ 0.001), Fst comparisons (line 296: 1000 permutations, p ˂ 0.01), NJ (line 251: 1000 bootstraps) has been performed accordingly. Structure analysis has been done in details. 4. In the STRUCTURE analysis, I do not get what is the meaning of “accession assigned/unassigned”. STRUCTURE usually does not require any a priori assumption, in my understanding. Please add more description in the method section. Response: Yes, structure does not require any a priori assumption. But as we divided the collection into subpopulation according to their type and origin, that’s why we ran structure considering accession assigned to specific subpopulation and accession unassigned to specific subpopulation to determine the exact cluster number of the collection. Details added in material and method section (line 166 to 182). 5. In the line 313, the meanings of “the mean linked LD”, “mean unlinked LD”, and “loci pair under linked LD”, are not clear to me. In the method section, the mean lined LD was calculated using SNPs with r2<0.2 (line 179), but it means they are unlinked. In addition, % is not added to 0.44, 0.02? It is somewhat confusing. Response: That was typo. We changed it accordingly (line 197 to 205 and line 335 to 354). 6. In the line 401, the authors mentioned that the higher transition to transversion ratio is due to natural selection, but I do not agree with it. The pattern is ubiquitously found in many organisms including humans, and in non-coding regions, and people usually think that this is due to higher mutation rate of transition than transversion. Response: We agreed to reviewer comment and deleted the information (line 431-432). 7. The sentence in the line 427-428 seems contradicting to the previous sentence. Probably the words, “homogeneity of the diversity indices” is not appropriate. Response: We agreed to reviewer comment and deleted the information (line 457 to 459). 8. In the line 471, NJ is certainly distance-based method, but PCA is not. Response: We agreed to reviewer comment and changed accordingly (line 503 to 504). 9. In the line 524, the authors mentioned that the level of LD in B. napus is low compared with the other crops, but there is no quantitative statement. I also wonder whether authors used the thinned data to calculate the mean LD. If ones remove the linked SNPs from the analysis, the level of LD obviously decreases. Response: We modified the information according to reviewer comments (line 558 to 561 and line 574 to 577) Minor comments, 1. L143: row SNPs would be raw SNPs 2. L153: burn-in length 3. L405: Rephrase the words “moderate or low informative”. 4. L446: the word “type” is not necessary? Response: We tried our best to correct typos and changed accordingly (line 151, line 161, line 411 and line 450) 5. The assignment of Ks in Figure 4 is not clear. Are they determined using the presented tree? Response: Each branch is color-coded according to genotype belongs to subpopulation P1 to P5. Genotypes were grouped into three clusters by dividing the tree using black solid lines according to structure output (line 250 to 253). Submitted filename: Response to reviewer.docx Click here for additional data file. 15 Feb 2022 Linkage disequilibrium and population structure in a core collection of Brassica napus (L.) PONE-D-21-09997R3 Dear Dr. Rahman , We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Tzen-Yuh Chiang Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #5: All comments have been addressed Reviewer #6: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #5: Yes Reviewer #6: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #5: Yes Reviewer #6: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #5: Yes Reviewer #6: No ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #5: Yes Reviewer #6: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #5: This manuscript is a revised manuscript that I have reviewed before. The authors addressed all the issues I raised so the manuscript is ready for publication. Reviewer #6: (No Response) ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #5: No Reviewer #6: Yes: Reza Talebi 21 Feb 2022 PONE-D-21-09997R3 Linkage disequilibrium and population structure in a core collection of Brassica napus (L.) Dear Dr. Rahman: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Tzen-Yuh Chiang Academic Editor PLOS ONE
  75 in total

1.  A genome-wide association study reveals novel elite allelic variations in seed oil content of Brassica napus.

Authors:  Sheng Liu; Chuchuan Fan; Jiana Li; Guangqin Cai; Qingyong Yang; Jian Wu; Xinqi Yi; Chunyu Zhang; Yongming Zhou
Journal:  Theor Appl Genet       Date:  2016-02-25       Impact factor: 5.699

2.  The high-quality genome of Brassica napus cultivar 'ZS11' reveals the introgression history in semi-winter morphotype.

Authors:  Fengming Sun; Guangyi Fan; Qiong Hu; Yongming Zhou; Mei Guan; Chaobo Tong; Jiana Li; Dezhi Du; Cunkou Qi; Liangcai Jiang; Weiqing Liu; Shunmou Huang; Wenbin Chen; Jingyin Yu; Desheng Mei; Jinling Meng; Peng Zeng; Jiaqin Shi; Kede Liu; Xi Wang; Xinfa Wang; Yan Long; Xinming Liang; Zhiyong Hu; Guodong Huang; Caihua Dong; He Zhang; Jun Li; Yaolei Zhang; Liangwei Li; Chengcheng Shi; Jiahao Wang; Simon Ming-Yuen Lee; Chunyun Guan; Xun Xu; Shengyi Liu; Xin Liu; Boulos Chalhoub; Wei Hua; Hanzhong Wang
Journal:  Plant J       Date:  2017-10-09       Impact factor: 6.417

3.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

4.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

5.  Heterotic patterns in rapeseed (Brassica napus L.): I. Crosses between spring and Chinese semi-winter lines.

Authors:  W Qian; O Sass; J Meng; M Li; M Frauen; C Jung
Journal:  Theor Appl Genet       Date:  2007-04-24       Impact factor: 5.574

6.  Extending the rapeseed gene pool with resynthesized Brassica napus II: Heterosis.

Authors:  Andreas Girke; Antje Schierholt; Heiko C Becker
Journal:  Theor Appl Genet       Date:  2011-12-13       Impact factor: 5.699

7.  Genome-Wide SNP Markers Based on SLAF-Seq Uncover Breeding Traces in Rapeseed (Brassica napus L.).

Authors:  Qinghong Zhou; Can Zhou; Wei Zheng; Annaliese S Mason; Shuying Fan; Caijun Wu; Donghui Fu; Yingjin Huang
Journal:  Front Plant Sci       Date:  2017-04-28       Impact factor: 5.753

8.  Genetic Diversity and Population Structure of a Camelina sativa Spring Panel.

Authors:  Zinan Luo; Jordan Brock; John M Dyer; Toni Kutchan; Daniel Schachtman; Megan Augustin; Yufeng Ge; Noah Fahlgren; Hussein Abdel-Haleem
Journal:  Front Plant Sci       Date:  2019-02-20       Impact factor: 5.753

9.  Genome-wide investigation of genetic changes during modern breeding of Brassica napus.

Authors:  Nian Wang; Feng Li; Biyun Chen; Kun Xu; Guixin Yan; Jiangwei Qiao; Jun Li; Guizhen Gao; Ian Bancroft; Jingling Meng; Graham J King; Xiaoming Wu
Journal:  Theor Appl Genet       Date:  2014-06-20       Impact factor: 5.699

10.  Genome-wide association study dissects the genetic architecture of seed weight and seed quality in rapeseed (Brassica napus L.).

Authors:  Feng Li; Biyun Chen; Kun Xu; Jinfeng Wu; Weilin Song; Ian Bancroft; Andrea L Harper; Martin Trick; Shengyi Liu; Guizhen Gao; Nian Wang; Guixin Yan; Jiangwei Qiao; Jun Li; Hao Li; Xin Xiao; Tianyao Zhang; Xiaoming Wu
Journal:  DNA Res       Date:  2014-02-07       Impact factor: 4.458

View more
  1 in total

1.  "Late" effectors from Leptosphaeria maculans as tools for identifying novel sources of resistance in Brassica napus.

Authors:  Audren Jiquel; Elise J Gay; Justine Mas; Pierre George; Armand Wagner; Adrien Fior; Sébastien Faure; Marie-Hélène Balesdent; Thierry Rouxel
Journal:  Plant Direct       Date:  2022-08-05
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.