Literature DB >> 35333299

Mapping the sex determination region in the Salix F1 hybrid common parent population confirms a ZW system in six diverse species.

Dustin G Wilkerson1, Bircan Taskiran1, Craig H Carlson1, Lawrence B Smart1.   

Abstract

Within the genus Salix, there are approximately 350 species native primarily to the northern hemisphere and adapted to a wide range of habitats. This diversity can be exploited to mine novel alleles conferring variation important for production as a bioenergy crop, but also to identify evolutionarily important genes, such as those involved in sex determination. To leverage this diversity, we created a mapping population by crossing 6 Salix species (Salix viminalis, Salix suchowensis, Salix integra, Salix koriyanagi, Salix udensis, and Salix alberti) to common male and female Salix purpurea parents. Each family was genotyped via genotyping-by-sequencing and assessed for kinship and population structure as well as the construction of 16 backcross linkage maps to be used as a genetic resource for breeding and selection. Analyses of population structure resolved both the parents and F1 progeny to their respective phylogenetic section and indicated that the S. alberti parent was misidentified and was most likely S.suchowensis. Sex determining regions were identified on Salix chromosome 15 in the female-informative maps for seven of the eight families indicating that these species share a common female heterogametic ZW sex system. The eighth family, S. integra × S. purpurea, was entirely female and had a truncated chromosome 15. Beyond sex determination, the Salix F1 hybrid common parent population (Salix F1 HCP) introduced here will be useful in characterizing genetic factors underlying complex traits, aid in marker-assisted selection, and support genome assemblies for this promising bioenergy crop.
© The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America.

Entities:  

Keywords:  zzm321990 Salixzzm321990 ; zzm321990 Salix purpureazzm321990 ; zzm321990 Salix suchowensiszzm321990 ; zzm321990 Salix viminaliszzm321990 ; QTL mapping; linkage map; sex determination

Mesh:

Year:  2022        PMID: 35333299      PMCID: PMC9157088          DOI: 10.1093/g3journal/jkac071

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.542


Introduction

The establishment of genomic resources is an important step in developing a fully realized breeding program, reinforced by modern tools for trait mapping, candidate gene identification, and marker-assisted selection. Salix and Populus (poplars) comprise the majority of species in the family Salicaceae, which consists of dioecious trees, shrubs, and subshrubs that are highly heterozygous. Shrub willow (Salix spp.) are grown in northern latitudes as a sustainable, high-yielding, carbon neutral, bioenergy crop that can grow on marginal land and provide multiple ecosystem services (Smart ; Stoof ; Clifton‐Brown ; Fabio and Smart 2020). While shrub willow breeding in the United States has been active since the 1980’s, most of the nearly 350 species have yet to be tapped as a source of genetic diversity (Dickmann and Kuzovkina 2014; Stanton ). Genomic resources developed for poplar were used in early genomic studies in Salix, because their genomes are largely colinear (Hanley ; Berlin ). As sequencing technologies became increasingly more affordable, more Salix genome sequencing was completed and now there are several high-quality assemblies. Genomic resources for Salix are currently centered around a few key species. In Europe, Salix viminalis is an important bioenergy crop species with a recently published, high-quality genome assembly (Almeida ). Salix viminalis has been used in several QTL mapping studies for resistance to willow leaf rust (Melampsora larici-epitea) (Rönnberg-Wästljung ; Samils ; Sulima ), drought tolerance (Rönnberg-Wästljung ), and growth and phenology (Hallingbäck ; Hallingbäck ). While in the United States, Salix purpurea is the model species for bioenergy willow breeding, genetics, and genomics. The US Department of Energy Joint Genome Institute has produced the highest quality annotated Salix reference genomes assembled in the genus on male and female S. purpurea, available on Phytozome (Zhou ; Zhou ; https://phytozome-next.jgi.doe.gov/info/SpurpureaFishCreek_v3_1; last accessed 29 March 2022; https://phytozome-next.jgi.doe.gov/info/Spurpurea_v5_1; last accessed 29 March 2022). Using joint linkage and association mapping approaches focused on S. purpurea, Carlson identified numerous QTL for a wide range of morphological, physiological, insect and disease resistance and biomass composition traits. A naturalized species in North America, S. purpurea is a potential donor of broad adaptability traits for species susceptible to pests and diseases in the Northeast United States. There is value in studying the genomes of less-characterized Salix species for phylogenomic analysis and to discover diverse sources of alleles for introgression into elite yielding cultivars. Here, we introduce the Salix F1 hybrid common parent population (Salix F1 HCP). The parents, described in Fabio and Crowell , represent a diverse selection of species from Salix subgenus Vetrix (Dickmann and Kuzovkina 2014). Six Salix species (S. viminalis, Salix suchowensis, Salix integra, Salix koriyanagi, Salix udensis, and Salix alberti) were crossed to common male and female S. purpurea parents to form eight species hybrid families. Literature describing these species ranges from the high-quality reference genomes available for S. purpurea and S. viminalis to the scarcely studied S. alberti (Rosso ). Salix suchowensis is native to China and has been used recently to generate a chromosome scale genome assembly (Wei ). This species has been assessed for its response to drought stress (Jia ) and was one of the first Salix species used to map the sex determination region (SDR) (Liu ). Salix udensis, formally known as S. sachalinensis, has been described as a Japanese riparian willow species that acts as a natural nest cavity for fish owls (Niiyama 2008; Slaght ) and is suggested to have sexually dimorphic characteristics (Ueno and Seiwa 2003; Ueno ). Some genomic resources are available for the Korean S. koriyanagi, as its chloroplast genome have been sequenced (Kim ; Park ) while S. integra has been assessed for its phytoremediation potential (Cao ; Yang ). Developed to interrogate the genetics of several understudied Salix species, the Salix F1 HCP is an important step in the development of genomic resources in Salix. The Salicaceae represents an interesting family for the study of the evolution of dioecy and the mechanisms of sex determination. Genetic mapping of SDRs in the Salicaceae have revealed considerable intra- and inter-chromosomal variability and contrasting sex determination systems (Yang ). Although Populus alba is female heterogametic (ZW) (Paolucci ; Müller ; Sabatti ) and Populus trichocarpa is male heterogametic (XY) (Geraldes ), both of their SDRs are located on chr19 while the SDR of Populus euphratica was identified on chr14 and also has an XY system (Yang ). Conversely, the SDR of S. purpurea has been mapped to a large, pericentromeric region on Salix chr15, but has been found to maintain a few orthologous regions present within the P. trichocarpa chr19 SDR (Zhou ). However, it is unclear whether dioecy evolved before or after their divergence (Hou ). The sex systems of Salix also lack a consensus model. For instance, similar to S. purpurea, both S. suchowensis (Hou ; Chen ) and S. viminalis (Pucholt ; Pucholt ; Hallingbäck ) have female heterogametic (ZW) systems with SDRs located on Salix chr15, yet tree-form S. nigra has an XY system SDR on chr07 (Sanderson ). Considering the variability in sex determination systems and SDR locations already discovered within Salix, elucidating the mechanisms of sex determination in more species could help to build a more cohesive understanding of SDR evolution. Further investigation could provide evidence to implicate SDR turnovers as a factor in the considerable species diversity within the genus. To establish the Salix F1 HCP as a genetic resource, this study sought to: (1) describe the genetic relationships between the parent species, (2) develop female and male parent informative linkage maps for each family, and (3) define and compare their respective SDRs.

Materials and methods

Plant material

Females of S. viminalis, S. integra, S. alberti, and S. suchowensis were crossed with male S. purpurea clone ID 94001, while males of S. viminalis, S. udensis, S. koriyanagi, and S. suchowensis were crossed with female S. purpurea clone ID 94006 creating eight F1 species hybrid families (Fig. 1). The common S. purpurea parents, 94006 and 94001, were chosen based upon their differential resistance to willow leaf rust (Crowell ), adaptation to the Northeastern U.S., and availability of high-quality reference genomes (Zhou ).
Fig. 1.

Crossing scheme of the Salix F1 HCP. There were four full-sib families with 94001 as the paternal parent and four full-sib families with 94006 as the maternal parent, both common parents are S. purpurea. Reciprocal crosses were made with male and female S. viminalis and S. suchowensis while S. integra, S. alberti, S. udensis, and S. koriyanagi were crossed only once. *Received as S. alberti.

Crossing scheme of the Salix F1 HCP. There were four full-sib families with 94001 as the paternal parent and four full-sib families with 94006 as the maternal parent, both common parents are S. purpurea. Reciprocal crosses were made with male and female S. viminalis and S. suchowensis while S. integra, S. alberti, S. udensis, and S. koriyanagi were crossed only once. *Received as S. alberti. Crosses were made by forcing floral catkins of the parent genotypes from dormant shoots in a greenhouse. All crosses were made in isolation to prevent pollen contamination. When anthers began to dehisce, male catkins were excised and placed in falcon tubes for pollen extraction using toluene, as described in Kopp , then stored in 2 mL microcentrifuge tubes at ‒20°C until female catkins were receptive. Seedlings were established in a standard peat-based potting mix in a greenhouse, then transplanted to nursery beds near Cornell AgriTech (Geneva, NY, USA). Winter dormant cuttings from all parents and progeny were collected from 1-year-old stems and hand-planted in the field in a randomized complete block design with 3 plants per plot and 4 replicate blocks. Field trials were established with 1.83 m spacing between rows and 40.6 cm between plants within rows.

DNA extraction and genotyping-by-sequencing

Shoot tips for DNA extraction were collected from plants in nursery beds and stored in desiccant. Dried shoot tips were ground to a fine powder with a Geno/Grinder (SPEX SamplePrep, Metuchen, NJ, USA) before genomic DNA extraction using the DNeasy Plant Mini Kit (QIAGEN Inc., Valencia, CA, USA). After checking the DNA quality using gel electrophoresis, DNA quantity was estimated using the Invitrogen Qubit dsDNA Broad Range Assay kit on a Qubit Fluorometer (ThermoFisher Scientific, Waltham, MA, USA). Genomic DNA was submitted to the University of Wisconsin Biotechnology Center (Madison, WI, USA) for 96-plex Genotyping-by-Sequencing (GBS) library preparation using the ApeKI restriction enzyme and sequenced (1 × 100 bp) on the Illumina HiSeq 2500 (Illumina, Inc., San Diego, CA, USA) platform.

Variant discovery and imputation

Initial variant discovery and filtering were performed using the TASSEL GBS v2 Discovery Pipeline (Bradbury ). Sequence reads were trimmed to 64 bp and aligned using BWA mem (Li and Durbin 2009) under default parameters to the S. purpurea v5.1 reference genome (Zhou ). As this genome includes both chr15Z and 15W, chr15Z was excluded to reduce mapping errors within the pseudoautosomal regions surrounding the SDR in S. purpurea. This process was repeated once for all eight families together and then for each of the eight F1 families separately to identify population-wide and family-specific SNPs. By calling variants on all eight families and then on each family separately, variants were based on inter- and intra- familial genetic differences, respectively. The resulting VCF files contained 684,412 SNPs for the full analysis and ranged from 174,762 to 266,797 SNPs depending on the family. On the eight F1 datasets, SNPs with >70% missing data and minor allele frequency <0.01 and F1 individuals with >80% missing data or determined to be outliers based on principal component analysis were dropped. Missing genotype calls and low read depth are common in GBS (Elshire ), therefore imputation was performed separately on each family to validate calls in haplotype blocks. Using LinkImputeR (Money ), genotypes called with a read depth <5 were set to missing before filtering again for missingness >70%, resulting in 95,281 to 145,944 imputed SNPs with accuracies ranging from 84.3 to 93.1%.

Population structure

SNPs for the population-wide analysis were filtered to retain markers and individuals with ≤20% missing data and SNPs with a minor allele frequency >0.01, which resulted in 55,398 SNPs. Multiple sequence runs of each of the parents were included as technical replicates for greater confidence in GBS calls. Analysis of principal components, once with just the parents and then again with the full population, was performed using default parameters in Tassel 5 and visualized in R (R Core Team 2020). Using only the parents, Tassel 5 was also used to generate a distance matrix and subsequent unrooted neighbor-joining tree for phylogenetic analysis. The parental runs and 10 randomly selected F1 progeny were analyzed using fastSTRUCTURE (Raj ). Only a subset of the F1 were used in this analysis to manage file size and computation requirements. Multiple analyses of “structure.py” were completed (K = 3–10). Using “chooseK.py,” K = 6 represented the model complexity that maximized the marginal likelihood and best explained the data structure, suggesting six separate populations.

Linkage map construction and analysis

Unless otherwise indicated, linkage map construction and analysis were performed using custom R code, available on Github (link in Data Availability). Multiple runs of each parent were used to form consensus genotypes for each SNP. In the absence of a clear consensus, the genotype was set to missing. If both parents were set to missing, the SNP was removed from the analysis. If only one parent was set to missing, its genotype was inferred based on the genotype of the known parent and the segregation of the F1. These parental consensus genotypes were used to identify the female informative (AB × AA) and male informative (AA × AB) markers used to generate backcross linkage maps for each parent using a combination of R/qtl (Broman ) and ASMap (Taylor and Butler 2017). Co-located markers and those exhibiting extreme segregation distortion were removed using ASMap function “pullCross.” Linkage groups were then created using “mstmap” with default parameters except for objective.fun = “ML” and bychr = FALSE. The p-value for determining linkage groups varied between families, ranging from 1e-6 to 1e-12, depending on the demarcation of the 19 expected linkage groups. A custom R function was then used to perform simple error correction to reduce the number of double crossovers and deflate map distances before reforming linkage groups. Briefly, this function relies on the marker order determined after formation of linkage groups, identifying double crossovers of ≤2 SNPs, and correcting them. Map quality was checked using two strategies. The “heatMap” function in ASMap, which plots LOD linkage between markers on the upper triangle and estimated recombination frequency on the bottom, reveals markers that are problematic or out of phase. Then by comparing the physical position (Mb) based on alignment to the S. purpurea reference genome and the genetic distance (cM) within each linkage group shows issues with marker order or potential chromosomal rearrangements. To delimit the SDR in each family, the sex of all F1 individuals was recorded by inspecting flowering catkins during two growing seasons, on plants with 2 and 3 years of post-coppice regrowth. Sex ratio bias was tested using a chi-square test for a 1:1 sex ratio. Using functions in R/qtl, genotype probabilities were calculated using “calc.genoprob” (step = 0, error.prob = 0.01, map.function = “kosambi,” stepwidth = “fixed”). Next, QTL mapping for individual sex was performed using “scanone”: model = “binary,” method = “em.” Genome-wide significance thresholds were determined based on the results of “scanone.perm” (n.perm = 1000). QTL positions were refined using “refineqtl” (method = “hk,” model = “binary”), then the 1.5 LOD support intervals were calculated using “lodint” with default parameters.

Results

A combination of PCA, phylogenetic analysis, and fastSTRUCTURE were used to describe the population structure of the Salix F1 HCP. For clarity when referring to specific individuals, the plant species is abbreviated to its first letter and includes its sex, M or F, in parenthesis. A PCA of multiple sequence runs of the parents revealed three distinct clusters, formed by two PCs accounting for 36.8% and 22.5% of the total genetic variation (Fig. 2a). The two common parents, S. p(F) 94006 and S. p(M) 94001, formed a single group and were separated from the other parents by PC2. The S. viminalis parents, ‘Jorr’ (M) and 07-MBG-5027 (F), formed a cluster with S. u(M) 04-BN-051, which was differentiated from the remaining species by PC1. Including the F1 progeny from each family into the analysis, each PC accounted for 26% and 10.1% of the total variation (Fig. 2b). PC2 split S. udensis from the two S. viminalis parents and separated S. koriyanagi from S. suchowensis, S. integra, and S. alberti. As expected, the F1 individuals were intermediate between the common parent and family-specific species. The F1 progeny derived from the female parents S. s(F) P295, S. a(F) P294, and S. i(F) P336 (all crossed with S. p(M) 94001), co-localized.
Fig. 2.

Results of PC, phylogenetic, and fastSTRUCTURE analysis of the Salix F1 HCP. Multiple technical replicates of the parents were included. a) PCA of the parents; b) PCA of the F1 and the parents; c) Unrooted neighbor-joining tree of the parents; d) Distruct plot using fastSTRUCTURE results. The species legend in panel (a) applies to panels (b) and (c) while colors in panel (d) are based on fastSRUCTURE results, affecting only S. alberti. *Received as S. alberti.

Results of PC, phylogenetic, and fastSTRUCTURE analysis of the Salix F1 HCP. Multiple technical replicates of the parents were included. a) PCA of the parents; b) PCA of the F1 and the parents; c) Unrooted neighbor-joining tree of the parents; d) Distruct plot using fastSTRUCTURE results. The species legend in panel (a) applies to panels (b) and (c) while colors in panel (d) are based on fastSRUCTURE results, affecting only S. alberti. *Received as S. alberti. Kinship analysis grouped the two S. viminalis parents with S. udensis, the two S. purpurea parents together, and S. koriyanagi with S. integra near S. suchowensis and S. alberti (Fig. 2c). These results closely mirrored the separation attributable to PC1 (Fig. 2a). In both PC and phylogenetic analyses, S.a(F) P294 grouped closely with both S. suchowensis parents. Admixture analysis included the parents and a subset of the F1 individuals of each family. The eight groups of F1 individuals were comprised of roughly half the genetic background of S. purpurea and half the other species parent, as expected (Fig. 2d). Salix viminalis, S. integra, S. koriyanagi, S. udensis and S. purpurea formed distinct populations, while S. a(F) P294 grouped together with the two S. suchowensis parents. Since there is considerable divergence between parents and pedigrees, variant discovery and marker filtration were performed for each family separately. Using consensus genotypes derived from multiple sequencing runs of the parents, markers were split into female (AB × AA) and male (AA × AB) informative backcross markers for linkage map construction, which resulted in 16 linkage maps (Fig. 3). Each linkage map consisted of 19 linkage groups with total map lengths ranging from 3939.9–6957.3 cM containing between 2035 and 3852 total markers (Supplementary Table 1). Recombination frequency and genetic to physical distance plots generated for each linkage map revealed that marker order and phase within each linkage group were reasonably linear (Supplementary Fig. 1). Sex phenotypes were used for QTL mapping of the SDR. Six of the eight families displayed significant sex ratio bias toward females based on a simple chi-square test (P < 0.05), with female to male ratios ranging from 1.4 to 1.6 (Table 1). Neither of the S. viminalis families displayed sex ratio bias, while the S. integra × S. purpurea family was entirely female. Single QTL for sex were identified on chr15 within seven of the eight maternal maps, excluding S. i(F) P336 of the S. integra × S. purpurea family. Each QTL explained between 60.7% and 74.1% of the total phenotypic variation (Table 2). LOD scores for the female and male linkage maps are in Supplementary Tables 2 and 3, respectively. Genome-wide LOD significance thresholds ranged from 3.39 to 3.69, depending on the map. On the four maternal S. p(F) 94006 maps, the QTL as determined by the permuted significance thresholds accounted for 10.3–12.03 Mb of the roughly 15.5 Mb chr15 due to suppressed recombination (Fig. 4). QTL refinement and 1.5 LOD support intervals narrowed this down to span 1.17–7.31 Mb with peak markers located at 2.91, 2.92, 7.33, and 8.65 Mb when S. s(M) P63, S. v(M) Jorr, S. k(M) 04-FF-016, and S. u(M) 04-BN-051 were the paternal parent, respectively (Table 2; Fig. 4). Of the three remaining maternal maps, QTL determined by LOD significance accounted for 12.3, 11.45, and 13.55 Mb of chr15 on the S. a(F) P294, S. s(F) P295, and S. v(F) 07-MBG-5027 maps, respectively, reflecting suppressed recombination. Upon refinement, these ranges were narrowed to account for 1.59, 6.91, and 6.41 Mb of the chromosome, with peaks centered at 9.73, 2.57, and 2.92 Mb.
Fig. 3.

Linkage maps for each of the parents within the Salix F1 HCP. Female maps (first and third columns) were constructed using female informative markers (AB × AA), while male maps (second and forth columns) were constructed using male informative markers (AA × AB). Each map title can be read as: abbreviated species (sex)—parent name (family pedigree). Total and chromosome-specific summary statistics for each map are provided in Supplementary Table 1. *Received as S. alberti.

Table 1.

Sex phenotype statistics for the eight families in the Salix F1 HCP.

FamilyMaternal speciesPaternal species N FemalesMalesF:M Ratio P-valuea
S.p × S.u S. purpurea S. udensis 15091591.6:10.01*
S.p × S.s S. purpurea S. suchowensis 8853331.6:10.03*
S.p × S.v S. purpurea S. viminalis 10060401.5:10.05NS
S.p × S.k S. purpurea S. koriyanagi 15093571.6:1<0.001*
S.v × S.p S. viminalis S. purpurea 10053471.1:10.55NS
S.i × S.p S. integra S. purpurea 15015001:0<0.001*
S.s × S.p S. suchowensis S. purpurea 15091581.6:10.01*
S.a × S.p S. alberti b S. purpurea 15087621.4:10.04*

indicates significance in chi-square test for sex ratio bias (P < 0.05).

Not significant.

Received as S. alberti.

Table 2.

Sex QTL associated with the SDR within the maternal linkage maps.

FamilyMaternal SpeciesChrPeak (cM)Peak LODPeak (Mb)Low-LSI (Mb)High-LSI (Mb)VE (%)
S.p × S.u S. purpurea 1585.329.38.654.519.2273.7
S.p × S.s S. purpurea 1581.116.12.912.619.9261.8
S.p × S.v S. purpurea 15114.218.62.922.884.0560.7
S.p × S.k S. purpurea 1570.135.17.332.638.2971.1
S.v × S.p S. viminalis 1586.021.12.922.929.3666.0
S.s × S.p S. suchowensis 1578.426.72.572.449.3574.1
S.a × S.p S. albertia 15145.835.69.738.6310.2270.3

LSI, 1.5 LOD support interval; VE, variation explained; cM, centiMorgan position within the linkage map; Mb, physical position based on alignment to S. purpurea reference genome.

Received as S. alberti.

Fig. 4.

Genetic (cM) and physical (Mb) distances of the Salix F1 HCP maternal parent’s chromosome 15 with mapped QTL for sex. Each parent has a set of maps, genetic on the top and physical on the bottom with lines connecting them indicating a marker's relative position in both maps. On the genetic maps, each marker is represented by a gray vertical line while the 1.5 LOD support interval of the QTL is shown with a purple box. On the physical map, gray vertical lines are markers not associated with the QTL, yellow lines are markers whose LOD score was above the permuted significance threshold, purple lines are markers within the 1.5 LOD support interval, and red lines indicate the position of the peak marker. The cM distance of S.i(F) P336's linkage map was artificially increased for figure clarity. *Received as S. alberti.

Linkage maps for each of the parents within the Salix F1 HCP. Female maps (first and third columns) were constructed using female informative markers (AB × AA), while male maps (second and forth columns) were constructed using male informative markers (AA × AB). Each map title can be read as: abbreviated species (sex)—parent name (family pedigree). Total and chromosome-specific summary statistics for each map are provided in Supplementary Table 1. *Received as S. alberti. Genetic (cM) and physical (Mb) distances of the Salix F1 HCP maternal parent’s chromosome 15 with mapped QTL for sex. Each parent has a set of maps, genetic on the top and physical on the bottom with lines connecting them indicating a marker's relative position in both maps. On the genetic maps, each marker is represented by a gray vertical line while the 1.5 LOD support interval of the QTL is shown with a purple box. On the physical map, gray vertical lines are markers not associated with the QTL, yellow lines are markers whose LOD score was above the permuted significance threshold, purple lines are markers within the 1.5 LOD support interval, and red lines indicate the position of the peak marker. The cM distance of S.i(F) P336's linkage map was artificially increased for figure clarity. *Received as S. alberti. Sex phenotype statistics for the eight families in the Salix F1 HCP. indicates significance in chi-square test for sex ratio bias (P < 0.05). Not significant. Received as S. alberti. Sex QTL associated with the SDR within the maternal linkage maps. LSI, 1.5 LOD support interval; VE, variation explained; cM, centiMorgan position within the linkage map; Mb, physical position based on alignment to S. purpurea reference genome. Received as S. alberti.

Discussion

Salix is a very diverse genus, consisting of more than 350 species. That diversity extends to variation between species even in traits as evolutionarily important as sex determination (Yang ). By generating mapping populations that include characterized species crossed with those less studied, we will increase the number of Salix species available for trait introgression in breeding programs. We developed the Salix F1 HCP as a resource to characterize the interactions between alleles from different, but related species, and to map variation in important traits. Using GBS, we analyzed the population structure among the eight families, generated linkage maps of each of the parents using backcross markers and mapped the SDR in seven of the eight families using phenotypes collected from repeated field surveys. PCA predominately resolved the population by section with the F1 clustering between the parents as expected. In both phylogenetic and fastSTRUCTURE analyses, S. a(F) P294 was found to be very closely related to the S. suchowensis parents. Given these results and the limited publicly available information about S. alberti, P294 is likely S. suchowensis and will be described as such. Among the 16 linkage maps produced, all QTL for sex were detected on chr15 in seven of the eight families and only in the maternal maps. In a recent study mapping sex in S. triandra using backcross markers, Li were also only able to detect QTL within the maternal map, which is indicative of a ZW sex determination system on chr15. While this had been known for S. purpurea (Zhou ), S. viminalis (Pucholt ; Hallingbäck ), and S. suchowensis (Chen ), this is the first study to report that both S. koriyanagi and S. udensis have ZW SDR on chr15. Although the location of the SDR has also not been reported in S. integra, the S. integra × S. purpurea family was entirely female and therefore was excluded in the linkage analysis. The chr15 map from S. i(F) P336 is considerably smaller than the other families, aligning to the 14.2–15.5 Mb region of the S. purpurea reference genome, while maps from the other families had near complete coverage. The region of S. i(F) P336’s chr15 that did have segregating markers was outside the SDR intervals of the other female maps and is likely in the pseudoautosomal region of the sex chromosome. Six of the eight families were female biased—a feature prevalent in Salix. Of those studied here, S. purpurea (Gouker ), S. viminalis (Alström-Rapaport ; Pucholt ), S. suchowensis (Yang ), S. udensis (Ueno and Seiwa 2003), and S. integra (Tozawa ) have documented cases of sex ratio bias, yet this is the first time it has been reported in S. koriyanagi. The genetic basis of sex ratio bias could be a result of secondary sex dimorphisms, such as higher mortality rates in males, increased herbivory and pathogen resistance in females, or the presence of a sex distorter locus (Pucholt ). Mapping of the SDR in this study resulted in marker associations that covered a majority of chr15 prior to refinement. Of the four families with S. p(F) 94006 as the common female parent, refined QTL shared similar ranges of the SDR described by Zhou (6.8 Mb in length, starting at 2.3 Mb) with the exception of the S. purpurea × S. viminalis family, which was 1.17 Mb in length. Carlson mapped the SDR to 4.5–11.4 Mb on chr15 using a S. purpurea F2 population while this study localized the SDR to the same pericentromeric region of chr15 in S. purpurea using F1 hybrid families. The most recent delimitation of the SDR in S. viminalis spanned roughly 3.4 Mb (approx. 2.3–5.7 Mb) of chr15 (Almeida ) and through Hallingbäck , is most likely controlled by a single locus. Our mapping of the SDR overlaps this region by 2.75 Mb, including the position of our peak marker, even though we aligned markers to a different reference genome. Almeida aligned the SDR to the chr15 of both S. purpurea v1.0 and S. viminalis reference genomes and found overall synteny between species, yet with several structural rearrangements. This contrasts with what is seen in our results, where the chr15 of S.v(F) 07-MBG-5027 showed only minor rearrangements, likely due to alignment to the high-quality S. purpurea v5.1 reference genome. Based on annotations from the P. trichocarpa genome, differential gene expression analysis in S. suchowensis between male and female plants led to predictions that the SDR was originally on chr14 (Liu ). However, later work repositioned the SDR to the centromeric region of chr15 when based on Salix alignment (Chen ). Our study defined the physical distance of the SDR at 2.44–9.35 Mb in S.s(F) P295 and 8.63–10.22 Mb in S.s(F) P294 with an overlap of 0.72 Mb. These two linkage maps show the greatest amount of rearrangement on chr15 when aligned to the S. purpurea reference. It is fair to conclude that alignment to a future S. suchowensis reference genome would aid in improving the mapping resolution in this region. Suppressed recombination is a hallmark of chromosomes containing an SDR. Comparing the map of chr15 from each family, the SDR extends across a region of sparce marker density, approximately 3–9 Mb and is flanked by regions with greater marker density. As described above, this centromeric region with suppressed recombination is often associated with the SDR in Salix. In all seven families with mapped SDR, the peak marker was located within this region although its position varied. In the two S. viminalis families and two of the three S. suchowensis families, the peak marker was located within a 0.38 Mb region (2.57–2.95 Mb), while the remaining three families were less consistent and located proximal to the centromere. The generation of additional reference genomes for use in mapping will add context to these results and further refine the structure of the SDR among variable species. This study described the population structure among the eight families within the Salix F1 HCP, constructed linkage maps for each parent, and mapped the SDR to the maternal chr15 in seven of the eight families. The introduction of the Salix F1 HCP provides the opportunity to map QTL for phenotypic traits beyond sex determination, while the linkage maps could be used to anchor and scaffold contigs in the generation of new reference genomes for each of the parents. While all species have a ZW sex determination system with an SDR that maps to chr15, these genetic resources provide a foundation for further characterization of the mechanism of sex determination and mapping of other key traits in these related species.

Data availability

The GBS data used in the population structure analysis and the eight family-specific files prior to linkage map construction are available in hapmap format through figshare at https://figshare.com at “GBS Data from Wilkerson et al.” (https://figshare.com/articles/dataset/GBS_Data_from_Wilkerson_et_al_2021/16926043; last accessed 29 March 2022). R code used to format and create the linkage maps is available on the Willowpedia Github site located at https://github.com/Willowpedia/Wilkerson_etal_SalixLinkageMaps; last accessed 29 March 2022. Supplementary Table 1 contains statistics on each of the 16 linkage maps created in this study, including marker count and cM length for each linkage group and the total markers and cM length for each map. Supplementary Fig. 1 is a PDF slide show that, one map per slide, shows the marker cM by physical position for each linkage group and a heatmap visualizing recombination frequency and linkage. Supplementary Tables 2 and 3 contain the cM distance and sex QTL LOD scores of markers within each female and male map, respectively. Supplemental material is available at G3 online. Click here for additional data file.
  35 in total

1.  R/qtl: QTL mapping in experimental crosses.

Authors:  Karl W Broman; Hao Wu; Saunak Sen; Gary A Churchill
Journal:  Bioinformatics       Date:  2003-05-01       Impact factor: 6.937

2.  Recent Y chromosome divergence despite ancient origin of dioecy in poplars (Populus).

Authors:  A Geraldes; C A Hefer; A Capron; N Kolosova; F Martinez-Nuñez; R Y Soolanayakanahally; B Stanton; R D Guy; S D Mansfield; C J Douglas; Q C B Cronk
Journal:  Mol Ecol       Date:  2015-04-02       Impact factor: 6.185

3.  Collection and storage of pollen from Salix (Salicaceae).

Authors:  Richard F Kopp; Charles A Maynard; Patricia Rocha de Niella; Lawrence B Smart; Lawrence P Abrahamson
Journal:  Am J Bot       Date:  2002-02       Impact factor: 3.844

4.  Characterization of a large sex determination region in Salix purpurea L. (Salicaceae).

Authors:  Ran Zhou; David Macaya-Sanz; Eli Rodgers-Melnick; Craig H Carlson; Fred E Gouker; Luke M Evans; Jeremy Schmutz; Jerry W Jenkins; Juying Yan; Gerald A Tuskan; Lawrence B Smart; Stephen P DiFazio
Journal:  Mol Genet Genomics       Date:  2018-07-18       Impact factor: 3.291

5.  High-density linkage mapping and evolution of paralogs and orthologs in Salix and Populus.

Authors:  Sofia Berlin; Ulf Lagercrantz; Sara von Arnold; Torbjörn Ost; Ann Christin Rönnberg-Wästljung
Journal:  BMC Genomics       Date:  2010-02-23       Impact factor: 3.969

6.  Allelic incompatibility can explain female biased sex ratios in dioecious plants.

Authors:  Pascal Pucholt; Henrik R Hallingbäck; Sofia Berlin
Journal:  BMC Genomics       Date:  2017-03-23       Impact factor: 3.969

7.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

8.  Transcriptome analysis of the differentially expressed genes in the male and female shrub willows (Salix suchowensis).

Authors:  Jingjing Liu; Tongming Yin; Ning Ye; Yingnan Chen; Tingting Yin; Min Liu; Danial Hassani
Journal:  PLoS One       Date:  2013-04-01       Impact factor: 3.240

9.  Association mapping in Salix viminalis L. (Salicaceae) - identification of candidate genes associated with growth and phenology.

Authors:  Henrik R Hallingbäck; Johan Fogelqvist; Stephen J Powers; Juan Turrion-Gomez; Rachel Rossiter; Joanna Amey; Tom Martin; Martin Weih; Niclas Gyllenstrand; Angela Karp; Ulf Lagercrantz; Steven J Hanley; Sofia Berlin; Ann-Christin Rönnberg-Wästljung
Journal:  Glob Change Biol Bioenergy       Date:  2015-07-29       Impact factor: 4.745

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.