Literature DB >> 26400405

Evidence of intense chromosomal shuffling during conifer evolution.

Marina de Miguel1, Jérôme Bartholomé1, François Ehrenmann1, Florent Murat2, Yoshinari Moriguchi3, Kentaro Uchiyama4, Saneyoshi Ueno4, Yoshihiko Tsumura5, Hélène Lagraulet1, Nuria de Maria6, José-Antonio Cabezas6, Maria-Teresa Cervera6, Jean Marc Gion7, Jérôme Salse2, Christophe Plomion8.   

Abstract

While recent advances have been gained on genome evolution in angiosperm lineages, virtually nothing is known about karyotype evolution in the other group of seed plants, the gymnosperms. Here we used high density gene-based linkage mapping to compare the karyotype structure of two families of conifers (the most abundant group of gymnosperms) separated around 290 million years ago: Pinaceae and Cupressaceae. We propose for the first time a model based on the fusion of 20 ancestral chromosomal blocks that may have shaped the modern karyotpes of Pinaceae (with n=12) and Cupressaceae (with n=11). The considerable difference in modern genome organization between these two lineages contrasts strongly with the remarkable level of synteny already reported within the Pinaceae. It also suggests a convergent evolutionary mechanism of chromosomal block shuffling that has shaped the genomes of the spermatophytes.
© The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Cuppressaceae; Pinaceae; chromosomal rearrangement; comparative mapping; gymnosperm; synteny

Year:  2015        PMID: 26400405      PMCID: PMC4684699          DOI: 10.1093/gbe/evv185

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Knowledge about genome structure and evolution is a fundamental step toward understanding species adaptation and evolution. Genome evolution is based on genetic variability generated by mutation, recombination, and the acquisition of new genes. New genes can be acquired by interspecific hybridization or the duplication of some or all the existing genes of an organism (Renny-Byfield and Wendel 2014; Cong et al. 2015). Plant genomes, unlike those of animals, have evolved through frequent, rapid chromosomal rearrangements, including whole-genome duplications (WGDs) followed by nested chromosome fusions in particular (Abrouk et al. 2010; Salse et al. 2015). The sequencing of plant genomes has made it possible to construct evolutionary models for various angiosperm lineages (Murat et al. 2010; Salse 2012; Murat et al. 2015). These studies revealed that angiosperms have experienced successive common and lineage-specific WGDs, which have governed increases in genome size and shaped the genome structure and composition of extant species (Renny-Byfield and Wendel 2014). Evolutionary genome shuffling events (chromosomal fusions and fissions) have been identified during the course of angiosperm evolutionary history, making it possible to reconstruct the karyotypes of the common ancestors of eudicots, with seven protochromosomes, and of monocots, with five or seven protochromosomes (Abrouk et al. 2010; Salse 2012). However, we still know little about karyotype evolution in the other group of seed plants, the gymnosperms. Conifers are the most abundant group of gymnosperms. Genome structure and evolution differ between conifers and angiosperms. Conifers have extremely large genomes (ranging from 18 to 35 Gb) characterized by the presence of repetitive elements (Kovach et al. 2010; Mackay et al. 2012; Neale et al. 2014). These features have hindered attempts to sequence the genomes of these plants and the recently released draft genome sequences are highly fragmented (Nystedt et al. 2013; Zimin et al. 2014; Warren et al. 2015). Consequently, the evolutionary mechanisms shaping the composition and structure of conifer genomes remain to be determined. One ancient WGD event is known to have occurred before the angiosperm–gymnosperm split around 350 Ma (Jiao et al. 2011). However, there is no evidence to suggest that other WGD events have occurred during the evolutionary history of conifers (Kovach et al. 2010; Nystedt et al. 2013), by contrast to what has been reported for angiosperms. Conifer genome size seems to have increased due to the accumulation of large numbers of retrotransposons (Morse et al. 2009; Nystedt et al. 2013). Polyploidy is exceptional in gymnosperms, with only two species from the Cupressaceae known to be polyploids, one of these species being hexaploid (Sequoia semperviens 2n = 66) and the other tetraploid (Juniperus chinensis 2n = 44). The haploid number of chromosomes in conifers ranges from 9 to 19, but karyotypes are highly conserved across species and genera, with most having 11 or 12 chromosomes (Wang and Ran 2014). Pinaceae, the largest family of conifers, has been studied more thoroughly than other conifers, for ecological and economic reasons. Pinus and Picea, the main genera within Pinaceae, separated around 87–193 Ma (Morse et al. 2009). Comparative mapping between Pinaceae species has revealed high levels of interspecific and intergeneric synteny and macrocollinearity (Krutovsky et al. 2004; Pelgas et al. 2006; Pavy et al. 2012), suggesting a lack of chromosomal rearrangement within this family, despite the ancient nature of the divergence between some taxa. Nevertheless, the issue of the conservation of synteny across different families of conifers has not yet been addressed. Further studies of the evolution of conifer karyotypes are therefore required to determine whether it has followed a pattern similar to that in angiosperms or more similar to that in the Pinaceae. In the absence of a completely contiguous reference genome in conifers, high-density comparative mapping provides an opportunity to compare genomes from different lineages, thereby improving our understanding of conifer karyotype evolution. In this work, we analyzed conifer karyotype evolution through comparative mapping, making use of published genetic linkage maps for two families: Pinaceae (n = 12) and Cupressaceae (n = 11). The aims of this study were 1) to analyze the degree of gene synteny and collinearity at the interfamily level, and 2) to set a likely scenario of karyotype evolution between both families.

Results and Discussion

We carried out a literature review to select high-density gene-based linkage maps for conifers for which sequence information is publicly available. We included 18 genetic maps (table 1) for six different species from two botanical families—Pinaceae (n = 12) and Cupressaceae (n = 11)—in this study. We made use of the high degree of synteny and macrocollinearity within Pinaceae (Krutovsky et al. 2004; Pavy et al. 2012) to establish a gene-based composite map for this botanical family. A stepwise strategy, from species to family level, was used to maximize the number of mapped markers common to different maps, thereby maximizing the number of anchor markers for the construction of the composite map for Pinaceae (fig. 1a). We began by constructing a composite linkage map for Pinus pinaster from 14 maps (table 1). We then generated two genus-level composite maps: 1) For Pinus sp., by combining the P. pinaster composite map generated in this study with a published map for Pinus taeda (Eckert et al. 2010); and 2) for Picea sp., based on published maps for Picea abies (Lind et al. 2014), Picea glauca and Picea mariana (Pavy et al. 2012). The composite maps for Pinus sp. and Picea sp. were then merged into a unified composite map for Pinaceae. This composite map for Pinaceae was then compared with a published gene-based map for Cryptomeria japonica (Moriguchi et al. 2012), a representative member of Cupressaceae. The strategy used to combine and compare the genetic maps is illustrated in figure 1a and supplementary figure S1, Supplementary Material online.
Table 1

Characteristics of the Genetic Linkage Maps Used in This Study to 1) Establish Composite Maps for P. pinaster (from 14 Individual Maps), and for the Pinaceae Family (combining Pinus pinaster, Pinus taeda, Picea glauca, Picea mariana, and Picea abies linkage maps), and 2) Compare the Pinaceae Composite Map with the Map of One Representative (Cryptomeria japonica) of the Cupressaceae Family

SpeciesPedigree NameLinkage Map IDNumber of IndividualsNumber of MarkersLength (cM)Mean Marker Interval (cM)Reference
Pinus pinasterC×LC1065741,4882.8Lagraulet (2015)
L8261,8632.3
M×LM1176271,6582.8
L9201,9532.2
C×MC947281,8862.6
M6301,6192.6
F2F2_O691,4811,6880.98Plomion et al. (2015)
F2_N922,0521,9931.17
G2G2M836191,4252.3Chancerel et al. (2013)
G2F5621,4452.57
C14×C15C14638121,7142.1extended from de Miguel et al. (2012)
C159231,5771.71
Gal1056×Oria6Gal1056696661,4262.14modified from de Miguel et al. (2014)
Oria67551,2961.72
Pinus taedaqtlPtaeda1721,8151,8991.1Eckert et al. (2010)
Picea glaucaD, PPglauca500, 2602,2702,0831.1Pavy et al. (2012)
Picea mariana9920002283
Picea abiesS21K7622162×S21K7621678Pabies2476861,8892.8Lind et al. (2014)
Cryptomeria japonicaYICjaponica1501,2621,4051.1Moriguchi et al. (2012)
F

Flowchart of the comparative analysis between Pinaceae and C. japonica. (a) Bioinformatic analysis developed for homologous genes identification. (b) Results of the comparative analysis between Pinaceae and C. japonica testing different confidence thresholds applied at two different steps: Sequence comparison for homolog unigne identification and comparative gene position for orthologs identification. Pathways that allowed the identification of orthologs covering the full set of chromosomes from C. japonica and Pinaceae are marked in bold.

Flowchart of the comparative analysis between Pinaceae and C. japonica. (a) Bioinformatic analysis developed for homologous genes identification. (b) Results of the comparative analysis between Pinaceae and C. japonica testing different confidence thresholds applied at two different steps: Sequence comparison for homolog unigne identification and comparative gene position for orthologs identification. Pathways that allowed the identification of orthologs covering the full set of chromosomes from C. japonica and Pinaceae are marked in bold. Characteristics of the Genetic Linkage Maps Used in This Study to 1) Establish Composite Maps for P. pinaster (from 14 Individual Maps), and for the Pinaceae Family (combining Pinus pinaster, Pinus taeda, Picea glauca, Picea mariana, and Picea abies linkage maps), and 2) Compare the Pinaceae Composite Map with the Map of One Representative (Cryptomeria japonica) of the Cupressaceae Family The P. pinaster composite map comprised 3,491 unigenes of the Pine V3 Unigene set (Canales et al. 2014) as well as 182 AFLP (amplified fragment length polymorphism) or SAMPL (selective amplification of microsatellite polymorphic locus) markers (table 2 and supplementary fig. S2, Supplementary Material online). There were 3,639 unique markers, 60% of which were present on at least two component maps. Overall, we found high degrees of synteny and collinearity between all the P. pinaster component maps and the composite map (supplementary fig. S3, Supplementary Material online). The mean proportion of markers noncollinear (inversion greater than 5 cM) between the composite map and a component map was 1.3%. The large number of markers common to different component linkage maps and the high levels of collinearity observed increased the degree of certainty concerning the relative positions of the mapped unigenes. The P. pinaster composite map contributed the largest number of mapped markers for construction of the composite map for Pinaceae (tables 1 and 2).
Table 2

Details of the Composite Genetic Linkage Maps Generated in This Study

Pinus pinasterPinusPiceaPinaceae
Nb of LGs12121212
Size (cM)1,721.71,9432,032.92,094.9
Nb of markers3,6735,1952,3256,912
Nb of markers corresponding to PineV3a unigenes3,4914,6391,9405,971
Nb of unique unigenes3,4574,6051,9315,927
Nb of unique positions1,8192,3362,0063,077
Mean marker interval (cM)0.470.390.880.30
Mean unique position interval (cM)0.960.931.020.68

aFrom Canales et al. (2014).

Details of the Composite Genetic Linkage Maps Generated in This Study aFrom Canales et al. (2014). The composite map for Pinaceae contained 6,912 mapped markers over 2,094.9 cM (table 2 and supplementary fig. S4, Supplementary Material online). As single nucleotide polymorphisms (SNPs) mapped in this composite map were identified from a variety of transcritptomic assays in diverse species, we considered as different gene loci only those that had an homolog in Pine V3, the gene catalog used as reference. Following this criterion, at least 5,927 different unigenes were mapped in the Pinaceae composite map (table 2 and supplementary table S1, Supplementary Material online). On average, 42 unigenes per linkage group (LG) were common to the Pinus sp. and Picea sp. maps, identifying a total of 513 othologous unigenes between both species (fig. 2 and supplementary table S1, Supplementary Material online). Only 5.9% of unigenes were nonsyntenic and 2.8% presented an inversion of more than 15 cM (supplementary table S1, Supplementary Material online). These markers were identified and removed from the Pinaceae composite map. As expected, there was a high degree of synteny and collinearity between members of the Pinaceae, providing support for the strategy followed in this study. A small fraction of mapped unigenes in the Pinaceae composite map may be originated by paralogy as revealed by the 44 unigenes mapped in more than one LG (supplementary table S2, Supplementary Material online). Finally, the P. pinaster, Pinus, and Pinaceae composite maps generated in this study are the densest linkage maps ever produced for conifers at species, genus, and family levels, respectively.
F

Comparison between the composite linkage maps for Pinus sp. and Picea sp. The Pinus sp. composite map is shown in blue and the Picea sp. composite map is shown in green. Orthologous markers are linked by black lines connecting the two maps. The number of orthologous markers is indicated at the top of each LG.

Comparison between the composite linkage maps for Pinus sp. and Picea sp. The Pinus sp. composite map is shown in blue and the Picea sp. composite map is shown in green. Orthologous markers are linked by black lines connecting the two maps. The number of orthologous markers is indicated at the top of each LG. Sequence comparisons between unigenes mapped on the Pinaceae and C. japonica genetic maps resulted in the identification of 257 and 229 homologous loci depending on the e-value cutoff applied (fig. 1b). Homologous unigenes were identified for all LGs whatever the threshold considered, from 17 on LG4 to 28 on LG3 for the Pinaceae map and from 13 on LG8 to 35 on LG3 for the C. japonica map (for an e-value cutoff of 1e−30). Linkage maps were aligned using homologous unigenes as anchor points. The alignment of both genetic maps enabled the identification of common genomic regions. Dotplot representation for map alignment (supplementary fig. S5, Supplementary Material online) showed a threshold of four shared unigenes between both maps suitable for othologous LG block determination. A more stringent criterion for ortholog selection was additionally tested, which consisted in a minimum of six shared unigenes between two regions to be orthologs. A total of 12–20 othologous LG blocks were identified depending on the threshold used (fig. 1b). However, orthologous LG blocks covered the complete set of chromosomes of the analyzed species only when a threshold of four shared markers was used (fig. 1b). Consequently, this threshold was considered the most appropriate in view of the level of resolution of available genetic maps, and further discussion of the results is based on this threshold. The use of a threshold of four homolgous unigenes to consider an orthologous LG block resulted in the identification of 143 orthologous loci (i.e., 55.6% of the homologous markers) for an e-value cutoff lower than 1e−30 and 124 (i.e., 54.2% of the homologous markers) for an e-value cutoff of 1e−35. Thus, the use of a more stringent selection criterion for the identification of homologous sequences did not decrease significantly the proportion of identified orthologous unigenes. As a result, we found that each of the LGs on the Pinaceae map corresponded to one or two different LG blocks on the Cupressaceae map, and that each LG on the Cupressaceae map corresponded to one to three LG blocks on the Pinaceae map (fig. 3a and supplementary fig. S5, Supplementary Material online). Each pair of orthologous LG blocks determined a contiguous ancestral region (CAR). Most CARs were identified whatever the e-value threshold applied with the exception of CARs #14 and #19 (table 3) that could not be confirmed using the most stringent criterion. Mean number of unigenes per CAR was 6 and 7 depending of the e-value cutoff applied. The number of orthologous unigenes per CAR was slightly reduced in eight CARs for the most stringent e-value cutoff, but the size of CARs was maintained with the exception of two CARs that were reduced by 20.1 and 48.1 cM, respectively (table 3). Thus, the number and size of identified CARs were consistent under the two different thresholds tested for homolog identification. Therefore, our results suggest the existence of 18–20 CARs that may have shaped the 12 chromosomes of modern Pinaceae species and the 11 chromosomes of modern Cupressaceae species through a different number of fusions (fig. 3b). Taking the Pinaceae map as the reference and inspecting the 20 proposed CARs (e-value threshold of 1e−30), seven C. japonica LGs displayed crossed CARs. Taking the noncrossing CARS as a measurement of collinearity, 40% of the CARs identified were considered to be collinear. Besides, high levels of collinearity were found within CARs, with only 18.1% of orthologous markers presenting an inversion of more than 15 cM within a CAR (supplementary fig. S5, Supplementary Material online). These results suggest an intense shuffling of orthologous LG blocks during the evolution of Pinaceae and Cupressaceae, but a higher conservation of gene order within these blocks.
F

Pinaceae–Cupressaceae comparative mapping. Results for an e-value cutoff of e−30 for homolog unigene identification and a threshold of at least four homologs shared between the two maps to determine an orthologous LG block. (a) Positions of the 143 orthologous unigenes mapped for representative species of both Pinaceae and Cupressaceae. Orthologous LG blocks are indicated by color-coded ribbons connecting the Pinaceae (in gray) and Cupressaceae (in white) LGs. LG number and genetic distance in centimorgans are indicated outside the circle. Pinaceae LGs are ordered from 1 to 12 and C. japonica LGs are ordered to facilitate graphical visualization. (b) Representation of the more parsimonious model of evolution of the identified orthologous LG blocks mapped on Pinaceae and C. japonica. Each orthologous LG block determined a CAR. Identified CARs are numbered from 1 to 20.

Table 3

Number of Unigenes and Size (cM) of Identified Orthologous LG blocks (CARs) at Two e-Value Cutoffs for Homologous Unigenes Identification

e−30
e−35
CARNb UnigenescM (Pinaceae)Nb UnigenescM (Pinaceae)
1910.9–155.8910.9–155.8
21011.9–84.9911.9–84.9
3120–146.5110–146.5
4765.3–147.5765.3–147.5
5527.6–69427.6–69
6513.5–123513.5–123
7612.2–70.8612.2–70.8
8737.4–157.9737.4–157.9
9842.9–83.7742.9–83.7
10677.2–146.44125.3–146.4
11427.7–64.7427.7–64.7
124116.6–161.84116.6–161.8
13630.9–69.9530.9–69.9
14525.9–740
15859.8–132859.8–132
16105–163.795–163.7
1766.7–41.266.7–41.2
18941.9–124.7862–124.7
19439.5–62.60
201214.9–163.31114.9–163.3

Note.—Changes in the number of unigenes or size of CARs following the most stringent threshold are indicated in bold.

Pinaceae–Cupressaceae comparative mapping. Results for an e-value cutoff of e−30 for homolog unigene identification and a threshold of at least four homologs shared between the two maps to determine an orthologous LG block. (a) Positions of the 143 orthologous unigenes mapped for representative species of both Pinaceae and Cupressaceae. Orthologous LG blocks are indicated by color-coded ribbons connecting the Pinaceae (in gray) and Cupressaceae (in white) LGs. LG number and genetic distance in centimorgans are indicated outside the circle. Pinaceae LGs are ordered from 1 to 12 and C. japonica LGs are ordered to facilitate graphical visualization. (b) Representation of the more parsimonious model of evolution of the identified orthologous LG blocks mapped on Pinaceae and C. japonica. Each orthologous LG block determined a CAR. Identified CARs are numbered from 1 to 20. Number of Unigenes and Size (cM) of Identified Orthologous LG blocks (CARs) at Two e-Value Cutoffs for Homologous Unigenes Identification Note.—Changes in the number of unigenes or size of CARs following the most stringent threshold are indicated in bold. Previous comparative genomics studies in Pinaceae have reported high levels of synteny and collinearity for genes (Chagné et al. 2003; Krutovsky et al. 2006; Pelgas et al. 2006; Pavy et al. 2012). The conservative genome macrostructure among Pinaceae species has been interpreted as evidence that genome rearrangement events are rare in conifers (Diaz-Sala et al. 2013; Nystedt et al. 2013). The results presented here revise this view of conifer genome evolution, which was inferred essentially from comparisons of Pinaceae species. Our findings also support a new hypothesis that substantial chromosome rearrangements have occurred between families. Molecular phylogenetic studies support the splitting of conifers into two groups: Pinaceae and Coniferales II, corresponding to all conifer families other than Pinaceae (Bowe et al. 2000; Gugerli et al. 2001; Lu et al. 2014). The observed chromosomal rearrangements may have generated a reproductive barrier separating the two lineages around 290 Ma (Burleigh et al. 2012). On the other hand, Pinus and Picea display remarkable levels of synteny and collinearity despite their ancient divergence, confirming the exceptionally high degree of genome structure conservation within Pinaceae. According to Gernandt et al. (2011), conifers (Coniferales) can be grouped into six different families: Pinaceae, Podocarpaceae, Araucariaceae, Sciadopityaceae, Taxaceae, and Cupressaceae. Comparative genomics studies with representatives of other conifer families are crucial to determine whether the lack of genome rearrangement observed in Pinaceae is a feature common to other conifers. The adaptive radiation of some Cupressaceae species dates from the Oligocene (23–33 Ma), but the first fossil record of C. japonica dates from 55 to 65 Ma (Yang et al. 2012). The study of genome structure in other species from Cupressaceae with a shorter life history could provide new insight into the mechanisms and patterns of genome evolution in conifers. The n = 12 karyotype is considered the most primitive in the Pinaceae family, based on chromosome morphometrics (Nkongolo et al. 2012). However, we were unable to reconstruct the karyotype of the common ancestor of Pinaceae and Cupressaceae in this study due to the lack of a suitable outgroup species phylogenetically close to conifers and with a well-assembled genome. The candidate species best matching these criteria is the basal angiosperm Amborella trichopoda (Amborella Genome Project 2013). A comparison between basal angiosperms and conifers should open up promising perspectives for the construction of a model of karyotype evolution. Comparative genomics and phylogenetic studies based on genome-wide comparisons with conifers will be crucial to bridge the gaps in our understanding of the evolution of plant genomes from cryptogams to flowering plants.

Conclusion

The results reported here take us a step beyond the “stasis” already described for the Pinaceae, opening up new avenues of research into the evolution of conifer genomes. We propose a possible scenario for conifer genome evolution, based on the fusion of chromosomal blocks, serving as a prelude to the modern karyotype configuration in Pinaceae and C. japonica. However, further improvements in our knowledge of basal angiosperms and gymnosperms will be required to reconstruct the karyotype of the common ancestor of seed plants.

Materials and Methods

Description of the Genetic Linkage Maps Used in This Study

The following terms were used to describe the different kinds of genetic maps used in this study, as suggested by Hudson et al. (2012): 1) Sex-averaged map: a consensus map for both parents of a pedigree; 2) consensus map: an integrated map based on segregation data from individual component maps; 3) composite map: an integrated map of different individual component maps built by a marker-merging method; and 4) component map: each of the maps used in the construction of a composite map. The graphics and the representations of genetic maps were produced with R 3.1.0 (R Core Team 2014).

Pinus pinaster

We used 14 base maps generated from seven different crosses to generate a composite genetic linkage map for P. pinaster. The first six maps were obtained from three controlled crosses (pedigrees #1, #2, and #3 in supplementary fig. S1, Supplementary Material online) between three different genotypes: Corsica × Landes (C×L), Morocco × Landes (M×L), and Corsica × Morocco (C×M). In total, 106, 117, and 94 full-sibs were genotyped with the 9k SNP-array described by Plomion et al. (2015) for C×L, M×L, and C×M, respectively. The regression mapping algorithm of JoinMap 4.1 (Van Ooijen 2011) was used to produce two maps for each parental genotype (one per cross), according to a two-way pseudotestcross mapping strategy (Grattapaglia and Sederoff 1994), using testcross markers (i.e., segregating in a 1:1 Mendelian ratio) only. The genetic maps were then combined into sex-averaged maps (Corsica, Landes and Morocco, see supplementary fig. S1, Supplementary Material online) with the function “combine groups for map integration” of JoinMap 4.1. More details on the construction of the maps can be found in Lagraulet (2015). Four other maps from two different mapping populations were also used. The first population was a three-generation inbred pedigree consisting of an F2 population (#4 in supplementary fig. S1, Supplementary Material online) resulting from the selfing of an interprovenance tree (Landes × Corsica). The second population was a three-generation outbred pedigree (G2, #5) resulting from a controlled cross of two intraprovenance hybrid trees (Landes × Landes). The construction of these maps was described by Plomion et al. (2015) for the F2 population and Chancerel et al. (2013) for the G2 population. For the F2 population, two different sets of individuals were used to generate two maps (F2_O and F2_N) with the RECORD algorithm (Van Os et al. 2005). For the G2 population, one map for each parent (G2M and G2F) was produced with the regression mapping algorithm of JoinMap 4.1 (Van Ooijen, 2011). The F2_O, G2M, and G2F maps included different marker types: AFLP, single sequence repeat (SSR), expressed sequence tag (EST), and SNPs from different arrays (Chancerel et al. 2011, 2013), whereas the F2_N map contained only SNPs from the 9k SNP-array (Plomion et al. 2015). We made use of the large number of common markers and the high level of collinearity between the two F2 maps to construct a composite map (referred to as F2C by Plomion et al. 2015). The last four maps were generated from two different F1 crosses: C14×C15 (#6 in supplementary fig. S1, Supplementary Material online) and Gal1056×Oria6 (#7). From the initial parental maps of the C14×C15 mapping population described by de Miguel et al. (2012), we mapped an additional set of 980 SNPs from the 12k SNP-array described by Chancerel et al. (2011) and 273 SNPs from a 1,536 SNP-array (Sáez-Laguna et al. unpublished) here to increase the number of anchor markers common to other maps. The parental maps of the Gal1056×Oria6 population used by de Miguel et al. (2014) were reconstructed in this study with the most informative individuals. For both pedigrees, we used the maximum-likelihood mapping algorithm of JoinMap 4.1 to generate individual genetic maps and sex-averaged maps. The four maps from the C14×C15 and Gal1056×Oria6 crosses contained different types of molecular markers (SSRs, ESTs, SAMPLs, and SNPs). For all maps, genetic distances in centimorgans (cM) were calculated with the Kosambi mapping function (Kosambi 1943).

Other Pinaceae

We carried out a literature review to identify previously published high-density gene-based linkage maps for members of the Pinaceae family, for which sequence information was publicly available. Only four studies satisfied these criteria. Eckert et al. (2010) provided a sex-averaged linkage map for a two-generation outbred pedigree based on SNPs for P. taeda (accession number: TG091, http://dendrome.ucdavis.edu/cmap/, last accessed May 21, 2015). The map provided by Pavy et al. (2012) is the densest genetic map published to date for Picea. This map was a consensus of the white spruce (Pic. glauca) and black spruce (Pic. mariana) linkage maps. The white spruce pedigree was an F1 full-sib family, whereas the black spruce pedigree was a backcross representing the hybridization species complex Pic. mariana × Pic. rubens. Pic. glauca and Pic. mariana linkage maps were constructed with the regression algorithm in JoinMap 3.0 software. Lind et al. (2014) provided the most saturated and gene-rich map to date for Pic. abies. This map was a sex-averaged map of the two parents of an F1 controlled cross and was also constructed with the regression algorithm of JoinMap 3.0. A detailed list of mapping features for each component map included in this study is available in table 1.

Cryptomeria japonica

A high-density linkage map for C. japonica was incorporated into this study, as a representative species from the Cupressaceae family (Moriguchi et al. 2012). This map was constructed from an F1 full-sib family (table 1), with the regression mapping algorithm implemented in JoinMap v 3.0.

Identification of Homologous Genes within Pinaceae

The 17 maps described above were mostly constructed with SNP markers (100% of the markers for Picea sp., 98% for P. pinaster, and 90% for P. taeda). The flanking sequences of each SNP marker were compared with the most recent Unigene sets available for each species to obtain the sequence of unigenes containing the mapped SNPs: Canales et al. (2014) for P. pinaster, Rigault et al. (2011) for Pic. glauca, Pic. mariana and Pic. abies, and Lorenz et al. (2012) for P. taeda. This comparison was carried out with the BLASTn tool (the BLAST 1 step in fig. 1a). Unigenes with a percentage identity exceeding 95% with mapped SNP flanking sequences were retained for the next step. Pinus pinaster Unigene set from Canales et al. (2014), Pine V3, was then used as the reference for the identification of homologous unigenes within Pinaceae species. A second sequence comparison (R-BLAST 2 in fig. 1a) was performed, between the mapped unigenes of each species and the unigene sequences of Pine V3. For this interspecific comparison, a stringent reciprocal tBLASTx analysis was performed. Only sequences with a reciprocal best hit with a percentage identity exceeding 85%, an e value below e−65, and an alignment of more than 200 bp were retained as homologous unigenes. Homologous unigenes between different species were considered as orthologs if they were positioned in the same LG (i.e., syntenic unigenes). Identified orthologous unigenes were used as anchor markers to construct a composite linkage map for each genus (Pinus and Picea), as a preliminary step in the construction of a composite map for the Pinaceae family including both genera.

Construction of Composite Genetic Linkage Maps

We used the R package LPmerge (Endelman and Plomion 2014) to integrate component linkage maps into a composite map without the use of segregation data. LPmerge assessed the goodness of fit of the composite map by calculating a root mean squared error (RMSE) per LG, by comparing the position (in cM) of all markers on the composite map with that on the component maps. We calculated this metric for different maximum interval sizes (parameter K in the algorithm), ranging from 1 to 10. The value of K minimizing the mean RMSE per LG was selected for construction of the composite map. This method was used for the construction of all the composite species maps reported here. Further details about the production of each composite map are described below. Before integrating the 14 base genetic linkage maps into a single composite map, we established consensus maps (supplementary fig. S1, Supplementary Material online) based on markers common to different accessions across pedigrees (Corsica, Landes, Morocco genotypes for pedigrees #1, #2, and #3, respectively), or accessions within pedigrees (Coca and GxO for pedigrees #6 and #7, respectively), or based on the merging of different data sets of the same pedigree (F2 for pedigree #4). This process, designed to increase the number of markers common to component maps, was facilitated by the use of the same 12k (Chancerel et al. 2013) and 9k (Plomion et al. 2015) SNP-arrays for some pedigrees. The SNP marker ID of each component map was replaced by the corresponding maritime pine unigene ID from Canales et al. (2014). This step, which was essential for the use of LPmerge (i.e., same marker name for orthologous markers), also made it possible to check the collinearity between maps. Thus, nonsyntenic unigenes between different P. pinaster linkage maps were removed from the analysis with the exception of those validated for at least two other component maps (supplementary table S2, Supplementary Material online). Finally, LPmerge was used to create the composite map for P. pinaster. Given that similar numbers of genotypes were used to obtain the base maps and the high degree of synteny between base maps, each component map was assigned the same weight in LPmerge.

Pinaceae

The SNP marker ID of each species component map was replaced by the corresponding homologous unigene ID of P. pinaster (Canales et al. 2014). We established composite maps at genus level before integrating the four genetic maps for each species into a single composite map for Pinaceae. This process was designed to increase the number of markers common to the component maps for each genus (Pinus and Picea). LPmerge was used to build these two composite maps, following the same procedure as described for P. pinaster. We discarded nonsyntenic unigenes, except for nonsyntenic unigenes validated by at least two component maps in the P. pinaster composite map, from the construction of composite linkage maps. Noncollinear unigenes with inversions of more than 15 cM were also excluded from the construction of the composite linkage map. A large inversion of a group of markers was detected in LG7 of Pic. abies (Lind et al. 2014), when the map for this species was compared with that for Pic. glauca (Pavy et al. 2008). Pic. abies LG7 (renamed LG2 after comparison with the P. pinaster reference map) was reconstructed from genotyping data provided as supplementary material by Lind et al. (2014), using the same parameters described in Lind’s article and the same mapping software (JoinMap v4.1). Two markers with a −Log10 (p) > 1 that produced a large number of double recombinants were excluded from this LG map. We were thus able to map 16 additional markers, and a much higher degree of synteny and collinearity was found between the homologous LGs of Pic. abies and Pic. glauca (supplementary fig. S3, Supplementary Material online).

Comparison with C. japonica

The available map for C. japonica consisted of 77% of SNP markers (Moriguchi et al. 2012). Sequences of unigenes containing the mapped SNPs were retrieved from the Unigene set developed by Ueno et al. (2012). Then, unigene sequences from Ueno et al. (2012) mapped in C. japonica linkage map (Moriguchi et al. 2012) were compared with those of P. pinaster (Canales et al. 2014) mapped in the Pinaceae composite map using tBLASTx (BLAST 3, fig. 1a). Different e-value cutoff for homologous unigenes identification between Pinaceae and C. japonica was tested: Lower than 1e−30 and 1e−35. Selected homologs from the Pinaceae and C. japonica linkage maps were used for comparative mapping. We established orthologous blocks within LGs where several homologous unigenes were shared between both families. Different thresholds were also tested to consider an orthologous block within an LG: Blocks with at least four and six shared unigenes. Each pair of orthologous chromosomal blocks determined a CAR between the two families. The most parsimonious evolutionary model between Pinaceae and Cupressaceae considering the existence of the identified CARs was proposed. Circular genetic maps used in interfamily comparative mapping were drawn with Circos software (Krzywinski et al. 2009).

Data Accessibility

All the linkage maps described here are available from the Pinus portal (a European genetic and genomic resource for Pinus) through the PinusMap application (https://w3.pierroton.inra.fr/PinusPortal/index.php). Accession numbers for marker sequences used in this study are available in supplementary table S3, Supplementary Material online.

Supplementary Material

Supplementary figures S1–S5 and tables S1–S3 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).
  42 in total

Review 1.  In silico archeogenomics unveils modern plant genome organisation, regulation and evolution.

Authors:  Jérôme Salse
Journal:  Curr Opin Plant Biol       Date:  2012-01-24       Impact factor: 7.834

2.  Three genome-based phylogeny of Cupressaceae s.l.: further evidence for the evolution of gymnosperms and Southern Hemisphere biogeography.

Authors:  Zu-Yu Yang; Jin-Hua Ran; Xiao-Quan Wang
Journal:  Mol Phylogenet Evol       Date:  2012-05-18       Impact factor: 4.286

3.  Ancestral grass karyotype reconstruction unravels new mechanisms of genome shuffling as a source of plant evolution.

Authors:  Florent Murat; Jian-Hong Xu; Eric Tannier; Michael Abrouk; Nicolas Guilhot; Caroline Pont; Joachim Messing; Jérôme Salse
Journal:  Genome Res       Date:  2010-09-28       Impact factor: 9.043

4.  RECORD: a novel method for ordering loci on a genetic linkage map.

Authors:  Hans Van Os; Piet Stam; Richard G F Visser; Herman J Van Eck
Journal:  Theor Appl Genet       Date:  2005-10-14       Impact factor: 5.699

Review 5.  Karyotype evolution in the Pinaceae: implication with molecular phylogeny.

Authors:  K K Nkongolo; M Mehes-Smith
Journal:  Genome       Date:  2012-10-25       Impact factor: 2.166

6.  The evolutionary split of Pinaceae from other conifers: evidence from an intron loss and a multigene phylogeny.

Authors:  F Gugerli; C Sperisen; U Büchler; I Brunner; S Brodbeck; J D Palmer; Y L Qiu
Journal:  Mol Phylogenet Evol       Date:  2001-11       Impact factor: 4.286

7.  Phylogeny of seed plants based on all three genomic compartments: extant gymnosperms are monophyletic and Gnetales' closest relatives are conifers.

Authors:  L M Bowe; G Coat; C W dePamphilis
Journal:  Proc Natl Acad Sci U S A       Date:  2000-04-11       Impact factor: 11.205

8.  A reference linkage map for Eucalyptus.

Authors:  Corey J Hudson; Jules S Freeman; Anand R K Kullan; César D Petroli; Carolina P Sansaloni; Andrzej Kilian; Frank Detering; Dario Grattapaglia; Brad M Potts; Alexander A Myburg; René E Vaillancourt
Journal:  BMC Genomics       Date:  2012-06-15       Impact factor: 3.969

9.  Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine.

Authors:  Emilie Chancerel; Camille Lepoittevin; Grégoire Le Provost; Yao-Cheng Lin; Juan Pablo Jaramillo-Correa; Andrew J Eckert; Jill L Wegrzyn; Diana Zelenika; Anne Boland; Jean-Marc Frigerio; Philippe Chaumeil; Pauline Garnier-Géré; Christophe Boury; Delphine Grivet; Santiago C González-Martínez; Pierre Rouzé; Yves Van de Peer; David B Neale; Maria T Cervera; Antoine Kremer; Christophe Plomion
Journal:  BMC Genomics       Date:  2011-07-18       Impact factor: 3.969

10.  High-density linkage mapping in a pine tree reveals a genomic region associated with inbreeding depression and provides clues to the extent and distribution of meiotic recombination.

Authors:  Emilie Chancerel; Jean-Baptiste Lamy; Isabelle Lesur; Céline Noirot; Christophe Klopp; François Ehrenmann; Christophe Boury; Grégoire Le Provost; Philippe Label; Céline Lalanne; Valérie Léger; Franck Salin; Jean-Marc Gion; Christophe Plomion
Journal:  BMC Biol       Date:  2013-04-18       Impact factor: 7.431

View more
  7 in total

1.  High-density linkage mapping and distribution of segregation distortion regions in the oak genome.

Authors:  Catherine Bodénès; Emilie Chancerel; François Ehrenmann; Antoine Kremer; Christophe Plomion
Journal:  DNA Res       Date:  2016-03-23       Impact factor: 4.458

2.  Linkage and Association Mapping for Two Major Traits Used in the Maritime Pine Breeding Program: Height Growth and Stem Straightness.

Authors:  Jérôme Bartholomé; Marco Cam Bink; Joost van Heerwaarden; Emilie Chancerel; Christophe Boury; Isabelle Lesur; Fikret Isik; Laurent Bouffier; Christophe Plomion
Journal:  PLoS One       Date:  2016-11-02       Impact factor: 3.240

3.  Effective BAC clone anchoring with genotyping-by-sequencing and Diversity Arrays Technology in a large genome cereal rye.

Authors:  Ewa Borzęcka; Anna Hawliczek-Strulak; Leszek Bolibok; Piotr Gawroński; Katarzyna Tofil; Paweł Milczarski; Stefan Stojałowski; Beata Myśków; Małgorzata Targońska-Karasek; Agnieszka Grądzielewska; Miłosz Smolik; Andrzej Kilian; Hanna Bolibok-Brągoszewska
Journal:  Sci Rep       Date:  2018-05-30       Impact factor: 4.379

4.  An Ultra-Dense Haploid Genetic Map for Evaluating the Highly Fragmented Genome Assembly of Norway Spruce (Picea abies).

Authors:  Carolina Bernhardsson; Amaryllis Vidalis; Xi Wang; Douglas G Scofield; Bastian Schiffthaler; John Baison; Nathaniel R Street; M Rosario García-Gil; Pär K Ingvarsson
Journal:  G3 (Bethesda)       Date:  2019-05-07       Impact factor: 3.154

5.  The Mitogenome of Norway Spruce and a Reappraisal of Mitochondrial Recombination in Plants.

Authors:  Alexis R Sullivan; Yrin Eldfjell; Bastian Schiffthaler; Nicolas Delhomme; Torben Asp; Kim H Hebelstrup; Olivier Keech; Lisa Öberg; Ian Max Møller; Lars Arvestad; Nathaniel R Street; Xiao-Ru Wang
Journal:  Genome Biol Evol       Date:  2020-01-01       Impact factor: 3.416

6.  Evolution of complex genome architecture in gymnosperms.

Authors:  Tao Wan; Yanbing Gong; Zhiming Liu; YaDong Zhou; Can Dai; Qingfeng Wang
Journal:  Gigascience       Date:  2022-08-10       Impact factor: 7.658

7.  Performance of genomic prediction within and across generations in maritime pine.

Authors:  Jérôme Bartholomé; Joost Van Heerwaarden; Fikret Isik; Christophe Boury; Marjorie Vidal; Christophe Plomion; Laurent Bouffier
Journal:  BMC Genomics       Date:  2016-08-11       Impact factor: 3.969

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.