| Literature DB >> 24192835 |
Leandro Gomide Neves1, John M Davis, William B Barbazuk, Matias Kirst.
Abstract
Loblolly pine (Pinus taeda L.) is an economically and ecologically important conifer for which a suite of genomic resources is being generated. Despite recent attempts to sequence the large genome of conifers, their assembly and the positioning of genes remains largely incomplete. The interspecific synteny in pines suggests that a gene-based map would be useful to support genome assemblies and analysis of conifers. To establish a reference gene-based genetic map, we performed exome sequencing of 14729 genes on a mapping population of 72 haploid samples, generating a resource of 7434 sequence variants segregating for 3787 genes. Most markers are single-nucleotide polymorphisms, although short insertions/deletions and multiple nucleotide polymorphisms also were used. Marker segregation in the population was used to generate a high-density, gene-based genetic map. A total of 2841 genes were mapped to pine's 12 linkage groups with an average of one marker every 0.58 cM. Capture data were used to detect gene presence/absence variations and position 65 genes on the map. We compared the marker order of genes previously mapped in loblolly pine and found high agreement. We estimated that 4123 genes had enough sequencing depth for reliable detection of markers, suggesting a high marker conversation rate of 92% (3787/4123). This is possible because a significant portion of the gene is captured and sequenced, increasing the chances of identifying a polymorphic site for characterization and mapping. This sub-centiMorgan genetic map provides a valuable resource for gene positioning on chromosomes and guide for the assembly of a reference pine genome.Entities:
Keywords: copy number variation; exome sequencing; high-density genetic map; high-throughput genotyping; loblolly pine
Mesh:
Year: 2014 PMID: 24192835 PMCID: PMC3887537 DOI: 10.1534/g3.113.008714
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Description of markers segregating and mapped in the population
| Marker Class | Segregating 1:1 | Genes Mapped | |
|---|---|---|---|
| No. Marker | No. Genes | ||
| Sequence variation | 7434 | 3787 | 2776 |
| SNP | 6857 | − | 2622 |
| Short deletions | 210 | − | 63 |
| Short insertions | 208 | − | 53 |
| MNP | 159 | − | 38 |
| Presence/absence variation | 408 | 408 | 65 |
| Total | 7842 | 4195 | 2841 |
Markers identified based on sequence variation were subdivided in single-nucleotide polymorphism (SNP), short insertions and deletions (short indels) and multiple-nucleotide polymorphism (MNP). No. Markers represent the number of markers that segregated 1:1 in the mapping population, and No. Genes is the number of unique genes represented by these markers
Figure 1High-density genetic map of loblolly pine. The final map is composed of 2841 genes distributed among the 12 LGs with an average distribution of a gene every 0.6 cM and a total length of 1637.4 cM. For easy comparison, the LG name and orientation of the map follow that published by Eckert , and common genes between the two maps are shown in green. Genes for which presence/absence variation was identified and used to map the gene is shown in green and have suffix PAV (e.g., 2_6131_PAV, for unigene 2_6131). The remaining markers were coded as “unigene_marker position” (e.g., 0_2563_337, for unigene 0_2563 and a polymorphic marker at position 337 of the unigene). The genetic map graphical visualization was prepared using MapChart 2.2 (Voorrips 2002).
Description of genetic map
| Linkage Group | No. Genes Mapped | Length, cM | Intermarker Distance, cM | |
|---|---|---|---|---|
| Mean | SD | |||
| 1 | 193 | 144.72 | 0.75 | 0.65 |
| 2 | 260 | 123.12 | 0.48 | 0.39 |
| 3 | 205 | 113.29 | 0.56 | 0.44 |
| 4 | 195 | 124.79 | 0.64 | 0.54 |
| 5 | 263 | 105.07 | 0.4 | 0.62 |
| 6 | 230 | 117.54 | 0.51 | 0.83 |
| 7 | 242 | 125.03 | 0.52 | 0.71 |
| 8 | 291 | 118.53 | 0.41 | 0.41 |
| 9 | 223 | 169.32 | 0.76 | 0.91 |
| 10 | 228 | 165.71 | 0.73 | 0.58 |
| 11 | 271 | 190 | 0.7 | 0.62 |
| 12 | 240 | 140.29 | 0.59 | 0.59 |
| Average | 236.75 | 136.45 | 0.58 | 0.61 |
| Total | 2841 | 1637.41 | − | − |
A total of 2841 genes were mapped to the expected 12 linkage groups of loblolly pine based on recombination data from 72 haploid samples.
Marker validation at the grouping level: comparison of the maps from Eckert and the present map
| LG | No. Genes at Same Group | No. Genes at Different Group | Genes Linked to Different Groups |
|---|---|---|---|
| 1 | 23 | 0 | |
| 2 | 40 | 0 | |
| 3 | 35 | 0 | |
| 4 | 24 | 1 | Gene = 0_9680; Neves = LG 4, Eckert = LG 8 |
| 5 | 33 | 1 | Gene = 0_7496; Neves = LG 5, Eckert = LG 11 |
| 6 | 32 | 0 | |
| 7 | 31 | 1 | Gene = 0_5740; Neves = LG 7, Eckert = LG 12 |
| 8 | 45 | 0 | |
| 9 | 30 | 1 | Gene = UMN_213; Neves = LG 9, Eckert = LG 7 |
| 10 | 35 | 1 | Gene = 0_6106; Neves = LG 10, Eckert = LG 6 |
| 11 | 30 | 0 | |
| 12 | 34 | 0 | |
| Total | 392 | 5 |
The few genes mapped to different linkage groups (LG) between the two studies are provided in the last column.
Figure 2Comparison of the normalized relative order of shared genes used in our study (X-axis) and that of Eckert (Y-axis). Because of space constraints, only plots for LGs 2 (left) and 4 (right) are shown. Assuming genes syntheny between the two populations, a straight line would illustrate perfect agreement at the gene ordering level between the two maps. Graphs for all 12 LGs are provided in Figure S1 and Figure S2.