| Literature DB >> 26068575 |
Jared W Westbrook1, Vikram E Chhatre2, Le-Shin Wu3, Srikar Chamala4, Leandro Gomide Neves5, Patricio Muñoz6, Pedro J Martínez-García7, David B Neale7, Matias Kirst1, Keithanne Mockaitis8, C Dana Nelson2, Gary F Peter1, John M Davis9, Craig S Echt10.
Abstract
A consensus genetic map for Pinus taeda (loblolly pine) and Pinus elliottii (slash pine) was constructed by merging three previously published P. taeda maps with a map from a pseudo-backcross between P. elliottii and P. taeda. The consensus map positioned 3856 markers via genotyping of 1251 individuals from four pedigrees. It is the densest linkage map for a conifer to date. Average marker spacing was 0.6 cM and total map length was 2305 cM. Functional predictions of mapped genes were improved by aligning expressed sequence tags used for marker discovery to full-length P. taeda transcripts. Alignments to the P. taeda genome mapped 3305 scaffold sequences onto 12 linkage groups. The consensus genetic map was used to compare the genome-wide linkage disequilibrium in a population of distantly related P. taeda individuals (ADEPT2) used for association genetic studies and a multiple-family pedigree used for genomic selection (CCLONES). The prevalence and extent of LD was greater in CCLONES as compared to ADEPT2; however, extended LD with LGs or between LGs was rare in both populations. The average squared correlations, r(2), between SNP alleles less than 1 cM apart were less than 0.05 in both populations and r(2) did not decay substantially with genetic distance. The consensus map and analysis of linkage disequilibrium establish a foundation for comparative association mapping and genomic selection in P. taeda and P. elliottii.Entities:
Keywords: linkage disequilibrium; linkage mapping; pine; population structure
Mesh:
Substances:
Year: 2015 PMID: 26068575 PMCID: PMC4528325 DOI: 10.1534/g3.115.019588
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Composition of input maps used to construct consensus genetic map for Pinus taeda and Pinus elliottii
| Input Map | Cohort Structure | Marker Type | Avg. GIC | Weight | Length, cM | |||
|---|---|---|---|---|---|---|---|---|
| QTL-BASE1 | BASE | 97 | SSR | 233 | 197 | 402 | 0.21 | 1413 |
| QTL | 170 | RFLP | 123 | 113 | ||||
| ESTP | 104 | 96 | ||||||
| QTL-BASE2 | BASE1 | 92 | SNP | 2307 | 1895 | 462 | 0.24 | 1476 |
| BASE2 | 110 | RFLP | 124 | 124 | ||||
| QTL1 | 180 | ESTP | 35 | 35 | ||||
| QTL2 | 307 | |||||||
| BC1 | 1 cohort | 490 | SNP | 803 | 803 | 941 | 0.49 | 1378 |
| 10-5 | 1 cohort | 72 | SNP | 2776 | 1359 | 121 | 0.06 | 1910 |
| PAV | 65 | 16 | ||||||
Data source for input map: QTL-BASE1 (Echt ), QTL-BASE2 (Martínez-García ), BC1 (Westbrook ), 10-5 (Neves ); Nind, number of individuals in the mapping populations; Marker type: SNP, single nucleotide polymorphism; PAV, presence/absence variants of genes; RFLP, restriction fragment length polymorphisms, ESTP, expressed sequence tag polymorphism; SSR, simple sequence repeat; Nmarkers original, total number of markers in the original input map, including all redundancies, by marker type and total; Nmarkers selected, number of nonredundant markers selected for maximum informativeness in consensus mapping, by marker type and total; Avg. GIC, input map’s average marker genotype information content; Weight, input map weight value used to resolve marker order conflicts in consensus mapping, scaled in proportion to Avg. GIC; Length, input map’s genome length in cM(Kosambi) units.
Figure 1Comparisons of orders of shared markers between input maps and the MergeMap consensus genetic map of Pinus taeda and Pinus elliottii. Linkage group (LG) lengths were standardized to 100 units for comparison among maps.
Summary of the consensus genetic map for Pinus taeda and Pinus elliottii by linkage group (LG)
| LG | Length, cM | Average Marker Spacing, cM | Maximum Marker Spacing, cM | Average 95% C.I. Marker Positions, cM | |
|---|---|---|---|---|---|
| 1 | 305 | 184.89 | 0.61 | 4.94 | 1.75 |
| 2 | 351 | 222.00 | 0.63 | 5.85 | 1.22 |
| 3 | 342 | 186.88 | 0.55 | 4.05 | 0.66 |
| 4 | 306 | 186.32 | 0.61 | 6.77 | 0.47 |
| 5 | 376 | 216.41 | 0.58 | 6.77 | 1.55 |
| 6 | 326 | 193.57 | 0.60 | 7.65 | 0.69 |
| 7 | 304 | 193.43 | 0.64 | 6.42 | 0.15 |
| 8 | 338 | 189.56 | 0.56 | 4.39 | 0.57 |
| 9 | 323 | 172.00 | 0.53 | 5.98 | 0.41 |
| 10 | 331 | 211.00 | 0.64 | 4.20 | 1.44 |
| 11 | 275 | 146.89 | 0.54 | 5.35 | 1.07 |
| 12 | 279 | 202.48 | 0.73 | 8.47 | 0.68 |
| Total | |||||
| Average | 321 | 192.12 | 0.60 | 5.87 | 0.90 |
Figure 2Pinus taeda and Pinus elliottii consensus map features; centimorgan (cM) scale left of bar, locus mean consensus position along bar; GeneID right of bar; marker type (font color): SNP and PAV (black), SSR (red), ESTP and RFLP (green); the variable next to GeneID indicates whether the locus was present (x) or absent (0) on the four input maps, listed in the following order: QTL-BASE1, QTL-BASE2, BC1, 10-5; far right column denotes cM range of upper and lower bounds of the 95% C.I. for the locus position: 0–1 cM (+) and 1–2 cM (++). Detail shown is from LG-4. For full map graphic, see Figure S9.
Figure 3Kernel density estimation of mapped markers in the MergeMap consensus genetic map and in two genotype-phenotype discovery populations of Pinus taeda. (A) Marker densities in the consensus map (red lines) were compared to the 95% C.I. of a Poisson distribution (gray regions) of random deviations from uniform marker densities. The densities of SNPs mapped in (B) ADEPT2 (unrelated association) or (C) CCLONES (multiple-family pedigree) populations (black lines) were compared against marker densities in the consensus map (red lines).
Figure 4Distributions of r2 between pairs of SNP loci within genes in the ADEPT2 (unrelated association) and CCLONES (multiple-family pedigree) populations of P. taeda
Number and percentage of SNP pairs among different genes with r2 > 0.1 for three linkage classes
| MAF | ADEPT2 Unrelated Association | CCLONES Multiple-Family Pedigree | |||
|---|---|---|---|---|---|
| No. (total) | % | No. (total) | % | ||
| Within LGs, <1 cM | |||||
| Before r2 adjustment | 0.001 | 110 (2617) | 4.2 | 210 (2921) | 7.2 |
| 0.1 | 81 (1247) | 6.5 | 134 (1286) | 10.4 | |
| 0.2 | 63 (501) | 12.6 | 92 (504) | 18.1 | |
| After r2 adjustment | 0.001 | 103 (2617) | 3.9 | 186 (2921) | 6.4 |
| 0.1 | 79 (1247) | 6.3 | 115 (1284) | 9.0 | |
| 0.2 | 58 (501) | 11.6 | 62 (507) | 12.2 | |
| Within LGs, >1 cM | |||||
| Before r2 adjustment | 0.001 | 142 (174,032) | <0.1 | 3498 (198,910) | 1.8 |
| 0.1 | 88 (82,576) | 0.1 | 1274 (85,524) | 1.5 | |
| 0.2 | 73 (32,452) | 0.2 | 621 (33,007) | 1.9 | |
| After r2 adjustment | 0.001 | 125 (174,032) | <0.1 | 1434 (198,910) | 0.7 |
| 0.1 | 80 (82,576) | 0.1 | 412 (85,524) | 0.5 | |
| 0.2 | 60 (32,452) | 0.2 | 116 (33,007) | 0.4 | |
| Between LGs | |||||
| Before r2 adjustment | 0.001 | 305 (1,933,979) | <0.1 | 16,999 (2,208,279) | 0.8 |
| 0.1 | 8 (922,248) | <0.1 | 5256 (953,810) | 0.6 | |
| 0.2 | 0 (360,875) | 0 | 2485 (368,881) | 0.7 | |
| After r2 adjustment | 0.001 | 229 (1,933,979) | <0.1 | 3052 (2208,279) | 0.1 |
| 0.1 | 0 (922,248) | 0 | 558 (953,810) | 0.1 | |
| 0.2 | 0 (360,875) | 0 | 370 (368,881) | 0.1 | |
Number and percentage of SNP pairs among different genes with r2 > 0.1 for three linkage classes before and after adjusting r2 for subpopulation structure in ADEPT2 (unrelated) and kinship in CCLONES (multiple-family pedigree). MAF, minimum minor allele frequency threshold for SNP pairs; LG, linkage group.
Figure 5Linkage disequilibrium r2 values between loci on the same LGs as a function of genetic distance in ADEPT2 (unrelated association) and CCLONES (multiple-family pedigree) populations of P. taeda. Displayed are values of r2 between mapped SNPs in different ESTs with minor allele frequencies greater than 0.1 and with less than 50% missing data. Red lines are kernel regressions of r2 vs. genetic distance.
Figure 6Extended linkage disequilibrium within linkage groups in the ADEPT2 (unrelated association) and CCLONES (multiple-family pedigree) populations of P. taeda. Values of r2 vs. genetic distance are plotted for SNP pairs with r2 > 0.1 that were more than 1 cM apart (A) before and (B) after accounting for population structure in ADEPT2 and kinship in CCLONES. The 95% confidence ellipses of r2 vs. genetic distance are depicted in red.
Figure 7Linkage disequilibrium between SNPs on different linkage groups in the ADEPT2 (unrelated association) and CCLONES (multiple-family pedigree) populations of P. taeda. Distributions of r2 were plotted for SNP pairs on different LGs with r2 > 0.1. The distributions were compared (A) before and (B) after adjusting r2 values for subpopulation structure in ADEPT2 and kinship in CCLONES.