| Literature DB >> 18510738 |
Steve R Bischoff1, Shengdar Tsai, Nicholas E Hardison, Abby M York, Brad A Freking, Dan Nonneman, Gary Rohrer, Jorge A Piedrahita.
Abstract
BACKGROUND: Genome-wide detection of single feature polymorphisms (SFP) in swine using transcriptome profiling of day 25 placental RNA by contrasting probe intensities from either Meishan or an occidental composite breed with Affymetrix porcine microarrays is presented. A linear mixed model analysis was used to identify significant breed-by-probe interactions.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18510738 PMCID: PMC2442091 DOI: 10.1186/1471-2164-9-252
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1a, b: Theoretical outcomes of probe hybridization. A) Affymetrix short oligonucleotide arrays are fabricated with sets of eleven 25-mer probes to interrogate the expression level of a particular mRNA. Because of the short probe length, a SNP falling in the middle of the probe sequence will result in that probe's failure to hybridize. B) Three potential hybridization scenarios within a single probe set are depicted: 1) mRNA to target probes is not expressed in either breed, 2) mRNA to target probes is equivalent in both breeds or, 3) intensity of probes is expressed in one breed and negligible in the other breed. In this latter case, a probe-by-breed effect is seen with Meishan RNA hybridized to probe 2 of the probe set. The green hash represents the SFP located within a central nucleotide position within the probe.
Figure 2Volcano plot of corrected probe-by-breed effect. The volcano plot provides a visualization of significant changes in probe-by-breed expression values between the two swine populations. Using Storey's q-value procedures to correct for multiple testing, 4,635 probes were identified at an estimated false discovery rate of 5%. The points in green are expressed SFPs that have been confirmed by sequencing. The points in red are probes identified as false positives by sequencing. The horizontal red line is a q < 0.05 significance threshold with 4,635 (~1.8%) that are above the threshold. The x-axis shows the log2 estimate of Meishan minus occidental probe-by-breed effect. Vertical red lines represent 2-fold expression differences.
Figure 3Probe level plots of expression intensities illustrating a probe-by-breed interaction effect. These plots demonstrate clear examples of a breed-by-probe interaction. A) Probe 2 of TPT1 gene for all Meishan RNA samples fails to hybridize. B) In the example of PEG10, a non-breed specific polymorphism is detected and confirmed by pyrosequencing. One of the three Meishan samples, '25 MB' is grouped with the occidental placental samples as visualized by the intensity plots.
Sequence data from pyrosequencing
| TAATCACTGGTGT | |
| TAATCACTGGTGT | |
| TAATCACTGGTGT | |
| TAATCACTGGTGT | |
| TAATCACTGGTGT | |
| TAATCACTGGTGT | |
| CACTGGTGT | |
| GATATCTCT | |
| GATATCTCT | |
| GATATCTCT | |
| GATATCTCT | |
| GATATCTCT | |
| GATATCTCT | |
| GATATCTCT | |
Pyrosequencing data confirms the intensity plot where a G/C = S polymorphism is identified (top).
Pyrosequencing confirms the non-breed specific nature of the polymorphism, namely A/G = R (bottom).
Figure 4Pyrograms of PEG10. The spectral output of a pyrosequencing reaction is a representative pyrogram. PEG10 [rank 732, Ssc.13476.1.A1_at, probe 1] is depicted with an 'R' SFP detected by pyrosequencing of Meishan and occidental placental cDNA. The top panel is Meishan (red 'G' = guanine).
Summary of pyrosequencing confirmation, arbitrary probes.
| Ssc.14003.2.S1_at | 9 | RPS4X | R, Y | Transition | 15, 19 | -5.09342 | 9.90E-35 | 1 | 1 |
| Ssc.14343.1.S1_at | 2 | TPT1 | Y | Transition | 10 | -3.66704 | 8.02E-32 | 2 | 2 |
| Ssc.1341.1.S1_at | 8 | TKT | Y | Transition | 13 | -3.05487 | 4.13E-29 | 3 | 3 |
| Ssc.16710.1.S1_at | 4 | Q9BTR0 | T/- | INDEL | 7 | -3.07201 | 1.67E-26 | 9 | 9 |
| Ssc.11554.1.S1_at | 11 | MRPL39 | TAAG/---- | INDEL | 17 | -3.13603 | 4.80E-26 | 10 | 10 |
| Ssc.2756.1.A1_at | 10 | MRPL22 | Y | Transition | 20 | -3.28928 | 6.28E-25 | 15 | 15 |
| Ssc.20986.2.S1_at | 4 | GRN | R | Transition | 13 | -3.23164 | 1.66E-24 | 20 | 20 |
| Ssc.24770.1.S1_at | 3 | PLAGL1 | Y | Transition | 14 | -1.23548 | 6.85E-19 | 61 | 61 |
| Ssc.4324.2.A1_at | 4 | CAMLG | S | Transversion | 6 | -1.8121 | 4.49E-15 | 124 | 124 |
| Ssc.55.1.S1_at | 9 | EGFR | Y | Transition | 10 | -1.5839 | 3.27E-14 | 154 | 154 |
| Ssc.422.1.S1_at | 10 | IGF2R | W | Transition | 21 | -1.41267 | 1.60E-11 | 259 | 253 |
| Ssc.21595.2.S1_at | 5 | GNAS | R | Transition | 19 | -1.59188 | 2.45E-09 | 401 | 371 |
| Ssc.2315.1.A1_at | 11 | Q8N3R3 | S | Transversion | 8 | -1.77102 | 3.35E-08 | 521 | 447 |
| Ssc.26344.1.S1_at | 11 | PAK3 | R | Transition | 9 | -1.12922 | 6.41E-08 | 561 | 474 |
| Ssc.21857.1.S1_at | 11 | GRB10 | R | Transition | 5 | -0.88725 | 7.90E-08 | 576 | no rank |
| Ssc.6989.1.A1_at | 5 | SFXN1 | Y | Transition | 17 | -1.33493 | 1.75E-07 | 627 | 509 |
| Ssc.939.1.A1_at | 7 | RPL12 | K | Transversion | 20 | -3.21892 | 1.86E-07 | 637 | 514 |
| Ssc.26344.1.S1_at | 4 | PAK3 | None | None | 1.068089 | 2.30E-07 | 655 | 523 | |
| Ssc.12365.1.A1_at | 11 | ADAMTS1 | Y | Transition | 5 | -1.13449 | 0.000135 | 1345 | 718 |
| Ssc.13476.1.A1_at | 1 | PEG10 | R | Transition | 10 | -1.94654 | 0.000221 | 1444 | 732 |
| Ssc.2716.1.A1_at | 6 | NP_055322 | None | None | -1.39706 | 0.000241 | 1472 | 736 | |
| Ssc.7604.2.A1_a_at | 11 | C14orf111 | M | Transversion | 16 | -1.52629 | 0.000994 | 1871 | 783 |
| Ssc.1595.1.S1_at | 1 | PHEMX | None | None | 0.973961 | 0.001016 | 1884 | no rank | |
| Ssc.26460.1.A1_at | 1 | SNRPN | None | None | -0.66999 | 0.00147 | 2003 | no rank | |
| Ssc.11508.1.A1_at | 7 | GABRA5 | None | None | 0.473879 | 0.00433 | 2450 | no rank | |
| Ssc.3850.1.S1_at | 11 | NNAT | None | None | -1.00118 | 0.008733 | 2860 | 822 | |
| Ssc.3802.1.S1_at | 2 | NAP1L4 | A/- | INDEL | 6 | -0.59859 | 0.02431 | 3704 | no rank |
Twenty-seven putative SFPs were selected, and were pyrosequenced to confirm or disprove predicted SNPs at probe locations. The column labeled 'SFP' describes the character of the polymorphism discovered, such as transitions/transversions, INDELs (MRPL39, NAP1L4), or false positives ("None"). When filtering out probes with less than 2-fold expression difference between breeds, 87% of predicted SNPs were true.
Summary of sequencing confirmation, random probes.
| Ssc.19486.1.S1_at | 3 | CCNB1IP1 | 7.36E-23 | 28 | 28 | none | |
| Ssc.15621.1.A1_at | 6 | PCYOX1 | 4.32E-19 | 59 | 59 | none | |
| Ssc.24184.1.S1_at | 4 | GPD1L | 1.32E-16 | 94 | 94 | S | 7 |
| Ssc.7266.1.A1_at | 2 | POSTN | 1.71E-15 | 114 | 114 | none | |
| Ssc.18458.1.S1_at | 6 | Q86V57 | 4.92E-15 | 126 | 126 | Y | 11 |
| Ssc.1447.3.S1_at | 4 | NP_116255 | 1.51E-13 | 174 | 174 | K | 14 |
| Ssc.16422.2.A1_at | 2 | PLAA | 2E-13 | 175 | 175 | S | 10 |
| Ssc.21553.1.S1_at | 8 | HDGF | 2.76E-13 | 179 | 179 | R | 10 |
| Ssc.16422.2.A1_at | 1 | PLAA | 3.26E-13 | 185 | 185 | S | 17 |
| Ssc.13948.1.S1_at | 3 | GET1_HUMAN | 4.42E-13 | 188 | 188 | W | 9 |
| Ssc.10435.1.A1_at | 1 | Q14156 | 1.47E-12 | 210 | 210 | M, Y | 8,17 |
| Ssc.17315.1.S1_at | 3 | EIF3S2 | 4.64E-12 | 236 | 233 | K | 10 |
| Ssc.19651.1.S1_at | 10 | WDR42A | 8.3E-12 | 249 | 244 | Y | 11 |
| Ssc.20060.1.A1_at | 8 | TRIP12 | 2.62E-11 | 269 | 262 | Y | 4 |
| Ssc.13817.1.A1_at | 6 | VDAC3 | 5.89E-11 | 286 | 276 | Y | 13 |
| Ssc.24025.1.A1_at | 9 | Q96E16 | 6.17E-11 | 287 | 277 | S | 19 |
| Ssc.11787.2.A1_at | 9 | RASSF2 | 1.96E-10 | 326 | 312 | M | 9 |
| Ssc.21613.1.S1_at | 4 | TIGD2 | 8.38E-10 | 357 | 339 | R | 12 |
| Ssc.19447.1.A1_at | 2 | ARMCX3 | 1.37E-09 | 377 | 353 | Y | 9 |
| Ssc.30685.1.A1_at | 10 | C5orf3 | 2.48E-09 | 402 | 372 | Y | 5 |
| Ssc.16654.1.A2_at | 3 | HNRPK | 3.15E-09 | 413 | 379 | none | |
| Ssc.6356.1.S1_at | 5 | ODC1 | 1.23E-08 | 470 | 419 | R | 10 |
| Ssc.1442.1.S1_at | 8 | PHF3 | 1.98E-08 | 495 | 434 | Y | 7 |
| Ssc.2042.1.S1_at | 3 | NP_079516 | 3.39E-08 | 522 | 448 | none | |
| Ssc.20974.1.A2_at | 7 | GNS | 5.15E-08 | 549 | 467 | R | 16 |
| Ssc.11173.1.A1_at | 1 | DEOC_HUMAN | 7.88E-08 | 575 | 482 | Y | 17 |
| Ssc.2042.1.S1_at | 4 | NP_079516 | 1.83E-07 | 632 | 512 | none | |
| Ssc.12013.1.A1_at | 1 | SYTL4 | 6.35E-07 | 726 | 554 | R, Y | 15,16 |
| Ssc.10949.1.S1_at | 6 | GBAS | 9.35E-07 | 757 | 566 | none | |
| Ssc.5052.1.S1_at | 4 | KIAA0674 | 9.74E-07 | 761 | 570 | R | 11 |
| Ssc.25584.1.S1_at | 8 | CDW92 | 1.62E-06 | 797 | 589 | W | 6 |
| Ssc.1031.1.S1_at | 7 | OAS1 | 3.2E-06 | 861 | 612 | Y | 25 |
| Ssc.28305.1.A1_at | 11 | TACC1 | 3.22E-06 | 862 | 613 | none | |
| Ssc.17313.2.S1_at | 7 | CALD1 | 7.14E-06 | 938 | 638 | Y | 17 |
| Ssc.17423.1.S1_at | 4 | YWHAZ | 1.18E-05 | 990 | 661 | R | 8 |
| Ssc.12029.1.S1_at | 6 | LAMA2 | 1.37E-05 | 1018 | 670 | R | 10 |
| Ssc.2466.1.S1_at | 5 | LRP10 | 4.59E-05 | 1177 | 698 | Y | 5 |
| Ssc.2466.1.S1_at | 5 | LRP10 | 4.59E-05 | 1177 | 698 | R | 5 |
| Ssc.12365.1.A1_at | 11 | ADAMTS1 | 0.000135 | 1345 | 718 | R | 5 |
| Ssc.28305.1.A1_at | 1 | TACC1 | 0.000376 | 1585 | 750 | none | |
| Ssc.18850.1.S1_at | 2 | GOLPH2 | 0.000442 | 1640 | 758 | none | |
| Ssc.1911.1.A1_at | 9 | PSMC1 | 0.001044 | 1893 | 788 | none | |
| Ssc.28305.1.A1_at | 8 | TACC1 | 0.001842 | 2108 | 797 | none | |
| Ssc.1911.1.A1_at | 10 | PSMC1 | 0.005343 | 2571 | 816 | none |
This table summarizes the results of sequencing of randomly selected predicted SFPs meeting the criteria of q ≤ 0.05 and a 2-fold expression difference between interacting probes. We estimate the unbiased confirmation rate from this random sample to be 70% using 6 arrays.
Confusion matrix for random SFPs sequenced
| 205 | 13 | ||
| 17 | 31 | ||
This is a confusion matrix comparing predicted versus actual SFPs. Sensitivity: 0.65, specificity: 0.94, FDR: 0.30.
Comparison of predicted SFPs using same versus divergent tissues
| 4,873 | 259, 743 | |||
| 857 | 588 | 269 | ||
| 263,759 | 4,285 | 259,474 | ||
Predicted SFPs and non-SFPs from analysis of microarray data from same tissues are in rows; predicted SFPs and non-SFPs from divergent tissues are in columns. 588 are predicted by both analyses, where 16 are expected by random chance.
Chi-squared = 21,205, df = 1, p-value < 0.00001