| Literature DB >> 22377632 |
Wynn K Meyer1, Barbara Arbeithuber, Carole Ober, Thomas Ebner, Irene Tiemann-Boege, Richard R Hudson, Molly Przeworski.
Abstract
Children of a heterozygous parent are expected to carry either allele with equal probability. Exceptions can occur, however, due to meiotic drive, competition among gametes, or viability selection, which we collectively term "transmission distortion" (TD). Although there are several well-characterized examples of these phenomena, their existence in humans remains unknown. We therefore performed a genome-wide scan for TD by applying the transmission disequilibrium test (TDT) genome-wide to three large sets of human pedigrees of European descent: the Framingham Heart Study (FHS), a founder population of European origin (HUTT), and a subset of the Autism Genetic Resource Exchange (AGRE). Genotyping error is an important confounder in this type of analysis. In FHS and HUTT, despite extensive quality control, we did not find sufficient evidence to exclude genotyping error in the strongest signals. In AGRE, however, many signals extended across multiple SNPs, a pattern highly unlikely to arise from genotyping error. We identified several candidate regions in this data set, notably a locus in 10q26.13 displaying a genome-wide significant TDT in combined female and male transmissions and a signature of recent positive selection, as well as a paternal TD signal in 6p21.1, the same region in which a significant TD signal was previously observed in 30 European males. Neither region replicated in FHS, however, and the paternal signal was not visible in sperm competition assays or as allelic imbalance in sperm. In maternal transmissions, we detected no strong signals near centromeres or telomeres, the regions predicted to be most susceptible to female-specific meiotic drive, but we found a significant enrichment of top signals among genes involved in cell junctions. These results illustrate both the potential benefits and the challenges of using the TDT to study transmission distortion and provide candidates for investigation in future studies.Entities:
Mesh:
Year: 2012 PMID: 22377632 PMCID: PMC3338262 DOI: 10.1534/genetics.112.139576
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Samples and SNPs remaining after quality control and sample size cutoffs
| Sample | Offspring | SNPs (combined) | SNPs (paternal) | SNPs (maternal) | Genotyping array |
|---|---|---|---|---|---|
| FHS | 2,362 | 353,116 | 328,855 | 335,466 | Affymetrix GeneChip® Mapping 500K (Affymetrix, Santa Clara, CA) |
| HUTT | 848 | 538,139 | 343,945 | 353,047 | Affymetrix GeneChip® Mapping 500K and SNPChips, 5.0 and 6.0 (Affymetrix) |
| AGRE | 1,518 | 491,632 | 452,468 | 462,981 | Illumina Human Hap550 BeadChip (Illumina Inc., San Diego, CA) |
For sample names and descriptions, see Materials and Methods.
Figure 1Manhattan plots for the TDT in AGRE. The TDT was performed separately considering (A) all transmissions, (B) paternal transmissions only, and (C) maternal transmissions only. The horizontal line in each plot indicates the permutation-based genome-wide significance threshold of 1.10 × 10−7 (combined), 1.20 × 10−6 (paternal), or 1.34 × 10−6 (maternal).
Figure 2Region surrounding genome-wide significant SNP in AGRE combined TDT. (A) The region (shaded) is shown with the nearest upstream and downstream genes. All SNPs with P < 0.01 for the combined TDT are plotted as black points. (B) A close-up of the region (shaded) is shown, with SNPs colored by their LD with the focal SNP (rs748001), which is circled in red. SNPs with | iHS | > 2 are starred. The most highly conserved regions of all conserved in vertebrates (Siepel ) are denoted in orange. SNP positions are as mapped in HG18.
Top 10 SNPs in each test with regional signals of TD
| TDT | Focal SNP | Chr | p | Rate | OA | MAF | Length | Start | No. | Genes | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| C | rs748001 | 10 | 4.55 × 10−8 | 0.585 | Anc | 1036 | 0.248 | 27.586 | 127199116 | 8 | None |
| C | rs12661087 | 6 | 3.65 × 10−6 | 0.601 | Anc | 524 | 0.124 | 203.859 | 120861096 | 6 | None |
| C | rs13112011 | 4 | 4.90 × 10−6 | 0.624 | Anc | 338 | 0.074 | 20.956 | 25233826 | 4 | None |
| C | rs1941852 | 11 | 7.91 × 10−6 | 0.613 | Der | 388 | 0.069 | 40.57 | 97380183 | 4 | None |
| C | rs10766778 | 11 | 9.74 × 10−6 | 0.570 | Anc | 1002 | 0.210 | 2.297 | 21219167 | 2 | |
| C | rs1249667 | 1 | 1.33 × 10−5 | 0.637 | Der | 251 | 0.051 | 39.316 | 75491425 | 2 | |
| C | rs16910190 | 11 | 1.35 × 10−5 | 0.578 | Der | 773 | 0.158 | 52.365 | 22773148 | 5 | |
| C | rs12582514 | 12 | 1.59 × 10−5 | 0.558 | Der | 1374 | 0.483 | 99.547 | 21469826 | 13 | |
| C | rs10935427 | 3 | 1.80 × 10−5 | 0.556 | Anc | 1445 | 0.396 | 97.379 | 142396949 | 9 | |
| C | rs10136259 | 14 | 1.94 × 10−5 | 0.566 | Anc | 1059 | 0.220 | 7.736 | 32663208 | 3 | |
| P | rs12199720 | 6 | 1.77 × 10−5 | 0.593 | Anc | 532 | 0.409 | 710.622 | 44779449 | 46 | |
| P | rs10795851 | 10 | 2.28 × 10−5 | 0.596 | Der | 482 | 0.198 | 31.105 | 11265343 | 8 | |
| P | rs2921031 | 8 | 3.29 × 10−5 | 0.601 | Der | 419 | 0.190 | 22.967 | 8365511 | 5 | None |
| P | rs2166013 | 5 | 4.11 × 10−5 | 0.606 | Der | 371 | 0.147 | 109.672 | 119407151 | 8 | None |
| P | rs1377210 | 4 | 4.11 × 10−5 | 0.627 | Der | 259 | 0.100 | 40.571 | 110022271 | 2 | |
| P | rs6084148 | 20 | 6.62 × 10−5 | 0.584 | Der | 567 | 0.237 | 36.55 | 2632924 | 5 | |
| P | rs4261974 | 4 | 7.97 × 10−5 | 0.586 | Der | 532 | 0.220 | 9.772 | 120221079 | 3 | |
| P | rs295263 | 9 | 8.46 × 10−5 | 0.603 | Der | 364 | 0.134 | 19.792 | 4829511 | 4 | |
| P | rs10506080 | 12 | 9.63 × 10−5 | 0.592 | Der | 453 | 0.174 | 69.518 | 31825919 | 5 | |
| P | rs321202 | 5 | 1.10 × 10−4 | 0.570 | Anc | 765 | 0.414 | 122.722 | 143107350 | 6 | |
| M | rs17807087 | 1 | 6.14 × 10−6 | 0.633 | Anc | 290 | 0.106 | 21.172 | 229092422 | 4 | None |
| M | rs2282315 | 6 | 1.60 × 10−5 | 0.645 | Anc | 220 | 0.086 | 53.056 | 132808430 | 3 | |
| M | rs12229163 | 12 | 2.43 × 10−5 | 0.649 | Anc | 202 | 0.076 | 24.574 | 113358516 | 2 | None |
| M | rs4646421 | 15 | 2.58 × 10−5 | 0.621 | Der | 301 | 0.108 | 302.84 | 72503662 | 6 | 6 |
| M | rs10868142 | 9 | 1.38 × 10−4 | 0.597 | Anc | 387 | 0.171 | 14.718 | 84150265 | 6 | |
| M | rs952893 | 2 | 1.41 × 10−4 | 0.589 | Der | 453 | 0.175 | 125.338 | 51727031 | 13 | None |
| M | rs17066329 | 4 | 1.54 × 10−4 | 0.614 | Anc | 277 | 0.112 | 11.921 | 179845588 | 2 | None |
| M | rs1372679/rs1372680 | 4 | 1.64 × 10−4 | 0.616 | Anc | 262 | 0.094 | 20.547 | 100658226 | 3 | None |
| M | rs10216366 | 8 | 1.75 × 10−4 | 0.593 | Anc | 410 | 0.157 | 64.351 | 126776817 | 4 | None |
| M | rs16973745 | 18 | 1.82 × 10−4 | 0.627 | Anc | 216 | 0.081 | 124.932 | 36610407 | 5 | None |
Category of TDT (C, combined; P, paternal; M, maternal)
Rate at which the overtransmitted allele was transmitted.
Overtransmitted allele (Anc, ancestral; Der, derived)
Sample size (number of transmissions)
Minor allele frequency in AGRE founders.
Length of the region of TD (see Materials and Methods)
Start of the region of TD (see Materials and Methods)
Number of SNPs within the TD region that have P < 0.01.
This region contains six genes: SEMA7A, UBL7, ARID3B, CLK3, EDC3, and CYP1A1.
Gene ontology terms most enriched among genes nearest top TD signals
| Test | Term | No. genes | Fold enrichment | FDR (%) | Annotation source | |
|---|---|---|---|---|---|---|
| Combined | Alternative splicing | 13 | 1.590 | 0.0459 | 38.91 | SP_PIR_KEYWORDS |
| Combined | Splice variant | 13 | 1.586 | 0.0467 | 41.31 | UP_SEQ_FEATURE |
| Paternal | Vitamin metabolic process | 2 | 30.062 | 0.0593 | 54.26 | GOTERM_BP_FAT |
| Paternal | Cell maturation | 2 | 30.062 | 0.0593 | 54.26 | GOTERM_BP_FAT |
| Maternal | Vinculin, conserved site | 2 | 362.152 | 0.00527 | 5.31 | INTERPRO |
| Maternal | Vinculin/alpha-catenin | 2 | 289.722 | 0.00659 | 6.59 | INTERPRO |
| Maternal | Fascia adherens | 2 | 159.775 | 0.0117 | 10.95 | GOTERM_CC_FAT |
| Maternal | Intercalated disc | 2 | 106.517 | 0.0175 | 15.97 | GOTERM_CC_FAT |
| Maternal | Cell–cell junction | 3 | 12.614 | 0.0203 | 18.35 | GOTERM_CC_FAT |
| Maternal | Cell–cell adherens junction | 2 | 45.65 | 0.0403 | 33.39 | GOTERM_CC_FAT |
Uncorrected P-values are presented for comparative purposes.
Figure 3Region of suggestive paternal TD in AGRE. The region (shaded) is shown with SNPs colored by their LD with the focal SNP (rs12199720), which is circled in red. In the lower half of the figure are the genes in the region, as well as lines indicating in orange the region previously reported by Santos in CEU males and in purple the amplicon used for genotyping single sperm. All positions are as mapped in HG18.
Sperm typing data from single DNA molecules characterized for transmission distortion
| Donor ID | SNPs | OHap/
UHap | Blood | Unselected sperm | Fast sperm | Prop fast sperm (%) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 26 | rs9381373 | TA/CC | 129 | 153 | 0.171 | 142 | 119 | 0.0866 | NA | NA | NA | NA |
| rs2093903 | ||||||||||||
| 1 | rs9381373 | TA/CC | 134 | 141 | 0.718 | 143 | 140 | 0.453 | NA | NA | NA | NA |
| rs2093903 | ||||||||||||
| 1006 | rs9381373 | TA/CC | 123 | 131 | 0.661 | 150 | 155 | 0.634 | NA | NA | NA | NA |
| rs2093903 | ||||||||||||
| 19 | rs9381373 | TG/CA | 186 | 210 | 0.248 | 249 | 251 | 0.553 | NA | NA | NA | NA |
| rs1284965 | ||||||||||||
| 21 | rs9381373 | TA/CC | NA | NA | NA | 173 | 200 | 0.926 | NA | NA | NA | NA |
| rs2093903 | ||||||||||||
| 37 | rs9381373 | TA/CC | NA | NA | NA | 109 | 124 | 0.853 | 101 | 111 | 0.775 | 0.922 |
| rs2093903 | ||||||||||||
| 45 | rs1284965 | G/A | NA | NA | NA | 222 | 211 | 0.361 | 121 | 116 | 0.398 | 5.155 |
| 2 | rs9381373 | TA/CC | NA | NA | NA | NA | NA | NA | 123 | 136 | 0.808 | 0.015 |
| rs2093903 | ||||||||||||
| 6 | rs9381373 | TGA/CAC | NA | NA | NA | NA | NA | NA | 112 | 134 | 0.929 | 0.007 |
| rs1284965 | ||||||||||||
| rs2093903 | ||||||||||||
| 8 | rs9381373 | TGA/CAC | NA | NA | NA | NA | NA | NA | 110 | 149 | 0.994 | 0.005 |
| rs1284965 | ||||||||||||
| rs2093903 | ||||||||||||
| Total | 572 | 635 | 0.0743 | 1067 | 1084 | 0.651 | 567 | 646 | 0.989 | |||
Haplotype expected to be overrepresented/haplotype expected to be underrepresented on the basis of the TDT in AGRE.
Count of haplotype expected to be overrepresented on the basis of the TDT.
Count of haplotype expected to be underrepresented on the basis of the TDT.
Probability of observing at least as strong a deviation from 50% each allele (two-sided binomial)
Probability of observing at least as many of the allele expected to be overrepresented as observed (1-sided binomial)
Proportion of total sperm selected by motility assay for fast sperm.
Figure 4No strong signals of maternal-specific TD near centromeres or telomeres in AGRE. Genetic distances to (A) the centromere and (B) the telomere for all SNPs with paternal TDT P > 0.01 within 3 cM of the centromere (A) or telomere (B) are plotted against the SNPs, maternal TDT P-values. All SNPs with P < 10−3 are colored, with the number of SNPs separating them from the centromere (A) or telomere (B) listed next to them. *The genetic distance to the most telomeric SNP in HapMap (Frazer ) is used to approximate the genetic distance to the telomere.