| Literature DB >> 33772539 |
Brecca R Miller1,2, Alison M Morse1,3, Jacqueline E Borgert1,4, Zihao Liu1,3, Kelsey Sinclair1,3, Gavin Gamble1, Fei Zou4,5, Jeremy R B Newman1,6, Luis G León-Novelo7, Fabio Marroni8, Lauren M McIntyre1,3.
Abstract
Allelic imbalance (AI) occurs when alleles in a diploid individual are differentially expressed and indicates cis acting regulatory variation. What is the distribution of allelic effects in a natural population? Are all alleles the same? Are all alleles distinct? The approach described applies to any technology generating allele-specific sequence counts, for example for chromatin accessibility and can be applied generally including to comparisons between tissues or environments for the same genotype. Tests of allelic effect are generally performed by crossing individuals and comparing expression between alleles directly in the F1. However, a crossing scheme that compares alleles pairwise is a prohibitive cost for more than a handful of alleles as the number of crosses is at least (n2-n)/2 where n is the number of alleles. We show here that a testcross design followed by a hypothesis test of AI between testcrosses can be used to infer differences between nontester alleles, allowing n alleles to be compared with n crosses. Using a mouse data set where both testcrosses and direct comparisons have been performed, we show that the predicted differences between nontester alleles are validated at levels of over 90% when a parent-of-origin effect is present and of 60%-80% overall. Power considerations for a testcross, are similar to those in a reciprocal cross. In all applications, the testing for AI involves several complex bioinformatics steps. BayesASE is a complete bioinformatics pipeline that incorporates state-of-the-art error reduction techniques and a flexible Bayesian approach to estimating AI and formally comparing levels of AI between conditions. The modular structure of BayesASE has been packaged in Galaxy, made available in Nextflow and as a collection of scripts for the SLURM workload manager on github (https://github.com/McIntyre-Lab/BayesASE).Entities:
Keywords: zzm321990 cis-regulatory variation; Bayesian analysis; allele specific expression; allelic imbalance
Mesh:
Year: 2021 PMID: 33772539 PMCID: PMC8104932 DOI: 10.1093/g3journal/jkab096
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Schematic representation of the crosses used in this study. (A) Maternal genotypes as columns (PWK, WSB, and CAST) and paternal genotypes in rows (PWK, WSB, and CAST). We focus here on the comparison of PWK vs WSB; the approach is valid for all the possible comparisons. The test crosses are outlined with red boxes and the direct crosses with blue boxes. H1 is a test within any single cross of the null for allelic balance. H3 represents a test of the null hypothesis that the AI is the same between two crosses. (B) Alleles PWK and WSB, can be compared directly with a test of H1 in either of the two reciprocal crosses in the blue boxes in (A). These tests can be labeled as H1 and H2 with the label of 1 or 2 assigned arbitrarily. The test of the (H3) is then a test of whether the AI is the same between the two reciprocal crosses. A test of H3 for the two crosses in the red box can also be conducted, here the test of H3 is an indirect inference for difference in cis effects contributed by PWK and WSB alleles. Note that the comparison of PWK vs WSB in the testcross compares the two alleles inherited from the same parent, while the direct cross is confounded by the parent of origin.
Figure 2Flowchart of the Bayesian analysis of allele imbalance process for testing for allele imbalance.
Percentage of genes showing AI in different crosses
| AI (%) | ||
|---|---|---|
| Cross | Females | Males |
| PWK WSB | 7.38 | 8.03 |
| WSB PWK | 8.34 | 7.87 |
| PWK CAST | 7.06 | 6.44 |
| CAST PWK | 7.23 | 5.38 |
| CAST WSB | 8.38 | 10.7 |
| WSB CAST | 8.72 | 11.23 |
Figure 3Percentage of genes showing AI, tested via the testcross approach. The x-axis represents the two alleles being compared. CAST and WSB are compared by testing H3 in the two crosses CAST × PWK and WSB × PWK (to test Maternal contribution, red) or PWK × CAST and PWK × WSB (to test Paternal contribution, blue). Results for female offspring are shown as circles, and for male offspring are shown as squares.
Validation rates of testcrosses using testcrosses
| Alleles compared | Sex |
| Validated total |
| Validated POO |
|---|---|---|---|---|---|
| PWK CAST | Female | 691 | 68.89 | 510 | 91.96 |
| PWK CAST | Male | 776 | 63.14 | 533 | 90.81 |
| PWK WSB | Female | 938 | 72.6 | 721 | 92.93 |
| PWK WSB | Male | 916 | 73.58 | 713 | 92.85 |
| CAST WSB | Female | 879 | 74.63 | 699 | 92.56 |
| CAST WSB | Male | 787 | 46.25 | 388 | 92.53 |
Number of genes showing different levels of expression using testcrosses (N total). Percentage of genes showing different levels of expression using testcrosses validated in direct comparison, in total (validated total). Number of genes showing different levels of expression using testcrosses and having parent of origin effect in the reciprocal crosses (N POO). Percentage of genes showing different levels of expression using testcrosses and having parent of origin effect, validated in direct comparison (validated POO).