| Literature DB >> 28185566 |
Zhe Yu1, David Sankoff2.
Abstract
BACKGROUND: We propose a new, continuous model of the fractionation process (duplicate gene deletion after polyploidization) on the real line. The aim is to infer how much DNA is deleted at a time, based on segment lengths for alternating deleted (invisible) and undeleted (visible) regions.Entities:
Keywords: Analysis of runs; Duplicate gene deletion; Genomics; Probability modeling; Whole genome duplication
Mesh:
Year: 2016 PMID: 28185566 PMCID: PMC5123346 DOI: 10.1186/s12859-016-1265-5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Processes pertinent to first sweep and t-th sweep. Solid horizontal bars represent the visible regions of the genome. Grey curves represent invisible regions. Dashed markers represent deletion points, solid markers represent end of deletion segments. ν and μ are the means of the deletion point spacing and deletion segment length variables, while λ ( is the mean space (= λ in the text) between visible deletion points after the t−1-st sweep
Fig. 2Cullen-Frey diagrams for length distributions of invisible (top) and visible (bottom) segments
Simulated values of shape and rate when , for a range of values of μ, and t=2
|
|
| shape | rate | 1/ |
|---|---|---|---|---|
| 1 | 3 | 0.8994863 | 0.7801536 | 1.281798866 |
| 2 | 6 | 0.8713054 | 0.3645944 | 2.742773888 |
| 3 | 9 | 0.8943245 | 0.2557653 | 3.909834524 |
| 4 | 12 | 0.8551860 | 0.1863732 | 5.365578313 |
| 5 | 15 | 0.8479933 | 0.1504409 | 6.647128540 |
| 6 | 18 | 0.8673458 | 0.1250687 | 7.995605615 |
| 7 | 21 | 0.8793444 | 0.1044622 | 9.572840702 |
| 8 | 24 | 0.91907486 | 0.09607099 | 10.40896945 |
| 9 | 27 | 0.91503151 | 0.08817842 | 11.34064321 |
| 10 | 30 | 0.82931206 | 0.07483308 | 13.36307419 |
Fig. 3Linear relation between 1/α and t−1 for fixed
Fig. 4Relation between slope of 1/α as a function of t, and
Fig. 5Relation between 1/rate (1/β) as a function of t for fixed
Fig. 6Relation between slope of ln1/β as a function of t, and
μ=1,ν=3,t=5,λ −1=0.16656,α=0.6711,β=0.3504
| par ∖time | 3 | 4 | 5 | 6 | 7 |
|---|---|---|---|---|---|
|
| 1.231661635 | 1.063543459 | 1.021578166 | 1.018615845 | 1.033112259 |
|
| 1.801158145 | 2.401544193 | 3.001930241 | 3.602316289 | 4.202702338 |
|
| 0.30056014 | 0.338494894 | 0.338654644 | 0.325314125 | 0.306788889 |
| 100 | 14.23403354 | 3.409208693 |
| 7.170390806 | 12.4566366 |
Bold entry indicates the t most consistent with the observed data on α,β and λ
μ = 6,ν = 12,t = 2,λ −1=0.17,α=0.8488,β=0.12063
| par ∖time | 2 | 3 | 4 | 5 | 6 | 7 |
|---|---|---|---|---|---|---|
|
| 5.7892 | 4.7235 | 4.7286 | 4.9648 | 5.2990 | 5.6560 |
|
| 12 | 18 | 24 | 30 | 36 | 42 |
|
| 0.1195 | 0.1406 | 0.1342 | 0.1220 | 0.1094 | 0.0976 |
| 100 |
| 16.5215 | 11.2325 | 1.1655 | 9.3410 | 19.0791 |
Bold entry indicates the t most consistent with the observed data on α,β and λ
μ=1,ν=3,t=3,λ −1=1.017737,α=0.7977859,β=0.5649623
| par ∖time | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|
|
| 1.4006 | 1.0332 | 0.9909 | 1.0102 | 1.0523 |
|
| 1.9651 | 2.9477 | 3.9303 | 4.9129 | 5.8954 |
|
| 0.4271 | 0.5606 | 0.5596 | 0.5246 | 0.4810 |
| 100 | 24.39 |
| 0.9497 | 7.15 | 14.87 |
Bold entry indicates the t most consistent with the observed data on α,β and λ
μ=5,ν=15,t=8,λ −1=0.532632,α=0.53869147,β=0.03107084
| par ∖time | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|
|
| 6.2529 | 5.4956 | 5.2256 | 5.1247 | 5.1051 | 5.1313 | 5.1861 |
|
| 7.5099 | 9.3873 | 11.2648 | 13.1423 | 15.0198 | 16.8972 | 18.7747 |
|
| 0.0281 | 0.0317 | 0.0324 | 0.0318 | 0.0306 | 0.0292 | 0.0277 |
| 100 | 9.43 | 2.18 | 4.22 | 2.33 |
| 6.00 | 10.97 |
Bold entry indicates the t most consistent with the observed data on α,β and λ