| Literature DB >> 27297469 |
Ting Xie1, Qing-Yong Yang1, Xiao-Tao Wang1, Aoife McLysaght2, Hong-Yu Zhang3.
Abstract
Ohnologs -paralogous gene pairs generated by whole genome duplication- are enriched for dosage sensitive genes, that is, genes that have a phenotype due to copy number changes. Dosage sensitive genes frequently occur in the same metabolic pathway and in physically interacting proteins. Accumulating evidence reveals that functionally related genes tend to co-localize in the three-dimensional (3D) arrangement of chromosomes. We query whether the spatial distribution of ohnologs has implications for their dosage balance. We analyzed the colocalization frequency of ohnologs based on chromatin interaction datasets of seven human cell lines and found that ohnolog pairs exhibit higher spatial proximity in 3D nuclear organization than other paralog pairs and than randomly chosen ohnologs in the genome. We also found that colocalized ohnologs are more resistant to copy number variations and more likely to be disease-associated genes, which indicates a stronger dosage balance in ohnologs with high spatial proximity. This phenomenon is further supported by the stronger similarity of gene co-expression and of gene ontology terms of colocalized ohnologs. In addition, for a large fraction of ohnologs, the spatial colocalization is conserved in mouse cells, suggestive of functional constraint on their 3D positioning in the nucleus.Entities:
Keywords: copy number variation; disease-associated genes.; dosage balance; ohnologs; spatial colocalization
Mesh:
Year: 2016 PMID: 27297469 PMCID: PMC4989111 DOI: 10.1093/molbev/msw108
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.The colocalization frequency of interchromosomal ohnolog pairs and SSD pairs in seven cell lines. In each panel, the x-axis represents the percentage of colocalized interchromosomal ohnolog pairs and SSD pairs in the different cell lines. The brown, yellow, and green vertical dashed lines indicate the observed colocalization frequency for ohnologs pairs, young SSD pairs, and old SSD pairs, respectively. The curves show the colocalization frequency distributions for 10,000 permuted randomizations of the same number of pairs as in the real data.
Basic Information for Colocalized and Noncolocalized Ohnolog Pairs in Seven Human Cell Lines.
| Cell lines | |||||||
|---|---|---|---|---|---|---|---|
| GM12878 | HMEC | HUVEC | IMR90 | K562 | KBM7 | NHEK | |
| Colocalized ohnolog pairs (%) | 4086 (53.38) | 2478 (32.37) | 3078 (40.21) | 3216 (42.01) | 3094 (40.42) | 3096 (40.44) | 2839 (37.09) |
| Non-colocalized ohnolog pairs (%) | 3569 (46.62) | 5177 (67.63) | 4577 (59.79) | 4439 (57.99) | 4561 (59.52) | 4559 (59.56) | 4816 (62.91) |
| Genes involved in colocalized ohnolog pairs | 4638 | 2984 | 3639 | 3721 | 3647 | 3698 | 3366 |
| Genes involved in non-colocalized ohnolog pairs | 4296 | 5609 | 5048 | 4877 | 4972 | 5048 | 5352 |
| Genes involved in colocalized ohnolog pairs with CNVs (%) | 3031 (63.20) | 1882 (63.07) | 2320 (63.75) | 2125 (63.13) | 2377 (63.88) | 2293 (62.87) | 2351 (63.57) |
| Genes involved in non-colocalized ohnolog pairs with CNVs (%) | 2878 (66.99) | 3765 (67.12) | 3398 (67.31) | 3619 (67.62) | 3280 (67.25) | 3362 (67.62) | 3394 (67.23) |
| Colocalized ohnolog pairs with GWAS disease genes (%) | 1064 (26.04) | 664 (26.80) | 812 (26.38) | 752 (26.49) | 854 (26.55) | 831 (26.86) | 820 (26.49) |
| Non-colocalized ohnolog pairs with GWAS disease genes (%) | 894 (23.82) | 1314 (24.13) | 1166 (24.07) | 1226 (24.11) | 1124 (23.88) | 1147 (23.75) | 1158 (23.99) |
| Colocalized ohnolog pairs with OMIM disease genes (%) | 929 (22.74) | 558 (22.52) | 702 (22.81) | 640 (22.54) | 768 (23.88) | 696 (22.50) | 739 (23.87) |
| Non-colocalized ohnolog pairs with OMIM disease genes (%) | 775 (20.20) | 1146 (21.05) | 1002 (20.68) | 1064 (20.93) | 936 (19.89) | 1008 (20.87) | 1025 (21.23) |
aPercentage is with respect to ohnolog pairs (the number is 7,655).
bPercentage is with respect to genes involved in colocalized ohnolog pairs.
cPercentage is with respect to genes involved in non-colocalized ohnolog pairs.
dPercentage is with respect to colocalized ohnolog pairs.
ePercentage is with respect to non-colocalized ohnolog pairs.
Fig. 2.Percentage of colocalized and noncolocalized ohnologs of seven cell lines that are (a) present in CNVs and (b) disease-associated genes identified in OMIM or by GWAS. P-values shown in the panels were calculated using Fisher’s exact test; ** represents P < 0.05, * represents 0.05 < P < 0.10 and # represents P > 0.10.
Fig. 3.Frequency of coexpression of colocalized and noncolocalized ohnologs of seven cell lines. P-values shown in the panels were calculated using Fisher’s exact test; ** represents P < 0.05 and * represents 0.05 < P < 0.10.
Comparison of GO-Based Czekanowski–Dice Distance (average ± sd) for Unlinked Ohnologs.
| Cell lines | |||||||
|---|---|---|---|---|---|---|---|
| GM12878 | HMEC | HUVEC | IMR90 | K562 | KBM7 | NEHK | |
| 0.512 ± 0.066 | 0.512 ± 0.068 | 0.509 ± 0.065 | 0.506 ± 0.066 | 0.508 ± 0.066 | 0.501 ± 0.066 | 0.515 ± 0.067 | |
| 0.549 ± 0.073 | 0.538 ± 0.070 | 0.543 ± 0.072 | 0.546 ± 0.072 | 0.544 ± 0.072 | 0.548 ± 0.071 | 0.538 ± 0.071 | |
| 6.57 × 10−8 | 3.54 × 10−4 | 1.63 × 10−6 | 5.64 × 10−8 | 3.10 × 10−7 | 2.38 × 10−11 | 4.73 × 10−3 | |