| Literature DB >> 24023392 |
Yuchao Zhang1, Atahualpa Castillo-Morales, Min Jiang, Yufei Zhu, Landian Hu, Araxi O Urrutia, Xiangyin Kong, Laurence D Hurst.
Abstract
In female mammals most X-linked genes are subject to X-inactivation. However, in humans some X-linked genes escape silencing, these escapees being candidates for the phenotypic aberrations seen in polyX karyotypes. These escape genes have been reported to be under stronger purifying selection than other X-linked genes. Although it is known that escape from X-inactivation is much more common in humans than in mice, systematic assays of escape in humans have to date employed only interspecies somatic cell hybrids. Here we provide the first systematic next-generation sequencing analysis of escape in a human cell line. We analyzed RNA and genotype sequencing data obtained from B lymphocyte cell lines derived from Europeans (CEU) and Yorubans (YRI). By replicated detection of heterozygosis in the transcriptome, we identified 114 escaping genes, including 76 not previously known to be escapees. The newly described escape genes cluster on the X chromosome in the same chromosomal regions as the previously known escapees. There is an excess of escaping genes associated with mental retardation, consistent with this being a common phenotype of polyX phenotypes. We find both differences between populations and between individuals in the propensity to escape. Indeed, we provide the first evidence for there being both hyper- and hypo-escapee females in the human population, consistent with the highly variable phenotypic presentation of polyX karyotypes. Considering also prior data, we reclassify genes as being always, never, and sometimes escape genes. We fail to replicate the prior claim that genes that escape X-inactivation are under stronger purifying selection than others.Entities:
Keywords: X-inactivation; expression evolution; rate of evolution
Mesh:
Year: 2013 PMID: 24023392 PMCID: PMC3840307 DOI: 10.1093/molbev/mst148
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
The 114 Escape Genes and the Nature of the Replication Evidence.
| Genes | SNPs | Persons | Reported | Genes | SNPs | Persons | Reported | Genes | SNPs | Persons | Reported |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Yes | Yes | Heter | No | Yes | Yes | Yes | Inactive | ||||
| No | Yes | Escape | No | Yes | Heter | No | Yes | Escape | |||
| No | Yes | Heter | Yes | Yes | Escape | Yes | Yes | Heter | |||
| Yes | Yes | Inactive | No | Yes | Inactive | Yes | Yes | ||||
| No | Yes | Inactive | Yes | Yes | Inactive | No | Yes | Inactive | |||
| No | Yes | No | Yes | Inactive | No | Yes | Inactive | ||||
| No | Yes | Inactive | Yes | Yes | Inactive | No | Yes | Heter | |||
| No | Yes | Inactive | Yes | Yes | Inactive | No | Yes | Inactive | |||
| Yes | Yes | Inactive | No | Yes | Yes | Yes | Escape | ||||
| No | Yes | Heter | No | Yes | No | Yes | Inactive | ||||
| No | Yes | Heter | Yes | Yes | Inactive | No | Yes | Inactive | |||
| Yes | Yes | Heter | No | Yes | Inactive | No | Yes | ||||
| No | Yes | Inactive | No | Yes | Inactive | No | Yes | Inactive | |||
| Yes | Yes | Inactive | Yes | Yes | Inactive | Yes | Yes | Heter | |||
| No | Yes | Escape | No | Yes | Inactive | No | Yes | Heter | |||
| No | Yes | Escape | No | Yes | Inactive | No | Yes | ||||
| Yes | Yes | No | Yes | Escape | Yes | Yes | Heter | ||||
| Yes | Yes | Escape | No | Yes | Inactive | No | Yes | Escape | |||
| No | Yes | Inactive | Yes | Yes | Heter | Yes | Yes | ||||
| No | Yes | No | Yes | Inactive | No | Yes | Inactive | ||||
| No | Yes | Inactive | Yes | Yes | Heter | Yes | Yes | Escape | |||
| No | Yes | Escape | No | Yes | Inactive | Yes | Yes | Escape | |||
| No | Yes | Inactive | No | Yes | Inactive | Yes | Yes | Inactive | |||
| Yes | Yes | Inactive | No | Yes | Yes | Yes | Escape | ||||
| Yes | Yes | Inactive | Yes | Yes | Inactive | No | Yes | Heter | |||
| No | Yes | Heter | Yes | No | Inactive | Yes | Yes | Inactive | |||
| No | Yes | Inactive | No | Yes | Inactive | Yes | Yes | Inactive | |||
| No | Yes | Heter | No | Yes | Inactive | No | Yes | Inactive | |||
| No | Yes | Escape | No | Yes | Heter | No | Yes | Escape | |||
| Yes | Yes | Escape | Yes | Yes | Escape | No | Yes | Inactive | |||
| Yes | Yes | No | Yes | Inactive | Yes | Yes | Escape | ||||
| No | Yes | Inactive | Yes | No | Inactive | No | Yes | Inactive | |||
| No | Yes | Inactive | No | Yes | Inactive | Yes | Yes | Inactive | |||
| Yes | Yes | Inactive | Yes | Yes | Heter | Yes | No | Inactive | |||
| No | Yes | Inactive | Yes | Yes | Inactive | ||||||
| Yes | Yes | Inactive | Yes | Yes | Heter | ||||||
| No | Yes | No | Yes | Inactive | |||||||
| No | Yes | Escape | Yes | No | Inactive | ||||||
| No | Yes | No | Yes | Inactive | |||||||
| No | Yes | Inactive | No | Yes | Inactive |
Note.—The SNP column indicates whether genes have multi-SNPs within one individual that all support the hypothesis of X-inactivation escape. The Persons column indicates whether genes have replication by being identified as escaping in multiple individuals. The Reported column indicates the reported state in previously reported rodent/human somatic cells (Carrel and Willard 2005). Escape genes are those that escape X-inactivation in all females tested; Heter are heterogeneous genes, i.e., genes that exhibit XCI in some, but not all, females assayed. For the cases with “No” in persons column, all of them are able to attempt verification. So, here No indicates that these cases are potentially able to be replicated but actually not supported.
FThe location of escape genes in CEU and YRI cluster in similar chromosomal locations. The genes found in more than three individuals and in greater than 50% of the potentially informative samples are considered to be common escape genes (red) in each population, whereas the others are rare escape genes (blue) in the populations. The genes solely replicated via more than one SNPs per gene are not included. Their inclusion makes no difference to qualitative trends. The x axis refers to the count of individuals with evidence for escape in the corresponding genes. Note Xist is within the Xic domain.
Escape Genes and Their Proportion of Escape among Individuals in CEU and YRI.
Note.—The percentages indicate the proportion of informative individuals showing evidence of escape in the population (or populations) in which they escape in the population. Escaping genes that are common to both CEU and YRI within each range are colored in red.
Genes That Potentially Show Differences between the Two Populations in Escape Profile.
| Gene | CEU | YRI | |
|---|---|---|---|
| 4/14 | 0/34 | 0.00729 | |
| 3/5 | 1/23 | 0.02046 | |
| 3/8 | 0/20 | 0.02228 | |
| 7/13 | 5/35 | 0.02229 | |
| 4/17 | 0/21 | 0.03858 | |
| 3/16 | 0/29 | 0.04549 | |
| 4/13 | 0/14 | 0.05364 | |
| 7/9 | 3/14 | 0.05575 | |
| 2/9 | 0/28 | 0.05874 | |
| 3/12 | 1/33 | 0.06052 | |
| 6/13 | 3/24 | 0.07425 | |
| 4/12 | 1/19 | 0.07787 | |
| 2/4 | 2/26 | 0.08774 | |
| 3/6 | 1/13 | 0.09626 | |
| 3/12 | 0/14 | 0.09837 | |
| 1/2 | 0/18 | 0.09976 |
Note.—The fractions in the CEU and YRI columns indicate the proportion of individuals with the gene escaping X-inactivation. The numerator is the number of escape samples, and the denominator is the number of heterozygous individuals at the DNA level. The differences between CEU and YRI were compared, and the P values were calculated (here, only genes with P < 0.1 are shown, and those with P < 0.05 are shown above the line). P values are from the randomization test as described in Materials and Methods.
Females Differ in Their Propensity to Allow Genes to Escape Inactivation.
Note.—In the escape column, there are two numbers N:M. N is the number of escape genes and M is the number of the other potentially informative genes that show no evidence of escape. Significance after Holm’s correction is marked in red and blue, red for hyper-escape and blue for hypo-escape.
FKa/Ks ratios of genes in the three X-inactivation classes in the merged data set. P values indicate significance on pairwise Mann–Whitney U tests. There are 35 that always escape, 206 always inactivated, and 205 heterogeneous (N = 446). Evolutionary rates are from the human–macaque orthologous genes with numbers taken from Ensembl or from Park et al. (2010). Outliners are not shown. Transverse lines indicate the median value.
FLocation of escape genes and haploid expressed genes on the X chromosome of one individual of CEU (NA12004) and YRI (NA18511). Genes marked as a “potential site” are those where there is exonic heterozygozity at the DNA level and transcripts that pass the coverage threshold but that do not show evidence of escape (i.e., no evidence of biallelic expression). Those marked in blue/red show evidence of escape. The sum height of the colored bar indicates the net read depth summing over both alleles. The proportion of blue to red indicates the proportion of expression from the inactive X chromosome (blue) and the active X chromosome (we always presume the minority allele is from the inactive X chromosome). The data for the pattern of escape from the remaining individuals are shown in supplementary figure S5, Supplementary Material online.