| Literature DB >> 34155038 |
Huixin Xu1,2, Qi Zhen1,3,4,5, Mingzhou Bai1, Lin Fang6,7, Yong Zhang6,7, Bao Li1,3, Huiyao Ge1,3,4,5, Sunjin Moon8, Weiwei Chen1,3,4,5, Wenqing Fu9, Qiongqiong Xu1,3,4,5, Yuwen Zhou10, Yafeng Yu1,3,4,5, Long Lin10, Liang Yong1,3,4,5, Tao Zhang7, Shirui Chen1,3,4,5, Siyang Liu11, Hui Zhang1,3,4,5, Ruoyan Chen12, Lu Cao1,3,4,5, Yuanwei Zhang1, Ruixue Zhang1,3,4,5, Huanjie Yang1, Xia Hu1,3,4,5, Joshua M Akey8, Xin Jin2, Liangdan Sun1,3,4,5.
Abstract
Protein-truncating variants (PTVs) have important impacts on phenotype diversity and disease. However, their population genetics characteristics in more globally diverse populations are not well defined. Here, we describe patterns of PTVs in 1320 genes sequenced in 10,539 healthy controls and 9434 patients with psoriasis, all of Han Chinese ancestry. We identify 8720 PTVs, of which 77% are novel, and estimate 88% of all PTVs are deleterious and subject to purifying selection. Furthermore, we show that individuals with psoriasis have a significantly higher burden of PTVs compared to controls (P = 0.02). Finally, we identified 18 PTVs in 14 genes with unusually high levels of population differentiation, consistent with the action of local adaptation. Our study provides insights into patterns and consequences of PTVs.Entities:
Year: 2021 PMID: 34155038 PMCID: PMC8256863 DOI: 10.1101/gr.267963.120
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Protein-truncating variants of 19,973 Chinese in 1320 targeted genes. (A) Allele frequency spectrum of known and novel PTVs. (B) Venn diagram of PTVs in the Chinese (CHN, n = 19,973), ExAC (n = 60,706), and East Asian (EAS-ExAC, n = 4327) population in 1320 genes. (C) Allele frequency comparisons of 892 overlapped PTVs between CHN and EAS-ExAC. The inset shows those of PTVs with a maximum allele frequency of 1%. (D–F) Individual level summary of PTVs in variant types (D), genotypic composition (E), and consequence (F). (G) Number of pPTVs per individual in heterozygous and homozygous genotypes. (H) Relative proportions of pPTVs in heterozygous and homozygous genotypes. (I) Number of PTVs and pPTVs per individual in singletons and doubletons.
Figure 2.Deleteriousness of PTVs. (A) Fractions of variants that are potentially deleterious (CADD ≥ 30) for each functional category and location of variants. Note, head, body, and tail are defined as CDS position of <10%, 10%–90%, and >90%, respectively. (B) Fractions of deleterious variants under purifying selection (f) for protein-truncating (PTV, red) and missense (Mis, green) variants across 19,973 Chinese (CHN, dark) and 4327 East Asians (EAS-ExAC, light). (C–F) Fractions of deleterious variants that are subject to purifying selection (f) for PTVs across CHN from B grouped by CDS positions (C), variant types (D), consequence (E), and gene sets (F). Note that theoretical 95% confidence intervals are obtained via bootstrapping of 1000 replications and shown as error bars.
Figure 3.Gene constraint of PTVs. (A) Increased proportions of constrained (pLI ≥ 0.9), unconstrained (pLI ≤ 0.1), and constraint-classified (pLI ≥ 0.9 or pLI ≤ 0.1) genes with greater sample sizes. (B) (Left) Proportions of constrained versus unconstrained genes of 19,973 Chinese across gene sets. (Right) Enrichment or depletion of constrained genes in the gene sets of interest. P-values and odds ratios with 95% confidence intervals were calculated via two-sided Fisher's exact tests. The dots and lines would appear in red if P < 0.05, otherwise in black. Detailed information is shown in Supplemental Table S8.
Figure 4.PTV enrichments in psoriasis patients compared to healthy controls. (A) Per-individual rates of heterozygous and homozygous PTVs and across cases and controls. (B) Per-individual rates of PTVs due to SNVs, deletions, and insertions across cases and controls. (C) Per-individual rates of heterozygous and homozygous pPTVs and across cases and controls. (D) Per-individual rates of pPTVs of SNVs, deletions, and insertions across cases and controls. Note that P-values are calculated through logistic regressions and shown in red if there exists significant difference between psoriasis cases and controls. (*) P < 0.05, (**) P < 0.01, (***) P < 0.001. Detailed information is shown in Supplemental Table S9. (Het) Heterozygotes, (Hom) homozygotes, (Del) deletions, (Ins) insertions.
Gene-based association results for PTVs and missense variants in 9434 cases and 9434 controls
Figure 5.PTVs with high population differentiation in allelic frequency between Chinese (CHN), non-Finnish European (NFE), and African (AFR) populations. (A) Target-wide Z(FST) Manhattan plot. Note that Weir and Cockerham FST was normalized and transformed into Z values and the Bonferroni correction set the significant Z(FST) cutoff at 4.3. (B) (Top) Gene structure of FUT2. (Bottom) Nucleotide sequence and allele frequency distribution of a stop-gained G-A PTV (rs601338) with a Z(FST) of 18.5. (C) (Top) Gene structure of EFCAB13 with five PTVs of significant Z(FST). (Bottom) Nucleotide sequence and allele frequency distribution of a splice-acceptor G-A PTV (rs76299620) with a Z(FST) of 12.2.