| Literature DB >> 35313924 |
Qi Ni1,2, Xiang Chen2,3, Ping Zhang2, Lin Yang4, Yulan Lu2, Feifan Xiao2, Bingbing Wu2, Huijun Wang2, Wenhao Zhou5,6,7, Xinran Dong8,9.
Abstract
BACKGROUND: Cystic fibrosis (CF) is a common, life-threatening genetic disease in Caucasians but rarely reported in Chinese population. The prevalence and population-specific genetic spectrum of CF in China needs to be systematically estimated and compared with Caucasians.Entities:
Keywords: Allele frequency comparison; Cystic fibrosis; Haplotype construction and comparison; Prevalence estimation; Screening panel
Mesh:
Substances:
Year: 2022 PMID: 35313924 PMCID: PMC8935702 DOI: 10.1186/s13023-022-02279-9
Source DB: PubMed Journal: Orphanet J Rare Dis ISSN: 1750-1172 Impact factor: 4.123
Fig. 1Overall Study design. This study consisted of four parts, and within each part, the systematically comparisons among our two cohorts (representing Chinese population) with public reported cohorts from gnomAD-NFE, 1000genome-NFE (representing Caucasian population) and gnomAD-EAS, 1000genome-EAS (control for east Asian population) were performed: (1) CFTR variant pathogenicity curation. (2) CF retrospectively identification. (3) CF affected frequency estimation. (4) CFTR gene haplotype estimation
Children cohort and parent’s cohort with estimated affected frequency by three methods
| Children cohort | Parents cohort | |
|---|---|---|
| Total number | 20,905 | 10,038 |
| Gender (Male/female) | 12,773/8132 | 5012/5026 |
| CES/WES | 15,869/5036 | 334/9704 |
| Heterozygous of P/LP variants | 110 (73 males, 37 females) | 60 (38 males, 22 females) |
| Carrier frequency | 1/190 | 1/167 |
| Couple’s carrier risk | 1/36,117 | 1/27,989 |
| Method 1: estimated affected frequency by carrier frequency | 1/144,469 | 1/111,957 |
| Method 2: estimated affected frequency by permutation-and-combination | 1/153,825 | 1/120,528 |
| Method 3: estimated affected frequency by Bayesian framework (95% confidence interval) | 1/143,171 (1/213,769–1/101,160) | 1/110,127 (1/192,111–1/69,638) |
| Average estimated prevalence (mean value of the above six frequencies) | 1/128,434 | |
Fig. 2Estimated affected frequency on two screening panels (Caucasian, Chinese) by Bayesian framework. The Venn-graph shows the screening variants intersection between Chinese-specific panel and Caucasian-specific panel. Each cohort has two bars with each showing the estimated affected frequency if only use the pathogenic variants in the panel, and the P-value above shows the significance for difference. The vertical line shows the 95% confidence interval
Fig. 3Allele frequencies of P/LP variants in different domains of CFTR protein. P/LP variants are mapped to CFTR protein domains. Different color label different domains. The height of each bar represents the allele frequency of each variant in each cohort. The violin plot summarizes the total allele frequency of pathogenic variants in each domain
Fig. 4Haplotype comparison between CCGT, EAS and NFE. A Haplotype structure for 1000genome-NFE, 1000genome-EAS and CCGT-WGS cohorts. The shared blocks with tagged SNPs (vertical line) are the intersected regions of these three cohorts. Only haplotype blocks with length larger than 10 kb were remained. Five haplotype blocks in 1000genome-NFE, three blocks in 1000genome-EAS and four blocks in CCGT-WGS cohorts were found, resulting in five shared blocks. B The distribution of haplotype frequency for each shared block in three cohorts. For each shared block, top five high frequency haplotypes were shown in color and the rest were combined as “other” in grey. The Sankey ribbon between each of the adjacent blocks showed the haplotype intersection statistics. For example, in 1000genome-NFE, 100% of the top 1 haplotype in the shared block 1 were accompanied by top 1 haplotype in the shared block 2. C Pie chart for the haplotype frequencies in the three cohorts. The colored chain rectangles indicated the combined haplotype construction of the five shared blocks for the adjacent sector
Fig. 5Joint genotype analysis for CFTR gene. A Location of two exon-tag SNPs (rs213950, rs1042077) in the shared haplotype block 3. B Odds ratio of allele frequencies for different rs213950-rs1042077 joint genotype (column) in each cohort (row). C Odds ratio of allele frequencies for different rs213950-rs1042077 joint genotype for alleles carry 53 pathogenic variants in CCGT exome sequencing cohorts