Literature DB >> 34172528

Prevalence and spectrum of DNA mismatch repair gene variation in the general Chinese population.

Li Zhang1, Zixin Qin1, Teng Huang1, Benjamin Tam1, Yongsen Ruan2, Maoni Guo1, Xiaobing Wu1, Jiaheng Li1, Bojin Zhao1, Jia Sheng Chian1, Xiaoyu Wang1, Lei Wang1, San Ming Wang3.   

Abstract

BACKGROUND: Identifying genetic disease-susceptible individuals through population screening is considered as a promising approach for disease prevention. DNA mismatch repair (MMR) genes including MLH1, MSH2, MSH6 and PMS2 play essential roles in maintaining microsatellite stability through DNA mismatch repair, and pathogenic variation in MMR genes causes microsatellite instability and is the genetic predisposition for cancer as represented by the Lynch syndrome. While the prevalence and spectrum of MMR variation has been extensively studied in cancer, it remains largely elusive in the general population. Lack of the knowledge prevents effective prevention for MMR variation-caused cancer. In the current study, we addressed the issue by using the Chinese population as a model.
METHODS: We performed extensive data mining to collect MMR variant data from 18 844 ethnic Chinese individuals and comprehensive analyses for the collected MMR variants to determine its prevalence, spectrum and features of the MMR data in the Chinese population.
RESULTS: We identified 17 687 distinct MMR variants. We observed substantial differences of MMR variation between the general Chinese population and Chinese patients with cancer, identified highly Chinese-specific MMR variation through comparing MMR data between Chinese and non-Chinese populations, predicted the enrichment of deleterious variants in the unclassified Chinese-specific MMR variants, determined MMR pathogenic prevalence of 0.18% in the general Chinese population and determined that MMR variation in the general Chinese population is evolutionarily neutral.
CONCLUSION: Our study provides a comprehensive view of MMR variation in the general Chinese population, a resource for biological study of human MMR variation, and a reference for MMR-related cancer applications. © Author(s) (or their employer(s)) 2022. No commercial re-use. See rights and permissions. Published by BMJ.

Entities:  

Keywords:  DNA repair; genetic variation; genetics; genomic instability; human genetics; population

Mesh:

Substances:

Year:  2021        PMID: 34172528      PMCID: PMC9252855          DOI: 10.1136/jmedgenet-2021-107886

Source DB:  PubMed          Journal:  J Med Genet        ISSN: 0022-2593            Impact factor:   5.941


Introduction

Cancer prevention is one of the ultimate goals in medicine.1 Populational screening for cancer-causing genetic predisposition has been proposed as a promising approach to reach the goal as it allows comprehensive identification of the predisposition carriers for early prevention before cancer development.2 3 The rapid progress of DNA sequencing technology is making populational screening for cancer prevention closer to reach the reality.4–7 However, the scientific base of population screening needs to be well established.8–10 One of the issues relating to the coming paradigm switching is the basic knowledge for cancer predisposition in the general population. DNA mismatch repair (MMR) genes including MLH1, MSH2, MSH6 and PMS2 play essential roles in maintaining microsatellite stability through repairing DNA mismatch errors.11 Pathogenic variants in MMR genes damage their normal function, resulting in microsatellite instability and increased risk for developing multiple types of cancer as best represented by the Lynch syndrome (LS), the cancer caused by MMR disorder affecting the gastrointestinal system.12–14 Since the relationship between MMR variation and cancer was revealed, MMR variation has been extensively characterised in patients with cancer. The resulting MMR variation data are widely used to guide clinical diagnosis and treatment. Most of the MMR variation data currently available were derived from cancer samples, however, the information reflects mainly the status of MMR variation in patients with cancer. Knowledge of MMR variation in the general population without cancer is essential in order to prevent cancer development in the population. Referring the variation data from patients with cancer to the general population can be largely erratic, as MMR variation such as the spectrum, frequency and penetrance could be substantially different between patients with cancer and the general population.15 16 Furthermore, the issue of ethnic-specific MMR variation further increases the complexity as the current MMR variation data were predominantly derived from the Caucasian populations.17 In the present study, we addressed MMR variation in the general population by using the ethnic Chinese population as a model, for which MMR variation has not been systematically characterised so far. We mined MMR variant data from 18 844 ethnic Chinese individuals, determined the prevalence and spectrum, and characterised the features of the variants including ethnic specificity, deleteriousness and evolution selection. Data generated from our study establish a scientific foundation for the use of population screening to prevent MMR-related cancer, a resource to study human MMR variation and a reference for MMR-related clinical applications.

Materials and methods

Samples and variant data collection

Genomic sequence data from 18 844 ethnic Chinese individuals were used in the study. These included the whole genome sequences from 10 588 Chinese individuals by the ChinaMAP project (https://creativecommons.org/licenses/by/4.0/),7 the whole genome sequences from 2657 Singapore ethnic Chinese by the Singapore SG10K project,18 the whole genome sequences from 597 Chinese by the Chinese Academy of Sciences Precision Medicine Initiative project,19 the whole genome sequences from 90 Chinese by the Han Chinese study,20 the whole exome sequences from 610 normal Chinese control by the Chinese breast cancer study21 and the MMR-targeted sequences from 4302 Macau Chinese by our own study (approved by the University of Macau Institutional Review Board, BESRE17‐APP014‐FHS) (table 1A).
Table 1

Summary of mismatch repair (MMR) variants identified in the general Chinese population

A. MMR variants identified
Data sourceCasesVariants identifiedTotal
MLH1 MSH2 MSH6 PMS2
ChinaMAP study7 10 588306565862140222214 013
Singapore 10k study18 2657100630127158555588
597 Han individuals19 59718573159165
90 Han individuals20 9015454789169959
Normal control21 6101457011691761191
Macau individuals430214323900382
Total18 844453111 1423144348122 298

*Ts/Tv ratio was calculated by 2×Ts/Tv.

Summary of mismatch repair (MMR) variants identified in the general Chinese population *Ts/Tv ratio was calculated by 2×Ts/Tv.

Data analysis

The quality of sequence data was checked by FastQC and duplicates were removed by Trimmatic. Sequencing reads were mapped to human genome reference sequences (hg19) by Burrows-Wheeler Aligner.22 Sequencing bias was marked by Base Quality Score Recalibration and sorted by Picard. Variants were called using GATK 4.1 following GATK best practices protocol,23 annotated and classified by ANNOVAR.24 Mutalyzer name checker and position converter tool were used for cDNA position and coding change with the following reference sequences: Genome: NC_000003.11 (MLH1), NC_000002.11 (MSH2 and MSH6), NC_000007.13 (PMS2); cDNA: NM_000249.3 (MLH1), NM_000251.2 (MSH2), NM_000179.2 (MSH6), NM_000535.5 (PMS2); protein: NP_000240.1 (MLH1), NP_000242.1 (MSH2), NP_000170.1 (MSH6), NP_000526.2 (PMS2). The following databases were used for comparative analyses: gnomAD (http://gnomad.broadinstitute.org/),25 ExAC (http://exac.broadinstitute.org/)26 and database of 1000 Genomes Project (https://www.internationalgenome.org)27 were used for comparing with non-Chinese general population; ClinVar (http://www.ncbi.nlm.nih.gov/clinvar/),28 InSiGHT (https://www.insight-group.org/variants/databases/),29 COGR (http://opengenetics.ca/)30 and UMD (http://www.umd.be)31 were used for comparing with non-Chinese cancer cohort; dbSNP150 (https://www.ncbi.nlm.nih.gov/snp/)32 and the databases listed above were used for novelty determination. The variants reported in any of the databases mentioned above were classified as known variants; the variants not reported were classified as novel variants. Power calculation was performed following procedures.33

Deleterious impact of variants on protein structural stability

Ramachandran Plot Molecular Dynamic Simulation (RPMDS) method was used following the procedures described in detail.34 Briefly, the N terminus of MLH1 crystal structure (PDB:4P7A, 2.30 A, residues 1–340) was used as the template. Altered amino acid residues caused by missense variants were incorporated into the wild-type MLH1 structure to generate mutant structures by using the Chimera software.35 Molecular dynamics simulation composed of five different programs of RMSD (root-mean-square-deviation),36 RMSF (root-mean-square-fluctuations),37 Rg (radius of gyration),38 SASA (solvent accessible surface area)39 and NH bond (hydrogen bond)40 were applied to measure the structural changes of equilibrium state, flexibility, shape, hydrogen bond, surface accessibility and structural expansion caused by the variants. Ramachandran plot was used to qualify and quantify the changes. By comparing the data from known pathogenic and benign variants, the variants with deleterious impact on protein structure stability were identified.

Evolutionary analysis

Variants from the Singapore SG10K project were used for the analysis. Ka/Ks ratio was calculated by using dNdSloc model in the dNdScv R package.41 42 Tajima’s D,43 the normalised Fay and Wu’s H test,44 the DH test and the E test45 were performed by using the Readms package of DH software.

Results

MMR variants identified in the general Chinese population

A total of 18 844 ethnic Chinese individuals were included in this study, including 11 885 (61.8%) mainland Chinese, 4302 (23.6%) Macau Chinese and 2657 (14.6%) Singapore Chinese (table 1A, figure 1A, online supplemental table S1). Online supplemental figure S1 outlines the analytical process. From the sequence data, we identified 17 687 distinct variants in MMR genes of MLH1, MSH2, MSH6 and PMS2, of which 1166 (6.6%) were located in coding regions and 16 521 (93.4%) were located in non-coding regions (table 1, online supplemental tables S2, S3). Of the 1166 coding variants, 210 were in MLH1, 248 in MSH2, 493 in MSH6 and 215 in PMS2 (table 1B); 57.4% were singleton, 27.0% with 2–5 carriers, 6.1% with 6–10 carriers, 5.8% with 11–50 carriers, 0.4% with 51–100 carriers and 3.3% with >100 carriers (figure 1B, online supplemental table S2); of the 13 variation types, missense variant had the highest rate (50.3%), 24.1% remained as unclassified variants (table 1B) and 45.2% as novel variants (table 1C). Ts/Tv ratio was 3.53. G>A had the highest frequency of 18.8% (table 1D). Multiple variation hotspots were identified in MMR genes: 300th to 500th residues in MLH1, 400th to 600th residues in MSH2, and several hotspots in different functional domains in MSH6 and PMS2 (figure 2). Of the 16 521 non-coding variants, 3416 were in MLH1, 8629 in MSH2, 2052 in MSH6 and 2424 in PMS2; 90.4% (14 941) were located in intron (table 1E), 69.8% (11 528) were not present in dbSNP, 62.9% (10 392) were singleton (online supplemental table S3) and 52.2% were within repetitive sequences (online supplemental table S4). Ts/Tv ratio was 3.38, lower than the 3.53 in the coding variants. C>T had the highest frequency of 20.7% (table 1D).
Figure 1

Mismatch repair (MMR) variants in ethnic Chinese population. (A) Sources of Chinese MMR variation data from mainland China, Macau and Singapore. (B) Distribution frequency of MMR variation. (C) Comparison of MMR variation between general Chinese population and patients with cancer. (a) Overall overlapping; (b) individual MMR gene overlapping.

Figure 2

Location and frequency of coding variants in the coding region of each mismatch repair gene. The map was constructed by using the ProteinPaint program.56

Mismatch repair (MMR) variants in ethnic Chinese population. (A) Sources of Chinese MMR variation data from mainland China, Macau and Singapore. (B) Distribution frequency of MMR variation. (C) Comparison of MMR variation between general Chinese population and patients with cancer. (a) Overall overlapping; (b) individual MMR gene overlapping. Location and frequency of coding variants in the coding region of each mismatch repair gene. The map was constructed by using the ProteinPaint program.56

MMR variation between general population and cancer cohort

We compared the MMR coding variants between the general Chinese population and the Chinese patients with cancer generated by our previous study (online supplemental table S5).17 We observed low overlapping rates between the two datasets: only 16 variants in MLH1, 18 in MSH2, 12 in MSH6 variants and 23 in PMS2 variants were shared in between (figure 1C, online supplemental table S6). We also searched the 1166 coding variants in ClinVar and InSiGHT databases. Of the 225 coding variants matched in the databases, colorectal cancer had the highest (142 variants) and LS was the second highest (100 variants) (online supplemental table S7), indicating that the coding variants were more oncogenic in colorectal tissue.17

MMR variation between Chinese and non-Chinese populations

We compared MMR data between the Chinese and non-Chinese populations. We first compared the data of general populations using the gnomAD, ExAC and 1000 Genome databases after removing the variants derived from ethnic Chinese in these databases. We observed that only 42.1% (491) of the 1166 coding variants and 1.9% (319) of the 16 582 non-coding variants were shared between the general Chinese and non-Chinese populations (table 2A, B). Next, we compared the Chinese population with the non-Chinese cancer data from the ClinVar, InSiGHT, COGR and UMD databases after removing the variants derived from ethnic Chinese in these databases. We observed that only 47.4% (553) coding variants and 1.1% (181) non-coding variants in the Chinese population were shared with MMR variants derived from non-Chinese patients with cancer. The shared variants had the highest overlapping rate in ClinVar for both coding (624, 53.5%) and non-coding (176, 1.1%) variants, followed by InSiGHT database for coding (200, 17.2%) and non-coding (119, 0.7%) variants (table 2A, B). In total, 63% (735) of Chinese coding variants and 4.1% (688) of Chinese non-coding variants were shared with non-Chinese populations. Of the 37% (431) of coding variants present in Chinese only, 34.3% (148) were nonsynonymous variants and 2.6% (11) were loss-of-function variants (4 stopgain, 3 frameshift insertion, 2 frameshift deletion and 2 no frameshift insertion) (online supplemental table S2). The differences between the Chinese and non-Chinese population imply that MMR variation is highly ethnic-specific.
Table 2

Comparison of mismatch repair variants between Chinese and non-Chinese populations

A. Coding variants
Non-Chinese*Chinese
MLH1 (210) MSH2 (248) MSH6 (493) PMS2 (215)Total (1166)
Total†Reported (%)TotalReported (%)TotalReported (%)TotalReported (%)TotalReported (%)
General population
gnomAD128692 (43.8)1685109 (44.0)2044185 (37.5)1531121 (56.3)6546507 (43.5)
ExAC94972 (34.3)100787 (35.1)1378136 (27.6)91698 (45.6)4250393 (33.7)
1000 genomes7201 (0.5)185210 (12.9)42911 (2.2)1936 (2.8)319428 (2.4)
Subtotal†225884 (40.0)2549109 (44.0)2646181 (36.7)1732117 (54.4)6927 491 (42.1)
Caner cohort
ClinVar2382114 (54.3)2961142 (57.3)3465220 (44.6)1791148 (68.8)10 599624 (53.5)
InSiGHT137551 (24.3)135242 (16.9)50655 (11.2)41452 (24.2)3647200 (17.2)
COGR12011 (5.2)12213 (5.2)10915 (3.0)447 (3.3)39546 (3.9)
UMD970 (0.0)24618 (7.3)512 (0.4)00 (0.0)39420 (1.7)
Subtotal†3644104 (49.5)4312121 (48.8)4061200 (40.6)2165128 (59.5)10 538 553 (47.4)
Total‡5852129 (61.4)7383168 (67.7)6184271 (55.0)3983167 (77.7)23 402 735 (63.0)

The bold values refer to the sum in each and combined populations.

*254 MMR variants derived from ethnic Chinese in these databases were excluded for the comparison.

†Distinct variants in non-Chinese databases after combination.

‡Distinct variants after combination of all variants in each column.

Comparison of mismatch repair variants between Chinese and non-Chinese populations The bold values refer to the sum in each and combined populations. *254 MMR variants derived from ethnic Chinese in these databases were excluded for the comparison. †Distinct variants in non-Chinese databases after combination. ‡Distinct variants after combination of all variants in each column.

Pathogenic variants in the Chinese population

Using ClinVar as the reference, 898 of the 1166 (77.0%) coding variants matched in ClinVar were classified into different clinical classes including 24 (2.0%) as Pathogenic and Likely pathogenic, 507 (43.5%) as Variants of Unknown Significance (VUS), 282 (24.2%) as Benign and Likely benign, 83 (7.3%) as Conflicting classification and 268 (23.0%) as Unclassified (table 3A). There were 33 carriers for the 24 Pathogenic/Likely pathogenic variants in the 18 844 general Chinese individuals included in the study; 26 (78.8%) carriers were in MSH6 and PMS2 (table 3B). This resulted in the prevalence of 0.18% for MMR pathogenic variants in the tested general Chinese population, including 0.005% in MLH1, 0.032% in MSH2, 0.053% in MSH6 and 0.085% in PMS2 (p<0.005). Power calculation showed that screening 18 844 individuals at a prevalence of 0.18% provides a 97.8% probability of detecting all pathogenic variants in the studied population. The 0.18% prevalence implies the presence of 2.52 million MMR pathogenic variant carriers estimated in the 1.4 billion Chinese population. We performed the same analysis for the MMR data in the gnomAD database, and observed a 0.11% prevalence of MMR pathogenic variants in the general non-Chinese population, composed of 0.020% in MLH1, 0.015% in MSH2, 0.024% in MSH6 and 0.052% in PMS2. Therefore, the general Chinese population had a higher prevalence than general non-Chinese populations in MSH2, MSH6 and PMS2 but lower prevalence in MLH1. In the Chinese population, the pathogenic variants c.3226C>T (MSH6), c.825A>G (PMS2) and c.82G>A (MSH2) had 5, 4 and 3 carriers, respectively, suggesting that they may be the potential founder mutations. We further collected cancer distribution information from the ClinVar database for the identified Pathogenic and Likely pathogenic variants. Similar to the distribution of MMR non-pathogenic variants, the 24 Pathogenic/Likely pathogenic variants had the highest frequency in colorectal cancer (table 3C).
Table 3

Clinical classification of coding variants

A. Classification
ClassGenes (%)Total
MLH1 MSH2 MSH6 PMS2
Pathogenic0 (0.0)3 (1.2)2 (0.4)8 (3.7)13 (1.1)
Likely pathogenic1 (0.5)1 (0.4)4 (0.8)5 (2.3)11 (0.9)
VUS113 (53.8)118 (47.6)193 (39.1)83 (38.6)507 (43.5)
Likely benign48 (22.9)50 (20.2)89 (18.1)53 (24.7)240 (20.6)
Benign7 (3.3)11 (4.4)8 (1.6)17 (7.9)42 (3.6)
Conflicting8 (3.8)22 (8.9)24 (4.9)31 (14.4)85 (7.3)
Unclassified33 (15.7)43 (17.3)173 (35.1)19 (8.8)268 (23.0)
Total210 (100.0)248 (100.0)493 (100.0)215 (100.0)1166 (100.0)

*Variants marked ‘–’ were predicted by InterVar.

HNPCC, hereditary nonpolyposis colorectal cancer; LS, Lynch syndrome; VUS, variants of unknown significance.

Clinical classification of coding variants *Variants marked ‘–’ were predicted by InterVar. HNPCC, hereditary nonpolyposis colorectal cancer; LS, Lynch syndrome; VUS, variants of unknown significance.

Ethnic-specific pathogenic variants

Of the 1166 coding variants, 507 (43.5%) were classified as VUS and 268 (23.0%) remained as Unclassified (table 3A). The presence of 66.5% as functional unknown variants raises the question if these unclassified variants had any biological significance. We tested this possibility by measuring the impact of the unclassified variants on MLH1 protein structure stability using the structure-based RPMDS method.34 By referring to the cut-off values from wild-type, known benign and known pathogenic variants to differentiate deleterious and non-deleterious variants, we tested the Unclassified variants (online supplemental table S8). We selected the Unclassified variants for the test under the following conditions: (1) the variants must be within the known MLH1 structure as the RPMDS method relies on the known protein structure in its analysis. Of the 756 amino acid residues in MLH1, the known structure (PDB:4P7A) covers 340 aa (1–340). (2) The variant must be missense as the RPMDS is designed for missense variant analysis. Under these conditions, we identified 16 Unclassified MLH1 variants for the analysis. We identified six variants as deleterious that significantly disturbed MLH1 structure (online supplemental table S9). For example, K241T had deleterious effects as it destabilised the two alpha helixes formed between residues 234 and 340 in MLH1 (online supplemental figure S2). The results indicate that the Unclassified variants were enriched with Chinese-specific deleterious variants.

Evolutionary analysis of Chinese MMR variation

We performed evolutionary analysis for MMR variation in the Chinese population. We calculated Ka/Ks ratio for each MMR gene with 0.83 in MLH1, 0.92 in MSH2, 1.07 in MSH6 and 1.17 in PMS2 (figure 3A). We observed that A/S ratios were all close to 2.5 (2.08 in MLH1, 2.29 in MSH2, 2.80 in MSH6 and 2.73 in PMS2). We validated the results by using multiple tests of Tajima D, Fay & Wu H, DH and Zeng E (figure 3B), and concluded that MMR genes in the Chinese population were evolutionarily neutral.
Figure 3

Evolutionary analysis of mismatch repair (MMR) gene variation in the general Chinese population. (A) Ka/Ks ratio of MMR variation in the Chinese population. Most of the genes in the genomes were neutral (left part, Ka/Ks ratio<1), of which MLH1 had 0.83, MSH2 had 0.92, MSH6 had 1.07 and PMS2 had 1.17 (right part). (B) MMR neutral tests. The A/S ratios for each MMR gene variation were tested by methods of Tajima D, Fay & Wu H, DH and Zeng E tests, confirming that each MMR gene variation was evolutionarily neutral.

Evolutionary analysis of mismatch repair (MMR) gene variation in the general Chinese population. (A) Ka/Ks ratio of MMR variation in the Chinese population. Most of the genes in the genomes were neutral (left part, Ka/Ks ratio<1), of which MLH1 had 0.83, MSH2 had 0.92, MSH6 had 1.07 and PMS2 had 1.17 (right part). (B) MMR neutral tests. The A/S ratios for each MMR gene variation were tested by methods of Tajima D, Fay & Wu H, DH and Zeng E tests, confirming that each MMR gene variation was evolutionarily neutral.

Discussion

Several conclusions can be made from our current study: Modest prevalence of MMR variation in the general population. Of the 1166 MMR coding variants identified in the general Chinese population, we identified 24 Pathogenic/Likely pathogenic variants with 33 carriers. Based on the data, we determined the 0.18% prevalence of MMR pathogenic variation in the 18 844 ethnic Chinese individuals, highlighting the presence of 2.52 million MMR pathogenic variant carriers in the 1.4 billion Chinese population. It is important to note that a group of autosomal dominant cancer predisposition mutations has much higher prevalence in the general population than these in many non-cancer hereditary diseases, such as spinocerebellar ataxia (SCA), in which there are only a few carriers per 100 000 individuals.46 For example, the prevalence of pathogenic mutation in BRCA1/BRCA2 (BRCA) is 0.26% in Japanese (1 in 384),47 0.38% in Chinese (1 in 265),48 0.38% in Mexicans (1 in 265),49 0.39% in Malaysians (1 in 556),50 0.53% in US population (1 in 189)51 and 2.17% in the Ashkenazi Jewish (1 in 46).52 The 0.18% prevalence in the Chinese population is the combination of four MMR genes of MLH1, MSH2, MSH6 and PMS2. With 1 in every 556 Chinese individuals an MMR pathogenic variant carrier, it indicates the serious threat of MMR-related cancer risk for the public health and the importance of preventing MMR-related cancer in the general Chinese population. However, the 0.18% prevalence is lower than the 0.38% prevalence for BRCA pathogenic variation in the general Chinese population.48 Although MMR and BRCA are the two groups of cancer predisposition genes with the most significant clinical value over other cancer predisposition genes,53 priority will be given to BRCA screening first if only one choice can be made when planning a population screening for cancer prevention. Alternatively, MMR and BRCA can be combined as one panel for the screening with the expected outcome of identifying twice more of BRCA pathogenic variant carriers over MMR carriers. Different spectra of MMR variation between the general population and patients with cancer. This is reflected by the low overlapping of MMR variants between the two cohorts, and most of the pathogenic variants in the general population were in MSH6 and PMS2 but in patients with cancer were in MLH1 and MSH2. This implies that the MMR variation data derived from patients with cancer cannot be directly applied as the standard reference to judge MMR variation in the general population, as the former reflects the MMR pathogenic variation enriched in the cancer cohort whereas the latter reflects the genetic variation distributed in the general population. Highly ethnic-specific MMR variation in the general population. This is reflected by the presence of 68.2% of novel MMR variants in the Chinese population (table 1C). A similar situation in BRCA variation was also present in the general Chinese population48 and may also exist in other ethnic populations. This feature reminds the limitation of the MMR data currently available as they were mostly derived from the Caucasian populations, and highlights the need to collect variation information in cancer predisposition genes from different ethnic populations. The challenge of classifying ethnic-specific MMR variants. Over 66% of Chinese MMR variants remain unclassified. It is difficult to classify the ethnic-specific variants and identify the potential ethnic-specific pathogenic variants due to the lack of clinical evidence, resources, expertise and references available in existing MMR databases. Substantial efforts need to be made to improve the situation. Human MMR system is not under obvious evolution selection. Our study demonstrated that MMR genes in the Chinese population were neutral without obvious positive or negative selection. This is in contrast to the situation in human BRCA1, in which strong positive selection is present.54 It is interesting to indicate that evolution selection can act differentially in different DNA damage repair genes/pathways for better fitness. Potential value of MMR non-coding variants. Of the 17 687 variants identified in MMR genes, 16 521 (93.4%) were in non-coding regions mostly in introns, and 12 055 (68.2%) were not present in dbSNP. Their rich presence highlights the highly variable nature in MMR non-coding regions and further exploring their potential clinical relevance is warranted. Certain limitations are present in our study, such as lack of sex and age information in the variation data as these were not available in the original data sources. For the variants detected only in single individuals, possibility exists that certain "singleton" MMR variants could be generated by sequencing errors instead of true genetic variants. In addition, the actual prevalence of MMR pathogenic variants could be higher than observed as genomic data used in our study were mainly collected by short sequences from the next-generation sequencing platform, which is sensitive in detecting single-base and small indel variants but lacks power to detect large structural variations. Further functional test of the rich MMR variants will help to identify the driver variants contributing to the oncogenic process.55

Conclusion

In summary, our study provides a populational view for MMR variation in an ethnic human population, a scientific basis in planning population screening for MMR-related cancer prevention, and a reference resource for biological study of human MMR variation and MMR-related clinical applications.
  54 in total

1.  Population genetic screening efficiently identifies carriers of autosomal dominant diseases.

Authors:  J J Grzymski; G Elhanan; J A Morales Rosado; E Smith; K A Schlauch; R Read; C Rowan; N Slotnick; S Dabe; W J Metcalf; B Lipp; H Reed; L Sharma; E Levin; J Kao; M Rashkin; J Bowes; K Dunaway; A Slonim; N Washington; M Ferber; A Bolze; J T Lu
Journal:  Nat Med       Date:  2020-07-27       Impact factor: 53.440

Review 2.  Social and ethical implications of BRCA testing.

Authors:  A Surbone
Journal:  Ann Oncol       Date:  2011-01       Impact factor: 32.976

3.  Exploring genomic alteration in pediatric cancer using ProteinPaint.

Authors:  Xin Zhou; Michael N Edmonson; Mark R Wilkinson; Aman Patel; Gang Wu; Yu Liu; Yongjin Li; Zhaojie Zhang; Michael C Rusch; Matthew Parker; Jared Becksfort; James R Downing; Jinghui Zhang
Journal:  Nat Genet       Date:  2016-01       Impact factor: 38.330

4.  A comparison of multiscale methods for the analysis of molecular dynamics simulations.

Authors:  Noah C Benson; Valerie Daggett
Journal:  J Phys Chem B       Date:  2012-04-24       Impact factor: 2.991

5.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.

Authors:  Kai Wang; Mingyao Li; Hakon Hakonarson
Journal:  Nucleic Acids Res       Date:  2010-07-03       Impact factor: 16.971

6.  Variants of DNA mismatch repair genes derived from 33,998 Chinese individuals with and without cancer reveal their highly ethnic-specific nature.

Authors:  Li Zhang; Shanmuga Priya Bhaskaran; Teng Huang; Hui Dong; Khyati Chandratre; Xiaobing Wu; Zixin Qin; Xiaoyu Wang; Wenming Cao; Tianhui Chen; Henry Lynch; San Ming Wang
Journal:  Eur J Cancer       Date:  2019-12-09       Impact factor: 9.162

7.  Tools for integrated sequence-structure analysis with UCSF Chimera.

Authors:  Elaine C Meng; Eric F Pettersen; Gregory S Couch; Conrad C Huang; Thomas E Ferrin
Journal:  BMC Bioinformatics       Date:  2006-07-12       Impact factor: 3.169

8.  Germline pathogenic variants of 11 breast cancer genes in 7,051 Japanese patients and 11,241 controls.

Authors:  Yukihide Momozawa; Yusuke Iwasaki; Michael T Parsons; Yoichiro Kamatani; Atsushi Takahashi; Chieko Tamura; Toyomasa Katagiri; Teruhiko Yoshida; Seigo Nakamura; Kokichi Sugano; Yoshio Miki; Makoto Hirata; Koichi Matsuda; Amanda B Spurdle; Michiaki Kubo
Journal:  Nat Commun       Date:  2018-10-04       Impact factor: 14.919

Review 9.  Spinocerebellar ataxia: an update.

Authors:  Roisin Sullivan; Wai Yan Yau; Emer O'Connor; Henry Houlden
Journal:  J Neurol       Date:  2018-10-03       Impact factor: 4.849

10.  Whole Genome Analyses of Chinese Population and De Novo Assembly of A Northern Han Genome.

Authors:  Zhenglin Du; Liang Ma; Hongzhu Qu; Wei Chen; Bing Zhang; Xi Lu; Weibo Zhai; Xin Sheng; Yongqiao Sun; Wenjie Li; Meng Lei; Qiuhui Qi; Na Yuan; Shuo Shi; Jingyao Zeng; Jinyue Wang; Yadong Yang; Qi Liu; Yaqiang Hong; Lili Dong; Zhewen Zhang; Dong Zou; Yanqing Wang; Shuhui Song; Fan Liu; Xiangdong Fang; Hua Chen; Xin Liu; Jingfa Xiao; Changqing Zeng
Journal:  Genomics Proteomics Bioinformatics       Date:  2019-09-05       Impact factor: 7.691

View more
  1 in total

1.  Distinct landscapes of deleterious variants in DNA damage repair system in ethnic human populations.

Authors:  Zixin Qin; Teng Huang; Maoni Guo; San Ming Wang
Journal:  Life Sci Alliance       Date:  2022-05-20
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.