Literature DB >> 34035850

Clinical application of whole-exome sequencing: A retrospective, single-center study.

Qiang Zhang1, Zailong Qin1, Shang Yi1, Hao Wei1, Xun Zhao Zhou1, Jiasun Su1.   

Abstract

The aim of the present study was to assess the practical diagnostic value of whole-exome sequencing (WES) in patients with different phenotypes and to explore possible strategies to increase the capability of WES in identifying disease-causing genes. A total of 1,360 patients (aged from 1 day to 42 years old) with manifestations of genetic diseases were genotyped using WES and statistical analysis was performed on the results obtained. Within this cohort, the overall positive rate of identification of a disease-causing gene alteration was 44.41%. The positive identification rate where trio-samples were used (from the proband and both parents) was higher than that where a single proband sample was used (50.00 vs. 43.71%), and 604 positive cases with 150 genetic syndromes, 510 genes and 718 mutations were detected. Missense mutations were the most common variations (n=335, 45.27%) and visual or auditory abnormalities (58.51%) had the highest rate of association with a genetic abnormality. The positive detection rate of WES was elevated with the increase in the number of clinical symptoms from 1 to 8. The present study indicated that WES may be used as a valuable tool in the clinic and the positive rate depends more on the professional experience of clinicians rather than on the analytical capabilities of the data analyst. At the same time, particular attention must be paid to certain possible factors (such as the age of the patients as well as possible exon deletions), which may affect the diagnostic rate while applying this process. Copyright: © Zhang et al.

Entities:  

Keywords:  human phenotype ontology; pathogenicity assessment; phenotypic abnormality; whole-exome sequencing

Year:  2021        PMID: 34035850      PMCID: PMC8135134          DOI: 10.3892/etm.2021.10185

Source DB:  PubMed          Journal:  Exp Ther Med        ISSN: 1792-0981            Impact factor:   2.447


Introduction

Based on World Health Organization estimates, China's birth defect rate is 5.6% at present, which is significantly higher than the birth defect rate of 4.72% in developed countries (1). In addition, birth defects have risen from fourth place in 2000 to 2nd place in 2011 in the league table of child mortality causes in China, accounting for 19.10% of child deaths (2). In the 21st century, the development of genetic testing technology and the ability to reveal the causes of genetic diseases have promoted the revolutionary transformation of medical practice from the symptoms-based diagnostics of traditional medicine to the cause-based diagnostics of modern medicine (3). With the large population in China and the increasing number of patients carrying genetic diseases, inadequate numbers of qualified genetic professionals and testing capabilities represent significant challenges (1). The diagnosis and treatment of inherited metabolic disorders has become an important task for clinicians (4,5). In 2009, Ng et al (6) first applied next-generation sequencing to capture the sequences of human genome exons in order to analyze the pathogenic variations of single-gene disorders. Since then, the technique of whole-exome sequencing (WES) has been widely used, due to its ability to provide potential molecular genetic proof for genetic diseases in suspected cases after clinical assessment (7). Guangxi, a province located in southwestern China, has a population including several ethnic minorities that is genetically heterogeneous (8). Based on our experience, the application of WES in the local population with a genetic disease presentation has proven to be beneficial. Guangxi Maternal and Child Health Hospital (Nanning, China) is one of the leading hospitals responsible for the diagnosis and treatment of genetic diseases in this region (9). In the present study, the data of 1,360 cases of WES in a single center were reviewed and summarized to assess the practical diagnostic value of WES and to explore how to improve the ability of this technique to find disease-causing genes.

Patients and methods

Basic information

A total of 1,360 cases subjected to WES at the Guangxi Maternal and Child Health Hospital (Nanning, China) between January 2017 and July 2019 were reviewed. Their age ranged from 1 day to 42 years (4.72±7.67) and the ratio of males to females was 1.74:1. Among them, 456 were inpatients and 904 were outpatients. The clinical characteristics of all patients were evaluated. The patients' ethnicities included Han, Zhuang, Yao, Uygur, Miao as well as other ethnic minorities which is representative of the ethnic composition of this region (10). Therefore, the present cohort may be considered as a representative sample. The present study was a retrospective study, and the data were collected as part of the routine clinical procedure and no informed consent is required. However, each patient or guardians signed the informed consent before performing the WES. The publication of the article is approved by the Ethics Committee of Guangxi Zhuang Autonomous Region Maternal and Child Health Hospital and the Children's Hospital (Reference File No.:2017, [2-11]).

WES and biometric filtering DNA extraction

Blood samples (2 ml of peripheral blood) were collected from the patients and their parents for WES. Genomic DNA was isolated using the Lab-Aid DNA kit (Zeesan Biotech Co., Ltd) and stored at -80˚C.

WES and copy number variation (CNV) analysis

Human exome sequencing libraries were constructed using the Agilent SureSelect Human All Exon V5 kit (Agilent Technologies, Inc.), and amplicons were generated and sequenced with the Illumina HiSeq 2500 system (Illumina, Inc.). All procedures were performed according to the manufacturer's instructions. After sequencing, reads were aligned to an indexed human reference genome (GRCh37/hg19) with Burrows-Wheeler transformation 0.7.15-r1140(11). Duplicate reads were removed using Picard v.1.85 (http://picard.sourceforge.net) prior to further processing. Base recalibration and variant calling were performed using the Genome Analysis Toolkit v.2.3-4Lite (12). Finally, identified variants were saved in a variant call format. In addition, in the present study, there was an attempt to reveal CNVs with a read depth-based CNV detection method (13-16). The detection methods can be separated into four steps: Raw coverage normalization, correction for sample-specific coverage biases, CNV calling, and partially-mapped read analysis and all copy number variants were finally confirmed by Illumina Human Cyto SNP12 kit. Translational Genomics Expert (LifeMap Sciences, Inc.) was used for variant prioritization. Integrative Genomics Viewer (IGV; v.2.4.15; http://software.broadinstitute.org/software/igv/) was used to visualize WES data and assess the coverage of the exons.

Quality control

The capture area of WES was 60M. The average sequencing depth of each sample was >120X and produced 12G clean data. At the same time, the actual qualified data output of the library was >95% with an average Q30≥80% for each sample and the accurate coverage of the exon region (>20X) should be ≥97%; base type distribution was uniform with no GC separation. Sanger sequencing was used to verify the mutations and their origins. In order to reduce the interference of noise signals in the results, a series of measures was implemented. First, in the experimental process, each sample was detected under the same experimental conditions to reduce any intra-batch differences. Furthermore, normalization of WES data was performed to avoid any differences caused by the capture or sequencing processes. In addition, mass correction on the exome region of the sample was performed including a GC correction. Finally, biological information processing on the same exome region of different samples on the same batch was routinely performed.

Sequencing data analysis

The Genome Analysis Toolkit (GATK) was used for variant calling (GATK HaplotypeCaller). The TGex software version 3.4.1 (LifeMap Sciences) was used to annotate the selected single nucleotide variants (SNVs) and indels. ‘Rare deleterious’ mutations were defined as those that met the following criteria: a) They led to a stop-gain, stop-loss, nonsynchronous, frameshift or splice-site mutation and (b) the reference genome GRCh37/hg19 from the Genome Reference Consortium was used (https://www.ncbi.nlm.nih.gov/grc).

Interpretation of variations

Pathogenicity assessment was based on the American College of Medical Genetics and Genomics (ACMG)/Association for Molecular Pathology (AMP) 2015 guidelines and the variants were classified as either ‘benign’, ‘likely benign’, ‘uncertain significance’, ‘likely pathogenic’ or ‘pathogenic’ based on InterVar (http://wintervar.wglab.org/) (17).

Statistical analysis

Data analysis was conducted by χ2 test using the IBM SPSS version 19.0 statistical analysis software. P<0.05 was considered significant. χ2 test was used to compare the frequencies of positive (disease-related genes found)/negative (No association was found between disease and gene) detection among medical staff.

Results

Baseline characteristics

In the present study information and WES data was collected on 1,360 patients from the Pediatric Intensive Care Unit and the Pediatrics, Eugenics, Child Rehabilitation, Neonatal, Otolaryngology and General Pediatric Inpatient Departments as well as the Surgery Clinic of Guangxi Maternal and Child Health Hospital (Nanning, China) between January 2017 and July 2019. Of the 1,360 patients, 456 were hospitalized and 904 were out-patients. The positive rates (detection rate of disease-associated genes) of hospitalized and out-patients were 43.20% (198/456) and 45.02% (407/904), respectively. Among all of the cases, 1,217 were single-sample patients (where only the proband was tested) and 144 were trio-samples (the proband and their parents). Of the 1,217 single-sample patients, 532 were positive (43.71%), while 72 positive samples were detected within 144 family samples (50.00%). Among the 604 positive patients, including 532 single and 72 family samples, 89 cases had CNVs with an associated positive rate of 14.74%, 83.28% were the non-CNVs and 1.98% (114 cases) carried compound heterozygous mutations. The deletions of one or more exons with merge point mutations accounted for 10 cases and CNV merged with SNV accounted for 6 cases. Among them, 478 mutations had been reported previously and 240 mutations were novel (unpublished). A total of 198 mutations were de novo variations. A flow chart of the analysis of all of the cases studied is presented in Fig. 1.
Figure 1

Test results of the 1,360 subjects. CNV, copy number variant; SNV, single nucleotide variant.

Genetic variants and pathogenicity assessment

A total of 150 genetic syndromes, involving 510 genes (with respect to abnormal CNVs and point mutations) and 718 variations were detected in the 604 positive cases, of which 89 were CNVs, including variations associated with deletions and duplications. Based on the ACMG/AMP 2015 guidelines, 718 mutations were classified as being pathogenic (65.13%), likely pathogenic (25.05%) and as having uncertain significance (9.82%). Even if certain genes may be consistent with clinical manifestations, they may not have been extensively studied and may only be identified as variants of uncertain significance. In such instances, the progress of research requires them to be tracked on a regular basis to determine whether the rating may be updated in order to provide patients with treatment and genetic counseling. At present, the ‘consensus recommendations for the clinical application of genetic testing for children's genetic diseases’ recommends that only pathogenic and likely pathogenic mutations may be used for prenatal or preimplantation diagnosis (18).

Classification of clinical symptoms and statistics

Among the patients, certain probands had a single symptom while others had multiple systemic phenotypes. The patients were divided into 20 categories according to their major clinical symptoms. The positive detection rate for each clinical manifestation is presented in Table I. The major clinical manifestations included short stature, motor deterioration, language development retardation, mental retardation, skeletal deformity, seizures and gonadal dysgenesis or abnormal genital organs. In order to investigate whether the positive rate of WES increased with the number of clinical symptoms, the number of clinical symptoms exhibited by each of the subjects was determined, and according to those numbers, the patients were assigned to categories 1-9 and the positive rates in those groups were then compared in a graph. Through analysis and comparison, it was determined that the positive rate of WES increased with the number of clinical symptoms in the range of 1 to 8. The positive rate was ~40% when the number of symptoms was 1-3, but when the number of symptoms was 4-8, the positive rate was ~50% (Fig. 2). It is recommended, wherever possible, that the interpretation of WES data should be accompanied by clinical information (19). In the positive cases, the incidence of autosomal variation was 4.59 times [491 (autosomal variation cases): 107 (sex chromosomes variation cases)] that of sex chromosome abnormality. The patients with sex chromosome abnormalities were mainly male children (<14 years old) and the male-to-female ratio was 15.75:1. Clinical manifestations of sex chromosome abnormalities included cryptorchidism, micro-penis and hypospadias. There were 629 non-CNVs, namely nonsense, missense, frameshift, splicing, in-frame deletions, in-frame duplications and exon-deletions. The proportion of each variant is presented in Fig. 3. In order to determine whether the capabilities of clinicians and data analysts affect the positivity rate of WES, a pediatric outpatient department, which has the largest number of test samples, was also selected to observe whether there were any differences in the positive status of the samples submitted by individual doctors. The positive detection rates among the samples analyzed by the data analysts were also determined. As presented in Table II, different clinicians had different professional levels of expertise and experience to evaluate patients and this directly affected the WES-positive rate, but there was no significant difference between different data analysts.
Table I

Positive rate under different clinical symptoms.

Phenotypic abnormalityHPONumber of subjects testedPositive cases, n (%)
Growth abnormalityHP:0000002239118 (49.37)
Abnormality of the nervous system   
     SeizuresHP:0001250229102 (44.54)
     Intellectual disabilityHP:000124910438 (36.54)
     Delayed speech and languageHP:0000750  
Development or autistic behaviorHP:000072911638 (32.76)
     Global developmental delayHP:00012639351 (54.84)
     Cerebral palsyHP:010002114359 (41.26)
     Motor deteriorationHP:000233320699 (48.06)
Abnormality of the respiratory systemHP:000208616567 (40.61)
Abnormality of the genitourinary system   
     Abnormality of the genital systemHP:000007818266 (36.26)
     Abnormality of the urinary systemHP:000007919566 (33.85)
Abnormality of the skeletal systemHP:0000924215116 (53.95)
Abnormality of head or neckHP:0000152290161 (55.52)
Abnormality of the skinHP:000095110858 (53.70)
Abnormality of the endocrine systemHP:0000818225112 (49.78)
Hearing or vision abnormalityHP:00003649455 (58.51)
 HP:0000504  
Abnormality of blood and blood-forming tissuesHP:00018717527 (36.00)
Abnormality of the digestive systemHP:00250315214 (26.92)
Abnormality of the cardiovascular systemHP:000162612554 (43.20)

HPO, human phenotype ontology which provides a standardized vocabulary of phenotypic abnormalities encountered in human disease. Positive cases, detection rate of disease-associated genes.

Figure 2

Positive rates for different numbers of symptoms.

Figure 3

Proportion of variant types in the total variations. CNV, copy number variant; del, deletion; dup, duplication.

Table II

Clinicians and data analysts' ability to influence the impact of whole-exome sequencing results.

Staff group/IDPositive casesNegative casesPositive rate (%)Chi-squareP-value
Clinicians   14.430.025
     A1401940.42  
     B32170.65  
     C21340.38  
     D12190.39  
     E480.33  
     F8120.40  
     G170.13  
Data analysts   3.460.75
     A11271500.46  
     B174820.47  
     C123330.41  
     D118230.44  
     E189940.49  
     F114240.37  
     G170690.50  

A-G represents different clinicians, and A1-G1 different analysts. Positive cases (as diagnosed by clinicians) are defined as the number of cases identified with disease-associated genes based on the clinician's assessment. Negative cases are defined as the opposite. Positive cases (as judged by data analysts) are defined by the number of cases identified with suspected disease-related genes by analysts by using the whole exome sequencing data.

Discussion

In general, Mendelian (single-gene) diseases are considered to be rare, occurring at a rate of 40-82 cases per 1,000 live births, with an estimated 7.9 million infants being born annually with serious birth defects of genetic or partially genetic origin (20,21). The US Food and Drug Administration has approved ~400 types of orphan drugs, but these drugs have limited efficacy and are suitable for treatment of only ~5.00% of rare diseases. Most rare diseases may only rely on symptomatic or placebo treatments, as the causes of most of these genetic diseases still remain elusive (22-24). In recent years, with the advancement of precision medicine and the rapid development of large-scale parallel sequencing technology, the application of WES has increased, not only for research purposes but also for clinical diagnosis. The effectiveness of WES has been well documented for certain diseases, such as those relating to the nervous system, dermatology and seizures (25-27). This may help to elucidate the pathogenesis of these diseases. In general, genetic variations may affect biological function by enhancing (e.g., dose effects and alterations of transcriptional activity) or decreasing (e.g., haplo-insufficiency and dominant-negative mutation) the function of key genes (28). WES was performed on all 1,360 patients of the present study. The examination and analytical processes are depicted in a schematic in Fig. 4, including those performed prior to, during and after WES. The overall positive rate obtained with this protocol was 47.05%, which was slightly higher than that for a previous case series, in which yields of 25.00-26.00% were reported (29) and it was higher than the positive rate of other genetic testing methods, such as whole-genome sequencing analysis (34.00-42.00%), karyotype analysis (4.20-10.00%), chromosome microarray analysis (6.50-20.00%) and fluorescence in situ hybridization analysis (3.00%) (30-36). Among them, the trio-samples positive rate was 6.29% higher than the rate obtained for single samples, and this was similar to the results obtained from a previous study (37).
Figure 4

Whole-exome sequencing analysis flow. CNV, copy number variant.

Analysis of trio-samples may not only identify the mutations associated with the disease detected based on the clinical manifestations but may also, in turn, be compared and evaluated from the severity of the mutations to the clinical performance of the patient. At the same time, it is possible to rapidly eliminate irrelevant variations by using the genetic model, allowing for the identification of suspected disease-causing genes, ultimately increasing the positive detection rate and efficiency. In addition, trios facilitate the detection of de novo variants and allow for appraisal of compound heterozygous variants during interpretation (rather than during the confirmatory testing process). When WES was performed, factors to be considered by clinicians included cost, time-to-result and the presence of consanguinity or family history. For certain diseases that are rapidly progressing and have serious clinical symptoms, this study would suggest that priority should be given to performing WES where trios are available. In the present study, the major clinical manifestations of 1,360 patients were classified into 20 main categories in order to determine which symptoms contributed the most to the positive rate of WES. Among the 20 classifications, the positive rate of audio or visual abnormalities was the highest (58.51%), followed by abnormality of the head or neck (55.52%) and global developmental delay (54.84%). The manifestations with the lowest positive rate were abnormalities of the digestive system (26.92%). A total of 52 patients with abnormality of the digestive system were analyzed, revealing that the major symptoms were diarrhea, constipation, vomiting and gastro-esophageal reflux. The age of the affected children ranged from several days to several months. At this stage, the appearance of such symptoms was mainly related to the developmental characteristics of the infants with an immature digestive system and minimal establishment of microflora. Therefore, it would be difficult to adapt to changes in the quality and quantity of food. In addition, those subjects are likely to have low body immunity and incorrect feeding regimens employed by certain parents as well as other factors may lead to these symptoms. However, such symptoms should not be ignored during the analysis, as they may also be characteristic of certain hereditary diseases, such as Hirschsprung disease 2 [online mendelian inheritance in man (OMIM) ID, 600155]. The three most common variations were missense, frameshift and nonsense mutations and the least common ones were in-frame duplications. It is worth noting that of the 604 positive cases, 22 had exon deletions, such as ASCC1 (activating signal cointegrator 1 complex subunit 1) c.932C>G(p.Ser311Ter) or exon5 deletion. These small deletions should always be noted, as they may be missed during routine and low-depth CNV analyses. During the analysis of an autosomal recessive disease, when one mutation is found in a certain gene, which is relevant to the clinical symptoms, we recommended that the exons of this particular gene should be carefully viewed using IGV software. WES may help clinicians to determine the cause of the disease and can also influence the treatment strategy, such as pyridoxine-dependent epilepsy caused by the ALDH7A1 gene (aldehyde dehydrogenase 7 family member A1; OMIM ID, 266100). Patients with this type of epilepsy are generally insensitive to treatment with anticonvulsants but may be treated effectively with large doses of pyridoxine (vitamin B6). WES, which Raffan and Semple (38) called ‘game-changing technology’ in 2011, is becoming an essential tool for clinicians dealing with rare and common genetic disorders. It may be assumed that due to the maturity of WES technology and the standardization of the process, the data analysis performed by different analysts according to standardized procedures does not exhibit any obvious differences in the results obtained. In order to avoid the influence of subjective factors, indicators of different diseases were quantified in the present study, so as to minimize variations caused by those subjective factors. For instance, short stature was evaluated according to items including the standard deviation, the annual increase in height and growth hormone stimulation test results. There are numerous challenges associated with WES. First of all, there is an increasing number of non-paternity infants (from donor gametes), which makes it difficult to identify and interpret suspected pathogenic variants. Furthermore, there are complex issues surrounding genetic counseling associated with WES (e.g., variants of unknown significance). Finally, establishing a close cooperation network between analysts and clinicians in the clinical environment may be challenging. In summary, there are numerous difficulties and challenges that require to be overcome to successfully apply WES sequencing technology in the fields of healthcare and disease prevention. Fortunately, the situation is constantly improving. In the foreseeable future, it is anticipated that WES will become a valuable and possibly even a first-line diagnostic tool in the clinical setting for the diagnosis of complex genetic diseases.
  36 in total

Review 1.  The genetic basis of disease.

Authors:  Maria Jackson; Leah Marks; Gerhard H W May; Joanna B Wilson
Journal:  Essays Biochem       Date:  2018-12-02       Impact factor: 8.000

2.  Does the geographical gradient of ApoE4 allele exist in China? A systemic comparison among multiple Chinese populations.

Authors:  Peng Hu; Yuan Han Qin; Cheng Xue Jing; Ling Lu; Bo Hu; Peng Fei Du
Journal:  Mol Biol Rep       Date:  2010-03-31       Impact factor: 2.316

3.  Genetic services and testing in China.

Authors:  Xinliang Zhao; Peirong Wang; Xuguang Tao; Nanbert Zhong
Journal:  J Community Genet       Date:  2013-04-18

4.  From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.

Authors:  Geraldine A Van der Auwera; Mauricio O Carneiro; Christopher Hartl; Ryan Poplin; Guillermo Del Angel; Ami Levy-Moonshine; Tadeusz Jordan; Khalid Shakir; David Roazen; Joel Thibault; Eric Banks; Kiran V Garimella; David Altshuler; Stacey Gabriel; Mark A DePristo
Journal:  Curr Protoc Bioinformatics       Date:  2013

Review 5.  Rare Diseases: Drug Discovery and Informatics Resource.

Authors:  Mingzhu Zhao; Dong-Qing Wei
Journal:  Interdiscip Sci       Date:  2017-11-01       Impact factor: 2.233

Review 6.  Next generation sequencing: Coping with rare genetic diseases in China.

Authors:  David S Cram; Daixing Zhou
Journal:  Intractable Rare Dis Res       Date:  2016-08

Review 7.  The identification of microdeletion syndromes and other chromosome abnormalities: cytogenetic methods of the past, new technologies for the future.

Authors:  Lisa G Shaffer; Bassem A Bejjani; Beth Torchia; Susan Kirkpatrick; Justine Coppinger; Blake C Ballif
Journal:  Am J Med Genet C Semin Med Genet       Date:  2007-11-15       Impact factor: 3.908

8.  Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application.

Authors:  C Li; W Hung Wong
Journal:  Genome Biol       Date:  2001-08-03       Impact factor: 13.583

9.  The survey of birth defects rate based on birth registration system.

Authors:  Min Yu; Zhiguang Ping; Shuiping Zhang; Yuying He; Rui Dong; Xiong Guo
Journal:  Chin Med J (Engl)       Date:  2015-01-05       Impact factor: 2.628

10.  Targeted capture and massively parallel sequencing of 12 human exomes.

Authors:  Sarah B Ng; Emily H Turner; Peggy D Robertson; Steven D Flygare; Abigail W Bigham; Choli Lee; Tristan Shaffer; Michelle Wong; Arindam Bhattacharjee; Evan E Eichler; Michael Bamshad; Deborah A Nickerson; Jay Shendure
Journal:  Nature       Date:  2009-08-16       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.