| Literature DB >> 35912641 |
Martina Witsch-Baumgartner1, Gunda Schwaninger1, Simon Schnaiter1, Franziska Kollmann1, Silja Burkhard1, Rebekka Gröbner1, Beatrix Mühlegger1, Esther Schamschula1, Peter Kirchmeier2, Johannes Zschocke1.
Abstract
Genotyping arrays are by far the most widely used genetic tests but are not generally utilized for diagnostic purposes in a medical context. In the present study, we examined the diagnostic value of a standard genotyping array (Illumina Global Screening Array) for a range of indications. Applications included stand-alone testing for specific variants (32 variants in 10 genes), first-tier array variant screening for monogenic conditions (10 different autosomal recessive metabolic diseases), and diagnostic workup for specific conditions caused by variants in multiple genes (suspected familial breast and ovarian cancer, and hypercholesterolemia). Our analyses showed a high analytical sensitivity and specificity of array-based analyses for validated and non-validated variants, and identified pitfalls that require attention. Ethical-legal assessment highlighted the need for a software solution that allows for individual indication-based consent and the reliable exclusion of non-consented results. Cost/time assessment revealed excellent performance of diagnostic array analyses, depending on indication, proband data, and array design. We have implemented some analyses in our diagnostic portfolio, but array optimization is required for the implementation of other indications.Entities:
Keywords: DNA array; genetic test; genotyping; screening; single nucleotide variants
Mesh:
Year: 2022 PMID: 35912641 PMCID: PMC9482391 DOI: 10.1002/mgg3.2016
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.473
GSA results for specifically targeted common pathogenic disease variants in 212 individuals
| Gene | Cases | Target variants | Observed variants | Correct GSA results | GSA result incorrect | GSA results inconclusive | |||
|---|---|---|---|---|---|---|---|---|---|
| Hom | Comp het | Het | Hom WT | ||||||
|
| 26 | 4 | 2 | 0 | 0 | 9 | 17 | 0 | 0 |
|
| 21 | 1 | 1 | 0 | 0 | 1 | 20 | 0 | 0 |
|
| 35 | 1 | 1 | 2 | 0 | 7 | 26 | 0 | 0 |
|
| 19 | 1 | 1 | 4 | 0 | 11 | 4 | 0 | 0 |
|
| 22 | 3 | 1 | 0 | 0 | 1 | 21 | 0 | 0 |
|
| 3 | 2 | 2 | 0 | 0 | 3 | 0 | 0 | 0 |
|
| 22 | 2 | 2 | 2 | 2 | 8 | 10 | 0 | 0 |
|
| 15 | 1 | 1 | 6 | 0 | 8 | 1 | 0 | 0 |
|
| 28 | 15 | 10 | 1 | 6 | 8 | 13 | 0 | 1 |
|
| 21 | 2 | 2 | 2 | 1 | 10 | 8 | 0 | 0 |
|
|
|
|
|
|
|
|
|
|
|
Notes: See Supplementary Table S1 for detailed variant and transcript information. 23 target variants were observed in 92 samples. Multiplying the number of SNP targets with the number of cases results in a total of 772 individual genotypes (homozygous, heterozygous or wild type at a particular position). All variant and wild type genotypes denoted conclusive by GSA analysis (n = 771) were identical with the genotypes identified by standard sequencing.
Bold font is used to highlight relevant (summary) information.
Abbreviations: Comp het, compound heterozygous; Het, heterozygous; Hom, homozygous.
One sample contained three heterozygous variants, another contained a homozygous variant (c.442G > C) and two heterozygous variants.
An inconclusive results was obtained at one position (MEFV variant c.2084A > G) in a single individual who did not carry this variant, but was homozygous for variant c.2082G > A (not covered by the GSA, identified by sequencing).
Pathogenic variants identified by GSA analysis in 116 individuals with cystic fibrosis (CF, n = 33), phenylketonuria (PKU, n = 39) or one of 8 other autosomal recessive disease (N = 44)
| Disease genes | Cases | Variants | Variants on GSA | Diagnostic GSA results | incorrect | inconclusive | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| No. | No. | Alleles | No. | % | Cases | % | Variants | Alleles | Variants | Alleles | |
|
| 32 | 22 | 62 | 18 | 82 | 0 | 0 | 2 | 2 | ||
|
| 3 | 2 | 4 | (2) | (100) | ||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 | 4 | 12 | 1 | 25 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 6 | 5 | 12 | 4 | 80 | 5 | 83 | 0 | 0 | 0 | 0 |
|
| 3 | 4 | 6 | 3 | 75 | 2 | 67 | 0 | 0 | 0 | 0 |
|
| 3 | 2 | 6 | 2 | 100 | 3 | 100 | 0 | 0 | 0 | 0 |
|
| 4 | 4 | 8 | 2 | 50 | 2 | 50 | 0 | 0 | 0 | 0 |
|
| 6 | 5 | 12 | 1 | 20 | 1 | 17 | 0 | 0 | 0 | 0 |
|
| 10 | 15 | 20 | 9 | 60 | 3 | 30 | 0 | 0 | 0 | 0 |
|
| 6 | 6 | 12 | 5 | 88 | 5 | 86 | 0 | 0 | 0 | 0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Notes: See Supplementary Tables S2 and S3 for detailed variant and transcript information. CFTR gene analyses in individuals with cystic fibrosis included a total of 22 pathogenic single nucleotide variants and small deletions on a total of 62 alleles; 18 of these variants were covered on the GSA. In addition, there were two single/multiple exon deletions on 4 alleles. GSA analysis correctly identified homozygous and heterozygous genotypes for all variants covered on the array, including (by targeted quantitative analysis) the two single/multiple exon deletions. There were no false negative results for GSA‐covered PVs. Inconclusive results were called for two PVs in two cases; one of them was explained by a large deletion of the whole exon, the other was unexplained. Conventional PAH gene sequencing in individuals with phenylketonuria identified 26 different pathogenic single nucleotide variants including 2 single nucleotide deletions; there were no large deletions or duplications. Three PVs (heterozygous in single cases) were not identified as they are no GSA target variants; an additional PV not covered by the GSA triggered an incorrect positive result for a GSA‐covered different variant at the same nucleotide. Of the remaining 22 variants identified in our cohort and covered by the GSA, 21 were correctly genotyped and one was denoted inconclusive in the single variant‐positive sample. Variant‐negative inconclusive results were mostly caused by the variant probe previously shown to be unreliable, or presence of a different variant in close vicinity. There were no false negative results, i.e., no GSA detectable variants denoted conclusive wild type. The high proportion of diagnostic GSA results in PKU is explained by the inclusion, on the GSA, of 27 of the 29 common PVs in Europe.16 The allele detection rate of 94% (73/78 alleles, see Supplementary Table S2) fits nicely with the predicted GSA detection rate of 93% for PKU alleles, based on the combined frequency of GSA‐covered variants in European PKU patients. In contrast, several relevant cystic fibrosis alleles have so far not been included in the GSA, explaining a somewhat lower allele characterization rate of 91% (60/66 alleles, see Supplementary Table S2) in our cohort. OMIM numbers: CFTR 602421, PAH 612349, ACADM 607008, BTD 609019, CBS 613381, CPT2 600,650, FAH 613871, GAA 606800, GCDH 608801, GALT 606999.
Bold font is used to highlight relevant (summary) information.
GSA results were regarded as diagnostic when variants on both alleles were correctly identified, and there were no incorrect positives; inconclusive results were disregarded.
Both samples with incorrectly called PAH gene variants contained a different pathogenic variant at the same position/region; in one sample the other variant was covered and correctly called by the GSA analysis.
Inconclusive results were associated with presence of that variant in the respective sample (1 variant), presence of another variant at the same position/region (2 variants), wild type sequence at the position/region (2 variants), or poor performance of the variant also in control samples (1 variant).
Copy number assessment was not part of the routine GSA analysis and was performed post hoc only in samples with respective abnormalities identified by massive parallel sequencing.
BRCA1 or BRCA2 pathogenic variants and variant clusters identified by GSA analysis in 181 individuals with suspected inherited breast and ovarian cancer syndrome
| Gene | Variant cluster | Cases | GSA‐called variant | Class. | Confirmed variant | Class. | Comment | |
|---|---|---|---|---|---|---|---|---|
|
| 1 | 1 | c.68_69dup | C5 | c.68_69dupAG | C5 | (2) | Additional variants |
| c.64_65del | C5 | |||||||
| c.65 T > C | C5 | |||||||
| 2 | 1 | c.569_570insAACG | C5 | c.570C > T | C2 | (4) | Incorrect variant | |
| 3 | 1 | c.676del | C5 | c.676delT | C5 | (1) | Correct positive | |
| 4 | 2 | c.843_846delCTCA | C5 | c.843_846delCTCA | C5 | (1) | Correct positive | |
| 5 | 1 | c.1204del | C5 | c.1204delG | C5 | (1) | Correct positive | |
| 6 | 3 | c.1687C > T | C5 | c.1687C > T | C5 | (1) | Correct positive | |
| 7 | 1 | c.3296del | C5 | c.3296delC | C5 | (1) | Correct positive | |
| 8 | 2 | c.3481_3491del | C5 | c.3481_3491del | C5 | (2) | Additional variant | |
| c.3481G > T | C5 | |||||||
| 9 | 2 | c.3511A > T | C5 | c.3511A > T | C5 | (1) | Correct positive | |
| 10 | 8 | c.4183C > T | C5 | c.4183C > T | C5 | (1) | Correct positive | |
| 11 | 1 | c.4837A > G | C1 | c.4837A > G | C1 | (5) | Additional variants | |
| c.4834_4835del | C5 | |||||||
| c.4838_4839insC | C5 | |||||||
| 12 | 1 | c.5057A > G | C5 | c.5057A > G | C5 | (2) | Additional variant | |
| c.5056_5057insC | C5 | |||||||
| 13 | 1 | c.5212G > A | C5 | c.5212G > A | C5 | (2) | Additional variant | |
| c.5213del | C5 | |||||||
|
| 14 | 1 | c.1909 + 1G > A | C5 | c.1909 + 1G > A | C5 | (1) | Correct positive |
| 15 | 2 | c.3808_3809del | C5 | c.3807 T > C | C1 | (4) | Incorrect variant | |
| 16 | 2 | c.4258del | C5 | c.4258G > T | C1 | (4) | Incorrect variant | |
| 17 | 2 | c.4440 T > G | C5 | c.4440 T > G | C5 | (1) | Correct positive | |
| 18 | 2 | c.7544C > T | C1 | c.7544C > T | C1 | (5) | Additional variant | |
| c.7537_7538insA | C4 | |||||||
| 19 | 1 | c.7565C > T | C3 | c.7565C > T | C3 | (5) | Additional variant | |
| c.7565_7568del | C5 | |||||||
| 20 | 1 | c.8536G > T | C5 | c.8535_8538delAGAG | C5 | (3) | Incorrect variant | |
| 21 | 1 | c.8583_8584insT | C5 | c.8585dupT | C5 | (3) | Incorrect variant | |
| 22 | 3 | c.9976A > T | C1 | c.9976A > T | C1 | (5) | Additional variant | |
| c.9981A > T | C3 | |||||||
| Total | 22 | 40 | ||||||
Variant not on GSA. Class. = pathogenicity classification: C1 benign, C2 likely benign, C3 variant of uncertain significance (VUS), C4 likely pathogenic, C5 pathogenic. Comments: (1) = PV correctly identified. (2) = PV correctly identified together with 1–2 wrongly ascertained variants at the same or adjacent positions. (3) = PV incorrectly ascertained as different PV. (4) = non‐pathogenic variant incorrectly ascertained as PV. (5) = non‐pathogenic variant or VUS correctly identified together with 1–2 wrongly ascertained (pathogenic) variants at the same or adjacent positions.
GSA analysis results in 392 individuals with hypercholesterolemia
| Disease genes | Cases | Variants | Variants on GSA | Diagnostic GSA results | Incorrect PV calls | Inconclusive PV calls | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| No. | No. | No. | % | Cases | % | Variants | Cases | Variants | Cases | |
|
| 139 | 60 | 36 | 58 | 93 | 67 | 5 | 6 | 5 | 16 |
|
| 7 | 6 | (4) | (66) | (5) | (71) | (2) | (2) | n.a. | n.a. |
|
| 13 | 1 | 1 | 100 | 13 | 100 | 0 | 0 | 0 | 0 |
|
| 1 | 1 | 1 | 100 | 1 | 100 | 0 | 0 | 0 | 0 |
|
| – | – | – | – | – | – | – | – | – | – |
|
|
|
|
|
|
|
|
|
|
|
|
| Recessive | 1 | 1 (+4) | (4) | (80) | 0 | – | 0 | 0 | 0 | 0 |
| No variants | 231 | – | – | – | – | – | – | – | – | – |
|
|
|
|
|
|
|
|
|
|
|
|
Notes: See Supplementary Table S4 for additional variant information. The GSA‐covered well‐known APOB variant p.Arg3527Gln in 13 individuals and the PCSK9 variant p.Asp129Asn in one individual were correctly genotyped; there were no false positive or inconclusive PV results for these genes. 36 GSA‐covered small LDLR variants (single nucleotide variants or small deletions/insertions) were correctly identified in 93 individuals. 5 PVs in 16 additional cases were called inconclusive; 4 of these variants in 12 cases were present in the sample, as shown by sequencing. These variants thus are not reliably detectable by standard GSA analysis focusing on conclusive genotypes. Five PVs not covered on the GSA gave rise to incorrect positive calls for other PVs at the same positions in six individuals. In total, 93/139 (67%) cases were correctly genotyped with a diagnostic PV. The combination of GSA analysis with targeted sequencing of all PV calls (excluding inconclusive PVs) would have led to correct PV identification in 99 cases.
Bold font is used to highlight relevant (summary) information.
Abbreviations: LDLR del/dup, large whole exon deletions or duplications; 4/6 copy number changes were correctly identified by post hoc quantitative GSA analysis; n.a., not applicable; PV, pathogenic variant.
All incorrect results were due to variants not covered on the GSA that were incorrectly assigned as other (GSA‐covered) variants at the same or adjacent positions. There were no false negative results, i.e., wild type sequence called conclusive at the position of a GSA‐covered pathogenic variant.
Inconclusive GSA results for pathogenic variants were observed in 16 cases. In 12 cases, they were associated with presence of the respective variant (4 different variants), in one case there was WT sequence at the respective position. In three cases with the 4 bp duplication c.1415_1418dup, the GSA analysis gave also inconclusive calls for the adjacent missense variant c.1414G > T not present in the samples.
Recessive disease genes analyzed: LDLRAP1, ABCG5, ABCG8, LIPA. Only one sample showed a homozygous pathogenic variant for LDLRAP1, which was not covered by the GSA. Four other recessive variants (GSA covered) were coincidentally detected on 6 alleles; as these variants were non‐diagnostic, the respective figures are given in parentheses in the table and disregarded in the summary total.