| Literature DB >> 34932189 |
Marzena Wojtaszewska1, Rafał Stępień2, Alicja Woźna3, Maciej Piernik4, Pawel Sztromwasser5, Maciej Dąbrowski6, Michał Gniot7, Sławomir Szymański8, Maciej Socha9,10, Piotr Kasprzak11, Rafał Matkowski11,12, Paweł Zawadzki13.
Abstract
BACKGROUND ANDEntities:
Mesh:
Substances:
Year: 2021 PMID: 34932189 PMCID: PMC8766398 DOI: 10.1007/s40291-021-00571-1
Source DB: PubMed Journal: Mol Diagn Ther ISSN: 1177-1062 Impact factor: 4.074
Fig. 1Summary of the in-house pipeline used for data extraction and processing
Fig. 2Samples and data qualified for the study
Fig. 3Training set cross-validation accuracy comparison between three features used to determine human epidermal growth factor receptor 2 (HER2) amplification status in whole genome sequencing (WGS) data. Because of a class imbalance, the plot also includes a reference classifier assigning all samples to the majority class (DummyMostFrequent), which serves as a simple baseline
Analytical validation of the whole genome sequencing ploidy-corrected ERBB2 CN on internal hold-out and the external dataset
| Dataset | No. of samples | TP | TN | FP | FN | Sensitivity % (95% CI) | Specificity % (95% CI) | Accuracy (%) | PPV (%) | NPV (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| Holdout | 263 | 31 | 226 | 3 | 3 | 91.18% (76.32–98.14%) | 98.69% (96.22–99.73%) | 97.56 | 92.47 | 98.45 |
| External (Sanger) | 551 | 62 | 463 | 19 | 7 | 89.86% (80.21–95.82%) | 96.06% (93.91–97.61%) | 95.13 | 80.09 | 98.17 |
TP true positive, TN true negative, FP false positive, FN false negative, PPV positive predictive value, NPV negative predictive value
Fig. 4Principal component analysis of the dataset with six features: purity, ploidy, ERBB2 CN, CEP17 CN, ERBB2 CN/CEP17 CN ratio, and ploidy-corrected ERBB2 CN
Fig. 5Wilcoxon test comparison of means between distributions of accuracies in: A high vs low coverage data (threshold ×49), B high vs low ploidy data (threshold 3), and C high and low purity data (threshold 0.6)
| Whole genome sequencing is a reliable method for clinical human epidermal growth factor receptor 2 status assessment. |
| The ploidy-corrected copy number value is the most accurate biomarker for genetic testing with high concordance to gold standard immunohistochemistry and fluorescence in situ hybridization. |
| Short-read whole genome sequencing for human epidermal growth factor receptor 2 assessment is consistent across different platforms and wet lab protocols. |