| Literature DB >> 33510858 |
Tom Rabinowitz1, Shira Deri-Rozov1, Noam Shomron1.
Abstract
The technology of noninvasive prenatal testing (NIPT) enables risk-free detection of genetic conditions in the fetus, by analysis of cell-free DNA (cfDNA) in maternal blood. For chromosomal abnormalities, NIPT often effectively replaces invasive tests (e.g. amniocentesis), although it is considered as screening rather than diagnostics. Most recently, the NIPT has been applied to genome-wide, comprehensive genotyping of the fetus using cfDNA, i.e. identifying all its genetic variants and mutations. Previously, we suggested that NIPD should be treated as a special case of variant calling, and presented Hoobari, the first software tool for noninvasive fetal variant calling. Using a unique pipeline, we were able to comprehensively decipher the inheritance of SNPs and indels. A few caveats still exist in this pipeline. Performance was lower for indels and biparental loci (i.e. where both parents carry the same mutation), and performance was not uniform across the genome. Here we utilized standardized methods for benchmarking of variant calling pipelines and applied them to noninvasive fetal variant calling. By using the best performing pipeline and by focusing on coding regions, we showed that noninvasive fetal genotyping greatly improves performance, particularly in indels and biparental loci. These results emphasize the importance of using widely accepted concepts to describe the challenge of genome-wide NIPT of point mutations; and demonstrate a benchmarking process for the first time in this field. This study brings genome-wide and complete NIPD closer to the clinic; while potentially alleviating uncertainty and anxiety during pregnancy, and promoting informed choices among families and physicians.Entities:
Keywords: NIPD; NIPT; Noninvasive prenatal diagnosis; Variant calling; cell-free DNA; cfDNA
Year: 2020 PMID: 33510858 PMCID: PMC7809098 DOI: 10.1016/j.csbj.2020.12.032
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Experimental workflow for comparing fetal genotyping pipelines. This is the workflow for comparing pipelines for noninvasive fetal variant callers. Twelve pipelines were tested, based on possible combinations of two read aligners, three post-alignment approaches, and two variant calling programs. Each pipeline begins with raw sequence FASTQ files of the parents, the true fetal sample, and cfDNA. Alignment is performed using either BWA-MEM or Bowtie2 (Blue). Duplicate reads are removed using Samblaster. Non-exonic regions are sliced out using BEDTools pairToBed, by keeping only read pairs that have at least one read covering an exonic region. ABRA and GATK3 IndelRealign are compared against avoidance of any realignment or reassembly (Green). FreeBayes and GATK4 are used for variant calling of the parents and the true fetal samples, and for preprocessing of the cfDNA sample (Red). Eventually, FreeBayes- or GATK4-comparitble Hoobari is run to call fetal variants using the cfDNA reads and the parental genotypes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
A summary of the samples used in this study.
| Family | Individual | Sample | Depth of coverage | Fetal fraction |
|---|---|---|---|---|
| G1 | Mother | White blood cells | 40 | |
| Plasma (38 weeks2) | 270 | 30.2% | ||
| Father | White blood cells | 45 | ||
| Offspring | Umbilical cord blood | 50 | ||
| G2 | Mother | White blood cells | 40 | |
| Plasma (18 weeks) | 195 | 23.2% | ||
| Father | White blood cells | 60 | ||
| Offspring | Placental tissues | 60 | ||
| G3 | Mother | White blood cells | 38 | |
| Plasma (11 weeks) | 310 | 18.5% | ||
| Father | White blood cells | 41 | ||
| Offspring | Chorionic villus sampling | 38 | ||
Median, on target; 2 Gestational age.
Comparison of noninvasive fetal variant calling pipelines in family G1 – SNPs.
| Pipeline | F1_Score | Recall | Precision | TOTAL | TP | FN | FP | FP.gt | FP.al |
|---|---|---|---|---|---|---|---|---|---|
| bwa-none-freebayes | 0.991090 | 0.986208 | 0.996020 | 22,332 | 22,024 | 308 | 88 | 68 | 0 |
| bwa-abra-freebayes | 0.990910 | 0.986117 | 0.995749 | 22,330 | 22,020 | 310 | 94 | 75 | 0 |
| bwa-gatk-freebayes | |||||||||
| bwa-none-gatk | 0.988222 | 0.981314 | 0.995228 | 22,316 | 21,899 | 417 | 105 | 71 | 0 |
| bwa-abra-gatk | 0.988245 | 0.981447 | 0.995138 | 22,315 | 21,901 | 414 | 107 | 72 | 0 |
| bwa-gatk-gatk | 0.988063 | 0.981397 | 0.994820 | 22,308 | 21,893 | 415 | 114 | 72 | 0 |
| bowtie-none-freebayes | 0.943192 | 0.898413 | 0.992669 | 22,306 | 20,040 | 2266 | 148 | 105 | 0 |
| bowtie-abra-freebayes | 0.989173 | 0.985115 | 0.993265 | 22,305 | 21,973 | 332 | 149 | 106 | 0 |
| bowtie-gatk-freebayes | 0.942454 | 0.897206 | 0.992508 | 22,297 | 20,005 | 2292 | 151 | 101 | 0 |
| bowtie-none-gatk | 0.974657 | 0.956364 | 0.993663 | 22,298 | 21,325 | 973 | 136 | 88 | 0 |
| bowtie-abra-gatk | 0.974633 | 0.956319 | 0.993663 | 22,298 | 21,324 | 974 | 136 | 88 | 0 |
| bowtie-gatk-gatk | 0.974679 | 0.956407 | 0.993663 | 22,297 | 21,325 | 972 | 136 | 87 | 0 |
Fig. 2Comparison of fetal genotyping pipelines stratified by inheritance in family G1.
Comparison of noninvasive fetal variant calling pipelines in family G1 – Indels.
| Pipeline | F1_Score | Recall | Precision | TOTAL | TP | FN | FP | FP.gt | FP.al |
|---|---|---|---|---|---|---|---|---|---|
| bwa-none-freebayes | 0.722148 | 0.656098 | 0.802985 | 410 | 269 | 141 | 66 | 62 | 0 |
| bwa-abra-freebayes | 0.843710 | 0.805825 | 0.885333 | 412 | 332 | 80 | 43 | 41 | 0 |
| bwa-gatk-freebayes | 0.749669 | 0.695332 | 0.813218 | 407 | 283 | 124 | 65 | 58 | 0 |
| bwa-none-gatk | 0.961586 | 0.934940 | 0.989796 | 415 | 388 | 27 | 4 | 4 | 0 |
| bwa-gatk-gatk | 0.962779 | 0.934940 | 0.992327 | 415 | 388 | 27 | 3 | 3 | 0 |
| bowtie-none-freebayes | 0.736698 | 0.658537 | 0.835913 | 410 | 270 | 140 | 53 | 49 | 0 |
| bowtie-abra-freebayes | 0.864385 | 0.827670 | 0.904509 | 412 | 341 | 71 | 36 | 34 | 0 |
| bowtie-gatk-freebayes | 0.805851 | 0.737226 | 0.888563 | 411 | 303 | 108 | 38 | 35 | 0 |
| bowtie-none-gatk | 0.940881 | 0.901205 | 0.984211 | 415 | 374 | 41 | 6 | 6 | 0 |
| bowtie-abra-gatk | 0.940881 | 0.901205 | 0.984211 | 415 | 374 | 41 | 6 | 6 | 0 |
| bowtie-gatk-gatk | 0.944862 | 0.908434 | 0.984334 | 415 | 377 | 38 | 6 | 6 | 0 |
Fig. 3Comparison of fetal genotyping pipelines – transitions/transversions and heterozygous/alternate-homozygous ratios in family G1.
Definitions for TPs, FPs, and FNs by the GA4GH Benchmarking Team.
| Truth (fetal sample) | ||||
|---|---|---|---|---|
| Genotype | homref | het2 | homalt3 | |
| Query (cfDNA) | homref | n/a | FN | FN |
| het | FP | TP | FP.GT | |
| homalt | FP | FP.GT | TP | |
homozygous to the reference allele; 2 heterozygous; 3homozygous to the alternate allele TPs: true positives; FP: false positives; FN: false positives.