| Literature DB >> 24931986 |
Ladislav Rampášek1, Aryan Arbabi1, Michael Brudno2.
Abstract
MOTIVATION: The past several years have seen the development of methodologies to identify genomic variation within a fetus through the non-invasive sequencing of maternal blood plasma. These methods are based on the observation that maternal plasma contains a fraction of DNA (typically 5-15%) originating from the fetus, and such methodologies have already been used for the detection of whole-chromosome events (aneuploidies), and to a more limited extent for smaller (typically several megabases long) copy number variants (CNVs).Entities:
Mesh:
Substances:
Year: 2014 PMID: 24931986 PMCID: PMC4058944 DOI: 10.1093/bioinformatics/btu292
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Distribution of fragments per kilobase of chromosome 1 per million fragments (FPKM) in 1 megabase segments for plasma sample (blue) and maternal sample (red) of the I1 trio
Fig. 2.(a) A scatterplot demonstrating the correlation of WRV between plasma samples of I1 and G1 trios. The shown WRVs were computed for windows of size 1 kb in chromosome 1. (b) Histogram of absolute errors between WRVs from different samples, comparing distribution of absolute error between plasma samples of I1 and G1 trios (red), and between plasma sample and maternal sample of I1 trio (blue). There is a notably heavier tail in case of plasma to maternal sample error distribution, composed of windows with weak WRV correspondence—an artifact of wider coverage distribution in plasma cfDNA sample compared to standard WGS maternal sample (Fig. 1). This artifact causes plasma to maternal sample WRV comparison to have higher mean absolute error (0.000521, compared with 0.000347 for plasma I1 to plasma G1) even though they are from the same trio
Fig. 3.HMM used for CNV inference. (a) High-level architecture of the HMM with five sets of states corresponding to five types of fetal inheritance. Note, we do not allow two CNVs to be adjacent; thus, switching between two CNVs always has to go through a normal inheritance state. Edges in (a) represent edges coming in/out of all states between two sets of states. (b–d) Correspond to the diagram of states of the HMM within the normal inheritance, maternal duplication and maternal deletion states of (a). Paternal duplications/deletions are analogous to (c) and (d). Inner edges in (b–d) serve to model errors in phasing or recombination events
Summary of mother–father–child trio I1 sequencing data [Courtesy of Kitzman ]
| Individual | Sample | DOC |
|---|---|---|
| Mother (I1-M) | Plasma (5 ml, gestational age 18.5 weeks) | 78 |
| Whole blood (<1 ml) | 32 | |
| Father (I1-P) | Saliva | 39 |
| Child (I1-C) | Cord blood at delivery | 40 |
Summary of recall on test set composed of 360 in silico simulated CNVs in I1 maternal plasma samples with 13 and 10% fetal admixture ratio
Note: The ‘ratios only’ column corresponds to the method that only uses allelic ratios, but not the coverage prior. In such cases both the precision and recall are mostly dominated by the model combining both signals. (We write ‘NA’ in a precision field if no call of such CNV category was predicted by the model).
Summary of results obtained by an HMM using only WRV signal
Note: The same test set composed of 360 in silico simulated CNVs was used as in Table 2. We ran the model with 100, and 300 kb bin sizes. (We write ‘NA’ in a precision field if no call of such CNV category was predicted by the model).
In silico recall and number of CNVs of various sizes generated in a genome-wide run
Note: For each CNV size, we also show (in parenthesis) the number of calls that are from at least 50% overlapped by CNVnator (Abyzov ) calls on the fetal, maternal and paternal genomes, respectively.