| Literature DB >> 33081811 |
Juozas Gordevičius1, Milda Narmontė1, Povilas Gibas1, Kotryna Kvederavičiūtė1, Vita Tomkutė1, Priit Paluoja2,3, Kaarel Krjutškov2,4, Andres Salumets2,4,5,6, Edita Kriukienė7,8.
Abstract
BACKGROUND: Massively parallel sequencing of maternal cell-free DNA (cfDNA) is widely used to test fetal genetic abnormalities in non-invasive prenatal testing (NIPT). However, sequencing-based approaches are still of high cost. Building upon previous knowledge that placenta, the main source of fetal circulating DNA, is hypomethylated in comparison to maternal tissue counterparts of cfDNA, we propose that targeting either unmodified or 5-hydroxymethylated CG sites specifically enriches fetal genetic material and reduces numbers of required analytical sequencing reads thereby decreasing cost of a test.Entities:
Keywords: 5-Hydroxymethylcytosine; Cell-free DNA; Covalent labeling; DNA modification; Down syndrome; Fetal trisomy; NIPT
Year: 2020 PMID: 33081811 PMCID: PMC7574562 DOI: 10.1186/s13148-020-00938-x
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1uTOP-seq and hmTOP-seq analysis of CV and cfDNA samples reveals a contribution of CV tissue to maternal cfDNA. a Total CG-coverage in cfDNA and CV tissue sample groups for the uCG and 5hmCG signals. Total log2 transformed sequencing coverage of autosomes was computed for each sample and ANOVA was used to test for differences in distributions across sample groups. b CG-fraction in cfDNA and CV tissue sample groups for the uCG and 5hmCG data. Fraction of CGs covered by at least one read to the total number of CGs was computed for each sample and ANOVA was used to test for differences in distributions across sample groups. c Non-metric multidimensional scaling plots of genome-wide signal distribution using two dimensions for uCG and 5hmCG data (using samples after the outlier removal). d Odds ratio (OR) for enrichment of uCG and 5hmCG data across genomic elements (p < 1.6 × 10–10). The genome was divided into 1,000-bp windows and total coverage per sample was averaged across sample groups for each window. Fisher’s exact test was used to test whether the windows with the 10% of the strongest signal are enriched with particular genomic elements. CGI, CG island; lincRNA, long intergenic non-coding RNA
Fig. 2Correlation of the reference fetal fraction and SeqFF prediction from a uTOP-seq and b hmTOP-seq data indicates the enrichment of cffDNA in maternal cfDNA mixture. Dashed line indicates linear regression. hmTOP-seq samples of shallow sequencing (on average 2.5 million (M) raw reads) are indicated with red circles and were not used in the estimation of Pearson correlation
Fig. 3Detection of fetal trisomy T21 using CG-fraction. a Effect of fetal fraction on the Z-score indicating fetal karyotype. Normalized CG-fractions in chromosome 21 were used to compute Z-scores. 100% diagnostic accuracy of fetal trisomy T21 independent of fetal fraction was achieved. Dashed lines represent logistic regression fits. hmTOP-seq samples of shallow sequencing (2.5 million raw reads) are indicated with red circles. b Effect of reduced library size on classification accuracy. 100% diagnostic accuracy is achieved using 3 million or 1 million of processed sequencing reads of uTOP-seq and hmTOP-seq, respectively. Reads were randomly sampled and karyotype determined using leave-one-out cross-validation. In each cross-validation loop a logistic regression model was built with the Z-scores computed from the normalized CG-fraction within chromosome 21. Error bars indicate the standard deviation from mean AUC across 30 sampling iterations
Fig. 4Analysis of fetal-specific and T21-specific differentially modified regions. a Venn diagrams indicating the overlap of the pregnancy-specific and CV-specific DMRs that constitute the placenta-specific DMR sets for uCG and 5hmCG data. b Pearson correlation of the modification differences observed in the placenta-specific DMRs. Pregnancy-specific changes on the x-axis indicate modification differences between NPCs and cfDNA samples of pregnant women. CV-specific changes on the y-axis indicate modification differences between NPCs and CVS. c Enrichment of genomic elements for the placenta-specific and T21-specific DMRs using Fisher’s exact test. CGI, CG island; lincRNA, long intergenic non-coding RNA. d T21-specific DMRs exhibit higher CG-coverage and CG-fraction differences than non-differentially modified regions. logFC represents a log fold-change difference between T21-diagnosed pregnancies and healthy pregnancies
Fig. 5Detection of T21-specific CG-DMRs by qPCR. Relative quantification of individual T21-specific a uCG-DMRs and b 5hmCG-DMRs located on chromosome 21 using qPCR for replicated cfDNA samples of women pregnant with healthy or T21-diagnosed fetuses. The y-axis indicates the threshold cycle values (CT) calculated in qPCR for the regions selected from Table S4 whose genome coordinates are shown above the graphs. Numerical values of CT inversely correlate to the abundance of the DMR region, indicating higher abundance of the region in the cfDNA samples of pregnant females carrying a T21-diagnosed fetus. Only samples which showed CT values above the set threshold (CT values ≤ 34) were included in the graphs. P-values indicated above were calculated using t-test