| Literature DB >> 26019723 |
Akash Kumar1, Allison Ryan2, Jacob O Kitzman1, Nina Wemmer2, Matthew W Snyder1, Styrmir Sigurjonsson2, Choli Lee1, Milena Banjevic2, Paul W Zarutskie3, Alexandra P Lewis1, Jay Shendure1, Matthew Rabinowitz2.
Abstract
BACKGROUND: Preimplantation genetic diagnosis (PGD) enables profiling of embryos for genetic disorders prior to implantation. The majority of PGD testing is restricted in the scope of variants assayed or by the availability of extended family members. While recent advances in single cell sequencing show promise, they remain limited by bias in DNA amplification and the rapid turnaround time (<36 h) required for fresh embryo transfer. Here, we describe and validate a method for inferring the inherited whole genome sequence of an embryo for preimplantation genetic diagnosis (PGD).Entities:
Year: 2015 PMID: 26019723 PMCID: PMC4445980 DOI: 10.1186/s13073-015-0160-4
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Figure 1Experimental approach. (a) Parental genomes were haplotype-resolved using a combination of direct molecular methods and population-based phasing. Embryo biopsies were genotyped using a rapid SNP-array protocol and processed using information derived from parents and sibling embryos to infer a sparse set of ‘Parental Support’ (PS) genotypes. (b) Prediction of the embryo genome. Informative sites from PS embryo genotyping were combined with haplotype-resolved parental genome sequences to predict the whole genome sequence of the embryo.
Figure 2Results of embryo predictions. (a) Predicted transmission of parental haplotypes to the embryo after rapid array-based genotyping with ‘Parental Support’. A hidden Markov model incorporating parental phase and SNP array measurements of an embryo was used to obtain the probability of each parental haplotype being transmitted into each embryo. These states and state probabilities were used to determine which parental haplotypes composed the embryo genome and to define the location of meiotic recombination events. The example shown here is colored according to grandparental haplotype after incorporating SNP array measurements from grandparents within predictions. M: Maternal; P: Paternal. (b) Phasing of HBA1/HBA2 deletion in the embryo’s father via dilution pool sequencing. The deletion was identified using whole genome sequencing (WGS). The HBA1/HBA2 containing haplotype was determined by identifying multiple dilution pools that overlapped the deleted region (in blue). Transmission of this haplotype was predicted by comparing informative PS embryo genotyping sites (blue letters) with corresponding sites within parental haplotypes.
Summary of haplotype sources and the number of variants phased/predicted
|
|
|
|
|
| |
|---|---|---|---|---|---|
| ‘Parental Support’ embryo genotypes | Phased | 75,869 | 73,622 | 3.9% | 3.9% |
| Phased and predicted | 73,003 | 71,362 | 3.8% | 3.8% | |
| (+) Phasing by dilution pool sequencing | Phased | 1,829,314 | 1,751,004 | 94.2% | 92.4% |
| Phased and predicted | 1,672,932 | 1,589,836 | 86.2% | 83.9% | |
| (+) Phasing by population reference panel | Phased | 1,840,200 | 1,895,730 | 97.1% | 97.6% |
| Phased and predicted | 1,732,641 | 1,775,614 | 91.4% | 91.5% | |
| (+) 10× sequence from grandparents | Phased | 1,922,421 | 1,875,363 | 99.0% | 98.9% |
| Phased and predicted | 1,880,092 | 1,847,297 | 96.8% | 97.5% |
Sites determined by comparison with Illumina trio sequencing (including the offspring) to have poor genotype quality scores or genotypes that violated Mendelian inheritance were discarded for the purpose of evaluating accuracy.
Prediction accuracy for embryo genotypes
|
|
|
|
|
|
|---|---|---|---|---|
| Father | Heterozygous, predicted | Homozygous | 1,138,851 | 99.5% |
| Heterozygous, predicted | 561,745 | 99.5% | ||
| Heterozygous, unpredicted | 32,045 | NA | ||
| Heterozygous, unpredicted | All | 112,699 | NA | |
| Mother | Heterozygous, predicted | Homozygous | 1,179,791 | 99.5% |
| Heterozygous, unpredicted | 34,078 | NA | ||
| Heterozygous, unpredicted | Homozygous | 117,858 | NA | |
| Heterozygous, unpredicted | 16,018 | NA |
Transmission predictions for this analysis made use of PS embryo genotypes as well as molecular haplotyping and population-phasing of the parental genomes, but did not make use of grandparental genotypes. Accuracy defined as the percentage of transmitted alleles correctly predicted, out of all predicted sites. Sites determined by comparison with Illumina trio sequencing (including the offspring) to have poor genotype quality scores or genotypes that violated Mendelian inheritance were discarded for the purpose of evaluating accuracy. A total of 312,698 sites were omitted due to unsuccessful phasing in either parent (sum of categories above with ‘NA’ in accuracy column).
NA: not applicable.