| Literature DB >> 32930094 |
Charles L Bormann1,2, Manoj Kumar Kanakasabapathy3, Prudhvi Thirumalaraju3, Raghav Gupta3, Rohan Pooniwala3, Hemanth Kandula3, Eduardo Hariton1, Irene Souter1,2, Irene Dimitriadis1,2, Leslie B Ramirez4, Carol L Curchoe5,6, Jason Swain6, Lynn M Boehnlein7, Hadi Shafiee2,3.
Abstract
Deep learning in in vitro fertilization is currently being evaluated in the development of assistive tools for the determination of transfer order and implantation potential using time-lapse data collected through expensive imaging hardware. Assistive tools and algorithms that can work with static images, however, can help in improving the access to care by enabling their use with images acquired from traditional microscopes that are available to virtually all fertility centers. Here, we evaluated the use of a deep convolutional neural network (CNN), trained using single timepoint images of embryos collected at 113 hr post-insemination, in embryo selection amongst 97 clinical patient cohorts (742 embryos) and observed an accuracy of 90% in choosing the highest quality embryo available. Furthermore, a CNN trained to assess an embryo's implantation potential directly using a set of 97 euploid embryos capable of implantation outperformed 15 trained embryologists (75.26% vs. 67.35%, p<0.0001) from five different fertility centers.Entities:
Keywords: blastocysts; convolutional neural networks; embryology; euploid embryos; human; human embryos; in - vitro fertilization; medicine
Mesh:
Year: 2020 PMID: 32930094 PMCID: PMC7527234 DOI: 10.7554/eLife.55301
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.140
Figure 1.Classification and selection of embryos at 113 hpi.
The schematic shows neural networks that classify, and rank order embryos based on their morphological quality (network A) and classify embryos based on the implantation potential (network B). The two networks share a common Xception architecture but the classification layers are specific to each task. Network A also uses a genetic algorithm that helps in generating embryo scores by using the softmax output of the network with weights generated by the algorithm during training. Embryo(s) with the highest scores are evaluated for single embryo and double embryo transfer scenarios using the retrospective test set. The implantation potential is given by the softmax output of the neural network.
Figure 2—figure supplement 1.Confusion matrix of the network in classifying embryos based on their morphological quality.
The matrix provides the network’s confusion between the five training classes. The dotted lines represent the separation between non-blastocysts (classes 1 and 2) and blastocysts (classes 3, 4, and 5). The reported accuracy is the binary classification performance accuracy of the CNN in differentiating between the two inference classes (non-blastocysts and blastocysts).
Figure 2.Classification and selection of embryos at 113 hpi.
(A) The performance in single embryo selections by embryologists and the algorithm in selecting blastocysts using embryo morphologies obtained at 113 hpi from 97 patient cohorts. (B) The performance in double embryo selections by the two groups in selecting blastocysts (n = 97 patient cohorts). (C) The performance in single embryo selections by the two groups in selecting the highest quality blastocysts (n = 97 patient cohorts). (D) The performance of the two groups in selecting the highest quality blastocysts when two selections were provided (n = 97 patient cohorts).
The matrix provides the network’s confusion between the five training classes. The dotted lines represent the separation between non-blastocysts (classes 1 and 2) and blastocysts (classes 3, 4, and 5). The reported accuracy is the binary classification performance accuracy of the CNN in differentiating between the two inference classes (non-blastocysts and blastocysts).
Figure 3.Performance in identifying embryos based on implantation outcomes.
(A) The performance of the neural network system in identifying embryos that implanted compared to the baseline historical implantation for the image set (n = 29). The error-bar represents the Clopper-Pearson exact binomial 95% confidence interval. (B) The performance of the neural network system in identifying euploid embryos that implanted compared to the performance of 15 embryologists in identifying implanting embryos (n = 97). The error-bar represents the 95% confidence interval of the embryologists’ performance in identifying implanting embryos.
The scatter plot illustrates the implantation potential of the euploid embryos evaluated in this study as measured by the neural network (n = 97). The ground truth represents actual clinical transfer outcomes.
Figure 3—figure supplement 1.Implantation potential and the relative implantation rates using the euploid embryo test set.
The scatter plot illustrates the implantation potential of the euploid embryos evaluated in this study as measured by the neural network (n = 97). The ground truth represents actual clinical transfer outcomes.