1
|
AlSaad R, Abusarhan L, Odeh N, Abd-alrazaq A, Choucair F, Zegour R, Ahmed A, Aziz S, Sheikh J. Deep learning applications for human embryo assessment using time-lapse imaging: scoping review. FRONTIERS IN REPRODUCTIVE HEALTH 2025; 7:1549642. [PMID: 40264925 PMCID: PMC12011738 DOI: 10.3389/frph.2025.1549642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2024] [Accepted: 03/13/2025] [Indexed: 04/24/2025] Open
Abstract
Background The integration of deep learning (DL) and time-lapse imaging technologies offers new possibilities for improving embryo assessment and selection in clinical in vitro Fertilization (IVF). Objectives This scoping review aims to explore the range of deep learning model applications in the evaluation and selection of embryos monitored through time-lapse imaging systems. Methods A total of 6 electronic databases (Scopus, MEDLINE, EMBASE, ACM Digital Library, IEEE Xplore, and Google Scholar) were searched for peer-reviewed literature published before May 2024. We adhered to the PRISMA guidelines for reporting scoping reviews. Results Out of the 773 articles reviewed, 77 met the inclusion criteria. Over the past four years, the use of DL in embryo analysis has increased rapidly. The primary applications of DL in the reviewed studies included predicting embryo development and quality (61%, n = 47) and forecasting clinical outcomes, such as pregnancy and implantation (35%, n = 27). The number of embryos involved in the studies exhibited significant variation, with a mean of 10,485 (SD = 35,593) and a range from 20 to 249,635 embryos. A variety of data types have been used, namely images of blastocyst-stage embryos (47%, n = 36), followed by combined images of cleavage and blastocyst stages (23%, n = 18). Most of the studies did not provide maternal age details (82%, n = 63). Convolutional neural networks (CNNs) were the predominant deep learning architecture used, accounting for 81% (n = 62) of the studies. All studies utilized time-lapse video images (100%) as training data, while some also incorporated demographics, clinical and reproductive histories, and IVF cycle parameters. Most studies utilized accuracy as the discriminative measure (58%, n = 45). Conclusion Our results highlight the diverse applications and potential of deep learning in clinical IVF and suggest directions for future advancements in embryo evaluation and selection techniques.
Collapse
Affiliation(s)
- Rawan AlSaad
- AI Center for Precision Health, Weill Cornell Medicine-Qatar, Doha, Qatar
| | - Leen Abusarhan
- Faculty of Medicine, Hashemite University, Zarqa, Jordan
| | - Nour Odeh
- Faculty of Medicine, Hashemite University, Zarqa, Jordan
| | - Alaa Abd-alrazaq
- AI Center for Precision Health, Weill Cornell Medicine-Qatar, Doha, Qatar
| | - Fadi Choucair
- Reproductive Medicine Unit, Sidra Medicine, Doha, Qatar
| | - Rachida Zegour
- Faculty of Exact Sciences, University of Bejaia, Bejaia, Algeria
| | - Arfan Ahmed
- AI Center for Precision Health, Weill Cornell Medicine-Qatar, Doha, Qatar
| | - Sarah Aziz
- AI Center for Precision Health, Weill Cornell Medicine-Qatar, Doha, Qatar
| | - Javaid Sheikh
- AI Center for Precision Health, Weill Cornell Medicine-Qatar, Doha, Qatar
| |
Collapse
|
2
|
Conversa L, Bori L, Insua F, Marqueño S, Cobo A, Meseguer M. Testing an artificial intelligence algorithm to predict fetal heartbeat of vitrified-warmed blastocysts from a single image: predictive ability in different settings. Hum Reprod 2024; 39:2240-2248. [PMID: 39173597 DOI: 10.1093/humrep/deae178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 07/01/2024] [Indexed: 08/24/2024] Open
Abstract
STUDY QUESTION Could an artificial intelligence (AI) algorithm predict fetal heartbeat from images of vitrified-warmed embryos? SUMMARY ANSWER Applying AI to vitrified-warmed blastocysts may help predict which ones will result in implantation failure early enough to thaw another. WHAT IS KNOWN ALREADY The application of AI in the field of embryology has already proven effective in assessing the quality of fresh embryos. Therefore, it could also be useful to predict the outcome of frozen embryo transfers, some of which do not recover their pre-vitrification volume, collapse, or degenerate after warming without prior evidence. STUDY DESIGN, SIZE, DURATION This retrospective cohort study included 1109 embryos from 792 patients. Of these, 568 were vitrified blastocysts cultured in time-lapse systems in the period between warming and transfer, from February 2022 to July 2023. The other 541 were fresh-transferred blastocysts serving as controls. PARTICIPANTS/MATERIALS, SETTING, METHODS Four types of time-lapse images were collected: last frame of development of 541 fresh-transferred blastocysts (FTi), last frame of 467 blastocysts to be vitrified (PVi), first frame post-warming of 568 vitrified embryos (PW1i), and last frame post-warming of 568 vitrified embryos (PW2i). After providing the images to the AI algorithm, the returned scores were compared with the conventional morphology and fetal heartbeat outcomes of the transferred embryos (n = 1098). The contribution of the AI score to fetal heartbeat was analyzed by multivariate logistic regression in different patient populations, and the predictive ability of the models was measured by calculating the area under the receiver-operating characteristic curve (ROC-AUC). MAIN RESULTS AND THE ROLE OF CHANCE Fetal heartbeat rate was related to AI score from FTi (P < 0.001), PW1i (P < 0.05), and PW2i (P < 0.001) images. The contribution of AI score to fetal heartbeat was significant in the oocyte donation program for PW2i (odds ratio (OR)=1.13; 95% CI [1.04-1.23]; P < 0.01), and in cycles with autologous oocytes for PW1i (OR = 1.18; 95% CI [1.01-1.38]; P < 0.05) and PW2i (OR = 1.15; 95% CI [1.02-1.30]; P < 0.05), but was not significantly associated with fetal heartbeat in genetically analyzed embryos. AI scores from the four groups of images varied according to morphological category (P < 0.001). The PW2i score differed in collapsed, non-re-expanded, or non-viable embryos compared to normal/viable embryos (P < 0.001). The predictability of the AI score was optimal at a post-warming incubation time of 3.3-4 h (AUC = 0.673). LIMITATIONS, REASONS FOR CAUTION The algorithm was designed to assess fresh embryos prior to vitrification, but not thawed ones, so this study should be considered an external trial. WIDER IMPLICATIONS OF THE FINDINGS The application of predictive software in the management of frozen embryo transfers may be a useful tool for embryologists, reducing the cancellation rates of cycles in which the blastocyst does not recover from vitrification. Specifically, the algorithm tested in this research could be used to evaluate thawed embryos both in clinics with time-lapse systems and in those with conventional incubators only, as just a single photo is required. STUDY FUNDING/COMPETING INTERESTS This study was supported by the Regional Ministry of Innovation, Universities, Science and Digital Society of the Valencian Community (CIACIF/2021/019) and by Instituto de Salud Carlos III (PI21/00283), and co-funded by European Union (ERDF, 'A way to make Europe'). M.M. received personal fees in the last 5 years as honoraria for lectures from Merck, Vitrolife, MSD, Ferring, AIVF, Theramex, Gedeon Richter, Genea Biomedx, and Life Whisperer. There are no other competing interests. TRIAL REGISTRATION NUMBER N/A.
Collapse
Affiliation(s)
- L Conversa
- IVIRMA Global Research Alliance-IVI Valencia, IVF Laboratory, Valencia, Spain
- IVIRMA Global Research Alliance-IVI Foundation, Health Research Institute La Fe, Reproductive Medicine, Valencia, Spain
| | - L Bori
- IVIRMA Global Research Alliance-IVI Valencia, IVF Laboratory, Valencia, Spain
- IVIRMA Global Research Alliance-IVI Foundation, Health Research Institute La Fe, Reproductive Medicine, Valencia, Spain
| | - F Insua
- IVIRMA Global Research Alliance-IVI Valencia, IVF Laboratory, Valencia, Spain
| | - S Marqueño
- IVIRMA Global Research Alliance-IVI Valencia, IVF Laboratory, Valencia, Spain
| | - A Cobo
- IVIRMA Global Research Alliance-IVI Valencia, IVF Laboratory, Valencia, Spain
- IVIRMA Global Research Alliance-IVI Foundation, Health Research Institute La Fe, Reproductive Medicine, Valencia, Spain
| | - M Meseguer
- IVIRMA Global Research Alliance-IVI Valencia, IVF Laboratory, Valencia, Spain
- IVIRMA Global Research Alliance-IVI Foundation, Health Research Institute La Fe, Reproductive Medicine, Valencia, Spain
| |
Collapse
|
3
|
Goswami N, Winston N, Choi W, Lai NZE, Arcanjo RB, Chen X, Sobh N, Nowak RA, Anastasio MA, Popescu G. EVATOM: an optical, label-free, machine learning assisted embryo health assessment tool. Commun Biol 2024; 7:268. [PMID: 38443460 PMCID: PMC10915136 DOI: 10.1038/s42003-024-05960-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 02/22/2024] [Indexed: 03/07/2024] Open
Abstract
The combination of a good quality embryo and proper maternal health factors promise higher chances of a successful in vitro fertilization (IVF) procedure leading to clinical pregnancy and live birth. Of these two factors, selection of a good embryo is a controllable aspect. The current gold standard in clinical practice is visual assessment of an embryo based on its morphological appearance by trained embryologists. More recently, machine learning has been incorporated into embryo selection "packages". Here, we report EVATOM: a machine-learning assisted embryo health assessment tool utilizing an optical quantitative phase imaging technique called artificial confocal microscopy (ACM). We present a label-free nucleus detection method with, to the best of our knowledge, novel quantitative embryo health biomarkers. Two viability assessment models are presented for grading embryos into two classes: healthy/intermediate (H/I) or sick (S) class. The models achieve a weighted F1 score of 1.0 and 0.99 respectively on the in-distribution test set of 72 fixed embryos and a weighted F1 score of 0.9 and 0.95 respectively on the out-of-distribution test dataset of 19 time-instances from 8 live embryos.
Collapse
Affiliation(s)
- Neha Goswami
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA.
- Beckman Institute of Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA.
| | - Nicola Winston
- Division of Reproductive Endocrinology and Infertility, Department of Obstetrics and Gynecology, University of Illinois at Chicago College of Medicine, Chicago, IL, 60612, USA
| | - Wonho Choi
- Department of Animal Sciences, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
| | - Nastasia Z E Lai
- Department of Animal Sciences, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
| | - Rachel B Arcanjo
- Department of Animal Sciences, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Xi Chen
- Beckman Institute of Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- School of Applied and Engineering Physics, Cornell University, Ithaca, NY, 14850, USA
| | - Nahil Sobh
- NCSA Center for Artificial Intelligence Innovation, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
| | - Romana A Nowak
- Department of Animal Sciences, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA.
| | - Mark A Anastasio
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA.
- Beckman Institute of Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA.
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA.
| | - Gabriel Popescu
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Beckman Institute of Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
| |
Collapse
|
4
|
Mensing LC, Eliasen TU, Johansen MN, Berntsen J, Montag M, Iversen LH, Gabrielsen A. Using blastocyst re-expansion rate for deciding when to warm a new blastocyst for single vitrified-warmed blastocyst transfer. Reprod Biomed Online 2023; 47:103378. [PMID: 37862858 DOI: 10.1016/j.rbmo.2023.103378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 08/09/2023] [Accepted: 08/29/2023] [Indexed: 10/22/2023]
Abstract
RESEARCH QUESTION Can predictive post-warm parameters that support the decision to transfer a warmed blastocyst or to warm another blastocyst be identified in women with multiple frozen-vitrified blastocysts? DESIGN Retrospective single-centre observational cohort analysis. A total of 1092 single vitrified-warmed blastocyst transfers (SVBT) with known Gardner score, maternal age and live birth were used to develop live birth prediction models based on logistic regression, including post-warm re-expansion parameters. Time-lapse incubation was used for pre-vitrification and post-warm embryo culture. A dataset of 558 SVBT with the same inclusion criteria was used to validate the model, but with known clinical pregnancy outcome instead of live birth outcome. RESULTS Three different logistic regression models were developed for predicting live birth based on post-warm blastocyst re-expansion. Different post-warm assessment times indicated that a 2-h post-warm culture period was optimal for live birth prediction (model 1). Adjusting for pre-vitrification Gardner score (model 2) and in combination with maternal age (model 3) further increased predictability (area under the curve [AUC] = 0.623, 0.633, 0.666, respectively). Model validation gave an AUC of 0.617, 0.609 and 0.624, respectively. The false negative rate and true negative rate for model 3 were 2.0 and 10.1 in the development dataset and 3.5 and 8.0 in the validation dataset. CONCLUSIONS Clinical application of a simple model based on 2 h of post-warm re-expansion data, pre-vitrification Gardner score and maternal age can support a standardized approach for deciding if warming another blastocyst may increase the likelihood of live birth in SVBT.
Collapse
|
5
|
Berman A, Anteby R, Efros O, Klang E, Soffer S. Deep learning for embryo evaluation using time-lapse: a systematic review of diagnostic test accuracy. Am J Obstet Gynecol 2023; 229:490-501. [PMID: 37116822 DOI: 10.1016/j.ajog.2023.04.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 03/28/2023] [Accepted: 04/19/2023] [Indexed: 04/30/2023]
Abstract
OBJECTIVE This study aimed to investigate the accuracy of convolutional neural network models in the assessment of embryos using time-lapse monitoring. DATA SOURCES A systematic search was conducted in PubMed and Web of Science databases from January 2016 to December 2022. The search strategy was carried out by using key words and MeSH (Medical Subject Headings) terms. STUDY ELIGIBILITY CRITERIA Studies were included if they reported the accuracy of convolutional neural network models for embryo evaluation using time-lapse monitoring. The review was registered with PROSPERO (International Prospective Register of Systematic Reviews; identification number CRD42021275916). METHODS Two reviewer authors independently screened results using the Covidence systematic review software. The full-text articles were reviewed when studies met the inclusion criteria or in any uncertainty. Nonconsensus was resolved by a third reviewer. Risk of bias and applicability were evaluated using the QUADAS-2 tool and the modified Joanna Briggs Institute or JBI checklist. RESULTS Following a systematic search of the literature, 22 studies were identified as eligible for inclusion. All studies were retrospective. A total of 522,516 images of 222,998 embryos were analyzed. Three main outcomes were evaluated: successful in vitro fertilization, blastocyst stage classification, and blastocyst quality. Most studies reported >80% accuracy, and embryologists were outperformed in some. Ten studies had a high risk of bias, mostly because of patient bias. CONCLUSION The application of artificial intelligence in time-lapse monitoring has the potential to provide more efficient, accurate, and objective embryo evaluation. Models that examined blastocyst stage classification showed the best predictions. Models that predicted live birth had a low risk of bias, used the largest databases, and had external validation, which heightens their relevance to clinical application. Our systematic review is limited by the high heterogeneity among the included studies. Researchers should share databases and standardize reporting.
Collapse
Affiliation(s)
- Aya Berman
- Faculty of Health Sciences, Ben-Gurion University of the Negev, Be'er Sheva, Israel.
| | - Roi Anteby
- Department of Surgery and Transplantation B, Chaim Sheba Medical Center, Tel Hashomer, Israel; Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Orly Efros
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel; National Hemophilia Center and Institute of Thrombosis & Hemostasis, Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Eyal Klang
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel; Institute for Health Care Delivery Science, Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY; Deep Vision Lab, Chaim Sheba Medical Center, Ramat Gan, Israel; Division of Diagnostic Imaging, Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Shelly Soffer
- Deep Vision Lab, Chaim Sheba Medical Center, Ramat Gan, Israel; Internal Medicine B, Assuta Medical Center, Ashdod, Israel; Ben-Gurion University of the Negev, Be'er Sheva, Israel
| |
Collapse
|
6
|
Goswami N, Winston N, Choi W, Lai NZE, Arcanjo RB, Chen X, Sobh N, Nowak RA, Anastasio MA, Popescu G. Machine learning assisted health viability assay for mouse embryos with artificial confocal microscopy (ACM). BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.30.550591. [PMID: 37547014 PMCID: PMC10402120 DOI: 10.1101/2023.07.30.550591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
The combination of a good quality embryo and proper maternal health factors promise higher chances of a successful in vitro fertilization (IVF) procedure leading to clinical pregnancy and live birth. Of these two factors, selection of a good embryo is a controllable aspect. The current gold standard in clinical practice is visual assessment of an embryo based on its morphological appearance by trained embryologists. More recently, machine learning has been incorporated into embryo selection "packages". Here, we report a machine-learning assisted embryo health assessment tool utilizing a quantitative phase imaging technique called artificial confocal microscopy (ACM). We present a label-free nucleus detection method with novel quantitative embryo health biomarkers. Two viability assessment models are presented for grading embryos into two classes: healthy/intermediate (H/I) or sick (S) class. The models achieve a weighted F1 score of 1.0 and 0.99 respectively on the in-distribution test set of 72 fixed embryos and a weighted F1 score of 0.9 and 0.95 respectively on the out-of-distribution test dataset of 19 time-instances from 8 live embryos.
Collapse
|