1
|
Lee T, Natalwala J, Chapple V, Liu Y. A brief history of artificial intelligence embryo selection: from black-box to glass-box. Hum Reprod 2024; 39:285-292. [PMID: 38061074 PMCID: PMC11016335 DOI: 10.1093/humrep/dead254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Revised: 11/21/2023] [Indexed: 02/02/2024] Open
Abstract
With the exponential growth of computing power and accumulation of embryo image data in recent years, artificial intelligence (AI) is starting to be utilized in embryo selection in IVF. Amongst different AI technologies, machine learning (ML) has the potential to reduce operator-related subjectivity in embryo selection while saving labor time on this task. However, as modern deep learning (DL) techniques, a subcategory of ML, are increasingly used, its integrated black-box attracts growing concern owing to the well-recognized issues regarding lack of interpretability. Currently, there is a lack of randomized controlled trials to confirm the effectiveness of such black-box models. Recently, emerging evidence has shown underperformance of black-box models compared to the more interpretable traditional ML models in embryo selection. Meanwhile, glass-box AI, such as interpretable ML, is being increasingly promoted across a wide range of fields and is supported by its ethical advantages and technical feasibility. In this review, we propose a novel classification system for traditional and AI-driven systems from an embryology standpoint, defining different morphology-based selection approaches with an emphasis on subjectivity, explainability, and interpretability.
Collapse
Affiliation(s)
- Tammy Lee
- Fertility North, Joondalup Private Hospital, Joondalup, Western Australia, Australia
- School of Human Sciences, University of Western Australia, Crawley, Western Australia, Australia
| | - Jay Natalwala
- Fertility North, Joondalup Private Hospital, Joondalup, Western Australia, Australia
| | - Vincent Chapple
- Fertility North, Joondalup Private Hospital, Joondalup, Western Australia, Australia
| | - Yanhe Liu
- Fertility North, Joondalup Private Hospital, Joondalup, Western Australia, Australia
- School of Human Sciences, University of Western Australia, Crawley, Western Australia, Australia
- Faculty of Health Sciences and Medicine, Bond University, Robina, Queensland, Australia
- School of Medical and Health Sciences, Edith Cowan University, Joondalup, Western Australia, Australia
| |
Collapse
|
2
|
Duval A, Nogueira D, Dissler N, Maskani Filali M, Delestro Matos F, Chansel-Debordeaux L, Ferrer-Buitrago M, Ferrer E, Antequera V, Ruiz-Jorro M, Papaxanthos A, Ouchchane H, Keppi B, Prima PY, Regnier-Vigouroux G, Trebesses L, Geoffroy-Siraudin C, Zaragoza S, Scalici E, Sanguinet P, Cassagnard N, Ozanon C, De La Fuente A, Gómez E, Gervoise Boyer M, Boyer P, Ricciarelli E, Pollet-Villard X, Boussommier-Calleja A. A hybrid artificial intelligence model leverages multi-centric clinical data to improve fetal heart rate pregnancy prediction across time-lapse systems. Hum Reprod 2023; 38:596-608. [PMID: 36763673 PMCID: PMC10068266 DOI: 10.1093/humrep/dead023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 01/10/2023] [Indexed: 02/12/2023] Open
Abstract
STUDY QUESTION Can artificial intelligence (AI) algorithms developed to assist embryologists in evaluating embryo morphokinetics be enriched with multi-centric clinical data to better predict clinical pregnancy outcome? SUMMARY ANSWER Training algorithms on multi-centric clinical data significantly increased AUC compared to algorithms that only analyzed the time-lapse system (TLS) videos. WHAT IS KNOWN ALREADY Several AI-based algorithms have been developed to predict pregnancy, most of them based only on analysis of the time-lapse recording of embryo development. It remains unclear, however, whether considering numerous clinical features can improve the predictive performances of time-lapse based embryo evaluation. STUDY DESIGN, SIZE, DURATION A dataset of 9986 embryos (95.60% known clinical pregnancy outcome, 32.47% frozen transfers) from 5226 patients from 14 European fertility centers (in two countries) recorded with three different TLS was used to train and validate the algorithms. A total of 31 clinical factors were collected. A separate test set (447 videos) was used to compare performances between embryologists and the algorithm. PARTICIPANTS/MATERIALS, SETTING, METHODS Clinical pregnancy (defined as a pregnancy leading to a fetal heartbeat) outcome was first predicted using a 3D convolutional neural network that analyzed videos of the embryonic development up to 2 or 3 days of development (33% of the database) or up to 5 or 6 days of development (67% of the database). The output video score was then fed as input alongside clinical features to a gradient boosting algorithm that generated a second score corresponding to the hybrid model. AUC was computed across 7-fold of the validation dataset for both models. These predictions were compared to those of 13 senior embryologists made on the test dataset. MAIN RESULTS AND THE ROLE OF CHANCE The average AUC of the hybrid model across all 7-fold was significantly higher than that of the video model (0.727 versus 0.684, respectively, P = 0.015; Wilcoxon test). A SHapley Additive exPlanations (SHAP) analysis of the hybrid model showed that the six first most important features to predict pregnancy were morphokinetics of the embryo (video score), oocyte age, total gonadotrophin dose intake, number of embryos generated, number of oocytes retrieved, and endometrium thickness. The hybrid model was shown to be superior to embryologists with respect to different metrics, including the balanced accuracy (P ≤ 0.003; Wilcoxon test). The likelihood of pregnancy was linearly linked to the hybrid score, with increasing odds ratio (maximum P-value = 0.001), demonstrating the ranking capacity of the model. Training individual hybrid models did not improve predictive performance. A clinic hold-out experiment was conducted and resulted in AUCs ranging between 0.63 and 0.73. Performance of the hybrid model did not vary between TLS or between subgroups of embryos transferred at different days of embryonic development. The hybrid model did fare better for patients older than 35 years (P < 0.001; Mann-Whitney test), and for fresh transfers (P < 0.001; Mann-Whitney test). LIMITATIONS, REASONS FOR CAUTION Participant centers were located in two countries, thus limiting the generalization of our conclusion to wider subpopulations of patients. Not all clinical features were available for all embryos, thus limiting the performances of the hybrid model in some instances. WIDER IMPLICATIONS OF THE FINDINGS Our study suggests that considering clinical data improves pregnancy predictive performances and that there is no need to retrain algorithms at the clinic level unless they follow strikingly different practices. This study characterizes a versatile AI algorithm with similar performance on different time-lapse microscopes and on embryos transferred at different development stages. It can also help with patients of different ages and protocols used but with varying performances, presumably because the task of predicting fetal heartbeat becomes more or less hard depending on the clinical context. This AI model can be made widely available and can help embryologists in a wide range of clinical scenarios to standardize their practices. STUDY FUNDING/COMPETING INTEREST(S) Funding for the study was provided by ImVitro with grant funding received in part from BPIFrance (Bourse French Tech Emergence (DOS0106572/00), Paris Innovation Amorçage (DOS0132841/00), and Aide au Développement DeepTech (DOS0152872/00)). A.B.-C. is a co-owner of, and holds stocks in, ImVitro SAS. A.B.-C. and F.D.M. hold a patent for 'Devices and processes for machine learning prediction of in vitro fertilization' (EP20305914.2). A.D., N.D., M.M.F., and F.D.M. are or have been employees of ImVitro and have been granted stock options. X.P.-V. has been paid as a consultant to ImVitro and has been granted stocks options of ImVitro. L.C.-D. and C.G.-S. have undertaken paid consultancy for ImVitro SAS. The remaining authors have no conflicts to declare. TRIAL REGISTRATION NUMBER N/A.
Collapse
Affiliation(s)
| | - D Nogueira
- INOVIE Fertilité, Institut de Fertilité La Croix Du Sud, Toulouse, France
- Art Fertility Clinics, IVF laboratory, Abu Dhabi, United Arab Emirate
| | | | | | | | - L Chansel-Debordeaux
- Service de la biologie et de la reproduction et CECOS, CHU Bordeaux Groupe Hospitalier Pellegrin, Bordeaux, France
| | - M Ferrer-Buitrago
- Crea Centro Médico de Fertilidad y Reproducción Asistida, Valencia, Spain
| | - E Ferrer
- Crea Centro Médico de Fertilidad y Reproducción Asistida, Valencia, Spain
| | - V Antequera
- Crea Centro Médico de Fertilidad y Reproducción Asistida, Valencia, Spain
| | - M Ruiz-Jorro
- Crea Centro Médico de Fertilidad y Reproducción Asistida, Valencia, Spain
| | - A Papaxanthos
- Service de la biologie et de la reproduction et CECOS, CHU Bordeaux Groupe Hospitalier Pellegrin, Bordeaux, France
| | - H Ouchchane
- INOVIE Fertilité, Gen-Bio, Clermont-Ferrand, France
| | - B Keppi
- INOVIE Fertilité, Gen-Bio, Clermont-Ferrand, France
| | - P-Y Prima
- Laboratoire FIV PMAtlantique - Clinique Santé Atlantique, Nantes, France
| | | | | | - C Geoffroy-Siraudin
- Hopital Saint Joseph, Service Médicine et Biologie de la Reproduction, Marseille, France
| | - S Zaragoza
- INOVIE Fertilité, Bioaxiome, Avignon, France
| | - E Scalici
- INOVIE Fertilité, Bioaxiome, Avignon, France
| | - P Sanguinet
- INOVIE Fertilité, LaboSud, Montpellier, France
| | - N Cassagnard
- INOVIE Fertilité, Institut de Fertilité La Croix Du Sud, Toulouse, France
| | - C Ozanon
- Clinique Hôtel Privé Natecia, Centre Assistance Médicale à la Procréation, Lyon, France
| | | | - E Gómez
- Next Fertility, Murcia, Spain
| | - M Gervoise Boyer
- Hopital Saint Joseph, Service Médicine et Biologie de la Reproduction, Marseille, France
| | - P Boyer
- Hopital Saint Joseph, Service Médicine et Biologie de la Reproduction, Marseille, France
| | | | - X Pollet-Villard
- Nataliance, Centre Assistance Médicale à la Procréation, Saran, France
| | | |
Collapse
|
3
|
Berntsen J, Rimestad J, Lassen JT, Tran D, Kragh MF. Robust and generalizable embryo selection based on artificial intelligence and time-lapse image sequences. PLoS One 2022; 17:e0262661. [PMID: 35108306 PMCID: PMC8809568 DOI: 10.1371/journal.pone.0262661] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 01/03/2022] [Indexed: 01/31/2023] Open
Abstract
Assessing and selecting the most viable embryos for transfer is an essential part of in vitro fertilization (IVF). In recent years, several approaches have been made to improve and automate the procedure using artificial intelligence (AI) and deep learning. Based on images of embryos with known implantation data (KID), AI models have been trained to automatically score embryos related to their chance of achieving a successful implantation. However, as of now, only limited research has been conducted to evaluate how embryo selection models generalize to new clinics and how they perform in subgroup analyses across various conditions. In this paper, we investigate how a deep learning-based embryo selection model using only time-lapse image sequences performs across different patient ages and clinical conditions, and how it correlates with traditional morphokinetic parameters. The model was trained and evaluated based on a large dataset from 18 IVF centers consisting of 115,832 embryos, of which 14,644 embryos were transferred KID embryos. In an independent test set, the AI model sorted KID embryos with an area under the curve (AUC) of a receiver operating characteristic curve of 0.67 and all embryos with an AUC of 0.95. A clinic hold-out test showed that the model generalized to new clinics with an AUC range of 0.60–0.75 for KID embryos. Across different subgroups of age, insemination method, incubation time, and transfer protocol, the AUC ranged between 0.63 and 0.69. Furthermore, model predictions correlated positively with blastocyst grading and negatively with direct cleavages. The fully automated iDAScore v1.0 model was shown to perform at least as good as a state-of-the-art manual embryo selection model. Moreover, full automatization of embryo scoring implies fewer manual evaluations and eliminates biases due to inter- and intraobserver variation.
Collapse
Affiliation(s)
| | | | | | - Dang Tran
- Harrison AI, Sydney, New South Wales, Australia
| | - Mikkel Fly Kragh
- Vitrolife A/S, Aarhus, Denmark
- Department of Electrical and Computer Engineering, Aarhus University, Aarhus, Denmark
| |
Collapse
|
5
|
Zaninovic N, Rosenwaks Z. Artificial intelligence in human in vitro fertilization and embryology. Fertil Steril 2021; 114:914-920. [PMID: 33160513 DOI: 10.1016/j.fertnstert.2020.09.157] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Accepted: 09/23/2020] [Indexed: 02/06/2023]
Abstract
Embryo evaluation and selection embody the aggregate manifestation of the entire in vitro fertilization (IVF) process. It aims to choose the "best" embryos from the larger cohort of fertilized oocytes, the majority of which will be determined to be not viable either as a result of abnormal development or due to chromosomal imbalances. Indeed, it is generally acknowledged that even after embryo selection based on morphology, time-lapse microscopic photography, or embryo biopsy with preimplantation genetic testing, implantation rates in the human are difficult to predict. Our pursuit of enhancing embryo evaluation and selection, as well as increasing live birth rates, will require the adoption of novel technologies. Recently, several artificial intelligence (AI)-based methods have emerged as objective, standardized, and efficient tools for evaluating human embryos. Moreover, AI-based methods can be implemented for other clinical aspects of IVF, such as assessing patient reproductive potential and individualizing gonadotropin stimulation protocols. As AI has the capability to analyze "big" data, the ultimate goal will be to apply AI tools to the analysis of all embryological, clinical, and genetic data in an effort to provide patient-tailored treatments. In this chapter, we present an overview of existing AI technologies in reproductive medicine and envision their potential future applications in the field.
Collapse
Affiliation(s)
- Nikica Zaninovic
- The Ronald O. Perelman and Claudia Cohen Center for Reproductive Medicine, Weill Cornell Medicine, New York, New York.
| | - Zev Rosenwaks
- The Ronald O. Perelman and Claudia Cohen Center for Reproductive Medicine, Weill Cornell Medicine, New York, New York
| |
Collapse
|