1
|
Zhou G, Lee MC, Wang X, Zhong D, Githeko AK, Yan G. Mapping Potential Malaria Vector Larval Habitats for Larval Source Management in Western Kenya: Introduction to Multimodel Ensembling Approaches. Am J Trop Med Hyg 2024; 110:421-430. [PMID: 38350135 PMCID: PMC10919169 DOI: 10.4269/ajtmh.23-0108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Accepted: 11/03/2023] [Indexed: 02/15/2024] Open
Abstract
Identification and mapping of larval sources are a prerequisite for effective planning and implementing mosquito larval source management (LSM). Ensemble modeling is increasingly used for prediction modeling, but it lacks standard procedures. We proposed a detailed framework to predict potential malaria vector larval habitats by using multimodel ensemble modeling, which includes selection of models, ensembling method, and predictors, evaluation of variable importance, prediction of potential larval habitats, and assessment of prediction uncertainty. The models were built and validated based on multisite, multiyear field observations and climatic/environmental variables. Model performance was tested using independent field observations. Overall, we found that the ensembled model predicted larval habitats with about 20% more accuracy than the average of the individual models ensembled. Key larval habitat predictors in western Kenya were elevation, geomorphon class, and precipitation for the 2 months prior. Additional predictors may be required to increase the predictive accuracy of the larva-positive habitats. This is the first study to provide a detailed framework for the process of multimodel ensemble modeling for malaria vector habitats. Mapping of potential habitats will be helpful in LSM planning.
Collapse
Affiliation(s)
- Guofa Zhou
- Program in Public Health, University of California, Irvine, California
| | - Ming-Chieh Lee
- Program in Public Health, University of California, Irvine, California
| | - Xiaoming Wang
- Program in Public Health, University of California, Irvine, California
| | - Daibin Zhong
- Program in Public Health, University of California, Irvine, California
| | - Andrew K. Githeko
- Centre for Global Health Research, Kenya Medical Research Institute, Kisumu, Kenya
| | - Guiyun Yan
- Program in Public Health, University of California, Irvine, California
| |
Collapse
|
2
|
Simoulin A, Thiebaut N, Neuberger K, Ibnouhsein I, Brunel N, Viné R, Bousquet N, Latapy J, Reix N, Molière S, Lodi M, Mathelin C. From free-text electronic health records to structured cohorts: Onconum, an innovative methodology for real-world data mining in breast cancer. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 240:107693. [PMID: 37453367 DOI: 10.1016/j.cmpb.2023.107693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 05/25/2023] [Accepted: 06/23/2023] [Indexed: 07/18/2023]
Abstract
PURPOSE A considerable amount of valuable information is present in electronic health records (EHRs) however it remains inaccessible because it is embedded into unstructured narrative documents that cannot be easily analyzed. We wanted to develop and evaluate a methodology able to extract and structure information from electronic health records in breast cancer. METHODS We developed a software platform called Onconum (ClinicalTrials.gov Identifier: NCT02810093) which uses a hybrid method relying on machine learning approaches and rule-based lexical methods. It is based on natural language processing techniques that allows a targeted analysis of free-text medical data related to breast cancer, independently of any pre-existing dictionary, in a French context (available in N files). We then evaluated it on a validation cohort called Senometry. FINDINGS Senometry cohort included 9,599 patients with breast cancer (both invasive and in situ), treated between 2000 and 2017 in the breast cancer unit of Strasbourg University Hospitals. Extraction rates ranged from 45 to 100%, depending on the type of each parameter. Precision of extracted information was 68%-94% compared to a structured cohort, and 89%-98% compared to manually structured databases and it retrieved more rare occurrences compared to another database search engine (+17%). INTERPRETATION This innovative method can accurately structure relevant medical information embedded in EHRs in the context of breast cancer. Missing data handling is the main limitation of this method however multiple sources can be incorporated to reduce this limit. Nevertheless, this methodology does not need neither pre-existing dictionaries nor manually annotated corpora. It can therefore be easily implemented in non-English-speaking countries and in other diseases outside breast cancer, and it allows prospective inclusion of new patients.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Nicolas Bousquet
- Quantmetry, 52 rue d'Anjou, 75008 Paris, France; Sorbonne University, 4 place Jussieu, 75005 Paris, France
| | | | - Nathalie Reix
- ICube UMR 7537, Strasbourg University / CNRS, Fédération de Médecine Translationnelle de Strasbourg, 67200 Strasbourg, France; Biochemistry and Molecular Biology Laboratory, Strasbourg University Hospitals, 1 place de l'Hôpital, 67091 Strasbourg, France
| | - Sébastien Molière
- Radiology Department, Strasbourg University Hospitals, 1 avenue Molière, 67098 Strasbourg, France
| | - Massimo Lodi
- Institut de cancérologie Strasbourg Europe (ICANS), 17 avenue Albert Calmette, 67033 Strasbourg Cedex, France; Department of Functional Genomics and Cancer, Institut de Génétique et de Biologie Moléculaire et Cellulaire, CNRS UMR 7104, INSERM U964, Strasbourg University, Illkirch, France; Strasbourg University Hospitals, 1 place de l'Hôpital, 67091 Strasbourg, France.
| | - Carole Mathelin
- Institut de cancérologie Strasbourg Europe (ICANS), 17 avenue Albert Calmette, 67033 Strasbourg Cedex, France; Department of Functional Genomics and Cancer, Institut de Génétique et de Biologie Moléculaire et Cellulaire, CNRS UMR 7104, INSERM U964, Strasbourg University, Illkirch, France; Strasbourg University Hospitals, 1 place de l'Hôpital, 67091 Strasbourg, France.
| |
Collapse
|
3
|
Tao X, Jiang M, Liu Y, Hu Q, Zhu B, Hu J, Guo W, Wu X, Xiong Y, Shi X, Zhang X, Han X, Li W, Tong R, Long E. Predicting three-month fasting blood glucose and glycated hemoglobin changes in patients with type 2 diabetes mellitus based on multiple machine learning algorithms. Sci Rep 2023; 13:16437. [PMID: 37777593 PMCID: PMC10543442 DOI: 10.1038/s41598-023-43240-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 09/21/2023] [Indexed: 10/02/2023] Open
Abstract
Fasting blood glucose (FBG) and glycosylated hemoglobin (HbA1c) are key indicators reflecting blood glucose control in type 2 diabetes mellitus (T2DM) patients. The purpose of this study is to establish a predictive model for blood glucose changes in T2DM patients after 3 months of treatment, achieving personalized treatment.A retrospective study was conducted on type 2 diabetes mellitus real-world medical data from 4 cities in Sichuan Province, China from January 2015 to December 2020. After data preprocessing, data inputting, data sampling, and feature screening, 16 kinds of machine learning methods were used to construct prediction models, and 5 prediction models with the best prediction performance were screened respectively. A total of 100,000 cases were included to establish the FBG model, and 2,169 cases were established to establish the HbA1c model. The best prediction model both of FBG and HbA1c finally obtained are realized by ensemble learning and modified random forest inputting, the AUC values are 0.819 and 0.970, respectively. The most important indicators of the FBG and HbA1c prediction model were FBG and HbA1c. Medication compliance, follow-up outcome, dietary habits, BMI, and waist circumference also had a greater impact on FBG levels. The prediction accuracy of the models of the two blood glucose control indicators is high and has certain clinical applicability.HbA1c and FBG are mutually important predictors, and there is a close relationship between them.
Collapse
Affiliation(s)
- Xue Tao
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
| | - Min Jiang
- West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, 610044, Sichuan, China
| | - Yumeng Liu
- Department of Pharmacy, Daping Hospital, Army Medical University, Chongqing, 400042, China
| | - Qi Hu
- Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu, 610072, Sichuan, China
| | - Baoqiang Zhu
- School of Pharmacy, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Jiaqiang Hu
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
| | - Wenmei Guo
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
| | - Xingwei Wu
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
| | - Yu Xiong
- Institute of Materia Medica, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, 100050, China
| | - Xia Shi
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
| | - Xueli Zhang
- Sichuan Provincial Health Information Center, Chengdu, 610015, Sichuan, China
| | - Xu Han
- Sichuan Provincial Health Information Center, Chengdu, 610015, Sichuan, China
| | - Wenyuan Li
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
| | - Rongsheng Tong
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
| | - Enwu Long
- Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China.
| |
Collapse
|
4
|
Feng Z, Shen Z, Li H, Li S. e-TSN: an interactive visual exploration platform for target-disease knowledge mapping from literature. Brief Bioinform 2022; 23:bbac465. [PMID: 36347537 PMCID: PMC9677481 DOI: 10.1093/bib/bbac465] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 09/20/2022] [Accepted: 09/27/2022] [Indexed: 11/10/2022] Open
Abstract
Target discovery and identification processes are driven by the increasing amount of biomedical data. The vast numbers of unstructured texts of biomedical publications provide a rich source of knowledge for drug target discovery research and demand the development of specific algorithms or tools to facilitate finding disease genes and proteins. Text mining is a method that can automatically mine helpful information related to drug target discovery from massive biomedical literature. However, there is a substantial lag between biomedical publications and the subsequent abstraction of information extracted by text mining to databases. The knowledge graph is introduced to integrate heterogeneous biomedical data. Here, we describe e-TSN (Target significance and novelty explorer, http://www.lilab-ecust.cn/etsn/), a knowledge visualization web server integrating the largest database of associations between targets and diseases from the full scientific literature by constructing significance and novelty scoring methods based on bibliometric statistics. The platform aims to visualize target-disease knowledge graphs to assist in prioritizing candidate disease-related proteins. Approved drugs and associated bioactivities for each interested target are also provided to facilitate the visualization of drug-target relationships. In summary, e-TSN is a fast and customizable visualization resource for investigating and analyzing the intricate target-disease networks, which could help researchers understand the mechanisms underlying complex disease phenotypes and improve the drug discovery and development efficiency, especially for the unexpected outbreak of infectious disease pandemics like COVID-19.
Collapse
Affiliation(s)
- Ziyan Feng
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Zihao Shen
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Honglin Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
- Innovation Center for AI and Drug Discovery, East China Normal University, Shanghai 200062, China
- Lingang Laboratory, Shanghai 200031, China
| | - Shiliang Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
- Innovation Center for AI and Drug Discovery, East China Normal University, Shanghai 200062, China
| |
Collapse
|
5
|
吴 妍, 何 白, 沈 敏, 杨 艳, 金 玉, 张 青, 杨 军, 李 姝. [Characteristics of wideband tympanometry in patients with Ménière's disease based on neutral network]. LIN CHUANG ER BI YAN HOU TOU JING WAI KE ZA ZHI = JOURNAL OF CLINICAL OTORHINOLARYNGOLOGY, HEAD, AND NECK SURGERY 2022; 36:685-690. [PMID: 36036069 PMCID: PMC10127621 DOI: 10.13201/j.issn.2096-7993.2022.09.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Indexed: 06/13/2023]
Abstract
Objective:To construct a prediction model for Ménière's disease based on neural network and evaluate its prediction ability. Methods:Sixty-four patients with Ménière's disease underwent gadolinium enhanced magnetic resonance imaging of inner ear which showed endolymphatic hydrops. Meanwhile, 40 healthy adults were enrolled as controls. The database of wideband tympanometry of patients and control subjects was analyzed, and the neural network model was established by MATLAB 2021a software. The prediction ability of the model was evaluated by accuracy, positive predictive value, negative predictive value, the Youden index, sensitivity, specificity, receiver operating characteristic curve and area under curve (AUC). Results:A feedforward network model was built with a single hidden layer to predict Ménière's disease with wideband tympanometry. There were 104 features in the input layer, 13 neuron nodes in the hidden layer and 1 output neuron in the output layer. The accuracy of the model was 83.2%, the positive predictive value was 80.7%, the negative predictive value was 84.3%, the sensitivity was 76.5%, the specificity was 83.7%, the Youden index was 0.602, and the AUC was 0.855. Conclusion:Based on neural network, the prediction model of Ménière's disease with high accuracy was constructed according to the results of wideband tympanometry, which provided reference for the diagnose of Ménière's disease.
Collapse
Affiliation(s)
- 妍 吴
- 上海交通大学医学院附属新华医院耳鼻咽喉-头颈外科 上海交通大学医学院耳科学研究所 上海耳鼻疾病转化医学重点实验室(上海,200092)Department of Otorhinolaryngology-Head & Neck Surgery, Xinhua Hospital, Shanghai Jiaotong University School of Medicine; Shanghai Jiaotong University School of Medicine Ear Institute; Shanghai Key Laboratory of Translational Medicine on Ear and Nose diseases, Shanghai, 200092, China
| | - 白慧 何
- 上海交通大学医学院附属新华医院耳鼻咽喉-头颈外科 上海交通大学医学院耳科学研究所 上海耳鼻疾病转化医学重点实验室(上海,200092)Department of Otorhinolaryngology-Head & Neck Surgery, Xinhua Hospital, Shanghai Jiaotong University School of Medicine; Shanghai Jiaotong University School of Medicine Ear Institute; Shanghai Key Laboratory of Translational Medicine on Ear and Nose diseases, Shanghai, 200092, China
| | - 敏 沈
- 上海交通大学医学院附属新华医院耳鼻咽喉-头颈外科 上海交通大学医学院耳科学研究所 上海耳鼻疾病转化医学重点实验室(上海,200092)Department of Otorhinolaryngology-Head & Neck Surgery, Xinhua Hospital, Shanghai Jiaotong University School of Medicine; Shanghai Jiaotong University School of Medicine Ear Institute; Shanghai Key Laboratory of Translational Medicine on Ear and Nose diseases, Shanghai, 200092, China
| | - 艳 杨
- 辽宁省医疗器械检验检测院Liaoning Medical Instrument Inspection and Testing Institute
| | - 玉莲 金
- 上海交通大学医学院附属新华医院耳鼻咽喉-头颈外科 上海交通大学医学院耳科学研究所 上海耳鼻疾病转化医学重点实验室(上海,200092)Department of Otorhinolaryngology-Head & Neck Surgery, Xinhua Hospital, Shanghai Jiaotong University School of Medicine; Shanghai Jiaotong University School of Medicine Ear Institute; Shanghai Key Laboratory of Translational Medicine on Ear and Nose diseases, Shanghai, 200092, China
| | - 青 张
- 上海交通大学医学院附属新华医院耳鼻咽喉-头颈外科 上海交通大学医学院耳科学研究所 上海耳鼻疾病转化医学重点实验室(上海,200092)Department of Otorhinolaryngology-Head & Neck Surgery, Xinhua Hospital, Shanghai Jiaotong University School of Medicine; Shanghai Jiaotong University School of Medicine Ear Institute; Shanghai Key Laboratory of Translational Medicine on Ear and Nose diseases, Shanghai, 200092, China
| | - 军 杨
- 上海交通大学医学院附属新华医院耳鼻咽喉-头颈外科 上海交通大学医学院耳科学研究所 上海耳鼻疾病转化医学重点实验室(上海,200092)Department of Otorhinolaryngology-Head & Neck Surgery, Xinhua Hospital, Shanghai Jiaotong University School of Medicine; Shanghai Jiaotong University School of Medicine Ear Institute; Shanghai Key Laboratory of Translational Medicine on Ear and Nose diseases, Shanghai, 200092, China
| | - 姝娜 李
- 上海交通大学医学院附属新华医院耳鼻咽喉-头颈外科 上海交通大学医学院耳科学研究所 上海耳鼻疾病转化医学重点实验室(上海,200092)Department of Otorhinolaryngology-Head & Neck Surgery, Xinhua Hospital, Shanghai Jiaotong University School of Medicine; Shanghai Jiaotong University School of Medicine Ear Institute; Shanghai Key Laboratory of Translational Medicine on Ear and Nose diseases, Shanghai, 200092, China
| |
Collapse
|