1
|
Landschaft A, Antweiler D, Mackay S, Kugler S, Rüping S, Wrobel S, Höres T, Allende-Cid H. Implementation and evaluation of an additional GPT-4-based reviewer in PRISMA-based medical systematic literature reviews. Int J Med Inform 2024; 189:105531. [PMID: 38943806 DOI: 10.1016/j.ijmedinf.2024.105531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 04/26/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]
Abstract
BACKGROUND PRISMA-based literature reviews require meticulous scrutiny of extensive textual data by multiple reviewers, which is associated with considerable human effort. OBJECTIVE To evaluate feasibility and reliability of using GPT-4 API as a complementary reviewer in systematic literature reviews based on the PRISMA framework. METHODOLOGY A systematic literature review on the role of natural language processing and Large Language Models (LLMs) in automatic patient-trial matching was conducted using human reviewers and an AI-based reviewer (GPT-4 API). A RAG methodology with LangChain integration was used to process full-text articles. Agreement levels between two human reviewers and GPT-4 API for abstract screening and between a single reviewer and GPT-4 API for full-text parameter extraction were evaluated. RESULTS An almost perfect GPT-human reviewer agreement in the abstract screening process (Cohen's kappa > 0.9) and a lower agreement in the full-text parameter extraction were observed. CONCLUSION As GPT-4 has performed on a par with human reviewers in abstract screening, we conclude that GPT-4 has an exciting potential of being used as a main screening tool for systematic literature reviews, replacing at least one of the human reviewers.
Collapse
Affiliation(s)
- Assaf Landschaft
- Boston Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA; Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS), Sankt Augustin, Germany.
| | - Dario Antweiler
- Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS), Sankt Augustin, Germany
| | - Sina Mackay
- Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS), Sankt Augustin, Germany
| | - Sabine Kugler
- Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS), Sankt Augustin, Germany
| | - Stefan Rüping
- Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS), Sankt Augustin, Germany
| | - Stefan Wrobel
- Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS), Sankt Augustin, Germany
| | - Timm Höres
- Fraunhofer-Institut für Translationale Medizin und Pharmakologie (ITMP), Frankfurt am Main, Germany
| | - Hector Allende-Cid
- Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS), Sankt Augustin, Germany
| |
Collapse
|
2
|
Al Jaberi SM, Patel A, AL-Masri AN. Object tracking and detection techniques under GANN threats: A systemic review. Appl Soft Comput 2023. [DOI: 10.1016/j.asoc.2023.110224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
|
3
|
Kim JY, Cho SB. Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending. Sci Prog 2022; 105:368504221124004. [PMID: 36112937 PMCID: PMC10450474 DOI: 10.1177/00368504221124004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In peer-to-peer (P2P) social lending, it is important to predict the repayment of borrowers. P2P lending data are generated in real-time, but most of them are pending to decide the repayment because the deadline is not yet expired. Adding the unexpired data with appropriate labels into the training set could improve the performance of a prediction model, but the pseudo-labels cannot be certainly precise. In this paper, we propose an ensemble classifier composed of diverse convolutional neural networks (CNNs) of GoogLeNet, ResNet and DenseNet for the repayment prediction in social lending with the pseudo-labels approximated by an uncertainty handling scheme. The additional data labeled by Dempster-Shafer fusion of two semi-supervised learning methods boost up training of various models of CNNs, which are combined by weighted voting. A diversity measure is applied to constructing a pool of different models of CNNs that extract the effective features in the social lending data with labeling noise and predict the borrower's loan status. The experiment with the real dataset of 855,502 cases from Lending Club confirms that the diverse ensemble combined with weighted voting achieves the highest performance and outperforms conventional methods.
Collapse
Affiliation(s)
- Ji-Yoon Kim
- Department of Computer Science, Yonsei University, Seoul, Korea
| | - Sung-Bae Cho
- Department of Computer Science, Yonsei University, Seoul, Korea
| |
Collapse
|
4
|
Vehicle Type Classification using Hybrid Features and a Deep Neural Network. INTERNATIONAL JOURNAL OF APPLIED METAHEURISTIC COMPUTING 2022. [DOI: 10.4018/ijamc.292518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Currently, considerable research has been done in vehicle type classification, especially due to the success of deep learning in many image classification problems. In this research, a system incorporating hybrid features is proposed to improve the performance of vehicle type classification. The feature vectors are extracted from the pre-processed images using Gabor features, a histogram of oriented gradients and a local optimal oriented pattern. The hybrid set of features contains complementary information that could help discriminate between the classes better, further, an ant colony optimizer is utilized to reduce the dimension of the extracted feature vectors. Finally, a deep neural network is used to classify the types of vehicles in the images. The proposed approach was tested on the MIO vision traffic camera dataset and another more challenging real-world dataset consisting of videos of multiple lanes of a toll plaza. The proposed model showed an improvement in accuracy ranging from 0.28% to 8.68% in the MIO TCD dataset when compared to well-known neural network architectures.
Collapse
|
5
|
Gao W, Wu M, Lam SK, Xia Q, Zou J. Decoupled self-supervised label augmentation for fully-supervised image classification. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107605] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
6
|
Qiao S, Han N, Huang J, Yue K, Mao R, Shu H, He Q, Wu X. A Dynamic Convolutional Neural Network Based Shared-Bike Demand Forecasting Model. ACM T INTEL SYST TEC 2021. [DOI: 10.1145/3447988] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Bike-sharing systems are becoming popular and generate a large volume of trajectory data. In a bike-sharing system, users can borrow and return bikes at different stations. In particular, a bike-sharing system will be affected by weather, the time period, and other dynamic factors, which challenges the scheduling of shared bikes. In this article, a new shared-bike demand forecasting model based on dynamic convolutional neural networks, called
SDF
, is proposed to predict the demand of shared bikes. SDF chooses the most relevant weather features from real weather data by using the Pearson correlation coefficient and transforms them into a two-dimensional dynamic feature matrix, taking into account the states of stations from historical data. The feature information in the matrix is extracted, learned, and trained with a newly proposed dynamic convolutional neural network to predict the demand of shared bikes in a dynamical and intelligent fashion. The phase of parameter update is optimized from three aspects: the loss function, optimization algorithm, and learning rate. Then, an accurate shared-bike demand forecasting model is designed based on the basic idea of minimizing the loss value. By comparing with classical machine learning models, the weight sharing strategy employed by SDF reduces the complexity of the network. It allows a high prediction accuracy to be achieved within a relatively short period of time. Extensive experiments are conducted on real-world bike-sharing datasets to evaluate SDF. The results show that SDF significantly outperforms classical machine learning models in prediction accuracy and efficiency.
Collapse
Affiliation(s)
- Shaojie Qiao
- Chengdu University of Information Technology, Chengdu, Sichuan, China
| | - Nan Han
- Chengdu University of Information Technology, Chengdu, Sichuan, China
| | | | - Kun Yue
- Yunnan University, Kunming, Yunnan, China
| | - Rui Mao
- Guangdong Province Key Laboratory of Popular High Performance Computers, and Guangdong Province Engineering Center of China-Made High Performance Data Computing System, Shenzhen, Guangzhou, China
| | - Hongping Shu
- Software Automatic Generation and Intelligent Service Key Laboratory of Sichuan Province, Chengdu, Sichuan, China
| | - Qiang He
- Swinburne University of Technology, Melbourne, Australia
| | - Xindong Wu
- Hefei University of Technology, Hefei, Anhui, China
| |
Collapse
|
7
|
Melanoma Recognition by Fusing Convolutional Blocks and Dynamic Routing between Capsules. Cancers (Basel) 2021; 13:cancers13194974. [PMID: 34638456 PMCID: PMC8508435 DOI: 10.3390/cancers13194974] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Revised: 09/28/2021] [Accepted: 09/29/2021] [Indexed: 11/17/2022] Open
Abstract
Simple Summary The early treatment of skin cancer can effectively reduce mortality rates. Recently, automatic melanoma diagnosis from skin images has gained attention, which was mainly encouraged by the well-known challenge developed by the International Skin Imaging Collaboration project. The majority of contestant submitted Convolutional Neural Network based solutions. However, this type of model presents disadvantages. As a consequence, Dynamic Routing between Capsules has been proposed to overcome such limitations. The aim of our proposal was to assess the advantages of combining both architectures. An extensive experimental study showed the proposal significantly outperformed state-of-the-art models, achieving 166% higher predictive performance compared to ResNet in non-dermoscopic images. In addition, the pixels activated during prediction were shown, which allows to assess the rationale to give such a conclusion. Finally, more research should be conducted in order to demonstrate the potential of this neural network architecture in other areas. Abstract Skin cancer is one of the most common types of cancers in the world, with melanoma being the most lethal form. Automatic melanoma diagnosis from skin images has recently gained attention within the machine learning community, due to the complexity involved. In the past few years, convolutional neural network models have been commonly used to approach this issue. This type of model, however, presents disadvantages that sometimes hamper its application in real-world situations, e.g., the construction of transformation-invariant models and their inability to consider spatial hierarchies between entities within an image. Recently, Dynamic Routing between Capsules architecture (CapsNet) has been proposed to overcome such limitations. This work is aimed at proposing a new architecture which combines convolutional blocks with a customized CapsNet architecture, allowing for the extraction of richer abstract features. This architecture uses high-quality 299×299×3 skin lesion images, and a hyper-tuning of the main parameters is performed in order to ensure effective learning under limited training data. An extensive experimental study on eleven image datasets was conducted where the proposal significantly outperformed several state-of-the-art models. Finally, predictions made by the model were validated through the application of two modern model-agnostic interpretation tools.
Collapse
|
8
|
Chandrasekar KS, Geetha P. A new formation of supervised dimensionality reduction method for moving vehicle classification. Neural Comput Appl 2021. [DOI: 10.1007/s00521-020-05524-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
9
|
Gnip P, Vokorokos L, Drotár P. Selective oversampling approach for strongly imbalanced data. PeerJ Comput Sci 2021; 7:e604. [PMID: 34239981 PMCID: PMC8237317 DOI: 10.7717/peerj-cs.604] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 05/31/2021] [Indexed: 06/03/2023]
Abstract
Challenges posed by imbalanced data are encountered in many real-world applications. One of the possible approaches to improve the classifier performance on imbalanced data is oversampling. In this paper, we propose the new selective oversampling approach (SOA) that first isolates the most representative samples from minority classes by using an outlier detection technique and then utilizes these samples for synthetic oversampling. We show that the proposed approach improves the performance of two state-of-the-art oversampling methods, namely, the synthetic minority oversampling technique and adaptive synthetic sampling. The prediction performance is evaluated on four synthetic datasets and four real-world datasets, and the proposed SOA methods always achieved the same or better performance than other considered existing oversampling methods.
Collapse
Affiliation(s)
- Peter Gnip
- Department of Computers and Informatics, Technical University of Košice, Slovak Republic
| | - Liberios Vokorokos
- Department of Computers and Informatics, Technical University of Košice, Slovak Republic
| | - Peter Drotár
- Department of Computers and Informatics, Technical University of Košice, Slovak Republic
| |
Collapse
|
10
|
Accelerating the experimental responses on cell behaviors: a long-term prediction of cell trajectories using Social Generative Adversarial Network. Sci Rep 2020; 10:15635. [PMID: 32973301 PMCID: PMC7519062 DOI: 10.1038/s41598-020-72605-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Accepted: 07/17/2020] [Indexed: 01/04/2023] Open
Abstract
The incremented uptake provided by time-lapse microscopy in Organ-on-a-Chip (OoC) devices allowed increased attention to the dynamics of the co-cultured systems. However, the amount of information stored in long-time experiments may constitute a serious bottleneck of the experimental pipeline. Forward long-term prediction of cell trajectories may reduce the spatial–temporal burden of video sequences storage. Cell trajectory prediction becomes crucial especially to increase the trustworthiness in software tools designed to conduct a massive analysis of cell behavior under chemical stimuli. To address this task, we transpose here the exploitation of the presence of “social forces” from the human to the cellular level for motion prediction at microscale by adapting the potential of Social Generative Adversarial Network predictors to cell motility. To demonstrate the effectiveness of the approach, we consider here two case studies: one related to PC-3 prostate cancer cells cultured in 2D Petri dishes under control and treated conditions and one related to an OoC experiment of tumor-immune interaction in fibrosarcoma cells. The goodness of the proposed strategy has been verified by successfully comparing the distributions of common descriptors (kinematic descriptors and mean interaction time for the two scenarios respectively) from the trajectories obtained by video analysis and the predicted counterparts.
Collapse
|
11
|
Deep Learning in the Biomedical Applications: Recent and Future Status. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9081526] [Citation(s) in RCA: 76] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Deep neural networks represent, nowadays, the most effective machine learning technology in biomedical domain. In this domain, the different areas of interest concern the Omics (study of the genome—genomics—and proteins—transcriptomics, proteomics, and metabolomics), bioimaging (study of biological cell and tissue), medical imaging (study of the human organs by creating visual representations), BBMI (study of the brain and body machine interface) and public and medical health management (PmHM). This paper reviews the major deep learning concepts pertinent to such biomedical applications. Concise overviews are provided for the Omics and the BBMI. We end our analysis with a critical discussion, interpretation and relevant open challenges.
Collapse
|