1
|
Li F, Zhang J, Li K, Peng Y, Zhang H, Xu Y, Yu Y, Zhang Y, Liu Z, Wang Y, Huang L, Zhou F. GANSamples-ac4C: Enhancing ac4C site prediction via generative adversarial networks and transfer learning. Anal Biochem 2024; 689:115495. [PMID: 38431142 DOI: 10.1016/j.ab.2024.115495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 02/18/2024] [Accepted: 02/22/2024] [Indexed: 03/05/2024]
Abstract
RNA modification, N4-acetylcytidine (ac4C), is enzymatically catalyzed by N-acetyltransferase 10 (NAT10) and plays an essential role across tRNA, rRNA, and mRNA. It influences various cellular functions, including mRNA stability and rRNA biosynthesis. Wet-lab detection of ac4C modification sites is highly resource-intensive and costly. Therefore, various machine learning and deep learning techniques have been employed for computational detection of ac4C modification sites. The known ac4C modification sites are limited for training an accurate and stable prediction model. This study introduces GANSamples-ac4C, a novel framework that synergizes transfer learning and generative adversarial network (GAN) to generate synthetic RNA sequences to train a better ac4C modification site prediction model. Comparative analysis reveals that GANSamples-ac4C outperforms existing state-of-the-art methods in identifying ac4C sites. Moreover, our result underscores the potential of synthetic data in mitigating the issue of data scarcity for biological sequence prediction tasks. Another major advantage of GANSamples-ac4C is its interpretable decision logic. Multi-faceted interpretability analyses detect key regions in the ac4C sequences influencing the discriminating decision between positive and negative samples, a pronounced enrichment of G in this region, and ac4C-associated motifs. These findings may offer novel insights for ac4C research. The GANSamples-ac4C framework and its source code are publicly accessible at http://www.healthinformaticslab.org/supp/.
Collapse
Affiliation(s)
- Fei Li
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, and College of Computer Science and Technology, Jilin University, Changchun, Jilin, 130012, China
| | - Jiale Zhang
- College of Software, Jilin University, Changchun, Jilin, 130012, China
| | - Kewei Li
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, and College of Computer Science and Technology, Jilin University, Changchun, Jilin, 130012, China.
| | - Yu Peng
- College of Software, Jilin University, Changchun, Jilin, 130012, China
| | - Haotian Zhang
- College of Software, Jilin University, Changchun, Jilin, 130012, China
| | - Yiping Xu
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, and College of Computer Science and Technology, Jilin University, Changchun, Jilin, 130012, China
| | - Yue Yu
- College of Software, Jilin University, Changchun, Jilin, 130012, China
| | - Yuteng Zhang
- College of Software, Jilin University, Changchun, Jilin, 130012, China
| | - Zewen Liu
- College of Software, Jilin University, Changchun, Jilin, 130012, China
| | - Ying Wang
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, and College of Computer Science and Technology, Jilin University, Changchun, Jilin, 130012, China
| | - Lan Huang
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, and College of Computer Science and Technology, Jilin University, Changchun, Jilin, 130012, China
| | - Fengfeng Zhou
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, and College of Computer Science and Technology, Jilin University, Changchun, Jilin, 130012, China; School of Biology and Engineering, Guizhou Medical University, Guiyang, 550025, Guizhou, China.
| |
Collapse
|
2
|
Nia AF, Tang V, Talou GM, Billinghurst M. Synthesizing affective neurophysiological signals using generative models: A review paper. J Neurosci Methods 2024; 406:110129. [PMID: 38614286 DOI: 10.1016/j.jneumeth.2024.110129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 01/04/2024] [Accepted: 04/03/2024] [Indexed: 04/15/2024]
Abstract
The integration of emotional intelligence in machines is an important step in advancing human-computer interaction. This demands the development of reliable end-to-end emotion recognition systems. However, the scarcity of public affective datasets presents a challenge. In this literature review, we emphasize the use of generative models to address this issue in neurophysiological signals, particularly Electroencephalogram (EEG) and Functional Near-Infrared Spectroscopy (fNIRS). We provide a comprehensive analysis of different generative models used in the field, examining their input formulation, deployment strategies, and methodologies for evaluating the quality of synthesized data. This review serves as a comprehensive overview, offering insights into the advantages, challenges, and promising future directions in the application of generative models in emotion recognition systems. Through this review, we aim to facilitate the progression of neurophysiological data augmentation, thereby supporting the development of more efficient and reliable emotion recognition systems.
Collapse
Affiliation(s)
- Alireza F Nia
- Auckland Bioengineering Institute, 70 Symonds Street, Auckland, 1010, New Zealand.
| | - Vanessa Tang
- Auckland Bioengineering Institute, 70 Symonds Street, Auckland, 1010, New Zealand
| | - Gonzalo Maso Talou
- Auckland Bioengineering Institute, 70 Symonds Street, Auckland, 1010, New Zealand
| | - Mark Billinghurst
- Auckland Bioengineering Institute, 70 Symonds Street, Auckland, 1010, New Zealand
| |
Collapse
|
3
|
Ahmadian M, Bodalal Z, van der Hulst HJ, Vens C, Karssemakers LHE, Bogveradze N, Castagnoli F, Landolfi F, Hong EK, Gennaro N, Pizzi AD, Beets-Tan RGH, van den Brekel MWM, Castelijns JA. Overcoming data scarcity in radiomics/radiogenomics using synthetic radiomic features. Comput Biol Med 2024; 174:108389. [PMID: 38593640 DOI: 10.1016/j.compbiomed.2024.108389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 03/11/2024] [Accepted: 03/25/2024] [Indexed: 04/11/2024]
Abstract
PURPOSE To evaluate the potential of synthetic radiomic data generation in addressing data scarcity in radiomics/radiogenomics models. METHODS This study was conducted on a retrospectively collected cohort of 386 colorectal cancer patients (n = 2570 lesions) for whom matched contrast-enhanced CT images and gene TP53 mutational status were available. The full cohort data was divided into a training cohort (n = 2055 lesions) and an independent and fixed test set (n = 515 lesions). Differently sized training sets were subsampled from the training cohort to measure the impact of sample size on model performance and assess the added value of synthetic radiomic augmentation at different sizes. Five different tabular synthetic data generation models were used to generate synthetic radiomic data based on "real-world" radiomics data extracted from this cohort. The quality and reproducibility of the generated synthetic radiomic data were assessed. Synthetic radiomics were then combined with "real-world" radiomic training data to evaluate their impact on the predictive model's performance. RESULTS A prediction model was generated using only "real-world" radiomic data, revealing the impact of data scarcity in this particular data set through a lack of predictive performance at low training sample numbers (n = 200, 400, 1000 lesions with average AUC = 0.52, 0.53, and 0.56 respectively, compared to 0.64 when using 2055 training lesions). Synthetic tabular data generation models created reproducible synthetic radiomic data with properties highly similar to "real-world" data (for n = 1000 lesions, average Chi-square = 0.932, average basic statistical correlation = 0.844). The integration of synthetic radiomic data consistently enhanced the performance of predictive models trained with small sample size sets (AUC enhanced by 9.6%, 11.3%, and 16.7% for models trained on n_samples = 200, 400, and 1000 lesions, respectively). In contrast, synthetic data generated from randomised/noisy radiomic data failed to enhance predictive performance underlining the requirement of true signal data to do so. CONCLUSION Synthetic radiomic data, when combined with real radiomics, could enhance the performance of predictive models. Tabular synthetic data generation might help to overcome limitations in medical AI stemming from data scarcity.
Collapse
Affiliation(s)
- Milad Ahmadian
- Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; Amsterdam Center for Language and Communication, University of Amsterdam, Amsterdam, the Netherlands.
| | - Zuhir Bodalal
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; GROW School for Oncology and Developmental Biology, Maastricht University, Maastricht, the Netherlands
| | - Hedda J van der Hulst
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; GROW School for Oncology and Developmental Biology, Maastricht University, Maastricht, the Netherlands
| | - Conchita Vens
- Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; School of Cancer Science, University of Glasgow, Glasgow, Scotland, UK
| | - Luc H E Karssemakers
- Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands
| | - Nino Bogveradze
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; GROW School for Oncology and Developmental Biology, Maastricht University, Maastricht, the Netherlands; Department of Radiology, American Hospital Tbilisi, Tbilisi, Georgia
| | - Francesca Castagnoli
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; Department of Radiology, Royal Marsden Hospital, London, UK; Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, UK
| | - Federica Landolfi
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; Radiology Unit, Sant'Andrea Hospital, Sapienza University of Rome, Rome, Italy
| | - Eun Kyoung Hong
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; GROW School for Oncology and Developmental Biology, Maastricht University, Maastricht, the Netherlands; Seoul National University Hospital, Seoul, South Korea
| | - Nicolo Gennaro
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; Department of Radiology, Northwestern University, Chicago, USA
| | - Andrea Delli Pizzi
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; ITAB - Institute for Advanced Biomedical Technologies, G. d'Annunzio University, Chieti, Italy; Department of Innovative Technologies in Medicine and Dentistry, G. D'Annunzio University, Chieti, Italy
| | - Regina G H Beets-Tan
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; GROW School for Oncology and Developmental Biology, Maastricht University, Maastricht, the Netherlands; Institute of Regional Health Research, University of Southern Denmark, Odense, Denmark
| | - Michiel W M van den Brekel
- Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands; Amsterdam Center for Language and Communication, University of Amsterdam, Amsterdam, the Netherlands.
| | - Jonas A Castelijns
- Department of Radiology, The Netherlands Cancer Institute/Antoni van Leeuwenhoek Hospital, Amsterdam, the Netherlands
| |
Collapse
|
4
|
Monachino G, Zanchi B, Fiorillo L, Conte G, Auricchio A, Tzovara A, Faraci FD. Deep Generative Models: The winning key for large and easily accessible ECG datasets? Comput Biol Med 2023; 167:107655. [PMID: 37976830 DOI: 10.1016/j.compbiomed.2023.107655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 10/04/2023] [Accepted: 10/31/2023] [Indexed: 11/19/2023]
Abstract
Large high-quality datasets are essential for building powerful artificial intelligence (AI) algorithms capable of supporting advancement in cardiac clinical research. However, researchers working with electrocardiogram (ECG) signals struggle to get access and/or to build one. The aim of the present work is to shed light on a potential solution to address the lack of large and easily accessible ECG datasets. Firstly, the main causes of such a lack are identified and examined. Afterward, the potentials and limitations of cardiac data generation via deep generative models (DGMs) are deeply analyzed. These very promising algorithms have been found capable not only of generating large quantities of ECG signals but also of supporting data anonymization processes, to simplify data sharing while respecting patients' privacy. Their application could help research progress and cooperation in the name of open science. However several aspects, such as a standardized synthetic data quality evaluation and algorithm stability, need to be further explored.
Collapse
Affiliation(s)
- Giuliana Monachino
- Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland; Institute of Informatics, University of Bern, Neubrückstrasse 10, Bern 3012, Switzerland.
| | - Beatrice Zanchi
- Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland; Department of Quantitative Biomedicine, University of Zurich, Schmelzbergstrasse 26, Zurich 8091, Switzerland
| | - Luigi Fiorillo
- Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland
| | - Giulio Conte
- Division of Cardiology, Fondazione Cardiocentro Ticino, Via Tesserete 48, Lugano 6900, Switzerland; Centre for Computational Medicine in Cardiology, Faculty of Informatics, Università della Svizzera Italiana, Via la Santa 1, Lugano 6900, Switzerland
| | - Angelo Auricchio
- Division of Cardiology, Fondazione Cardiocentro Ticino, Via Tesserete 48, Lugano 6900, Switzerland; Centre for Computational Medicine in Cardiology, Faculty of Informatics, Università della Svizzera Italiana, Via la Santa 1, Lugano 6900, Switzerland
| | - Athina Tzovara
- Institute of Informatics, University of Bern, Neubrückstrasse 10, Bern 3012, Switzerland; Sleep Wake Epilepsy Center | NeuroTec, Department of Neurology, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse 16, Bern 3010, Switzerland
| | - Francesca Dalia Faraci
- Institute of Digital Technologies for Personalized Healthcare - MeDiTech, Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Via la Santa 1, Lugano 6900, Switzerland
| |
Collapse
|
5
|
Bargagna F, De Santi LA, Martini N, Genovesi D, Favilli B, Vergaro G, Emdin M, Giorgetti A, Positano V, Santarelli MF. Bayesian Convolutional Neural Networks in Medical Imaging Classification: A Promising Solution for Deep Learning Limits in Data Scarcity Scenarios. J Digit Imaging 2023; 36:2567-2577. [PMID: 37787869 PMCID: PMC10584795 DOI: 10.1007/s10278-023-00897-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 08/04/2023] [Accepted: 08/07/2023] [Indexed: 10/04/2023] Open
Abstract
Deep neural networks (DNNs) have already impacted the field of medicine in data analysis, classification, and image processing. Unfortunately, their performance is drastically reduced when datasets are scarce in nature (e.g., rare diseases or early-research data). In such scenarios, DNNs display poor capacity for generalization and often lead to highly biased estimates and silent failures. Moreover, deterministic systems cannot provide epistemic uncertainty, a key component to asserting the model's reliability. In this work, we developed a probabilistic system for classification as a framework for addressing the aforementioned criticalities. Specifically, we implemented a Bayesian convolutional neural network (BCNN) for the classification of cardiac amyloidosis (CA) subtypes. We prepared four different CNNs: base-deterministic, dropout-deterministic, dropout-Bayesian, and Bayesian. We then trained them on a dataset of 1107 PET images from 47 CA and control patients (data scarcity scenario). The Bayesian model achieved performances (78.28 (1.99) % test accuracy) comparable to the base-deterministic, dropout-deterministic, and dropout-Bayesian ones, while showing strongly increased "Out of Distribution" input detection (validation-test accuracy mismatch reduction). Additionally, both the dropout-Bayesian and the Bayesian models enriched the classification through confidence estimates, while reducing the criticalities of the dropout-deterministic and base-deterministic approaches. This in turn increased the model's reliability, also providing much needed insights into the network's estimates. The obtained results suggest that a Bayesian CNN can be a promising solution for addressing the challenges posed by data scarcity in medical imaging classification tasks.
Collapse
Affiliation(s)
- Filippo Bargagna
- University of Pisa, Pisa, Italy.
- Fondazione G. Monasterio CNR - Regione Toscana, Pisa, Italy.
| | - Lisa Anita De Santi
- University of Pisa, Pisa, Italy
- Fondazione G. Monasterio CNR - Regione Toscana, Pisa, Italy
| | - Nicola Martini
- Fondazione G. Monasterio CNR - Regione Toscana, Pisa, Italy
| | - Dario Genovesi
- Fondazione G. Monasterio CNR - Regione Toscana, Pisa, Italy
| | | | | | - Michele Emdin
- Scuola Universitaria Superiore 'S. Anna', Pisa, Italy
| | | | | | | |
Collapse
|
6
|
Bertels D, De Meester J, Dirckx G, Willems P. Estimation of the impact of combined sewer overflows on surface water quality in a sparsely monitored area. Water Res 2023; 244:120498. [PMID: 37639989 DOI: 10.1016/j.watres.2023.120498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 07/24/2023] [Accepted: 08/17/2023] [Indexed: 08/31/2023]
Abstract
Combined sewer overflows (CSOs) can have a severe negative, local impact on surface water systems. To assure good ecological surface water quality and drinking water production that meets the demands, the impact of sewer system overflows on the surrounding water bodies for current and future climate conditions needs to be assessed. Typically, integrated, detailed hydrological and hydrodynamic water quantity and quality models are used for this purpose, but often data and computational resource requirements limit their applicability. Therefore, an alternative computationally efficient, integrated water quantity and quality model of sewer systems and their receiving surface waters is proposed to assess the impact of CSOs on surface water quality in a sparsely observed area. A conceptual model approach to estimate CSO discharges is combined with an empirical model for estimating CSO pollutant concentrations based on waste water treatment plant influent observations. Both methods are compared with observations and independent results of established reference methods as to evaluate their performance. The methodology is demonstrated by modelling the current impact of CSOs on the water abstraction area of a major drinking water production centre in Flanders, Belgium. It is concluded that the proposed conceptual models achieve similar results for daily WWTP effluent and CSO frequency, whereby the accumulated CSO volume is similar to more detailed full hydrodynamic models. Further, the estimated pollutant concentrations correspond with another dataset based on high resolution sampled overflows. As a result, the proposed computational efficient method can give insights in the impact of CSOs on the water quality at a catchment level and can be used for planning monitoring campaigns or performing analyses of e.g. the current and future water availability for a data scarce areas.
Collapse
Affiliation(s)
- Daan Bertels
- KU Leuven, Department of Civil Engineering, Hydraulics and Geotechnics Section, Kasteelpark Arenberg 40 - box 2448, Leuven 3001, Belgium.
| | - Joke De Meester
- KU Leuven, Department of Civil Engineering, Hydraulics and Geotechnics Section, Kasteelpark Arenberg 40 - box 2448, Leuven 3001, Belgium
| | - Geert Dirckx
- Aquafin NV, R & D, Dijkstraat 8, Aartselaar 2630, Belgium
| | - Patrick Willems
- KU Leuven, Department of Civil Engineering, Hydraulics and Geotechnics Section, Kasteelpark Arenberg 40 - box 2448, Leuven 3001, Belgium
| |
Collapse
|
7
|
Mdegela L, De Bock Y, Luhanga E, Leo J, Mannens E. Monitoring Kikuletwa river levels in northern Tanzania: A data set unlocking insights for effective flood early warning systems. Data Brief 2023; 49:109395. [PMID: 37496522 PMCID: PMC10365974 DOI: 10.1016/j.dib.2023.109395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 06/21/2023] [Accepted: 07/06/2023] [Indexed: 07/28/2023] Open
Abstract
Floods are a recurring natural disaster that pose significant risks to communities and infrastructure. The lack of reliable and accurate data on river systems in developing countries has hindered the development of effective flood early warning systems. This paper presents a data set collected using ultrasonic distance sensors installed at two locations along the Kikuletwa River in the Pangani River Basin, Northern Tanzania. The dataset consists of hourly measurements of river water levels, providing a high-resolution time series that can be used to study trends in water level changes and to develop more accurate flood early warning systems. The Kikuletwa River dataset has significant potential applications for flood management, including the calibration and validation of hydrological models, the identification of critical thresholds for flood warning, and the evaluation of flood forecasting techniques. The dataset can also be used to study the hydrological processes in the basin, such as the relationship between rainfall and river discharge, and to develop more efficient and effective flood management strategies. The ultrasonic distance sensors were configured to record river level data at hourly intervals, providing a continuous time series of river levels. The data was subjected to quality control procedures to ensure accuracy and consistency, and missing or erroneous data was corrected or removed where necessary.
Collapse
Affiliation(s)
- Lawrence Mdegela
- University of Antwerp – imec IDLab, Department of Computer Science Sint-Pietersvliet 7, 2000 Antwerp, Belgium
- The Nelson Mandela African Institution of Science and Technology, P.O Box 447, Arusha, Tanzania
| | - Yorick De Bock
- University of Antwerp – imec IDLab, Department of Computer Science Sint-Pietersvliet 7, 2000 Antwerp, Belgium
| | - Edith Luhanga
- Carnegie Mellon African University, P.O Box 6150, Kigali, Rwanda
| | - Judith Leo
- The Nelson Mandela African Institution of Science and Technology, P.O Box 447, Arusha, Tanzania
| | - Erik Mannens
- University of Antwerp – imec IDLab, Department of Computer Science Sint-Pietersvliet 7, 2000 Antwerp, Belgium
| |
Collapse
|
8
|
Lee CY, Yang SH. Graph Spatio-Temporal Networks for Manufacturing Sales Forecast and Prevention Policies in Pandemic Era. Comput Ind Eng 2023; 182:109413. [PMID: 38620105 PMCID: PMC10299845 DOI: 10.1016/j.cie.2023.109413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 06/23/2023] [Accepted: 06/26/2023] [Indexed: 04/17/2024]
Abstract
Worldwide manufacturing industries are significantly affected by COVID-19 pandemic because of their production characteristics with low-cost country sourcing, globalization, and inventory level. To analyze the correlated time series, spatial-temporal model becomes more attractive, and the graph convolution network (GCN) is also commonly used to provide more information to the nodes and its neighbors in the graph. Recently, attention-adjusted graph spatio-temporal network (AGSTN) was proposed to address the problem of pre-defined graph in GCN by combining multi-graph convolution and attention adjustment to learn spatial and temporal correlations over time. However, AGSTN may show potential problem with limited small non-sensor data; particularly, convergence issue. This study proposes several variants of AGSTN and applies them to non-sensor data. We suggest data augmentation and regularization techniques such as edge selection, time series decomposition, prevention policies to improve AGSTN. An empirical study of worldwide manufacturing industries in pandemic era was conducted to validate the proposed variants. The results show that the proposed variants significantly improve the prediction performance at least around 20% on mean squared error (MSE) and convergence problem.
Collapse
Affiliation(s)
- Chia-Yen Lee
- Department of Information Management, National Taiwan University, Taipei 106, Taiwan
| | - Shu-Huei Yang
- Institute of Manufacturing Information and Systems, National Cheng Kung University, Tainan City 701, Taiwan
| |
Collapse
|
9
|
Rödiger T, Geyer S, Odeh T, Siebert C. Data scarce modelling the impact of present and future groundwater development on Jordan multiaquifer groundwater resources. Sci Total Environ 2023; 870:161729. [PMID: 36682544 DOI: 10.1016/j.scitotenv.2023.161729] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 01/16/2023] [Accepted: 01/16/2023] [Indexed: 06/17/2023]
Abstract
Rapidly growing demands and climate change stresses water resources worldwide and leads to highly competitive situations between the environment and socio-economic development of a region, calling for a smart and modelling driven water resources management. However, data scarcity often prevents the realisation of a comprehensive, nation-wide resources model, which provides reliable and spatially discretized results of water resources development. We present a workflow approach to set up a large-scale multi-aquifer model, overcoming data shortage by stepwise calibration and integrating hydrological and numerical groundwater flow modelling into a coupled system. The study aims to develop such a system to assess how groundwater resources react on anthropogenic impacts on the example of the Kingdom of Jordan, one of the water poorest countries on globe. Simulated heads reliably resembled the monitored ones in >70 % of the observation wells. That makes us confident, the model represents all the states well from 1970, prior to the intense development of the country until 2015. The water balance shows an annual deficit of 1.16 million cubic meter (MCM) due to an overdraft. The discharge to the Dead Sea increased from 564 MCM/yr to 696 MCM/yr over the time period. Regional drawdowns of >250 m and groundwater depression with an extension of approx.100 km are observable in both large aquifer complexes. Most severe areas in the upper calcareous aquifer are located in the north of Amman and practically in all urban and agricultural agglomerations across the country. Groundwater tables in the deeper sandstone aquifer are particularly affected in the south as well as in the wider vicinity of the Dead Sea as consequence of its continuous dropping. Simulations of the future development of the groundwater tables indicate a severe deterioration of the situation with further declines in groundwater levels of up to 70 m.
Collapse
Affiliation(s)
- T Rödiger
- Helmholtz-Centre for Environmental Research UFZ, Dept. Catchment Hydrology, Theodor-Lieser-Str. 4, 06120 Halle, Germany.
| | - S Geyer
- Helmholtz-Centre for Environmental Research UFZ, Dept. Catchment Hydrology, Theodor-Lieser-Str. 4, 06120 Halle, Germany
| | - T Odeh
- The Hashemite University, Prince Al Hassan bin Talal College for Natural Resources and Environment, Dept. of Water Management and Environment, Zarqa, Jordan
| | - C Siebert
- Helmholtz-Centre for Environmental Research UFZ, Dept. Catchment Hydrology, Theodor-Lieser-Str. 4, 06120 Halle, Germany
| |
Collapse
|
10
|
Chen M, Janssen ABG, de Klein JJM, Du X, Lei Q, Li Y, Zhang T, Pei W, Kroeze C, Liu H. Comparing critical source areas for the sediment and nutrients of calibrated and uncalibrated models in a plateau watershed in southwest China. J Environ Manage 2023; 326:116712. [PMID: 36402022 DOI: 10.1016/j.jenvman.2022.116712] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 10/24/2022] [Accepted: 11/03/2022] [Indexed: 06/16/2023]
Abstract
Controlling non-point source pollution is often difficult and costly. Therefore, focusing on areas that contribute the most, so-called critical source areas (CSAs), can have economic and ecological benefits. CSAs are often determined using a modelling approach, yet it has proved difficult to calibrate the models in regions with limited data availability. Since identifying CSAs is based on the relative contributions of sub-basins to the total load, it has been suggested that uncalibrated models could be used to identify CSAs to overcome data scarcity issues. Here, we use the SWAT model to study the extent to which an uncalibrated model can be applied to determine CSAs. We classify and rank sub-basins to identify CSAs for sediment, total nitrogen (TN), and total phosphorus (TP) in the Fengyu River Watershed (China) with and without model calibration. The results show high similarity (81%-93%) between the identified sediment and TP CSA number and locations before and after calibration both on the yearly and seasonal scale. For TN alone, the results show moderate similarity on the yearly scale (73%). This may be because, in our study area, TN is determined more by groundwater flow after calibration than by surface water flow. We conclude that CSA identification with the uncalibrated model for TP is always good because its CSA number and locations changed least, and for sediment, it is generally satisfactory. The use of the uncalibrated model for TN is acceptable, as its CSA locations did not change after calibration; however, the TN CSA number changed by over 60% compared to the figures before calibration on both yearly and seasonal scales. Therefore, we advise using an uncalibrated model to identify CSAs for TN only if water yield composition changes are expected to be limited. This study shows that CSAs can be identified based on relative loading estimates with uncalibrated models in data-deficient regions.
Collapse
Affiliation(s)
- Meijun Chen
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences/Key Laboratory of Nonpoint Source Pollution Control, Ministry of Agriculture and Rural Affairs, Beijing, 100081, China; Water Systems and Global Change Group, Department of Environmental Sciences, Wageningen University and Research, PO Box 47, 6700AA Wageningen, the Netherlands; Aquatic Ecology and Water Quality Management Group, Department of Environmental Sciences, Wageningen University and Research, PO Box, 47, 6700AA, Wageningen, the Netherlands.
| | - Annette B G Janssen
- Water Systems and Global Change Group, Department of Environmental Sciences, Wageningen University and Research, PO Box 47, 6700AA Wageningen, the Netherlands
| | - Jeroen J M de Klein
- Aquatic Ecology and Water Quality Management Group, Department of Environmental Sciences, Wageningen University and Research, PO Box, 47, 6700AA, Wageningen, the Netherlands
| | - Xinzhong Du
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences/Key Laboratory of Nonpoint Source Pollution Control, Ministry of Agriculture and Rural Affairs, Beijing, 100081, China.
| | - Qiuliang Lei
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences/Key Laboratory of Nonpoint Source Pollution Control, Ministry of Agriculture and Rural Affairs, Beijing, 100081, China
| | - Ying Li
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, CAS, Beijing 100101, PR China
| | - Tianpeng Zhang
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences/Key Laboratory of Nonpoint Source Pollution Control, Ministry of Agriculture and Rural Affairs, Beijing, 100081, China
| | - Wei Pei
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences/Key Laboratory of Nonpoint Source Pollution Control, Ministry of Agriculture and Rural Affairs, Beijing, 100081, China
| | - Carolien Kroeze
- Water Systems and Global Change Group, Department of Environmental Sciences, Wageningen University and Research, PO Box 47, 6700AA Wageningen, the Netherlands
| | - Hongbin Liu
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences/Key Laboratory of Nonpoint Source Pollution Control, Ministry of Agriculture and Rural Affairs, Beijing, 100081, China.
| |
Collapse
|
11
|
Arshad M, Qureshi M, Inam O, Omer H. Transfer learning in deep neural network-based receiver coil sensitivity map estimation. MAGMA 2021; 34:717-728. [PMID: 33772694 DOI: 10.1007/s10334-021-00919-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Revised: 03/07/2021] [Accepted: 03/09/2021] [Indexed: 11/30/2022]
Abstract
INTRODUCTION The success of parallel Magnetic Resonance Imaging algorithms like SENSitivity Encoding (SENSE) depends on an accurate estimation of the receiver coil sensitivity maps. Deep learning-based receiver coil sensitivity map estimation depends upon the size of training dataset and generalization capabilities of the trained neural network. When there is a mismatch between the training and testing datasets, retraining of the neural networks is required from a scratch which is costly and time consuming. MATERIALS AND METHODS A transfer learning approach, i.e., end-to-end fine-tuning is proposed to address the data scarcity and generalization problems of deep learning-based receiver coil sensitivity map estimation. First, generalization capabilities of a pre-trained U-Net (initially trained on 1.5T receiver coil sensitivity maps) are thoroughly assessed for 3T receiver coil sensitivity map estimation. Later, end-to-end fine-tuning is performed on the pre-trained U-Net to estimate the 3T receiver coil sensitivity maps. RESULT AND CONCLUSION Peak Signal-to-Noise Ratio, Root Mean Square Error and central line profiles (of the SENSE reconstructed images) show a successful SENSE reconstruction by utilizing the receiver coil sensitivity maps estimated by the proposed method.
Collapse
Affiliation(s)
- Madiha Arshad
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University, Islamabad, Pakistan.
| | - Mahmood Qureshi
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University, Islamabad, Pakistan
| | - Omair Inam
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University, Islamabad, Pakistan
| | - Hammad Omer
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University, Islamabad, Pakistan
| |
Collapse
|
12
|
Arshad M, Qureshi M, Inam O, Omer H. Transfer learning in deep neural network based under-sampled MR image reconstruction. Magn Reson Imaging 2020; 76:96-107. [PMID: 32980504 DOI: 10.1016/j.mri.2020.09.018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Revised: 08/10/2020] [Accepted: 09/08/2020] [Indexed: 01/27/2023]
Abstract
In Magnetic Resonance Imaging (MRI), the success of deep learning-based under-sampled MR image reconstruction depends on: (i) size of the training dataset, (ii) generalization capabilities of the trained neural network. Whenever there is a mismatch between the training and testing data, there is a need to retrain the neural network from scratch with thousands of MR images obtained using the same protocol. This may not be possible in MRI as it is costly and time consuming to acquire data. In this research, a transfer learning approach i.e. end-to-end fine tuning is proposed for U-Net to address the data scarcity and generalization problems of deep learning-based MR image reconstruction. First the generalization capabilities of a pre-trained U-Net (initially trained on the human brain images of 1.5 T scanner) are assessed for: (a) MR images acquired from MRI scanners of different magnetic field strengths, (b) MR images of different anatomies and (c) MR images under-sampled by different acceleration factors. Later, end-to-end fine tuning of the pre-trained U-Net is proposed for the reconstruction of the above-mentioned MR images (i.e. (a), (b) and (c)). The results show successful reconstructions obtained from the proposed method as reflected by the Structural SIMilarity index, Root Mean Square Error, Peak Signal-to-Noise Ratio and central line profile of the reconstructed images.
Collapse
Affiliation(s)
- Madiha Arshad
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University Islamabad, Pakistan
| | - Mahmood Qureshi
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University Islamabad, Pakistan
| | - Omair Inam
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University Islamabad, Pakistan
| | - Hammad Omer
- Medical Image Processing Research Group (MIPRG), Department of Electrical and Computer Engineering, COMSATS University Islamabad, Pakistan.
| |
Collapse
|
13
|
Borzooei S, Amerlinck Y, Panepinto D, Abolfathi S, Nopens I, Scibilia G, Meucci L, Zanetti MC. Energy optimization of a wastewater treatment plant based on energy audit data: small investment with high return. Environ Sci Pollut Res Int 2020; 27:17972-17985. [PMID: 32170609 DOI: 10.1007/s11356-020-08277-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Accepted: 02/28/2020] [Indexed: 06/10/2023]
Abstract
Ambitious energy targets in the 2020 European climate and energy package have encouraged many stakeholders to explore and implement measures improving the energy efficiency of water and wastewater treatment facilities. Model-based process optimization can improve the energy efficiency of wastewater treatment plants (WWTP) with modest investment and a short payback period. However, such methods are not widely practiced due to the labor-intensive workload required for monitoring and data collection processes. This study offers a multi-step simulation-based methodology to evaluate and optimize the energy consumption of the largest Italian WWTP using limited, preliminary energy audit data. An integrated modeling platform linking wastewater treatment processes, energy demand, and production sub-models is developed. The model is calibrated using a stepwise procedure based on available data. Further, a scenario-based optimization approach is proposed to obtain the non-dominated and optimized performance of the WWTP. The results confirmed that up to 5000 MWh annual energy saving in addition to improved effluent quality could be achieved in the studied case through operational changes only.
Collapse
Affiliation(s)
- Sina Borzooei
- Department of Civil and Environmental Engineering, Politecnico di Torino, Corso Duca degli Abruzzi, 10129, Torino, Italy.
| | - Youri Amerlinck
- Department of Data Analysis and Mathematical Modelling, BIOMATH, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000, Ghent, Belgium
| | - Deborah Panepinto
- Department of Civil and Environmental Engineering, Politecnico di Torino, Corso Duca degli Abruzzi, 10129, Torino, Italy
| | - Soroush Abolfathi
- Warwick Water Research Group, School of Engineering, The University of Warwick, Coventry, CV4 7AL, UK
| | - Ingmar Nopens
- Department of Data Analysis and Mathematical Modelling, BIOMATH, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000, Ghent, Belgium
| | - Gerardo Scibilia
- SMAT (Società Metropolitana Acque Torino) Research Center, Corso Unità d'Italia 235/3, 10127, Torino, Italy
| | - Lorenza Meucci
- SMAT (Società Metropolitana Acque Torino) Research Center, Corso Unità d'Italia 235/3, 10127, Torino, Italy
| | - Maria Chiara Zanetti
- Department of Civil and Environmental Engineering, Politecnico di Torino, Corso Duca degli Abruzzi, 10129, Torino, Italy
| |
Collapse
|
14
|
Duhalde DJ, Arumí JL, Oyarzún RA, Rivera DA. Fuzzy-based assessment of groundwater intrinsic vulnerability of a volcanic aquifer in the Chilean Andean Valley. Environ Monit Assess 2018; 190:390. [PMID: 29892906 DOI: 10.1007/s10661-018-6758-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Accepted: 05/31/2018] [Indexed: 06/08/2023]
Abstract
A fuzzy logic approach has been proposed to face the uncertainty caused by sparse data in the assessment of the intrinsic vulnerability of a groundwater system with parametric methods in Las Trancas Valley, Andean Mountain, south-central Chile, a popular touristic place in Chile, but lacking of a centralized drinking and sewage water public systems; this situation is a potentially source of groundwater pollution. Based on DRASTIC, GOD, and EKv and the expert knowledge of the study area, the Mamdani fuzzy approach was generated and the spatial data were processed by ArcGIS. The groundwater system exhibited areas with high, medium, and low intrinsic vulnerability indices. The fuzzy approach results were compared with traditional methods results, which, in general, have shown a good spatial agreement even though significant changes were also identified in the spatial distribution of the indices. The Mamdani logic approach has shown to be a useful and practical tool to assess the intrinsic vulnerability of an aquifer under sparse data conditions.
Collapse
Affiliation(s)
- Denisse J Duhalde
- Departamento Ingeniería de Minas, Universidad de La Serena, Benavente 980, La Serena, Chile.
| | - José L Arumí
- Universidad de Concepción, Chillán, Bio Bio, Chile
| | - Ricardo A Oyarzún
- Departamento Ingeniería de Minas, Universidad de La Serena, Benavente 980, La Serena, Chile
- CEAZA, La Serena, Chile
| | | |
Collapse
|
15
|
Gampe D, Nikulin G, Ludwig R. Using an ensemble of regional climate models to assess climate change impacts on water scarcity in European river basins. Sci Total Environ 2016; 573:1503-1518. [PMID: 27539825 DOI: 10.1016/j.scitotenv.2016.08.053] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Revised: 08/06/2016] [Accepted: 08/07/2016] [Indexed: 06/06/2023]
Abstract
Climate change will likely increase pressure on the water balances of Mediterranean basins due to decreasing precipitation and rising temperatures. To overcome the issue of data scarcity the hydrological relevant variables total runoff, surface evaporation, precipitation and air temperature are taken from climate model simulations. The ensemble applied in this study consists of 22 simulations, derived from different combinations of four General Circulation Models (GCMs) forcing different Regional Climate Models (RCMs) and two Representative Concentration Pathways (RCPs) at ~12km horizontal resolution provided through the EURO-CORDEX initiative. Four river basins (Adige, Ebro, Evrotas and Sava) are selected and climate change signals for the future period 2035-2065 as compared to the reference period 1981-2010 are investigated. Decreased runoff and evaporation indicate increased water scarcity over the Ebro and the Evrotas, as well as the southern parts of the Adige and the Sava, resulting from a temperature increase of 1-3° and precipitation decrease of up to 30%. Most severe changes are projected for the summer months indicating further pressure on the river basins already at least partly characterized by flow intermittency. The widely used Falkenmark indicator is presented and confirms this tendency and shows the necessity for spatially distributed analysis and high resolution projections. Related uncertainties are addressed by the means of a variance decomposition and model agreement to determine the robustness of the projections. The study highlights the importance of high resolution climate projections and represents a feasible approach to assess climate impacts on water scarcity also in regions that suffer from data scarcity.
Collapse
Affiliation(s)
- David Gampe
- Department of Geography, Ludwig-Maximilians-Universität, Munich, Germany.
| | - Grigory Nikulin
- Rossby Centre, Swedish Meteorological and Hydrological Institute (SMHI), Norrkopping, Sweden
| | - Ralf Ludwig
- Department of Geography, Ludwig-Maximilians-Universität, Munich, Germany
| |
Collapse
|