1
|
Wang W, Che S, Liu W, Tuo Y, Du Y, Zhang Z. A lightweight large receptive field network LrfSR for image super-resolution. Sci Rep 2025; 15:12535. [PMID: 40216924 PMCID: PMC11992099 DOI: 10.1038/s41598-025-96796-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2024] [Accepted: 03/31/2025] [Indexed: 04/14/2025] Open
Abstract
Deep convolutional neural networks have demonstrated excellent performance in the field of single-image super-resolution (SISR) reconstruction. However, existing methods often suffer from issues such as large number of parameters, intensive computation, and high latency, which limit the application of deep convolutional neural networks on devices with low computational resources. To solve these problems, this paper proposes a lightweight large receptive field network for image super-resolution (LrfSR). The innovations of this paper mainly include the following aspects. Firstly, we design an information distillation module based on large receptive field (LrfDM). The module achieves large receptive field by dilated convolution, and the enlarged receptive field facilitates the network to capture more pixel-to-pixel relationships and fuse multi-scale information in the feature distillation stage. This design effectively extracts the high-frequency features of the image, which can be demonstrated by the feature map. Secondly, a more efficient attention mechanism is introduced into the network, designed as ECCA and SESA, respectively, which achieves an improvement in super-resolution image quality with fewer network parameters. Experiments on Set5, Set14, B100, Urban100 and Manga109 datasets show that the LrfSR model achieves 4-fold super-resolution Peak Signal-to-Noise Ratio (PSNR) values of 32.23 dB, 28.65 dB, 27.59 dB, 26.36 dB and 30.53 dB, which is better than the existing model LKDN etc. Meanwhile, both qualitative and quantitative experimental results show that the LrfSR model explores the potential of large receptive fields in lightweight image super-resolution networks and successfully achieves a balance between high-quality image reconstruction and limited resources. The code and models are available at https://github.com/wanqin557/LrfSR .
Collapse
Affiliation(s)
- Wanqin Wang
- College of Computer Science and Mathematics, Central South University of Forestry and Technology, Changsha, 410004, China
| | - Shengbing Che
- College of Computer Science and Mathematics, Central South University of Forestry and Technology, Changsha, 410004, China.
| | - Wenxin Liu
- College of Computer Science and Mathematics, Central South University of Forestry and Technology, Changsha, 410004, China
| | - Yangzhuo Tuo
- College of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha, 410004, China
| | - Yafei Du
- College of Computer Science and Mathematics, Central South University of Forestry and Technology, Changsha, 410004, China
| | - Zixuan Zhang
- College of Computer Science and Mathematics, Central South University of Forestry and Technology, Changsha, 410004, China
| |
Collapse
|
2
|
Li C, Yang D, Yao S, Wang S, Wu Y, Zhang L, Li Q, Cho KIK, Seitz-Holland J, Ning L, Legarreta JH, Rathi Y, Westin CF, O'Donnell LJ, Sochen NA, Pasternak O, Zhang F. DDEvENet: Evidence-based ensemble learning for uncertainty-aware brain parcellation using diffusion MRI. Comput Med Imaging Graph 2025; 120:102489. [PMID: 39787735 PMCID: PMC11792617 DOI: 10.1016/j.compmedimag.2024.102489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 12/04/2024] [Accepted: 12/30/2024] [Indexed: 01/12/2025]
Abstract
In this study, we developed an Evidential Ensemble Neural Network based on Deep learning and Diffusion MRI, namely DDEvENet, for anatomical brain parcellation. The key innovation of DDEvENet is the design of an evidential deep learning framework to quantify predictive uncertainty at each voxel during a single inference. To do so, we design an evidence-based ensemble learning framework for uncertainty-aware parcellation to leverage the multiple dMRI parameters derived from diffusion MRI. Using DDEvENet, we obtained accurate parcellation and uncertainty estimates across different datasets from healthy and clinical populations and with different imaging acquisitions. The overall network includes five parallel subnetworks, where each is dedicated to learning the FreeSurfer parcellation for a certain diffusion MRI parameter. An evidence-based ensemble methodology is then proposed to fuse the individual outputs. We perform experimental evaluations on large-scale datasets from multiple imaging sources, including high-quality diffusion MRI data from healthy adults and clinically diffusion MRI data from participants with various brain diseases (schizophrenia, bipolar disorder, attention-deficit/hyperactivity disorder, Parkinson's disease, cerebral small vessel disease, and neurosurgical patients with brain tumors). Compared to several state-of-the-art methods, our experimental results demonstrate highly improved parcellation accuracy across the multiple testing datasets despite the differences in dMRI acquisition protocols and health conditions. Furthermore, thanks to the uncertainty estimation, our DDEvENet approach demonstrates a good ability to detect abnormal brain regions in patients with lesions that are consistent with expert-drawn results, enhancing the interpretability and reliability of the segmentation results.
Collapse
Affiliation(s)
- Chenjun Li
- University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| | - Dian Yang
- University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| | - Shun Yao
- The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Shuyue Wang
- The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Ye Wu
- Nanjing University of Science and Technology, Nanjing, Jiangsu, China
| | - Le Zhang
- University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| | - Qiannuo Li
- East China University of Science and Technology, Shanghai, China
| | | | | | | | | | | | | | | | - Nir A Sochen
- School of Mathematical Sciences, University of Tel Aviv, Tel Aviv, Israel
| | | | - Fan Zhang
- University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
| |
Collapse
|
3
|
Chen F, Su B, Jia Z. TUH-NAS: A Triple-Unit NAS Network for Hyperspectral Image Classification. SENSORS (BASEL, SWITZERLAND) 2024; 24:7834. [PMID: 39686370 DOI: 10.3390/s24237834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2024] [Revised: 11/26/2024] [Accepted: 12/05/2024] [Indexed: 12/18/2024]
Abstract
Over the last few years, neural architecture search (NAS) technology has achieved good results in hyperspectral image classification. Nevertheless, existing NAS-based classification methods have not specifically focused on the complex connection between spectral and spatial data. Strengthening the integration of spatial and spectral features is crucial to boosting the overall classification efficacy of hyperspectral images. In this paper, a triple-unit hyperspectral NAS network (TUH-NAS) aimed at hyperspectral image classification is introduced, where the fusion unit emphasizes the enhancement of the intrinsic relationship between spatial and spectral information. We designed a new hyperspectral image attention mechanism module to increase the focus on critical regions and enhance sensitivity to priority areas. We also adopted a composite loss function to enhance the model's focus on hard-to-classify samples. Experimental evaluations on three publicly accessible hyperspectral datasets demonstrated that, despite utilizing a limited number of samples, TUH-NAS outperforms existing NAS classification methods in recognizing object boundaries.
Collapse
Affiliation(s)
- Feng Chen
- School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454003, China
| | - Baishun Su
- School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454003, China
| | - Zongpu Jia
- School of Software, Henan Polytechnic University, Jiaozuo 454000, China
| |
Collapse
|
4
|
Giansanti D. Bridging the Gap: Exploring Opportunities, Challenges, and Problems in Integrating Assistive Technologies, Robotics, and Automated Machines into the Health Domain. Healthcare (Basel) 2023; 11:2462. [PMID: 37685498 PMCID: PMC10487463 DOI: 10.3390/healthcare11172462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 08/11/2023] [Accepted: 08/31/2023] [Indexed: 09/10/2023] Open
Abstract
The field of healthcare is continually evolving and advancing due to new technologies and innovations [...].
Collapse
Affiliation(s)
- Daniele Giansanti
- National Centre for Innovative Technologies in Public Health, Italian National Institute of Health, 00161 Rome, Italy
| |
Collapse
|
5
|
Zaharia C, Popescu V, Sandu F. Hardware-Software Partitioning for Real-Time Object Detection Using Dynamic Parameter Optimization. SENSORS (BASEL, SWITZERLAND) 2023; 23:4894. [PMID: 37430806 DOI: 10.3390/s23104894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 05/12/2023] [Accepted: 05/16/2023] [Indexed: 07/12/2023]
Abstract
Computer vision algorithms implementations, especially for real-time applications, are present in a variety of devices that we are currently using (from smartphones or automotive applications to monitoring/security applications) and pose specific challenges, memory bandwidth or energy consumption (e.g., for mobility) being the most notable ones. This paper aims at providing a solution to improve the overall quality of real-time object detection computer vision algorithms using a hybrid hardware-software implementation. To this end, we explore the methods for a proper allocation of algorithm components towards hardware (as IP Cores) and the interfacing between hardware and software. Addressing specific design constraints, the relationship between the above components allows embedded artificial intelligence to select the operating hardware blocks (IP cores)-in the configuration phase-and to dynamically change the parameters of the aggregated hardware resources-in the instantiation phase, similar to the concretization of a class into a software object. The conclusions show the benefits of using hybrid hardware-software implementations, as well as major gains from using IP Cores, managed by artificial intelligence, for an object detection use-case, implemented on a FPGA demonstrator built around a Xilinx Zynq-7000 SoC Mini-ITX sub-system.
Collapse
Affiliation(s)
- Corneliu Zaharia
- Department of Electronics and Computers, Transilvania University, Bdul Eroilor 29, 500068 Brașov, Romania
| | - Vlad Popescu
- Department of Electronics and Computers, Transilvania University, Bdul Eroilor 29, 500068 Brașov, Romania
| | - Florin Sandu
- Department of Electronics and Computers, Transilvania University, Bdul Eroilor 29, 500068 Brașov, Romania
| |
Collapse
|
6
|
Karras A, Karras C, Schizas N, Avlonitis M, Sioutas S. AutoML with Bayesian Optimizations for Big Data Management. INFORMATION 2023. [DOI: 10.3390/info14040223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023] Open
Abstract
The field of automated machine learning (AutoML) has gained significant attention in recent years due to its ability to automate the process of building and optimizing machine learning models. However, the increasing amount of big data being generated has presented new challenges for AutoML systems in terms of big data management. In this paper, we introduce Fabolas and learning curve extrapolation as two methods for accelerating hyperparameter optimization. Four methods for quickening training were presented including Bag of Little Bootstraps, k-means clustering for Support Vector Machines, subsample size selection for gradient descent, and subsampling for logistic regression. Additionally, we also discuss the use of Markov Chain Monte Carlo (MCMC) methods and other stochastic optimization techniques to improve the efficiency of AutoML systems in managing big data. These methods enhance various facets of the training process, making it feasible to combine them in diverse ways to gain further speedups. We review several combinations that have potential and provide a comprehensive understanding of the current state of AutoML and its potential for managing big data in various industries. Furthermore, we also mention the importance of parallel computing and distributed systems to improve the scalability of the AutoML systems while working with big data.
Collapse
Affiliation(s)
- Aristeidis Karras
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece
| | - Christos Karras
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece
| | - Nikolaos Schizas
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece
| | - Markos Avlonitis
- Department of Informatics, Ionian University, 49100 Kerkira, Greece
| | - Spyros Sioutas
- Computer Engineering and Informatics Department, University of Patras, 26504 Patras, Greece
| |
Collapse
|