1
|
Yang W, Qin Y, Wang K, Hu Y, Huang R, Chen Y. A discrete convolutional network for entity relation extraction. Neural Netw 2025; 184:107117. [PMID: 39798351 DOI: 10.1016/j.neunet.2024.107117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 11/02/2024] [Accepted: 12/31/2024] [Indexed: 01/15/2025]
Abstract
Relation extraction independently verifies all entity pairs in a sentence to identify predefined relationships between named entities. Because these entity pairs share the same contextual features of a sentence, they lead to a complicated semantic structure. To distinguish semantic expressions between relation instances, manually designed rules or elaborate deep architectures are usually applied to learn task-relevant representations. In this paper, a discrete convolutional network is proposed to incorporate discrete linguistic interactions and deep feature weighting. This network applies a discretization strategy to fix parameters of convolutional kernels into ternary values. Then, these discretized kernels are used to learn discrete semantic structures from vectorized token representations. Our approach leverages the ability of discrete CNNs to capture discrete linguistic patterns of a sentence, thereby maintaining model expressiveness and improving performance in the relation extraction task. Furthermore, our method has the advantages of reducing the overfitting problem caused by depending on prior knowledge and decreasing the computational complexity by reducing the number of trainable parameters. Our model is evaluated on five widely used benchmark datasets. It achieves state-of-the-art performance, outperforming all compared related works. Experimental results also demonstrate that, compared with traditional CNN networks, it achieves an average improvement of 14.66% in F1-score and accelerates training by an average of 17.46%, highlighting the efficiency and effectiveness of our model in the relation extraction task.
Collapse
Affiliation(s)
- Weizhe Yang
- State Key Laboratory of Public Big Data, Guizhou University, 550025, China; Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Guizhou University, 550025, China.
| | - Yongbin Qin
- State Key Laboratory of Public Big Data, Guizhou University, 550025, China; Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Guizhou University, 550025, China; College of Computer Science and Technology, Guizhou University, 550025, China.
| | - Kai Wang
- Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Guizhou University, 550025, China; College of Computer Science and Technology, Guizhou University, 550025, China.
| | - Ying Hu
- Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Guizhou University, 550025, China; College of Computer Science and Technology, Guizhou University, 550025, China.
| | - Ruizhang Huang
- State Key Laboratory of Public Big Data, Guizhou University, 550025, China; Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Guizhou University, 550025, China; College of Computer Science and Technology, Guizhou University, 550025, China.
| | - Yanping Chen
- State Key Laboratory of Public Big Data, Guizhou University, 550025, China; Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Guizhou University, 550025, China; College of Computer Science and Technology, Guizhou University, 550025, China.
| |
Collapse
|
2
|
Su Y, Seng KP, Ang LM, Smith J. Binary Neural Networks in FPGAs: Architectures, Tool Flows and Hardware Comparisons. SENSORS (BASEL, SWITZERLAND) 2023; 23:9254. [PMID: 38005640 PMCID: PMC10675041 DOI: 10.3390/s23229254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Revised: 10/14/2023] [Accepted: 11/13/2023] [Indexed: 11/26/2023]
Abstract
Binary neural networks (BNNs) are variations of artificial/deep neural network (ANN/DNN) architectures that constrain the real values of weights to the binary set of numbers {-1,1}. By using binary values, BNNs can convert matrix multiplications into bitwise operations, which accelerates both training and inference and reduces hardware complexity and model sizes for implementation. Compared to traditional deep learning architectures, BNNs are a good choice for implementation in resource-constrained devices like FPGAs and ASICs. However, BNNs have the disadvantage of reduced performance and accuracy because of the tradeoff due to binarization. Over the years, this has attracted the attention of the research community to overcome the performance gap of BNNs, and several architectures have been proposed. In this paper, we provide a comprehensive review of BNNs for implementation in FPGA hardware. The survey covers different aspects, such as BNN architectures and variants, design and tool flows for FPGAs, and various applications for BNNs. The final part of the paper gives some benchmark works and design tools for implementing BNNs in FPGAs based on established datasets used by the research community.
Collapse
Affiliation(s)
- Yuanxin Su
- School of AI and Advanced Computing, Xi’an Jiaotong Liverpool University, Suzhou 215000, China;
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3GJ, UK;
| | - Kah Phooi Seng
- School of AI and Advanced Computing, Xi’an Jiaotong Liverpool University, Suzhou 215000, China;
- School of Computer Science, Queensland University of Technology, Brisbane City, QLD 4000, Australia;
- School of Science Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD 4556, Australia
| | - Li Minn Ang
- School of Computer Science, Queensland University of Technology, Brisbane City, QLD 4000, Australia;
| | - Jeremy Smith
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3GJ, UK;
| |
Collapse
|
3
|
Yuan C, Agaian SS. A comprehensive review of Binary Neural Network. Artif Intell Rev 2023. [DOI: 10.1007/s10462-023-10464-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/07/2023]
|
4
|
Yan Z, Chen Y, Song J, Zhu J. Multimodal feature fusion based on object relation for video captioning. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY 2022. [DOI: 10.1049/cit2.12071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- Zhiwen Yan
- School of Computing Science South China Normal University Guangzhou China
| | - Ying Chen
- School of Computing Science South China Normal University Guangzhou China
| | - Jinlong Song
- School of Computing Science South China Normal University Guangzhou China
| | - Jia Zhu
- Key Laboratory of Intelligent Education Technology and Application of Zhejiang Province Zhejiang Normal University Jinhua China
| |
Collapse
|
5
|
Zhang C, Liu X. Feature Extraction of Ancient Chinese Characters Based on Deep Convolution Neural Network and Big Data Analysis. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:2491116. [PMID: 34504520 PMCID: PMC8423538 DOI: 10.1155/2021/2491116] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 08/14/2021] [Accepted: 08/18/2021] [Indexed: 11/17/2022]
Abstract
In recent years, deep learning has made good progress and has been applied to face recognition, video monitoring, image processing, and other fields. In this big data background, deep convolution neural network has also received more and more attention. In order to extract the ancient Chinese characters effectively, the paper will discuss the structure model, pool process, and network training of deep convolution neural network and compare the algorithm with the traditional machine learning algorithm. The results show that the accuracy and recall rate of the Chinese characters in the plaque of Ming Dynasty can reach the peak, 81.38% and 81.31%, respectively. When the number of training samples increases to 50, the recognition rate of MFA is 99.72%, which is much higher than other algorithms. This shows that the algorithm based on deep convolution neural network and big data analysis has excellent performance and can effectively identify the Chinese characters under different dynasties, different sample sizes, and different interference factors, which can provide a powerful reference for the extraction of ancient Chinese characters.
Collapse
Affiliation(s)
- Cheng Zhang
- College of Literature and Journalism, Chengdu University, Chengdu 610106, Sichuan, China
| | - Xingjun Liu
- School of Humanities and Communication, Sanya University, Sanya 572022, Hainan, China
| |
Collapse
|
6
|
Medhat F, Chesmore D, Robinson J. Masked Conditional Neural Networks for sound classification. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106073] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
7
|
|
8
|
Cao D, Chen Z, Gao L. An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES 2020. [DOI: 10.1186/s13673-020-00219-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Abstract
Object detection methods aim to identify all target objects in the target image and determine the categories and position information in order to achieve machine vision understanding. Numerous approaches have been proposed to solve this problem, mainly inspired by methods of computer vision and deep learning. However, existing approaches always perform poorly for the detection of small, dense objects, and even fail to detect objects with random geometric transformations. In this study, we compare and analyse mainstream object detection algorithms and propose a multi-scaled deformable convolutional object detection network to deal with the challenges faced by current methods. Our analysis demonstrates a strong performance on par, or even better, than state of the art methods. We use deep convolutional networks to obtain multi-scaled features, and add deformable convolutional structures to overcome geometric transformations. We then fuse the multi-scaled features by up sampling, in order to implement the final object recognition and region regress. Experiments prove that our suggested framework improves the accuracy of detecting small target objects with geometric deformation, showing significant improvements in the trade-off between accuracy and speed.
Collapse
|
9
|
Rana N, Latiff MSA, Abdulhamid SM, Chiroma H. Whale optimization algorithm: a systematic review of contemporary applications, modifications and developments. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-04849-z] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
|
10
|
|
11
|
Ali N, Zafar B, Iqbal MK, Sajid M, Younis MY, Dar SH, Mahmood MT, Lee IH. Modeling global geometric spatial information for rotation invariant classification of satellite images. PLoS One 2019; 14:e0219833. [PMID: 31323065 PMCID: PMC6641163 DOI: 10.1371/journal.pone.0219833] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 07/02/2019] [Indexed: 11/28/2022] Open
Abstract
The classification of high-resolution satellite images is an open research problem for computer vision research community. In last few decades, the Bag of Visual Word (BoVW) model has been used for the classification of satellite images. In BoVW model, an orderless histogram of visual words without any spatial information is used as image signature. The performance of BoVW model suffers due to this orderless nature and addition of spatial clues are reported beneficial for scene and geographical classification of images. Most of the image representations that can compute image spatial information as are not invariant to rotations. A rotation invariant image representation is considered as one of the main requirement for satellite image classification. This paper presents a novel approach that computes the spatial clues for the histograms of BoVW model that is robust to the image rotations. The spatial clues are calculated by computing the histograms of orthogonal vectors. This is achieved by calculating the magnitude of orthogonal vectors between Pairs of Identical Visual Words (PIVW) relative to the geometric center of an image. The comparative analysis is performed with recently proposed research to obtain the best spatial feature representation for the satellite imagery. We evaluated the proposed research for image classification using three standard image benchmarks of remote sensing. The results and comparisons conducted to evaluate this research show that the proposed approach performs better in terms of classification accuracy for a variety of datasets based on satellite images.
Collapse
Affiliation(s)
- Nouman Ali
- Department of Software Engineering, Mirpur University of Science & Technology (MUST), Mirpur AJK, Pakistan
| | - Bushra Zafar
- Department of Computer Science, National Textile University, Faisalabad, Pakistan
- Department of Computer Science, Government College University, Faisalabad, Pakistan
| | | | - Muhammad Sajid
- Department of Electrical Engineering, Mirpur University of Science & Technology (MUST), Mirpur AJK, Pakistan
| | - Muhammad Yamin Younis
- Department of Mechanical Engineering, Mirpur University of Science & Technology (MUST), Mirpur AJK, Pakistan
| | - Saadat Hanif Dar
- Department of Software Engineering, Mirpur University of Science & Technology (MUST), Mirpur AJK, Pakistan
| | - Muhammad Tariq Mahmood
- School of Computer Science and Engineering, Korea University of Technology and Education, Cheonan, South Korea
| | - Ik Hyun Lee
- Department of Mechatronics, Korea Polytechnic University, Siheung-si, Gyeonggi-do, South Korea
- * E-mail:
| |
Collapse
|
12
|
Fernandez-Lozano C, Carballal A, Machado P, Santos A, Romero J. Visual complexity modelling based on image features fusion of multiple kernels. PeerJ 2019; 7:e7075. [PMID: 31346494 PMCID: PMC6642794 DOI: 10.7717/peerj.7075] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Accepted: 05/04/2019] [Indexed: 01/28/2023] Open
Abstract
Humans' perception of visual complexity is often regarded as one of the key principles of aesthetic order, and is intimately related to the physiological, neurological and, possibly, psychological characteristics of the human mind. For these reasons, creating accurate computational models of visual complexity is a demanding task. Building upon on previous work in the field (Forsythe et al., 2011; Machado et al., 2015) we explore the use of Machine Learning techniques to create computational models of visual complexity. For that purpose, we use a dataset composed of 800 visual stimuli divided into five categories, describing each stimulus by 329 features based on edge detection, compression error and Zipf's law. In an initial stage, a comparative analysis of representative state-of-the-art Machine Learning approaches is performed. Subsequently, we conduct an exhaustive outlier analysis. We analyze the impact of removing the extreme outliers, concluding that Feature Selection Multiple Kernel Learning obtains the best results, yielding an average correlation to humans' perception of complexity of 0.71 with only twenty-two features. These results outperform the current state-of-the-art, showing the potential of this technique for regression.
Collapse
Affiliation(s)
- Carlos Fernandez-Lozano
- Computer Science Department, Faculty of Computer Science, University of A Coruña, A Coruña, Spain
| | - Adrian Carballal
- Computer Science Department, Faculty of Computer Science, University of A Coruña, A Coruña, Spain
| | - Penousal Machado
- CISUC, Department of Informatics Engineering, University of Coimbra, Coimbra, Portugal
| | - Antonino Santos
- Computer Science Department, Faculty of Computer Science, University of A Coruña, A Coruña, Spain
| | - Juan Romero
- Computer Science Department, Faculty of Computer Science, University of A Coruña, A Coruña, Spain
| |
Collapse
|