1
|
Zou F, Zhou Z, Cai Q, Guo F, Zhang X. An Expressway ETC Missing Data Restoration Model Considering Multi-Attribute Features. SENSORS (BASEL, SWITZERLAND) 2023; 23:8745. [PMID: 37960444 PMCID: PMC10647695 DOI: 10.3390/s23218745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/16/2023] [Accepted: 10/23/2023] [Indexed: 11/15/2023]
Abstract
Electronic toll collection (ETC) data mining has become one of the hotspots in the research of intelligent expressway extension applications. Ensuring the integrity of ETC data stands as a critical measure in upholding data quality. ETC data are typical structured data, and although deep learning holds great potential in the ETC data restoration field, its applications in structured data are still in the early stages. To address these issues, we propose an expressway ETC missing transaction data restoration model considering multi-attribute features (MAF). Initially, we employ an entity embedding neural network (EENN) to automatically learn the representation of categorical features in multi-dimensional space, subsequently obtaining embedding vectors from networks that have been adequately trained. Then, we use long short-term memory (LSTM) neural networks to extract the changing patterns of vehicle speeds across several continuous sections. Ultimately, we merge the processed features with other features as input, using a three-layer multilayer perceptron (MLP) to complete the ETC data restoration. To validate the effectiveness of the proposed method, we conducted extensive tests using real ETC datasets and compared it with methods commonly used for structured data restoration. The experimental results demonstrate that the proposed method significantly outperforms others in restoration accuracy on two different datasets. Specifically, our sample data size reached around 400,000 entries. Compared to the currently best method, our method improved the restoration accuracy by 19.06% on non-holiday ETC datasets. The MAE and RMSE values reached optimal levels of 12.394 and 23.815, respectively. The fitting degree of the model to the dataset also reached its peak (R2 = 0.993). Meanwhile, the restoration stability of our method on holiday datasets increased by 5.82%. An ablation experiment showed that the EENN and LSTM modules contributed 7.60% and 9% to the restoration accuracy, as well as 4.68% and 7.29% to the restoration stability. This study indicates that the proposed method not only significantly improves the quality of ETC data but also meets the timeliness requirements of big data mining analysis.
Collapse
Affiliation(s)
- Fumin Zou
- Fujian Key Laboratory for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China; (F.Z.); (Z.Z.); (F.G.); (X.Z.)
| | - Zhaoyi Zhou
- Fujian Key Laboratory for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China; (F.Z.); (Z.Z.); (F.G.); (X.Z.)
| | - Qiqin Cai
- Fujian Key Laboratory for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China; (F.Z.); (Z.Z.); (F.G.); (X.Z.)
- School of Mechanical Engineering and Automation, Huaqiao University, Xiamen 361021, China
| | - Feng Guo
- Fujian Key Laboratory for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China; (F.Z.); (Z.Z.); (F.G.); (X.Z.)
- College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China
| | - Xinyi Zhang
- Fujian Key Laboratory for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China; (F.Z.); (Z.Z.); (F.G.); (X.Z.)
| |
Collapse
|
2
|
Dynamic adaptive generative adversarial networks with multi-view temporal factorizations for hybrid recovery of missing traffic data. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-08064-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
3
|
Li Y, Du M, He S. Attention-Based Sequence-to-Sequence Model for Time Series Imputation. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1798. [PMID: 36554203 PMCID: PMC9778091 DOI: 10.3390/e24121798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 11/22/2022] [Accepted: 12/07/2022] [Indexed: 06/17/2023]
Abstract
Time series data are usually characterized by having missing values, high dimensionality, and large data volume. To solve the problem of high-dimensional time series with missing values, this paper proposes an attention-based sequence-to-sequence model to imputation missing values in time series (ASSM), which is a sequence-to-sequence model based on the combination of feature learning and data computation. The model consists of two parts, encoder and decoder. The encoder part is a BIGRU recurrent neural network and incorporates a self-attentive mechanism to make the model more capable of handling long-range time series; The decoder part is a GRU recurrent neural network and incorporates a cross-attentive mechanism into associate with the encoder part. The relationship weights between the generated sequences in the decoder part and the known sequences in the encoder part are calculated to achieve the purpose of focusing on the sequences with a high degree of correlation. In this paper, we conduct comparison experiments with four evaluation metrics and six models on four real datasets. The experimental results show that the model proposed in this paper outperforms the six comparative missing value interpolation algorithms.
Collapse
|
4
|
Missing Values Imputation Using Fuzzy K-Top Matching Value. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2022. [DOI: 10.1016/j.jksuci.2022.12.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
5
|
Li H, Cao Q, Bai Q, Li Z, Hu H. Multistate time series imputation using generative adversarial network with applications to traffic data. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07961-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
6
|
|
7
|
Zhong L, Chang Y, Wang F, Gao S. Distributed Missing Values Imputation Schemes for Plant-Wide Industrial Process Using Variational Bayesian Principal Component Analysis. Ind Eng Chem Res 2021. [DOI: 10.1021/acs.iecr.1c03860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Linsheng Zhong
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
| | - Yuqing Chang
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
- State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, China
| | - Fuli Wang
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
- State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, China
| | - Shihong Gao
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
- School of Automation and Software Engineering, Shanxi University, Taiyuan 030006, China
| |
Collapse
|
8
|
Lin JF, Li XY, Wang J, Wang LX, Hu XX, Liu JX. Study of Building Safety Monitoring by Using Cost-Effective MEMS Accelerometers for Rapid After-Earthquake Assessment with Missing Data. SENSORS 2021; 21:s21217327. [PMID: 34770632 PMCID: PMC8587995 DOI: 10.3390/s21217327] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 10/11/2021] [Accepted: 10/29/2021] [Indexed: 12/02/2022]
Abstract
Suffering from structural deterioration and natural disasters, the resilience of civil structures in the face of extreme loadings inevitably drops, which may lead to catastrophic structural failure and presents great threats to public safety. Earthquake-induced extreme loading is one of the major reasons behind the structural failure of buildings. However, many buildings in earthquake-prone areas of China lack safety monitoring, and prevalent structural health monitoring systems are generally very expensive and complicated for extensive applications. To facilitate cost-effective building-safety monitoring, this study investigates a method using cost-effective MEMS accelerometers for buildings’ rapid after-earthquake assessment. First, a parameter analysis of a cost-effective MEMS sensor is conducted to confirm its suitability for building-safety monitoring. Second, different from the existing investigations that tend to use a simplified building model or small-scaled frame structure excited by strong motions in laboratories, this study selects an in-service public building located in a typical earthquake-prone area after an analysis of earthquake risk in China. The building is instrumented with the selected cost-effective MEMS accelerometers, characterized by a low noise level and the capability to capture low-frequency small-amplitude dynamic responses. Furthermore, a rapid after-earthquake assessment scheme is proposed, which systematically includes fast missing data reconstruction, displacement response estimation based on an acceleration response integral, and safety assessment based on the maximum displacement and maximum inter-story drift ratio. Finally, the proposed method is successfully applied to a building-safety assessment by using earthquake-induced building responses suffering from missing data. This study is conducive to the extensive engineering application of MEMS-based cost-effective building monitoring and rapid after-earthquake assessment.
Collapse
Affiliation(s)
- Jian-Fu Lin
- Center of Safety Monitoring of Engineering Structures, Shenzhen Academy of Disaster Prevention and Reduction, China Earthquake Administration, Shenzhen 518003, China; (J.-F.L.); (L.-X.W.); (J.-X.L.)
| | - Xue-Yan Li
- MOE Key Laboratory of Disaster Forecast and Control in Engineering, School of Mechanics and Building Engineering, Jinan University, Guangzhou 510632, China;
| | - Junfang Wang
- MOE Key Laboratory for Resilient Infrastructures of Coastal Cities, College of Civil and Transportation Engineering, Shenzhen University, Shenzhen 518060, China
- Correspondence:
| | - Li-Xin Wang
- Center of Safety Monitoring of Engineering Structures, Shenzhen Academy of Disaster Prevention and Reduction, China Earthquake Administration, Shenzhen 518003, China; (J.-F.L.); (L.-X.W.); (J.-X.L.)
| | - Xing-Xing Hu
- Institute of Geophysics, China Earthquake Administration, Beijing 100081, China;
| | - Jun-Xiang Liu
- Center of Safety Monitoring of Engineering Structures, Shenzhen Academy of Disaster Prevention and Reduction, China Earthquake Administration, Shenzhen 518003, China; (J.-F.L.); (L.-X.W.); (J.-X.L.)
| |
Collapse
|
9
|
Alamoodi A, Zaidan B, Zaidan A, Albahri O, Chen J, Chyad M, Garfan S, Aleesa A. Machine learning-based imputation soft computing approach for large missing scale and non-reference data imputation. CHAOS, SOLITONS & FRACTALS 2021; 151:111236. [DOI: 10.1016/j.chaos.2021.111236] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2025]
|
10
|
|
11
|
|
12
|
Hasan MK, Alam MA, Roy S, Dutta A, Jawad MT, Das S. Missing value imputation affects the performance of machine learning: A review and analysis of the literature (2010–2021). INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100799] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
13
|
Sefidian AM, Daneshpour N. Estimating missing data using novel correlation maximization based methods. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106249] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
14
|
|
15
|
Qin M, Du Z, Zhang F, Liu R. A matrix completion-based multiview learning method for imputing missing values in buoy monitoring data. Inf Sci (N Y) 2019. [DOI: 10.1016/j.ins.2019.02.057] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
16
|
|
17
|
Chen X, Chen C, Cai Y, Wang H, Ye Q. Kernel Sparse Representation with Hybrid Regularization for On-Road Traffic Sensor Data Imputation. SENSORS 2018; 18:s18092884. [PMID: 30200348 PMCID: PMC6163639 DOI: 10.3390/s18092884] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Revised: 08/28/2018] [Accepted: 08/29/2018] [Indexed: 11/16/2022]
Abstract
The problem of missing values (MVs) in traffic sensor data analysis is universal in current intelligent transportation systems because of various reasons, such as sensor malfunction, transmission failure, etc. Accurate imputation of MVs is the foundation of subsequent data analysis tasks since most analysis algorithms need complete data as input. In this work, a novel MVs imputation approach termed as kernel sparse representation with elastic net regularization (KSR-EN) is developed for reconstructing MVs to facilitate analysis with traffic sensor data. The idea is to represent each sample as a linear combination of other samples due to inherent spatiotemporal correlation, as well as periodicity of daily traffic flow. To discover few yet correlated samples and make full use of the valuable information, a combination of l1-norm and l2-norm is employed to penalize the combination coefficients. Moreover, the linear representation among samples is extended to nonlinear representation by mapping input data space into high-dimensional feature space, which further enhances the recovery performance of our proposed approach. An efficient iterative algorithm is developed for solving KSR-EN model. The proposed method is verified on both an artificially simulated dataset and a public road network traffic sensor data. The results demonstrate the effectiveness of the proposed approach in terms of MVs imputation.
Collapse
Affiliation(s)
- Xiaobo Chen
- Automotive Engineering Research Institute, Jiangsu University, Zhenjiang 212013, China.
- School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, China.
| | - Cheng Chen
- School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, China.
| | - Yingfeng Cai
- Automotive Engineering Research Institute, Jiangsu University, Zhenjiang 212013, China.
| | - Hai Wang
- School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, China.
| | - Qiaolin Ye
- College of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, China.
| |
Collapse
|
18
|
Chen X, Cai Y, Ye Q, Chen L, Li Z. Graph regularized local self-representation for missing value imputation with applications to on-road traffic sensor data. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2018.04.029] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
19
|
Jiao Y, Zhang Y, Wang Y, Wang B, Jin J, Wang X. A Novel Multilayer Correlation Maximization Model for Improving CCA-Based Frequency Recognition in SSVEP Brain-Computer Interface. Int J Neural Syst 2017; 28:1750039. [PMID: 28982285 DOI: 10.1142/s0129065717500393] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Multiset canonical correlation analysis (MsetCCA) has been successfully applied to optimize the reference signals by extracting common features from multiple sets of electroencephalogram (EEG) for steady-state visual evoked potential (SSVEP) recognition in brain-computer interface application. To avoid extracting the possible noise components as common features, this study proposes a sophisticated extension of MsetCCA, called multilayer correlation maximization (MCM) model for further improving SSVEP recognition accuracy. MCM combines advantages of both CCA and MsetCCA by carrying out three layers of correlation maximization processes. The first layer is to extract the stimulus frequency-related information in using CCA between EEG samples and sine-cosine reference signals. The second layer is to learn reference signals by extracting the common features with MsetCCA. The third layer is to re-optimize the reference signals set in using CCA with sine-cosine reference signals again. Experimental study is implemented to validate effectiveness of the proposed MCM model in comparison with the standard CCA and MsetCCA algorithms. Superior performance of MCM demonstrates its promising potential for the development of an improved SSVEP-based brain-computer interface.
Collapse
Affiliation(s)
- Yong Jiao
- 1 Key Laboratory for Advanced Control and Optimization for Chemical Processes, East China University of Science and Technology, Shanghai, P. R. China
| | - Yu Zhang
- 1 Key Laboratory for Advanced Control and Optimization for Chemical Processes, East China University of Science and Technology, Shanghai, P. R. China
| | - Yu Wang
- 2 Shanghai Ruanzhong Information Technology Co., Ltd., Shanghai, P. R. China
| | - Bei Wang
- 1 Key Laboratory for Advanced Control and Optimization for Chemical Processes, East China University of Science and Technology, Shanghai, P. R. China
| | - Jing Jin
- 1 Key Laboratory for Advanced Control and Optimization for Chemical Processes, East China University of Science and Technology, Shanghai, P. R. China
| | - Xingyu Wang
- 1 Key Laboratory for Advanced Control and Optimization for Chemical Processes, East China University of Science and Technology, Shanghai, P. R. China
| |
Collapse
|