1
|
Zhang R, Wen X, Cao H, Cui P, Chai H, Hu R, Yu R. High-risk event prone driver identification considering driving behavior temporal covariate shift. Accid Anal Prev 2024; 199:107526. [PMID: 38432064 DOI: 10.1016/j.aap.2024.107526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/15/2024] [Accepted: 02/25/2024] [Indexed: 03/05/2024]
Abstract
Drivers who perform frequent high-risk events (e.g., hard braking maneuvers) pose a significant threat to traffic safety. Existing studies commonly estimated high-risk event occurrence probabilities based upon the assumption that data collected from different time periods are independent and identically distributed (referred to as i.i.d. assumption). Such approach ignored the issue of driving behavior temporal covariate shift, where the distributions of driving behavior factors vary over time. To fill the gap, this study targets at obtaining time-invariant driving behavior features and establishing their relationships with high-risk event occurrence probability. Specifically, a generalized modeling framework consisting of distribution characterization (DC) and distribution matching (DM) modules was proposed. The DC module split the whole dataset into several segments with the largest distribution gaps, while the DM module identified time-invariant driving behavior features through learning common knowledge among different segments. Then, gated recurrent unit (GRU) was employed to conduct time-invariant driving behavior feature mining for high-risk event occurrence probability estimation. Moreover, modified loss functions were introduced for imbalanced data learning caused by the rarity of high-risk events. The empirical analyses were conducted utilizing online ride-hailing services data. Experiment results showed that the proposed generalized modeling framework provided a 7.2% higher average precision compared to the traditional i.i.d. assumption based approach. The modified loss functions further improved the model performance by 3.8%. Finally, benefits for the driver management program improvement have been explored by a case study, demonstrating a 33.34% enhancement in the identification precision of high-risk event prone drivers.
Collapse
Affiliation(s)
- Ruici Zhang
- College of Transportation Engineering, Tongji University, Shanghai, 201804, China; The Key Laboratory of Road and Traffic Engineering, Ministry of Education, 4800 Cao'an Road, 201804, Shanghai, China.
| | - Xiang Wen
- Didi Chuxing, Zuanshi Mansion, Zhongguancun Software Park Compound 19, Dongbeiwang Road, 100000, Beijing, China.
| | - Huanqiang Cao
- Didi Chuxing, Zuanshi Mansion, Zhongguancun Software Park Compound 19, Dongbeiwang Road, 100000, Beijing, China.
| | - Pengfei Cui
- Didi Chuxing, Zuanshi Mansion, Zhongguancun Software Park Compound 19, Dongbeiwang Road, 100000, Beijing, China.
| | - Hua Chai
- Didi Chuxing, Zuanshi Mansion, Zhongguancun Software Park Compound 19, Dongbeiwang Road, 100000, Beijing, China.
| | - Runbo Hu
- Didi Chuxing, Zuanshi Mansion, Zhongguancun Software Park Compound 19, Dongbeiwang Road, 100000, Beijing, China.
| | - Rongjie Yu
- College of Transportation Engineering, Tongji University, Shanghai, 201804, China; The Key Laboratory of Road and Traffic Engineering, Ministry of Education, 4800 Cao'an Road, 201804, Shanghai, China.
| |
Collapse
|
2
|
Bento N, Rebelo J, Carreiro AV, Ravache F, Barandas M. Exploring Regularization Methods for Domain Generalization in Accelerometer-Based Human Activity Recognition. Sensors (Basel) 2023; 23:6511. [PMID: 37514805 PMCID: PMC10386236 DOI: 10.3390/s23146511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 07/03/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023]
Abstract
The study of Domain Generalization (DG) has gained considerable momentum in the Machine Learning (ML) field. Human Activity Recognition (HAR) inherently encompasses diverse domains (e.g., users, devices, or datasets), rendering it an ideal testbed for exploring Domain Generalization. Building upon recent work, this paper investigates the application of regularization methods to bridge the generalization gap between traditional models based on handcrafted features and deep neural networks. We apply various regularizers, including sparse training, Mixup, Distributionally Robust Optimization (DRO), and Sharpness-Aware Minimization (SAM), to deep learning models and assess their performance in Out-of-Distribution (OOD) settings across multiple domains using homogenized public datasets. Our results show that Mixup and SAM are the best-performing regularizers. However, they are unable to match the performance of models based on handcrafted features. This suggests that while regularization techniques can improve OOD robustness to some extent, handcrafted features remain superior for domain generalization in HAR tasks.
Collapse
Affiliation(s)
- Nuno Bento
- Associação Fraunhofer Portugal Research, Rua Alfredo Allen 455/461, 4200-135 Porto, Portugal
| | - Joana Rebelo
- Associação Fraunhofer Portugal Research, Rua Alfredo Allen 455/461, 4200-135 Porto, Portugal
| | - André V Carreiro
- Associação Fraunhofer Portugal Research, Rua Alfredo Allen 455/461, 4200-135 Porto, Portugal
| | - François Ravache
- ICOM France, 1 Rue Brindejonc des Moulinais, 31500 Toulouse, France
| | - Marília Barandas
- Associação Fraunhofer Portugal Research, Rua Alfredo Allen 455/461, 4200-135 Porto, Portugal
| |
Collapse
|
3
|
Loewinger G, Patil P, Kishida KT, Parmigiani G. Hierarchical resampling for bagging in multistudy prediction with applications to human neurochemical sensing. Ann Appl Stat 2022; 16:2145-2165. [PMID: 36274786 PMCID: PMC9586160 DOI: 10.1214/21-aoas1574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
We propose the "study strap ensemble", which combines advantages of two common approaches to fitting prediction models when multiple training datasets ("studies") are available: pooling studies and fitting one model versus averaging predictions from multiple models each fit to individual studies. The study strap ensemble fits models to bootstrapped datasets, or "pseudo-studies." These are generated by resampling from multiple studies with a hierarchical resampling scheme that generalizes the randomized cluster bootstrap. The study strap is controlled by a tuning parameter that determines the proportion of observations to draw from each study. When the parameter is set to its lowest value, each pseudo-study is resampled from only a single study. When it is high, the study strap ignores the multi-study structure and generates pseudo-studies by merging the datasets and drawing observations like a standard bootstrap. We empirically show the optimal tuning value often lies in between, and prove that special cases of the study strap draw the merged dataset and the set of original studies as pseudo-studies. We extend the study strap approach with an ensemble weighting scheme that utilizes information in the distribution of the covariates of the test dataset. Our work is motivated by neuroscience experiments using real-time neurochemical sensing during awake behavior in humans. Current techniques to perform this kind of research require measurements from an electrode placed in the brain during awake neurosurgery and rely on prediction models to estimate neurotransmitter concentrations from the electrical measurements recorded by the electrode. These models are trained by combining multiple datasets that are collected in vitro under heterogeneous conditions in order to promote accuracy of the models when applied to data collected in the brain. A prevailing challenge is deciding how to combine studies or ensemble models trained on different studies to enhance model generalizability. Our methods produce marked improvements in simulations and in this application. All methods are available in the studyStrap CRAN package.
Collapse
|
4
|
Zhao X, Sicilia A, Minhas DS, O'Connor EE, Aizenstein HJ, Klunk WE, Tudorascu DL, Hwang SJ. ROBUST WHITE MATTER HYPERINTENSITY SEGMENTATION ON UNSEEN DOMAIN. Proc IEEE Int Symp Biomed Imaging 2021; 2021:1047-1051. [PMID: 34909113 PMCID: PMC8668404 DOI: 10.1109/isbi48211.2021.9434034] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Typical machine learning frameworks heavily rely on an underlying assumption that training and test data follow the same distribution. In medical imaging which increasingly begun acquiring datasets from multiple sites or scanners, this identical distribution assumption often fails to hold due to systematic variability induced by site or scanner dependent factors. Therefore, we cannot simply expect a model trained on a given dataset to consistently work well, or generalize, on a dataset from another distribution. In this work, we address this problem, investigating the application of machine learning models to unseen medical imaging data. Specifically, we consider the challenging case of Domain Generalization (DG) where we train a model without any knowledge about the testing distribution. That is, we train on samples from a set of distributions (sources) and test on samples from a new, unseen distribution (target). We focus on the task of white matter hyperintensity (WMH) prediction using the multi-site WMH Segmentation Challenge dataset and our local in-house dataset. We identify how two mechanically distinct DG approaches, namely domain adversarial learning and mix-up, have theoretical synergy. Then, we show drastic improvements of WMH prediction on an unseen target domain.
Collapse
Affiliation(s)
- Xingchen Zhao
- Department of Computer Science, University of Pittsburgh
| | | | | | - Erin E O'Connor
- Department of Diagnostic Radiology & Nuclear Medicine - University of Maryland, Baltimore
| | | | | | | | - Seong Jae Hwang
- Intelligent Systems Program - University of Pittsburgh
- Department of Computer Science, University of Pittsburgh
| |
Collapse
|