1
|
Zou B, Wu P, Chen J, Luo J, Lei Y, Luo Q, Zhu B, Zhou M. The global burden of cancers attributable to occupational factors, 1990-2021. BMC Cancer 2025; 25:503. [PMID: 40102805 PMCID: PMC11921477 DOI: 10.1186/s12885-025-13914-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Accepted: 03/11/2025] [Indexed: 03/20/2025] Open
Abstract
This study assessed the global cancer burden due to occupational carcinogens (OCs) using data from Global Burden of Disease (GBD) 2021. Mortality and disability-adjusted life years (DALYs) were employed to assess the evolving trend of cancer attributable to occupational risk. The analysis was conducted by age, year, geographical location, and socio-demographic index (SDI). Subsequently, the estimated annual percentage change (EAPC) values were calculated. Globally, asbestos exposure showed the most severe impact on age-standardized death rate (ASDR) and age-standardized DALY rate but decreased significantly. Conversely, diesel engine exhaust exposure increased, with EAPCs of 0.80 for deaths. Trichloroethylene exposure, although low in absolute terms, exhibited the fastest growth with an EAPC of 1.21 in age-standardized DALY rate. Notably, diesel engine exhaust exposure in South Asia and polycyclic aromatic hydrocarbons (PAHs) in Southeast Asia, East Asia, and Oceania increased significantly in age-standardized DALY rate. Regions with low to middle SDI, such as South Asia and sub-Saharan Africa, showed the highest increases in OC-related cancer burdens in age-standardized DALY rate. Lesotho, Kenya, and Egypt exhibited the fastest growth, with EAPCs in age-standardized DALY rate of 3.45, 2.13, and 2.95, respectively. High-income regions like the Netherlands, the United Kingdom, and Italy had the most severe OC-related cancer of ASDR burdens in 2021. OC exposure remains a major contributor to the global cancer burden, especially from asbestos and silica. Exposure to diesel engine exhaust was associated with increased risk of cancers, particularly in low -to -middle SDI regions such as South Asia and sub-Saharan Africa.
Collapse
Affiliation(s)
- Binbin Zou
- Department of Hematology, Hunan Provincial People's Hospital, the First Affiliated Hospital of Hunan Normal University, Changsha, China
| | - Ping Wu
- Department of Pharmacy, Changde Hospital, Xiangya School of Medicine, Central South University, Hunan, 415000, China
| | - Jianjun Chen
- Department of Hematology, Hunan Provincial People's Hospital, the First Affiliated Hospital of Hunan Normal University, Changsha, China
| | - Juan Luo
- Department of Hematology, Hunan Provincial People's Hospital, the First Affiliated Hospital of Hunan Normal University, Changsha, China
| | - Yanjun Lei
- Department of Critical Care Medicine, Hunan Provincial People's Hospital, the First Affiliated Hospital of Hunan Normal University, Changsha, Hunan, 410000, China
| | - Qingqing Luo
- Department of Oncology, Hunan Provincial People's Hospital, Changsha, 410002, China
| | - Biqiong Zhu
- Department of Ultrasound Medicine, Hunan Provincial People's Hospital, the First Affiliated Hospital of Hunan Normal University, Changsha, China
| | - Ming Zhou
- Department of Hematology, Hunan Provincial People's Hospital, the First Affiliated Hospital of Hunan Normal University, Changsha, China.
| |
Collapse
|
2
|
Shi Y, Chen C, Huang Y, Xu Y, Xu D, Shen H, Ye X, Jin J, Tong H, Yu Y, Tang X, Li A, Cui D, Xie W. Global disease burden and trends of leukemia attributable to occupational risk from 1990 to 2019: An observational trend study. Front Public Health 2022; 10:1015861. [PMID: 36452945 PMCID: PMC9703980 DOI: 10.3389/fpubh.2022.1015861] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 10/10/2022] [Indexed: 11/16/2022] Open
Abstract
Background Leukemia caused by occupational risk is a problem that needs more attention and remains to be solved urgently, especially for acute lymphoid leukemia (ALL), acute myeloid leukemia (AML), and chronic lymphoid leukemia (CLL). However, there is a paucity of literature on this issue. We aimed to assess the global burden and trends of leukemia attributable to occupational risk from 1990 to 2019. Methods This observational trend study was based on the Global Burden of Disease (GBD) 2019 database, the global deaths, and disability-adjusted life years (DALYs), which were calculated to quantify the changing trend of leukemia attributable to occupational risk, were analyzed by age, year, geographical location, and socio-demographic index (SDI), and the corresponding estimated annual percentage change (EAPC) values were calculated. Results Global age-standardized DALYs and death rates of leukemia attributable to occupational risk presented significantly decline trends with EAPC [-0.38% (95% CI: -0.58 to -0.18%) for DALYs and -0.30% (95% CI: -0.45 to -0.146%) for death]. However, it was significantly increased in people aged 65-69 years [0.42% (95% CI: 0.30-0.55%) for DALYs and 0.38% (95% CI: 0.26-0.51%) for death]. At the same time, the age-standardized DALYs and death rates of ALL, AML, and CLL were presented a significantly increased trend with EAPCs [0.78% (95% CI: 0.65-0.91%), 0.87% (95% CI: 0.81-0.93%), and 0.66% (95% CI: 0.51-0.81%) for DALYs, respectively, and 0.75% (95% CI: 0.68-0.82%), 0.96% (95% CI: 0.91-1.01%), and 0.55% (95% CI: 0.43-0.68%) for death], respectively. The ALL, AML, and CLL were shown an upward trend in almost all age groups. Conclusion We observed a substantial reduction in leukemia due to occupational risks between 1990 and 2019. However, the people aged 65-69 years and burdens of ALL, AML, and CLL had a significantly increased trend in almost all age groups. Thus, there remains an urgent need to accelerate efforts to reduce leukemia attributable to occupational risk-related death burden in this population and specific causes.
Collapse
Affiliation(s)
- Yuanfei Shi
- Department of Hematology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Can Chen
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Yamei Huang
- Department of Pathology and Pathophysiology, Medical School of Southeast University, Nanjing, China
| | - Yi Xu
- Department of Hematology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Dandan Xu
- Department of Blood Transfusion, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Huafei Shen
- International Health Care Center, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Xiujin Ye
- Department of Hematology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Jie Jin
- Department of Hematology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Hongyan Tong
- Department of Hematology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Yue Yu
- Department of Quantitative Health Science, Mayo Clinic, Rochester, MN, United States
| | - Xinyi Tang
- Mayo Clinic, Rochester, MN, United States
| | - Azhong Li
- Zhejiang Blood Center, Hangzhou, China
| | - Dawei Cui
- Department of Blood Transfusion, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Wanzhuo Xie
- Department of Hematology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| |
Collapse
|
3
|
Hybrid Symmetrical Uncertainty and Reference Set Harmony Search Algorithm for Gene Selection Problem. MATHEMATICS 2022. [DOI: 10.3390/math10030374] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Selecting the most miniature possible set of genes from microarray datasets for clinical diagnosis and prediction is one of the most challenging machine learning tasks. A robust gene selection technique is required to identify the most significant subset of genes by removing spurious or non-predictive genes from the original dataset without sacrificing or reducing classification accuracy. Numerous studies have attempted to address this issue by implementing either a filter or a wrapper. Although the filter approaches are computationally efficient, they are completely independent of the induction algorithm. On the other hand, wrapper approaches outperform filter approaches but are computationally more expensive. Therefore, this study proposes an enhanced gene selection method that uses a hybrid technique that combines the Symmetrical Uncertainty (SU) filter and Reference Set Harmony Search Algorithm (RSHSA) wrapper method, known as SU-RSHSA. The framework to develop the proposed SU-RSHSA includes numerous stages: (1) investigate a novel gene selection method based on the HSA and will then determine appropriate values for the HSA’s parameters, (2) enhance the construction process of the initial harmony memory while satisfying the diversity of the solution by embedding a reference set within the HSA (RSHSA), and (3) investigates the effect of integrating Symmetrical Uncertainty (SU) as a filter and RSHSA as a wrapper (SU-RSHSA) to maximize classification accuracy by leveraging their respective advantages. The results demonstrate that the SU-RSHSA outperforms the original HSA and SU-HSA in terms of classification accuracy, a small number of selected relevant genes, and reduced computational time. More importantly, the proposed SU-RSHSA gene selection method effectively generates a small subset of salient genes with high classification accuracy.
Collapse
|
4
|
Wang Y, Liu B, Ma Z, Wong KC, Li X. Nature-Inspired Multiobjective Cancer Subtype Diagnosis. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE-JTEHM 2019; 7:4300112. [PMID: 30915261 PMCID: PMC6433215 DOI: 10.1109/jtehm.2019.2891746] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 12/03/2018] [Accepted: 01/03/2019] [Indexed: 11/18/2022]
Abstract
Cancer gene expression data is of great importance in cancer subtype diagnosis and drug discovery. Many computational methods have been proposed to classify subtypes using those data. However, most of the previous computational methods suffer from poor interpretability, experimental noises, and low diagnostic quality. To address those problems, multiobjective ensemble cuckoo search based on decomposition (MOECSA) is proposed to optimize those four objectives simultaneously including the number of features, the accuracy, and two entropy-based measures: the relevance and the redundancy, classifying the cancer gene expression data with high predictive power for different cardinality levels under multiple objectives. A novel binary encoding is proposed to choose gene subsets from the cancer gene expression data for calculating four objective functions. Furthermore, an effective ensemble mechanism blended in the cuckoo search algorithm framework is applied to balance the convergence speed and population diversity in MOECSA. To demonstrate the effectiveness and efficiency of the proposed algorithm, experiments on thirty-five benchmark cancer gene expression datasets, four independent disease datasets, and one sequencing-based dataset are carried out to compare MOECSA with the six state-of-the-art multiobjective evolutionary algorithms and seven traditional classification algorithms. The experimental results in different perspectives demonstrate that MOECSA has better diagnosis performance than others at multiple levels.
Collapse
Affiliation(s)
- Yunhe Wang
- School of Information Science and TechnologyNortheast Normal UniversityChangchun130117China
| | - Bo Liu
- School of Physical EducationNortheast Normal UniversityChangchun130117China
| | - Zhiqiang Ma
- School of Information Science and TechnologyNortheast Normal UniversityChangchun130117China
| | - Ka-Chun Wong
- Department of Computer ScienceCity University of Hong KongHong Kong
| | - Xiangtao Li
- School of Information Science and TechnologyNortheast Normal UniversityChangchun130117China
| |
Collapse
|
5
|
Lim J, Wang B, Lim JS. A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system. Technol Health Care 2017; 24 Suppl 2:S601-5. [PMID: 27163323 DOI: 10.3233/thc-161187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Finding the minimum number of appropriate biomarkers for specific targets such as a lung cancer has been a challenging issue in bioinformatics. We propose a hierarchical two-phase framework for selecting appropriate biomarkers that extracts candidate biomarkers from the cancer microarray datasets and then selects the minimum number of appropriate biomarkers from the extracted candidate biomarkers datasets with a specific neuro-fuzzy algorithm, which is called a neural network with weighted fuzzy membership function (NEWFM). In this context, as the first phase, the proposed framework is to extract candidate biomarkers by using a Bhattacharyya distance method that measures the similarity of two discrete probability distributions. Finally, the proposed framework is able to reduce the cost of finding biomarkers by not receiving medical supplements and improve the accuracy of the biomarkers in specific cancer target datasets.
Collapse
|
6
|
Alkuhlani A, Nassef M, Farag I. Multistage feature selection approach for high-dimensional cancer data. Soft comput 2016. [DOI: 10.1007/s00500-016-2439-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
7
|
Liu B, Tian M, Zhang C, Li X. Discrete Biogeography Based Optimization for Feature Selection in Molecular Signatures. Mol Inform 2015; 34:197-215. [DOI: 10.1002/minf.201400065] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2014] [Accepted: 01/21/2015] [Indexed: 11/10/2022]
|
8
|
|
9
|
Li X, Yin M. Multiobjective Binary Biogeography Based Optimization for Feature Selection Using Gene Expression Data. IEEE Trans Nanobioscience 2013; 12:343-53. [DOI: 10.1109/tnb.2013.2294716] [Citation(s) in RCA: 103] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
10
|
Sun X, Liu Y, Wei D, Xu M, Chen H, Han J. Selection of interdependent genes via dynamic relevance analysis for cancer diagnosis. J Biomed Inform 2013; 46:252-8. [DOI: 10.1016/j.jbi.2012.10.004] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2012] [Revised: 10/01/2012] [Accepted: 10/03/2012] [Indexed: 11/16/2022]
|
11
|
Ngo-Ye TL, Sinha AP. Analyzing Online Review Helpfulness Using a Regressional ReliefF-Enhanced Text Mining Method. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS 2012. [DOI: 10.1145/2229156.2229158] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Within the emerging context of Web 2.0 social media, online customer reviews are playing an increasingly important role in disseminating information, facilitating trust, and promoting commerce in the e-marketplace. The sheer volume of customer reviews on the web produces information overload for readers. Developing a system that can automatically identify the most helpful reviews would be valuable to businesses that are interested in gathering informative and meaningful customer feedback. Because the target variable---review helpfulness---is continuous, common feature selection techniques from text classification cannot be applied. In this article, we propose and investigate a text mining model, enhanced using the Regressional ReliefF (RReliefF) feature selection method, for predicting the helpfulness of online reviews from Amazon.com. We find that RReliefF significantly outperforms two popular dimension reduction methods. This study is the first to investigate and compare different dimension reduction techniques in the context of applying text regression for predicting online review helpfulness. Another contribution is that our analysis of the keywords selected by RReliefF reveals meaningful feature groupings.
Collapse
|
12
|
Mohamad MS, Omatu S, Deris S, Yoshioka M. A modified binary particle swarm optimization for selecting the small subset of informative genes from gene expression data. ACTA ACUST UNITED AC 2011; 15:813-22. [PMID: 21914573 DOI: 10.1109/titb.2011.2167756] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Gene expression data are expected to be of significant help in the development of efficient cancer diagnoses and classification platforms. In order to select a small subset of informative genes from the data for cancer classification, recently, many researchers are analyzing gene expression data using various computational intelligence methods. However, due to the small number of samples compared to the huge number of genes (high dimension), irrelevant genes, and noisy genes, many of the computational methods face difficulties to select the small subset. Thus, we propose an improved (modified) binary particle swarm optimization to select the small subset of informative genes that is relevant for the cancer classification. In this proposed method, we introduce particles' speed for giving the rate at which a particle changes its position, and we propose a rule for updating particle's positions. By performing experiments on ten different gene expression datasets, we have found that the performance of the proposed method is superior to other previous related works, including the conventional version of binary particle swarm optimization (BPSO) in terms of classification accuracy and the number of selected genes. The proposed method also produces lower running times compared to BPSO.
Collapse
Affiliation(s)
- Mohd Saberi Mohamad
- Faculty of Computer Science and Information Systems, Universiti Teknologi Malaysia, Skudai, Johore, Malaysia.
| | | | | | | |
Collapse
|
13
|
|
14
|
Maji P, Paul S. Rough set based maximum relevance-maximum significance criterion and Gene selection from microarray data. Int J Approx Reason 2011. [DOI: 10.1016/j.ijar.2010.09.006] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
15
|
Hu Q, Pan W, An S, Ma P, Wei J. An efficient gene selection technique for cancer recognition based on neighborhood mutual information. INT J MACH LEARN CYB 2010. [DOI: 10.1007/s13042-010-0008-6] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
16
|
Zhang Y, Dang Y, Chen H, Thurmond M, Larson C. Automatic online news monitoring and classification for syndromic surveillance. DECISION SUPPORT SYSTEMS 2009; 47:508-517. [PMID: 32287567 PMCID: PMC7114309 DOI: 10.1016/j.dss.2009.04.016] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2008] [Revised: 02/06/2009] [Accepted: 04/25/2009] [Indexed: 05/26/2023]
Abstract
Syndromic surveillance can play an important role in protecting the public's health against infectious diseases. Infectious disease outbreaks can have a devastating effect on society as well as the economy, and global awareness is therefore critical to protecting against major outbreaks. By monitoring online news sources and developing an accurate news classification system for syndromic surveillance, public health personnel can be apprised of outbreaks and potential outbreak situations. In this study, we have developed a framework for automatic online news monitoring and classification for syndromic surveillance. The framework is unique and none of the techniques adopted in this study have been previously used in the context of syndromic surveillance on infectious diseases. In recent classification experiments, we compared the performance of different feature subsets on different machine learning algorithms. The results showed that the combined feature subsets including Bag of Words, Noun Phrases, and Named Entities features outperformed the Bag of Words feature subsets. Furthermore, feature selection improved the performance of feature subsets in online news classification. The highest classification performance was achieved when using SVM upon the selected combination feature subset.
Collapse
Affiliation(s)
- Yulei Zhang
- Artificial Intelligence Lab, Department of Management Information Systems, Eller College of Management, University of Arizona, Tucson, AZ 85721, USA
| | - Yan Dang
- Artificial Intelligence Lab, Department of Management Information Systems, Eller College of Management, University of Arizona, Tucson, AZ 85721, USA
| | - Hsinchun Chen
- Artificial Intelligence Lab, Department of Management Information Systems, Eller College of Management, University of Arizona, Tucson, AZ 85721, USA
| | - Mark Thurmond
- FMD Lab, Center for Animal Disease Modeling and Surveillance (CADMS), University of California, Davis, CA 95616, USA
| | - Cathy Larson
- Artificial Intelligence Lab, Department of Management Information Systems, Eller College of Management, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
17
|
Maji P. $f$-Information Measures for Efficient Selection of Discriminative Genes From Microarray Data. IEEE Trans Biomed Eng 2009; 56:1063-9. [DOI: 10.1109/tbme.2008.2004502] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
18
|
Huerta EB, Duval B, Hao JK. Fuzzy logic for elimination of redundant information of microarray data. GENOMICS PROTEOMICS & BIOINFORMATICS 2009; 6:61-73. [PMID: 18973862 PMCID: PMC5054105 DOI: 10.1016/s1672-0229(08)60021-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Gene subset selection is essential for classification and analysis of microarray data. However, gene selection is known to be a very difficult task since gene expression data not only have high dimensionalities, but also contain redundant information and noises. To cope with these difficulties, this paper introduces a fuzzy logic based pre-processing approach composed of two main steps. First, we use fuzzy inference rules to transform the gene expression levels of a given dataset into fuzzy values. Then we apply a similarity relation to these fuzzy values to define fuzzy equivalence groups, each group containing strongly similar genes. Dimension reduction is achieved by considering for each group of similar genes a single representative based on mutual information. To assess the usefulness of this approach, extensive experimentations were carried out on three well-known public datasets with a combined classification model using three statistic filters and three classifiers.
Collapse
|
19
|
Abstract
The Internet is frequently used as a medium for exchange of information and opinions, as well as propaganda dissemination. In this study the use of sentiment analysis methodologies is proposed for classification of Web forum opinions in multiple languages. The utility of stylistic and syntactic features is evaluated for sentiment classification of English and Arabic content. Specific feature extraction components are integrated to account for the linguistic characteristics of Arabic. The entropy weighted genetic algorithm (EWGA) is also developed, which is a hybridized genetic algorithm that incorporates the information-gain heuristic for feature selection. EWGA is designed to improve performance and get a better assessment of key features. The proposed features and techniques are evaluated on a benchmark movie review dataset and U.S. and Middle Eastern Web forum postings. The experimental results using EWGA with SVM indicate high performance levels, with accuracies of over 91% on the benchmark dataset as well as the U.S. and Middle Eastern forums. Stylistic features significantly enhanced performance across all testbeds while EWGA also outperformed other feature selection methods, indicating the utility of these features and techniques for document-level classification of sentiments.
Collapse
|