Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

2378
(from Reference Citation Analysis)

Article PDFs (712)

Cited by > 0 (1580)

Searched Name

Data Mining

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Rezapour M, Yazdinejad M, Rajabi Kouchi F, Habibi Baghi M, Khorrami Z, Khavanin Zadeh M, Pourbaghi E, Rezapour H. Text mining of hypertension researches in the west Asia region: a 12-year trend analysis. Ren Fail 2024;46:2337285. [PMID: 38616180 PMCID: PMC11018045 DOI: 10.1080/0886022x.2024.2337285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 03/27/2024] [Indexed: 04/16/2024] Open

Labbo MS, Qu L, Xu C, Bai W, Ayele Atumo E, Jiang X. Understanding risky driving behaviors among young novice drivers in Nigeria: A latent class analysis coupled with association rule mining approach. Accid Anal Prev 2024;200:107557. [PMID: 38537532 DOI: 10.1016/j.aap.2024.107557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 02/22/2024] [Accepted: 03/21/2024] [Indexed: 04/14/2024]

Abstract

Traffic crashes are significant public health concern in Nigeria, particularly among young drivers. The study aims to explore the underlying pattern of risky driving behaviors and the associations with demographic factors among young drivers in Nigeria. A combined approach of Latent Class Analysis (LCA) and Association Rule Mining is applied to the dataset comprising responses from 684 young drivers who complete the "Behavior of Young Novice Drivers Scale" (BYND) questionnaires. The LCA identifies four distinct classes of drivers based on the risky behavior profiles: Reckless-Speedsters, Cautious Drivers, Distracted Multitaskers, and Emotion-impacted Drivers. Association rule mining further connects these driver classes to demographic and driving history variables, uncovering intriguing insights. Reckless-Speedsters predominantly consist of young males who engage in riskier driving behaviors, including exceeding speed limits and disregarding traffic rules. Conversely, Cautious Drivers, also predominantly young males, exhibit a safer driving profile marked by rule adherence and a notably lower crash rate. Distracted Multitaskers, sharing a demographic profile with Cautious Drivers, diverge significantly due to their higher crash involvement, hinting at a propensity for distracted driving practices. Lastly, Emotion-Impacted Drivers, primarily comprising young employed males, display behaviors influenced by emotions, shorter driving distances, and prior unsupervised driving experience. Most of the behaviors are attributed to inadequate traffic control, absence of traffic signs in most of the roads, preferential treatment, and lack of strict law enforcement in the country. The findings hold substantial implications for road safety interventions in Nigeria, urging targeted approaches to address the unique challenges presented by each driver class. With acknowledging the study limitations and advocating for future research in objective measures and emotion-behavior interactions, the comprehensive approach provides a robust foundation for enhancing road safety in the Nigerian context.

Collapse

Ju W, Fang Z, Gu Y, Liu Z, Long Q, Qiao Z, Qin Y, Shen J, Sun F, Xiao Z, Yang J, Yuan J, Zhao Y, Wang Y, Luo X, Zhang M. A Comprehensive Survey on Deep Graph Representation Learning. Neural Netw 2024;173:106207. [PMID: 38442651 DOI: 10.1016/j.neunet.2024.106207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 01/23/2024] [Accepted: 02/21/2024] [Indexed: 03/07/2024]

Abstract

Graph representation learning aims to effectively encode high-dimensional sparse graph-structured data into low-dimensional dense vectors, which is a fundamental task that has been widely studied in a range of fields, including machine learning and data mining. Classic graph embedding methods follow the basic idea that the embedding vectors of interconnected nodes in the graph can still maintain a relatively close distance, thereby preserving the structural information between the nodes in the graph. However, this is sub-optimal due to: (i) traditional methods have limited model capacity which limits the learning performance; (ii) existing techniques typically rely on unsupervised learning strategies and fail to couple with the latest learning paradigms; (iii) representation learning and downstream tasks are dependent on each other which should be jointly enhanced. With the remarkable success of deep learning, deep graph representation learning has shown great potential and advantages over shallow (traditional) methods, there exist a large number of deep graph representation learning techniques have been proposed in the past decade, especially graph neural networks. In this survey, we conduct a comprehensive survey on current deep graph representation learning algorithms by proposing a new taxonomy of existing state-of-the-art literature. Specifically, we systematically summarize the essential components of graph representation learning and categorize existing approaches by the ways of graph neural network architectures and the most recent advanced learning paradigms. Moreover, this survey also provides the practical and promising applications of deep graph representation learning. Last but not least, we state new perspectives and suggest challenging directions which deserve further investigations in the future.

Collapse

Affiliation(s)

Wei Ju School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Zheng Fang School of Intelligence Science and Technology, Peking University, Beijing, 100871, China
Yiyang Gu School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Zequn Liu School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Qingqing Long Computer Network Information Center, Chinese Academy of Sciences, Beijing, 100086, China
Ziyue Qiao Artificial Intelligence Thrust, The Hong Kong University of Science and Technology, Guangzhou, 511453, China
Yifang Qin School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Jianhao Shen School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Fang Sun Department of Computer Science, University of California, Los Angeles, 90095, USA
Zhiping Xiao Department of Computer Science, University of California, Los Angeles, 90095, USA
Junwei Yang School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Jingyang Yuan School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Yusheng Zhao School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China
Yifan Wang School of Information Technology & Management, University of International Business and Economics, Beijing, 100029, China
Xiao Luo Department of Computer Science, University of California, Los Angeles, 90095, USA.
Ming Zhang School of Computer Science, National Key Laboratory for Multimedia Information Processing, Peking University, Beijing, 100871, China.

Collapse

Tamakloe R, Zhang K, Hossain A, Kim I, Park SH. Critical risk factors associated with fatal/severe crash outcomes in personal mobility device rider at-fault crashes: A two-step inter-cluster rule mining technique. Accid Anal Prev 2024;199:107527. [PMID: 38428242 DOI: 10.1016/j.aap.2024.107527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/28/2024] [Accepted: 02/25/2024] [Indexed: 03/03/2024]

Abstract

Personal Mobility Devices (PMDs) have witnessed an extraordinary surge in popularity, emerging as a favored mode of urban transportation. This has sparked significant safety concerns, paralleled by a stark increase in PMD-involved crashes. Research indicates that PMD user behavior, especially in urban areas, is crucial in these crashes, underscoring the need for an extensive investigation into key factors, particularly those causing fatal/severe outcomes. Remarkably, there exists a noticeable gap in the research concerning the analysis of determinants behind fatal/severe PMD crashes, specifically in PMD rider-at-fault collisions. This study addresses this gap by identifying uniform groups of PMD rider-at-fault crashes and investigating cluster-specific key factor associations and determinants of fatal/severe crash outcomes using Seoul's PMD rider-at-fault crash data from 2017 to 2021. A comprehensive two-step framework, integrating Cluster Correspondence Analysis (CCA) and Association Rules Mining (ARM) techniques is employed to segment PMD rider-at-fault crash data into homogeneous groups, revealing unique risk factor patterns within each cluster and further exploring the combination of factors associated with fatal/severe PMD rider-at-fault crash outcomes. CCA revealed three distinct groups: PMD-vehicle, PMD-pedestrian, and single-PMD crashes. From the ARM, it was found that fatal/severe crashes were linked to dry road conditions, male PMD users, and weekdays, irrespective of the cluster. Whereas speeding violations and side collisions were associated with fatal/severe PMD-vehicle rider-at-fault crashes, traffic control violations were related to fatal/severe PMD-pedestrian rider-at-fault crashes at pedestrian crossings. Unsafe riding practices predominantly caused single-PMD crashes during daytime hours. From the findings, engineering improvements, awareness campaigns, education, and law enforcement actions are recommended. The new insights gleaned from this research provide a foundation for informed decision-making and the implementation of policies designed to enhance PMD safety.

Collapse

Tang XE, Lu T, Zhou YC, Zhan MJ, Chen W, Peng Z, Liu JH, Gui YF, Deng ZH, Fan F. Adult age estimation from the sternum using maximum intensity projection images of CT and data mining in a Chinese population. Int J Legal Med 2024;138:961-970. [PMID: 38240839 DOI: 10.1007/s00414-024-03161-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 01/08/2024] [Indexed: 04/11/2024]

Clarke DJB, Marino GB, Deng EZ, Xie Z, Evangelista JE, Ma'ayan A. Rummagene: massive mining of gene sets from supporting materials of biomedical research publications. Commun Biol 2024;7:482. [PMID: 38643247 PMCID: PMC11032387 DOI: 10.1038/s42003-024-06177-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 04/10/2024] [Indexed: 04/22/2024] Open

Masoumi S, Amirkhani H, Sadeghian N, Shahraz S. Natural language processing (NLP) to facilitate abstract review in medical research: the application of BioBERT to exploring the 20-year use of NLP in medical research. Syst Rev 2024;13:107. [PMID: 38622611 PMCID: PMC11020656 DOI: 10.1186/s13643-024-02470-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/23/2022] [Accepted: 01/28/2024] [Indexed: 04/17/2024] Open

Abstract

BACKGROUND

Abstract review is a time and labor-consuming step in the systematic and scoping literature review in medicine. Text mining methods, typically natural language processing (NLP), may efficiently replace manual abstract screening. This study applies NLP to a deliberately selected literature review problem, the trend of using NLP in medical research, to demonstrate the performance of this automated abstract review model.

METHODS

Scanning PubMed, Embase, PsycINFO, and CINAHL databases, we identified 22,294 with a final selection of 12,817 English abstracts published between 2000 and 2021. We invented a manual classification of medical fields, three variables, i.e., the context of use (COU), text source (TS), and primary research field (PRF). A training dataset was developed after reviewing 485 abstracts. We used a language model called Bidirectional Encoder Representations from Transformers to classify the abstracts. To evaluate the performance of the trained models, we report a micro f1-score and accuracy.

RESULTS

The trained models' micro f1-score for classifying abstracts, into three variables were 77.35% for COU, 76.24% for TS, and 85.64% for PRF. The average annual growth rate (AAGR) of the publications was 20.99% between 2000 and 2020 (72.01 articles (95% CI: 56.80-78.30) yearly increase), with 81.76% of the abstracts published between 2010 and 2020. Studies on neoplasms constituted 27.66% of the entire corpus with an AAGR of 42.41%, followed by studies on mental conditions (AAGR = 39.28%). While electronic health or medical records comprised the highest proportion of text sources (57.12%), omics databases had the highest growth among all text sources with an AAGR of 65.08%. The most common NLP application was clinical decision support (25.45%).

CONCLUSIONS

BioBERT showed an acceptable performance in the abstract review. If future research shows the high performance of this language model, it can reliably replace manual abstract reviews.

Collapse

Lei B, Mahajan A, Mallick B. Identifying and overcoming COVID-19 vaccination impediments using Bayesian data mining techniques. Sci Rep 2024;14:8595. [PMID: 38615084 PMCID: PMC11016065 DOI: 10.1038/s41598-024-58902-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 04/04/2024] [Indexed: 04/15/2024] Open

Li D, Deng Y, Liu L, Wang J, Huang Z, Zhang X. Analysis of heavy metal and polycyclic aromatic hydrocarbon pollution characteristics of a typical metal rolling industrial site based on data mining. Environ Geochem Health 2024;46:146. [PMID: 38578375 DOI: 10.1007/s10653-024-01928-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 02/20/2024] [Indexed: 04/06/2024]

Nishimura Y, Matsumoto S, Sasaki T, Kubo T. Impacts of workplace verbal aggression classified via text mining on workers' mental health. Occup Med (Lond) 2024;74:186-192. [PMID: 38346110 PMCID: PMC10990467 DOI: 10.1093/occmed/kqae009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2024] Open

Zhang JM, Wang Y, Mouton M, Zhang J, Shi M. Public Discourse, User Reactions, and Conspiracy Theories on the X Platform About HIV Vaccines: Data Mining and Content Analysis. J Med Internet Res 2024;26:e53375. [PMID: 38568723 PMCID: PMC11024739 DOI: 10.2196/53375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 11/08/2023] [Accepted: 02/28/2024] [Indexed: 04/05/2024] Open

Abstract

BACKGROUND

The initiation of clinical trials for messenger RNA (mRNA) HIV vaccines in early 2022 revived public discussion on HIV vaccines after 3 decades of unsuccessful research. These trials followed the success of mRNA technology in COVID-19 vaccines but unfolded amid intense vaccine debates during the COVID-19 pandemic. It is crucial to gain insights into public discourse and reactions about potential new vaccines, and social media platforms such as X (formerly known as Twitter) provide important channels.

OBJECTIVE

Drawing from infodemiology and infoveillance research, this study investigated the patterns of public discourse and message-level drivers of user reactions on X regarding HIV vaccines by analyzing posts using machine learning algorithms. We examined how users used different post types to contribute to topics and valence and how these topics and valence influenced like and repost counts. In addition, the study identified salient aspects of HIV vaccines related to COVID-19 and prominent anti-HIV vaccine conspiracy theories through manual coding.

METHODS

We collected 36,424 English-language original posts about HIV vaccines on the X platform from January 1, 2022, to December 31, 2022. We used topic modeling and sentiment analysis to uncover latent topics and valence, which were subsequently analyzed across post types in cross-tabulation analyses and integrated into linear regression models to predict user reactions, specifically likes and reposts. Furthermore, we manually coded the 1000 most engaged posts about HIV and COVID-19 to uncover salient aspects of HIV vaccines related to COVID-19 and the 1000 most engaged negative posts to identify prominent anti-HIV vaccine conspiracy theories.

RESULTS

Topic modeling revealed 3 topics: HIV and COVID-19, mRNA HIV vaccine trials, and HIV vaccine and immunity. HIV and COVID-19 underscored the connections between HIV vaccines and COVID-19 vaccines, as evidenced by subtopics about their reciprocal impact on development and various comparisons. The overall valence of the posts was marginally positive. Compared to self-composed posts initiating new conversations, there was a higher proportion of HIV and COVID-19-related and negative posts among quote posts and replies, which contribute to existing conversations. The topic of mRNA HIV vaccine trials, most evident in self-composed posts, increased repost counts. Positive valence increased like and repost counts. Prominent anti-HIV vaccine conspiracy theories often falsely linked HIV vaccines to concurrent COVID-19 and other HIV-related events.

CONCLUSIONS

The results highlight COVID-19 as a significant context for public discourse and reactions regarding HIV vaccines from both positive and negative perspectives. The success of mRNA COVID-19 vaccines shed a positive light on HIV vaccines. However, COVID-19 also situated HIV vaccines in a negative context, as observed in some anti-HIV vaccine conspiracy theories misleadingly connecting HIV vaccines with COVID-19. These findings have implications for public health communication strategies concerning HIV vaccines.

Collapse

Bellomo RK, Zavalis EA, Ioannidis JPA. Assessment of transparency indicators in space medicine. PLoS One 2024;19:e0300701. [PMID: 38564591 PMCID: PMC10986997 DOI: 10.1371/journal.pone.0300701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 03/04/2024] [Indexed: 04/04/2024] Open

Kirchhof B. 170 years of data-mining: history and future. Graefes Arch Clin Exp Ophthalmol 2024;262:1013-1014. [PMID: 38231246 PMCID: PMC10995019 DOI: 10.1007/s00417-023-06359-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 12/20/2023] [Accepted: 12/28/2023] [Indexed: 01/18/2024] Open

Paul J, Jacob J, Mahmud M, Vaka M, Krishnan SG, Arifutzzaman A, Thesiya D, Xiong T, Kadirgama K, Selvaraj J. A data mining approach to analyze the role of biomacromolecules-based nanocomposites in sustainable packaging. Int J Biol Macromol 2024;265:130850. [PMID: 38492706 DOI: 10.1016/j.ijbiomac.2024.130850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 03/09/2024] [Accepted: 03/11/2024] [Indexed: 03/18/2024]

Mateu-Sanz M, Fuenteslópez CV, Uribe-Gomez J, Haugen HJ, Pandit A, Ginebra MP, Hakimi O, Krallinger M, Samara A. Redefining biomaterial biocompatibility: challenges for artificial intelligence and text mining. Trends Biotechnol 2024;42:402-417. [PMID: 37858386 DOI: 10.1016/j.tibtech.2023.09.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 09/25/2023] [Accepted: 09/26/2023] [Indexed: 10/21/2023]

Li JJ, Chen L, Zhao Y, Yang XQ, Hu FB, Wang L. Data mining and safety analysis of traditional immunosuppressive drugs: a pharmacovigilance investigation based on the FAERS database. Expert Opin Drug Saf 2024;23:513-525. [PMID: 38533933 DOI: 10.1080/14740338.2024.2327503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/13/2023] [Indexed: 03/28/2024]

Limsomwong P, Ingviya T, Fumaneeshoat O. Identifying cancer patients who received palliative care using the SPICT-LIS in medical records: a rule-based algorithm and text-mining technique. BMC Palliat Care 2024;23:83. [PMID: 38556869 PMCID: PMC10983682 DOI: 10.1186/s12904-024-01419-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 03/25/2024] [Indexed: 04/02/2024] Open

Abstract

BACKGROUND

Due to limited numbers of palliative care specialists and/or resources, accessing palliative care remains limited in many low and middle-income countries. Data science methods, such as rule-based algorithms and text mining, have potential to improve palliative care by facilitating analysis of electronic healthcare records. This study aimed to develop and evaluate a rule-based algorithm for identifying cancer patients who may benefit from palliative care based on the Thai version of the Supportive and Palliative Care Indicators for a Low-Income Setting (SPICT-LIS) criteria.

METHODS

The medical records of 14,363 cancer patients aged 18 years and older, diagnosed between 2016 and 2020 at Songklanagarind Hospital, were analyzed. Two rule-based algorithms, strict and relaxed, were designed to identify key SPICT-LIS indicators in the electronic medical records using tokenization and sentiment analysis. The inter-rater reliability between these two algorithms and palliative care physicians was assessed using percentage agreement and Cohen's kappa coefficient. Additionally, factors associated with patients might be given palliative care as they will benefit from it were examined.

RESULTS

The strict rule-based algorithm demonstrated a high degree of accuracy, with 95% agreement and Cohen's kappa coefficient of 0.83. In contrast, the relaxed rule-based algorithm demonstrated a lower agreement (71% agreement and Cohen's kappa of 0.16). Advanced-stage cancer with symptoms such as pain, dyspnea, edema, delirium, xerostomia, and anorexia were identified as significant predictors of potentially benefiting from palliative care.

CONCLUSION

The integration of rule-based algorithms with electronic medical records offers a promising method for enhancing the timely and accurate identification of patients with cancer might benefit from palliative care.

Collapse

Zhao K, Ebrahimie E, Mohammadi-Dehcheshmeh M, Lewsey MG, Zheng L, Hoogenraad NJ. Transcriptomic signature of cancer cachexia by integration of machine learning, literature mining and meta-analysis. Comput Biol Med 2024;172:108233. [PMID: 38452471 DOI: 10.1016/j.compbiomed.2024.108233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 01/23/2024] [Accepted: 02/25/2024] [Indexed: 03/09/2024]

Zhang S, Wang Y, Qi Z, Tong S, Zhu D. Data mining and analysis of adverse event signals associated with teprotumumab using the Food and Drug Administration adverse event reporting system database. Int J Clin Pharm 2024;46:471-479. [PMID: 38245664 DOI: 10.1007/s11096-023-01676-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 11/20/2023] [Indexed: 01/22/2024]

Abstract

BACKGROUND

Teprotumumab was approved by the US Food and Drug Administration (FDA) for the treatment of thyroid eye disease in 2020. However, its adverse events (AEs) have not been investigated in real-world settings.

AIM

This study aimed to detect and evaluate AEs associated with teprotumumab in the real-world setting by conducting a pharmacovigilance analysis of the FDA Adverse Event Reporting System (FAERS) database.

METHOD

Reporting odds ratio (ROR) was used to detect risk signals from the data from January 2020 to March 2023 in the FAERS database.

RESULTS

A total of 3,707,269 cases were retrieved, of which 1542 were related to teprotumumab. The FAERS analysis identified 99 teprotumumab-related AE signals in 14 System Organ Classes (SOCs). The most frequent AEs were muscle spasms (n = 287), fatigue (n = 174), blood glucose increase (n = 121), alopecia (n = 120), nausea (n = 118), hyperacusis (n = 117), and headache (n = 117). The AEs with strongest signal strengths were autophony (ROR = 14,475.49), deafness permanent (ROR = 1853.35), gingival recession (ROR = 190.74), deafness neurosensory (ROR = 129.89), nail growth abnormal (ROR = 103.67), onychoclasis (ROR = 73.58), ear discomfort (ROR = 72.88), and deafness bilateral (ROR = 62.46). Eleven positive AE signals were found at the standardized MedDRA queries (SMQs) level, of which the top five SMQs were hyperglycemia/new-onset diabetes mellitus, hearing impairment, gastrointestinal nonspecific symptoms and therapeutic procedures, noninfectious diarrhea, and hypertension. Age significantly increased the risk of hearing impairment.

CONCLUSION

This study identified potential new and unexpected AE signals of teprotumumab. Our findings emphasize the importance of pharmacovigilance analysis in the real world to identify and manage AEs effectively, ultimately improving patient safety during teprotumumab treatment.

Collapse

Li T, Hu K, Ye L, Ma J, Huang L, Guo C, Huang X, Jiang J, Xie X, Guo C, He Q. Association of Antipsychotic Drugs with Venous Thromboembolism: Data Mining of Food and Drug Administration Adverse Event Reporting System and Mendelian Randomization Analysis. J Atheroscler Thromb 2024;31:396-418. [PMID: 38030236 PMCID: PMC10999720 DOI: 10.5551/jat.64461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 09/25/2023] [Indexed: 12/01/2023] Open

Wermers Z, Yoo S, Radenbaugh B, Douglass A, Biesecker LG, Johnston JJ. Comparison of literature mining tools for variant classification: Through the lens of 50 RYR1 variants. Genet Med 2024;26:101083. [PMID: 38281099 DOI: 10.1016/j.gim.2024.101083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 01/19/2024] [Accepted: 01/22/2024] [Indexed: 01/29/2024] Open

Rocha HAL, Solha EZM, Furtado V, Justino FL, Barreto LAL, da Silva RG, de Oliveira ÍM, Bates DW, de Góes Cavalcanti LP, Lima Neto AS, de Oliveira EA. COVID-19 outbreaks surveillance through text mining applied to electronic health records. BMC Infect Dis 2024;24:359. [PMID: 38549109 PMCID: PMC10976796 DOI: 10.1186/s12879-024-09250-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 03/24/2024] [Indexed: 04/01/2024] Open

Lu S, Yang J, Gu Y, He D, Wu H, Sun W, Xu D, Li C, Guo C. Advances in Machine Learning Processing of Big Data from Disease Diagnosis Sensors. ACS Sens 2024;9:1134-1148. [PMID: 38363978 DOI: 10.1021/acssensors.3c02670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2024]

Huang L, Fan Y, Lin R, Zhao Y, Mo Y, Luo S, Li Z. Investigating acupoint selection and combinations of acupuncture for primary idiopathic tinnitus using data mining. Medicine (Baltimore) 2024;103:e37107. [PMID: 38518013 PMCID: PMC10956944 DOI: 10.1097/md.0000000000037107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 01/08/2024] [Indexed: 03/24/2024] Open

Abstract

BACKGROUND

Acupuncture is widely used in the treatment of tinnitus worldwide because of its good efficacy and safety. However, the criteria for selecting acupoint prescriptions and combinations have not been summarized. Therefore, data mining was used herein to determine the treatment principles and the most effective acupoint selection for the treatment of idiopathic tinnitus.

METHODS

The clinical research literature of acupuncture in the treatment of idiopathic tinnitus from the establishment of the database to September 1, 2023 in China National Knowledge Infrastructure, China Medical Journal Full-text Database, PubMed, Embase, Cochrane Library and Web of Science databases was retrieved and extracted. Microsoft Excel 2016 was used to establish the acupoint prescription database and the frequency statistics of acupoints, meridians and specific acupoints were carried out. IBM SPSS Statistics 25.0 software was used for cluster analysis of acupoints, and IBM SPSS Modeler18.0 software was used for association rule analysis of acupoints.

RESULTS

A total of 112 articles were included, involving 221 acupuncture prescriptions, including 99 acupoints, with a total frequency of 1786 times. The 5 most frequently used acupoints were Tinggong (SI19), Tinghui (GB2), Yifeng (TE17), Ermen (TE21), and Zhongzhu (TE3). The commonly used meridians were Sanjiao meridian of hand-shaoyang, Gallbladder meridian of foot-shaoyang and Small intestine meridian of hand-taiyang. The specific points are mostly Crossing point, Five-shu point and Yuan-primary point. The core acupoint combination of association rules was Ermen (TE21)-Tinggong (SI19)-Tinghui (GB2)-Yifeng (TE17), and 3 effective clustering groups were obtained by cluster analysis of high-frequency acupoints.

CONCLUSION

In this study, the published literature on acupuncture treatment of idiopathic tinnitus was analyzed by data mining, and the relationship between acupoints was explored, which provided a more wise choice for clinical acupuncture treatment of idiopathic tinnitus.

Collapse

Karystianis G, Lukmanjaya W, Buchan I, Simpson P, Ginnivan N, Nenadic G, Butler T. An analysis of published study designs in PubMed prisoner health abstracts from 1963 to 2023: a text mining study. BMC Med Res Methodol 2024;24:68. [PMID: 38494501 PMCID: PMC10944606 DOI: 10.1186/s12874-024-02186-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 02/20/2024] [Indexed: 03/19/2024] Open

Abstract

BACKGROUND

The challenging nature of studies with incarcerated populations and other offender groups can impede the conduct of research, particularly that involving complex study designs such as randomised control trials and clinical interventions. Providing an overview of study designs employed in this area can offer insights into this issue and how research quality may impact on health and justice outcomes.

METHODS

We used a rule-based approach to extract study designs from a sample of 34,481 PubMed abstracts related to epidemiological criminology published between 1963 and 2023. The results were compared against an accepted hierarchy of scientific evidence.

RESULTS

We evaluated our method in a random sample of 100 PubMed abstracts. An F1-Score of 92.2% was returned. Of 34,481 study abstracts, almost 40.0% (13,671) had an extracted study design. The most common study design was observational (37.3%; 5101) while experimental research in the form of trials (randomised, non-randomised) was present in 16.9% (2319). Mapped against the current hierarchy of scientific evidence, 13.7% (1874) of extracted study designs could not be categorised. Among the remaining studies, most were observational (17.2%; 2343) followed by systematic reviews (10.5%; 1432) with randomised controlled trials accounting for 8.7% (1196) of studies and meta-analysis for 1.4% (190) of studies.

CONCLUSIONS

It is possible to extract epidemiological study designs from a large-scale PubMed sample computationally. However, the number of trials, systematic reviews, and meta-analysis is relatively small - just 1 in 5 articles. Despite an increase over time in the total number of articles, study design details in the abstracts were missing. Epidemiological criminology still lacks the experimental evidence needed to address the health needs of the marginalized and isolated population that is prisoners and offenders.

Collapse

Liu C, Sun K, Zhou Q, Duan Y, Shu J, Kan H, Gu Z, Hu J. CPMI-ChatGLM: parameter-efficient fine-tuning ChatGLM with Chinese patent medicine instructions. Sci Rep 2024;14:6403. [PMID: 38493251 PMCID: PMC10944515 DOI: 10.1038/s41598-024-56874-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 03/12/2024] [Indexed: 03/18/2024] Open

Huang L, Shi F, Hu D, Kang D. Analysis of research topics and trends in investigator-initiated research/trials (IIRs/IITs): A topic modeling study. Medicine (Baltimore) 2024;103:e37375. [PMID: 38457583 PMCID: PMC10919521 DOI: 10.1097/md.0000000000037375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/26/2023] [Accepted: 02/05/2024] [Indexed: 03/10/2024] Open

Abstract

BACKGROUND

With the exponential growth of publications in the field of investigator-initiated research/trials (IIRs/IITs), it has become necessary to employ text mining and bibliometric analysis as tools for gaining deeper insights into this area of study. By using these methods, researchers can effectively identify and analyze research topics within the field.

METHODS

This study retrieved relevant publications from the Web of Science Core Collection and conducted bioinformatics analysis. The latent Dirichlet allocation model, which is based on machine learning, was utilized to identify subfield research topics.

RESULTS

A total of 4315 articles related to IIRs/IITs were obtained from the Web of Science Core Collection. After excluding duplicates and articles with missing abstracts, a final dataset of 3333 articles was included for bibliometric analysis. The number of publications showed a steady increase over time, particularly since 2000. The United States, Germany, the United Kingdom, the Netherlands, Canada, Denmark, Japan, Switzerland, and France emerged as the most productive countries in terms of IIRs/IITs. The citation analysis revealed intriguing trends, with certain highly cited articles showing a significant increase in citation frequency in recent years. A model with 45 topics was deemed the best fit for characterizing the extensively researched fields within IIRs/IITs. Our analysis revealed 10 top topics that have garnered significant attention, spanning domains such as community health, cancer treatment, brain development and disease mechanisms, nursing research, and stem cell therapy. These top topics offer researchers valuable directions for further investigation and innovation. Additionally, we identified 12 hot topics, which represent the most cutting-edge and highly regarded research areas within the field.

CONCLUSION

This study contributes to a comprehensive understanding of the current research landscape and provides valuable insights for researchers working in this domain.

Collapse

Saheb T. Mapping Ethical Artificial Intelligence Policy Landscape: A Mixed Method Analysis. Sci Eng Ethics 2024;30:9. [PMID: 38451328 PMCID: PMC10920462 DOI: 10.1007/s11948-024-00472-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 01/30/2024] [Indexed: 03/08/2024]

Pei Y, O'Brien KH. Use of Social Media Data Mining to Examine Needs, Concerns, and Experiences of People With Traumatic Brain Injury. Am J Speech Lang Pathol 2024;33:831-847. [PMID: 38147471 DOI: 10.1044/2023_ajslp-23-00297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]

Abstract

PURPOSE

Given the limited availability of topic-specific resources, many people turn to anonymous social media platforms such as Reddit to seek information and connect to others with similar experiences and needs. Mining of such data can therefore identify unmet needs within the community and allow speech-language pathologists to incorporate clients' real-life insights into clinical practices.

METHOD

A mixed-method analysis was performed on 3,648 traumatic brain injury (TBI) subreddit posts created between 2013 and 2021. Sentiment analysis was used to determine the sentiment expressed in each post; topic modeling and qualitative content analysis were used to uncover the main topics discussed across posts. Subgroup analyses were conducted based on injury severity, chronicity, and whether the post was authored by a person with TBI or a close other.

RESULTS

There was no significant difference between the number of posts with positive sentiment and the number of posts with negative sentiment. Comparisons between subgroups showed significantly higher positive sentiment in posts by or about people with moderate-to-severe TBI (compared to mild TBI) and who were more than 1 month postinjury (compared to less than 1 month). Posts by close others had significantly higher positive sentiment than posts by people with TBI. Topic modeling identified three meta-themes: Recovery, Symptoms, and Medical Care. Qualitative content analysis further revealed that returning to productivity and life as well as sharing recovery tips were the primary focus under the Recovery theme. Symptom-related posts often discussed symptom management and validation of experiences. The Medical Care theme encompassed concerns regarding diagnosis, medication, and treatment.

CONCLUSIONS

Concerns and needs shift over time following TBI, and they extend beyond health and functioning to participation in meaningful daily activities. The findings can inform the development of tailored educational resources and rehabilitative approaches, facilitating recovery and community building for individuals with TBI.

SUPPLEMENTAL MATERIAL

https://doi.org/10.23641/asha.24881340.

Collapse

Park YJ, Yang GJ, Sohn CB, Park SJ. GPDminer: a tool for extracting named entities and analyzing relations in biological literature. BMC Bioinformatics 2024;25:101. [PMID: 38448845 PMCID: PMC10916184 DOI: 10.1186/s12859-024-05710-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 02/19/2024] [Indexed: 03/08/2024] Open

Wu X, Wen Q, Zhu J. Association rule mining with a special rule coding and dynamic genetic algorithm for air quality impact factors in Beijing, China. PLoS One 2024;19:e0299865. [PMID: 38437225 PMCID: PMC10911623 DOI: 10.1371/journal.pone.0299865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 02/16/2024] [Indexed: 03/06/2024] Open

Abstract

Understanding air quality requires a comprehensive understanding of its various factors. Most of the association rule techniques focuses on high frequency terms, ignoring the potential importance of low- frequency terms and causing unnecessary storage space waste. Therefore, a dynamic genetic association rule mining algorithm is proposed in this paper, which combines the improved dynamic genetic algorithm with the association rule mining algorithm to realize the importance mining of low- frequency terms. Firstly, in the chromosome coding phase of genetic algorithm, an innovative multi-information coding strategy is proposed, which selectively stores similar values of different levels in one storage unit. It avoids storing all the values at once and facilitates efficient mining of valid rules later. Secondly, by weighting the evaluation indicators such as support, confidence and promotion in association rule mining, a new evaluation index is formed, avoiding the need to set a minimum threshold for high-interest rules. Finally, in order to improve the mining performance of the rules, the dynamic crossover rate and mutation rate are set to improve the search efficiency of the algorithm. In the experimental stage, this paper adopts the 2016 annual air quality data set of Beijing to verify the effectiveness of the unit point multi-information coding strategy in reducing the rule storage air, the effectiveness of mining the rules formed by the low frequency item set, and the effectiveness of combining the rule mining algorithm with the swarm intelligence optimization algorithm in terms of search time and convergence. In the experimental stage, this paper adopts the 2016 annual air quality data set of Beijing to verify the effectiveness of the above three aspects. The unit point multi-information coding strategy reduced the rule space storage consumption by 50%, the new evaluation index can mine more interesting rules whose interest level can be up to 90%, while mining the rules formed by the lower frequency terms, and in terms of search time, we reduced it about 20% compared with some meta-heuristic algorithms, while improving convergence.

Collapse

Grotenhuis Z, Mosteiro PJ, Leeuwenberg AM. Modest performance of text mining to extract health outcomes may be almost sufficient for high-quality prognostic model development. Comput Biol Med 2024;170:108014. [PMID: 38301515 DOI: 10.1016/j.compbiomed.2024.108014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 01/03/2024] [Accepted: 01/19/2024] [Indexed: 02/03/2024]

Abstract

BACKGROUND

Across medicine, prognostic models are used to estimate patient risk of certain future health outcomes (e.g., cardiovascular or mortality risk). To develop (or train) prognostic models, historic patient-level training data is needed containing both the predictive factors (i.e., features) and the relevant health outcomes (i.e., labels). Sometimes, when the health outcomes are not recorded in structured data, these are first extracted from textual notes using text mining techniques. Because there exist many studies utilizing text mining to obtain outcome data for prognostic model development, our aim is to study the impact of the text mining quality on downstream prognostic model performance.

METHODS

We conducted a simulation study charting the relationship between text mining quality and prognostic model performance using an illustrative case study about in-hospital mortality prediction in intensive care unit patients. We repeatedly developed and evaluated a prognostic model for in-hospital mortality, using outcome data extracted by multiple text mining models of varying quality.

RESULTS

Interestingly, we found in our case study that a relatively low-quality text mining model (F1 score ≈ 0.50) could already be used to train a prognostic model with quite good discrimination (area under the receiver operating characteristic curve of around 0.80). The calibration of the risks estimated by the prognostic model seemed unreliable across the majority of settings, even when text mining models were of relatively high quality (F1 ≈ 0.80).

DISCUSSION

Developing prognostic models on text-extracted outcomes using imperfect text mining models seems promising. However, it is likely that prognostic models developed using this approach may not produce well-calibrated risk estimates, and require recalibration in (possibly a smaller amount of) manually extracted outcome data.

Collapse

Kung JY, Ly K, Shiri A. Text mining applications to support health library practice: A case study on marijuana legalization Twitter analytics. Health Info Libr J 2024;41:53-63. [PMID: 36598110 DOI: 10.1111/hir.12473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 11/29/2022] [Accepted: 12/14/2022] [Indexed: 01/05/2023]

Lawal O, Ochei LC. Lichen - air quality association rule mining for urban environments in the tropics. Int J Environ Health Res 2024;34:1713-1724. [PMID: 37489590 DOI: 10.1080/09603123.2023.2239716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Accepted: 07/18/2023] [Indexed: 07/26/2023]

Xiong J, Liu X, Li Z, Xiao H, Wang G, Niu Z, Fei C, Zhong F, Wang G, Zhang W, Fu Z, Liu Z, Chen K, Jiang H, Zheng M. αExtractor: a system for automatic extraction of chemical information from biomedical literature. Sci China Life Sci 2024;67:618-621. [PMID: 37758905 DOI: 10.1007/s11427-023-2388-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 06/07/2023] [Indexed: 09/29/2023]

Affiliation(s)

Jiacheng Xiong Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China University of Chinese Academy of Sciences, Beijing, 100049, China
Xiaohong Liu AI Department, Suzhou Alphama Biotechnology Co., Ltd., Suzhou, 215125, China
Zhaojun Li AI Department, Suzhou Alphama Biotechnology Co., Ltd., Suzhou, 215125, China College of Computer and Information Engineering, Dezhou University, Dezhou, 253023, China
Hongzhong Xiao AI Department, Suzhou Alphama Biotechnology Co., Ltd., Suzhou, 215125, China
Guangchao Wang College of Computer and Information Engineering, Dezhou University, Dezhou, 253023, China
Zhenjiang Niu AI Department, Suzhou Alphama Biotechnology Co., Ltd., Suzhou, 215125, China
Chaoyuan Fei AI Department, Suzhou Alphama Biotechnology Co., Ltd., Suzhou, 215125, China
Feisheng Zhong Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China University of Chinese Academy of Sciences, Beijing, 100049, China
Gang Wang Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China University of Chinese Academy of Sciences, Beijing, 100049, China
Wei Zhang Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China University of Chinese Academy of Sciences, Beijing, 100049, China
Zunyun Fu Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China University of Chinese Academy of Sciences, Beijing, 100049, China
Zhiguo Liu AI Department, Suzhou Alphama Biotechnology Co., Ltd., Suzhou, 215125, China
Kaixian Chen Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China University of Chinese Academy of Sciences, Beijing, 100049, China
Hualiang Jiang Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China. University of Chinese Academy of Sciences, Beijing, 100049, China.
Mingyue Zheng Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China. University of Chinese Academy of Sciences, Beijing, 100049, China.

Collapse

Wang L, Wang Y, Zhao Q. Data mining and analysis of the adverse events derived signals of 4 gadolinium-based contrast agents based on the US Food and drug administration adverse event reporting system. Expert Opin Drug Saf 2024;23:339-352. [PMID: 37837355 DOI: 10.1080/14740338.2023.2271834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 10/13/2023] [Indexed: 10/16/2023]

Hellali R, Chelly Dagdia Z, Ktaish A, Zeitouni K, Annane D. Corticosteroid sensitivity detection in sepsis patients using a personalized data mining approach: A clinical investigation. Comput Methods Programs Biomed 2024;245:108017. [PMID: 38241801 DOI: 10.1016/j.cmpb.2024.108017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 10/29/2023] [Accepted: 01/09/2024] [Indexed: 01/21/2024]

Mol MJ, Belfi B, Bakk Z. Unravelling the skills of data scientists: A text mining analysis of Dutch university master programs in data science and artificial intelligence. PLoS One 2024;19:e0299327. [PMID: 38422040 PMCID: PMC10903789 DOI: 10.1371/journal.pone.0299327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 02/07/2024] [Indexed: 03/02/2024] Open

Yu Y, Hu G, Yang X, Yin Y, Tong K, Yu R. A strategic study of acupuncture for diabetic kidney disease based on meta-analysis and data mining. Front Endocrinol (Lausanne) 2024;15:1273265. [PMID: 38469137 PMCID: PMC10925656 DOI: 10.3389/fendo.2024.1273265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Accepted: 01/22/2024] [Indexed: 03/13/2024] Open

Abstract

Objective

The specific benefit and selection of acupoints in acupuncture for diabetic kidney disease (DKD) remains controversial. This study aims to explore the specific benefits and acupoints selection of acupuncture for DKD through meta-analysis and data mining.

Methods

Clinical trials of acupuncture for DKD were searched in eight common databases. Meta-analysis was used to evaluate its efficacy and safety, and data mining was used to explore its acupoints selection.

Results

Meta-analysis displayed that compared with the conventional drug group, the combined acupuncture group significantly increased the clinical effective rate (risk ratio [RR] 1.35, 95% confidence interval [CI] 1.20 to 1.51, P < 0.00001) and high-density lipoprotein cholesterol (mean difference [MD] 0.36, 95% CI 0.27 to 0.46, P < 0.00001), significantly reduced the urinary albumin (MD -0.39, 95% CI -0.42 to -0.36, P < 0.00001), urinary microalbumin (MD -32.63, 95% CI -42.47 to -22.79, P < 0.00001), urine β2-microglobulin (MD -0.45, 95% CI -0.66 to -0.24, P < 0.0001), serum creatinine (MD -15.36, 95% CI -21.69 to -9.03, P < 0.00001), glycated hemoglobin A1c (MD -0.69, 95% CI -1.18 to -0.19, P = 0.006), fasting blood glucose (MD -0.86, 95% CI -0.90 to -0.82, P < 0.00001), 2h postprandial plasma glucose (MD -0.87, 95% CI -0.92 to -0.82, P < 0.00001), total cholesterol (MD -1.23, 95% CI -2.05 to -0.40, P = 0.003), triglyceride (MD -0.69, 95% CI -1.23 to -0.15, P = 0.01), while adverse events were comparable. Data mining revealed that CV12, SP8, SP10, ST36, SP6, BL20, BL23, and SP9 were the core acupoints for DKD treated by acupuncture.

Conclusion

Acupuncture improved clinical symptoms, renal function indices such as uALB, umALB, uβ2-MG, and SCR, as well as blood glucose and blood lipid in patients with DKD, and has a favorable safety profile. CV12, SP8, SP10, ST36, SP6, BL20, BL23, and SP9 are the core acupoints for acupuncture in DKD, and this program is expected to become a supplementary treatment for DKD.

Collapse

He X, Zhang H, Huang J, Zhao D, Li Y, Nie R, Liu X. [Research on fault diagnosis of patient monitor based on text mining]. Sheng Wu Yi Xue Gong Cheng Xue Za Zhi 2024;41:168-176. [PMID: 38403618 PMCID: PMC10894744 DOI: 10.7507/1001-5515.202306017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]

He YJ, Fan YS, Miao FR, Zhao XY, Zhang FZ, He C, Zhang H. Acupoint selection rules of acupuncture and moxibustion in treating neurogenic bladder based on data mining. Zhen Ci Yan Jiu 2024;49:198-207. [PMID: 38413042 DOI: 10.13702/j.1000-0607.20230018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]

Mughaz D, HaCohen-Kerner Y, Gabbay D. Extraction of time-related expressions using text mining with application to Hebrew. PLoS One 2024;19:e0293196. [PMID: 38394097 PMCID: PMC10889890 DOI: 10.1371/journal.pone.0293196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 10/08/2023] [Indexed: 02/25/2024] Open

Qiao H, Chen Y, Qian C, Guo Y. Clinical data mining: challenges, opportunities, and recommendations for translational applications. J Transl Med 2024;22:185. [PMID: 38378565 PMCID: PMC10880222 DOI: 10.1186/s12967-024-05005-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 02/18/2024] [Indexed: 02/22/2024] Open

Chandrasekaran R, Konaraddi K, Sharma SS, Moustakas E. Text-Mining and Video Analytics of COVID-19 Narratives Shared by Patients on YouTube. J Med Syst 2024;48:21. [PMID: 38358554 DOI: 10.1007/s10916-024-02047-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Accepted: 02/13/2024] [Indexed: 02/16/2024]

Hong S, Wang T, Fu X, Li G. Research on quantitative evaluation of digital economy policy in China based on the PMC index model. PLoS One 2024;19:e0298312. [PMID: 38359065 PMCID: PMC10868804 DOI: 10.1371/journal.pone.0298312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 01/21/2024] [Indexed: 02/17/2024] Open

Valdez D, Mena-Meléndez L, Crawford BL, Jozkowski KN. Analyzing Reddit Forums Specific to Abortion That Yield Diverse Dialogues Pertaining to Medical Information Seeking and Personal Worldviews: Data Mining and Natural Language Processing Comparative Study. J Med Internet Res 2024;26:e47408. [PMID: 38354044 PMCID: PMC10902765 DOI: 10.2196/47408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Revised: 09/27/2023] [Accepted: 12/20/2023] [Indexed: 02/16/2024] Open

Abstract

BACKGROUND

Attitudes toward abortion have historically been characterized via dichotomized labels, yet research suggests that these labels do not appropriately encapsulate beliefs on abortion. Rather, contexts, circumstances, and lived experiences often shape views on abortion into more nuanced and complex perspectives. Qualitative data have also been shown to underpin belief systems regarding abortion. Social media, as a form of qualitative data, could reveal how attitudes toward abortion are communicated publicly in web-based spaces. Furthermore, in some cases, social media can also be leveraged to seek health information.

OBJECTIVE

This study applies natural language processing and social media mining to analyze Reddit (Reddit, Inc) forums specific to abortion, including r/Abortion (the largest subreddit about abortion) and r/AbortionDebate (a subreddit designed to discuss and debate worldviews on abortion). Our analytical pipeline intends to identify potential themes within the data and the affect from each post.

METHODS

We applied a neural network-based topic modeling pipeline (BERTopic) to uncover themes in the r/Abortion (n=2151) and r/AbortionDebate (n=2815) subreddits. After deriving the optimal number of topics per subreddit using an iterative coherence score calculation, we performed a sentiment analysis using the Valence Aware Dictionary and Sentiment Reasoner to assess positive, neutral, and negative affect and an emotion analysis using the Text2Emotion lexicon to identify potential emotionality per post. Differences in affect and emotion by subreddit were compared.

RESULTS

The iterative coherence score calculation revealed 10 topics for both r/Abortion (coherence=0.42) and r/AbortionDebate (coherence=0.35). Topics in the r/Abortion subreddit primarily centered on information sharing or offering a source of social support; in contrast, topics in the r/AbortionDebate subreddit centered on contextualizing shifting or evolving views on abortion across various ethical, moral, and legal domains. The average compound Valence Aware Dictionary and Sentiment Reasoner scores for the r/Abortion and r/AbortionDebate subreddits were 0.01 (SD 0.44) and -0.06 (SD 0.41), respectively. Emotionality scores were consistent across the r/Abortion and r/AbortionDebate subreddits; however, r/Abortion had a marginally higher average fear score of 0.36 (SD 0.39).

CONCLUSIONS

Our findings suggest that people posting on abortion forums on Reddit are willing to share their beliefs, which manifested in diverse ways, such as sharing abortion stories including how their worldview changed, which critiques the value of dichotomized abortion identity labels, and information seeking. Notably, the style of discourse varied significantly by subreddit. r/Abortion was principally leveraged as an information and outreach source; r/AbortionDebate largely centered on debating across various legal, ethical, and moral abortion domains. Collectively, our findings suggest that abortion remains an opaque yet politically charged issue for people and that social media can be leveraged to understand views and circumstances surrounding abortion.

Collapse

Li S, Wang J. Exploration of the methods and rules of syndrome/pattern differentiation and treatment of headache from the acupuncture-moxibustion prescriptions of ancient literature based on the data mining technology. Zhongguo Zhen Jiu 2024;44:224-230. [PMID: 38373772 DOI: 10.13703/j.0255-2930.20230629-k0001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]

Abstract

The study aims to identifying and exploring the methods and rules of the syndrome/pattern differentiation and treatment of headache through collating acupuncture-moxibustion prescriptions recorded earliest in ancient literature. Using Excel2016 software, the structural data table was prepared with "name of disease", "location of disease", "etiology and pathogenesis", "complicated symptoms", "sites for acupuncture and moxibustion" and "techniques of acupuncture and moxibustion" included. The normative approach was conduced on "name of disease", "etiology and pathogenesis", "complicated symptoms" and "nomenclature of acupoint". Using conventional literature statistical method, combined with Apriori algorithm of association rule, the implicit multi-dimensional correlation rules were explored among various elements of syndrome/pattern differentiation of headache and corresponding therapeutic methods. Based on the findings of the study, the regularity was distinct regarding the treatment at "distal acupoints along the affected meridian and the local acupoints at the affected area" after identifying the location of headache; the strong association was presented between "etiology and pathogenesis" and "acupoint selection", and between "etiology and pathogenesis" and "therapeutic methods", including 9 and 12 rules, respectively. Guanyuan (CV 4) selected in treatment of headache was associated with kidney deficiency, the combination of Zhongwan (CV 12) and Zusanli (ST 36) was with phlegm, Fengfu (GV 16), Fengchi (GB 20), Xinghui (GV 22) and Baihui (GV 20) was with wind, and Hegu (LI 4) was with cold. Moxibustion was dominant in treatment if headache was caused by pathogenic cold or related to deficiency syndrome; acupuncture was used specially for the case caused by phlegm, or interaction of wind and phlegm or wind and heat. For heat syndrome, either acupuncture or moxibustion was applicable, in general, acupuncture was more commonly used in comparison with moxibustion for headache. There were 6 association rules regarding the acupoint selection and the techniques of acupuncture and moxibustion. Moxibustion was generally applied to Xinghui (GV 22), Shangxing (GV 23) and Baihui (GV 20) ; and acupuncture was to Fengfu (GV 16), Hegu (LI 4) and Zusanli (ST 36). There were few association rules between the complicated symptoms and acupoint selection. Among nearly 100 complications, there were only 3 feature associations. Zhongwan (CV 12) was selected for the case with poor appetite, Chengjiang (CV 24) was with neck stiffness, and Fengchic (GB 20) combined with Fenglong (ST 40) or Jiexi (ST 41) was used if vertigo was present. In the ancient time, regarding the treatment of headache, acupuncture and moxibustion are delivered based on the three aspects, i.e. the location of illness, the etiology and pathogenesis, and the complicated symptoms. For acupoint selection, in line with the courses of affected meridians, the adjacent and distal acupoints are combined according to the location of headache. The acupoint prescription is composed in terms of the etiology and pathogenesis. The techniques of acupuncture and moxibustion are optimized in consideration of the sites where acupuncture and moxibustion are operated.

Collapse

Guérin J, Nahid A, Tassy L, Deloger M, Bocquet F, Thézenas S, Desandes E, Le Deley MC, Durando X, Jaffré A, Es-Saad I, Crochet H, Le Morvan M, Lion F, Raimbourg J, Khay O, Craynest F, Giro A, Laizet Y, Bertaut A, Joly F, Livartowski A, Heudel P. Consore: A Powerful Federated Data Mining Tool Driving a French Research Network to Accelerate Cancer Research. Int J Environ Res Public Health 2024;21:189. [PMID: 38397680 PMCID: PMC10887639 DOI: 10.3390/ijerph21020189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/28/2024] [Accepted: 01/31/2024] [Indexed: 02/25/2024]

Affiliation(s)

Julien Guérin Institut Curie, 75005 Paris, France;
Amine Nahid Coexya, 69370 Saint-Didier-au-Mont-d’Or, France; (A.N.); (F.J.)
Louis Tassy Institut Paoli-Calmettes, 13009 Marseille, France; (L.T.); (M.L.M.)
Marc Deloger Gustave Roussy, 94805 Villejuif, France; (M.D.); (F.L.)
François Bocquet Data Factory & Analytics Department, Institut de Cancérologie de l’Ouest, 44805 Nantes-Angers, France (J.R.)
Simon Thézenas Institut Régional du Cancer de Montpellier, 34090 Montpellier, France;
Emmanuel Desandes Institut de Cancérologie de Lorraine, 54519 Nancy, France; (E.D.); (O.K.)
Marie-Cécile Le Deley Centre Oscar Lambret, 59000 Lille, France; (M.-C.L.D.); (F.C.)
Xavier Durando Centre Jean Perrin, 63011 Clermont Ferrand, France; (X.D.); (A.G.)
Anne Jaffré Institut Bergonié, 33076 Bordeaux, France; (A.J.); (Y.L.)
Ikram Es-Saad Centre Georges Francois Leclerc, 21000 Dijon, France; (I.E.-S.); (A.B.)
Hugo Crochet Centre Léon Bérard, 69008 Lyon, France (P.H.)
Marie Le Morvan Institut Paoli-Calmettes, 13009 Marseille, France; (L.T.); (M.L.M.)
François Lion Gustave Roussy, 94805 Villejuif, France; (M.D.); (F.L.)
Judith Raimbourg Data Factory & Analytics Department, Institut de Cancérologie de l’Ouest, 44805 Nantes-Angers, France (J.R.)
Oussama Khay Institut de Cancérologie de Lorraine, 54519 Nancy, France; (E.D.); (O.K.)
Franck Craynest Centre Oscar Lambret, 59000 Lille, France; (M.-C.L.D.); (F.C.)
Alexia Giro Centre Jean Perrin, 63011 Clermont Ferrand, France; (X.D.); (A.G.)
Yec’han Laizet Institut Bergonié, 33076 Bordeaux, France; (A.J.); (Y.L.)
Aurélie Bertaut Centre Georges Francois Leclerc, 21000 Dijon, France; (I.E.-S.); (A.B.)
Frederik Joly Coexya, 69370 Saint-Didier-au-Mont-d’Or, France; (A.N.); (F.J.)
Alain Livartowski Institut Curie, 75005 Paris, France;
Pierre Heudel Centre Léon Bérard, 69008 Lyon, France (P.H.)

Collapse

Fins IS, Davies H, Farrell S, Torres JR, Pinchbeck G, Radford AD, Noble P. Evaluating ChatGPT text mining of clinical records for companion animal obesity monitoring. Vet Rec 2024;194:e3669. [PMID: 38058223 PMCID: PMC10952314 DOI: 10.1002/vetr.3669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/25/2023] [Accepted: 11/07/2023] [Indexed: 12/08/2023]

Kilicoglu H, Ensan F, McInnes B, Wang LL. Semantics-enabled biomedical literature analytics. J Biomed Inform 2024;150:104588. [PMID: 38244957 DOI: 10.1016/j.jbi.2024.104588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 01/10/2024] [Indexed: 01/22/2024]