1
|
Atwal K. Artificial intelligence in clinical nutrition and dietetics: A brief overview of current evidence. Nutr Clin Pract 2024; 39:736-742. [PMID: 38591653 DOI: 10.1002/ncp.11150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 03/07/2024] [Accepted: 03/13/2024] [Indexed: 04/10/2024] Open
Abstract
The rapid surge in artificial intelligence (AI) has dominated technological innovation in today's society. As experts begin to understand the potential, a spectrum of opportunities could yield a remarkable revolution. The upsurge in healthcare could transform clinical interventions and outcomes, but it risks dehumanization and increased unethical practices. The field of clinical nutrition and dietetics is no exception. This article finds a multitude of developments underway, which include the use of AI for malnutrition screening; predicting clinical outcomes, such as disease onset, and clinical risks, such as drug interactions; aiding interventions, such as estimating nutrient intake; applying precision nutrition, such as measuring postprandial glycemic response; and supporting workflow through chatbots trained on natural language models. Although the opportunity and scalability of AI is incalculably attractive, especially in the face of poor healthcare resources, the threat cannot be ignored. The risk of malpractice and lack of accountability are some of the main concerns. As such, the healthcare professional's responsibility remains paramount. The data used to train AI models could be biased, which could risk the quality of care to vulnerable or minority patient groups. Standardized AI-development protocols, benchmarked to care recommendations, with rigorous large-scale validation are required to maximize application among different settings. AI could overturn the healthcare landscape, and this article skims the surface of its potential in clinical nutrition and dietetics.
Collapse
Affiliation(s)
- Kiranjit Atwal
- Department of Nutritional Sciences, King's College London, London, UK
- School of Health Professions, University of Plymouth, Plymouth, UK
| |
Collapse
|
2
|
Lan Y, Xu X, Guo Z, Sun L, Lai J, Li J. Ifood: Development and usability study of a social media-based applet for dietary monitoring. Digit Health 2023; 9:20552076231210707. [PMID: 37915791 PMCID: PMC10617295 DOI: 10.1177/20552076231210707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/12/2023] [Indexed: 11/03/2023] Open
Abstract
Background Dietary monitoring is critical to maintaining human health. Social media platforms are widely used for daily recording and communication for individuals' diets and activities. The textual content shared on social media offers valuable resources for dietary monitoring. Objective This study aims to describe the development of iFood, an applet providing personal dietary monitoring based on social media content, and validate its usability, which will enable efficient personal dietary monitoring. Methods The process of the development and validation of iFood is divided into four steps: Diet datasets construction, diet record and analysis, diet monitoring applet design, and diet monitoring applet usability assessment. The diet datasets were constructed with the data collected from Weibo, Meishijie, and diet guidelines, which will be used as the basic knowledge for further model training in the phase of diet record and analysis. Then, the friendly user interface was designed to link users with backend functions. Finally, the applet was deployed as a WeChat applet and 10 users from the Beijing Union Medical College have been recruited to validate the usability of iFood. Results Three dietary datasets, including User Visual-Textual Dataset, Dietary Information Expansion Dataset, and Diet Recipe Dataset have been constructed. The performance of 4 models for recognizing diet and fusing unimodality data was 40.43%(dictionary-based model), 18.45%(rule-based model), 59.95%(Inception-ResNet-v2), and 51.38% (K-nearest neighbor), respectively. Furthermore, we have designed a user-friendly interface for the iFood applet and conducted a usability assessment, which resulted in an above-average usability score. Conclusions iFood is effective for managing individual dietary behaviors through its seamless integration with social media data. This study suggests that future products could utilize social media data to promote healthy lifestyles.
Collapse
Affiliation(s)
- Yushan Lan
- Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Xiaowei Xu
- Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Zhen Guo
- Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Lianglong Sun
- Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Jianqiang Lai
- National Institute for Nutrition and Health, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Jiao Li
- Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| |
Collapse
|
3
|
Khan YF, Kaushik B, Chowdhary CL, Srivastava G. Ensemble Model for Diagnostic Classification of Alzheimer's Disease Based on Brain Anatomical Magnetic Resonance Imaging. Diagnostics (Basel) 2022; 12:diagnostics12123193. [PMID: 36553199 PMCID: PMC9777931 DOI: 10.3390/diagnostics12123193] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 11/08/2022] [Accepted: 11/15/2022] [Indexed: 12/24/2022] Open
Abstract
Alzheimer's is one of the fast-growing diseases among people worldwide leading to brain atrophy. Neuroimaging reveals extensive information about the brain's anatomy and enables the identification of diagnostic features. Artificial intelligence (AI) in neuroimaging has the potential to significantly enhance the treatment process for Alzheimer's disease (AD). The objective of this study is two-fold: (1) to compare existing Machine Learning (ML) algorithms for the classification of AD. (2) To propose an effective ensemble-based model for the same and to perform its comparative analysis. In this study, data from the Alzheimer's Diseases Neuroimaging Initiative (ADNI), an online repository, is utilized for experimentation consisting of 2125 neuroimages of Alzheimer's disease (n = 975), mild cognitive impairment (n = 538) and cognitive normal (n = 612). For classification, the framework incorporates a Decision Tree (DT), Random Forest (RF), Naïve Bayes (NB), and K-Nearest Neighbor (K-NN) followed by some variations of Support Vector Machine (SVM), such as SVM (RBF kernel), SVM (Polynomial Kernel), and SVM (Sigmoid kernel), as well as Gradient Boost (GB), Extreme Gradient Boosting (XGB) and Multi-layer Perceptron Neural Network (MLP-NN). Afterwards, an Ensemble Based Generic Kernel is presented where Master-Slave architecture is combined to attain better performance. The proposed model is an ensemble of Extreme Gradient Boosting, Decision Tree and SVM_Polynomial kernel (XGB + DT + SVM). At last, the proposed method is evaluated using cross-validation using statistical techniques along with other ML models. The presented ensemble model (XGB + DT + SVM) outperformed existing state-of-the-art algorithms with an accuracy of 89.77%. The efficiency of all the models was optimized using Grid-based tuning, and the results obtained after such process showed significant improvement. XGB + DT + SVM with optimized parameters outperformed all other models with an efficiency of 95.75%. The implication of the proposed ensemble-based learning approach clearly shows the best results compared to other ML models. This experimental comparative analysis improved understanding of the above-defined methods and enhanced their scope and significance in the early detection of Alzheimer's disease.
Collapse
Affiliation(s)
| | - Baijnath Kaushik
- School of CSE, Shri Mata Vaishno Devi University, Katra 182320, India
| | - Chiranji Lal Chowdhary
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, India
- Correspondence:
| | - Gautam Srivastava
- Department of Mathematics and Computer Science, Brandon University, Brandon, MB R7A 6A9, Canada
- Research Centre for Interneural Computing, China Medical University, Taichung 40402, Taiwan
- Department of Computer Science and Math, Lebanese American University, Beirut 1102, Lebanon
| |
Collapse
|
4
|
Li J, Lan Y, Xu X, Guo Z, Sun L, Lai J. iFood, a Social Media-based Applet for Dietary Management: Development and Usability Study (Preprint). JMIR Form Res 2022. [DOI: 10.2196/44826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023] Open
|
5
|
Baxi MK, Philip J, Mago V. Resilience of political leaders and healthcare organizations during COVID-19. PeerJ Comput Sci 2022; 8:e1121. [PMID: 36262139 PMCID: PMC9575867 DOI: 10.7717/peerj-cs.1121] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 09/07/2022] [Indexed: 06/16/2023]
Abstract
This study assesses the online societal association of leaders and healthcare organizations from the top-10 COVID-19 resilient nations through public engagement, sentiment strength, and inclusivity and diversity strength. After analyzing 173,071 Tweets authored by the leaders and health organizations, our findings indicate that United Arab Emirate's Prime Minister had the highest online societal association (normalized online societal association: 1.000) followed by the leaders of Canada and Turkey (normalized online societal association: 0.068 and 0.033, respectively); and among the healthcare organizations, the Public Health Agency of Canada was the most impactful (normalized online societal association: 1.000) followed by the healthcare agencies of Turkey and Spain (normalized online societal association: 0.632 and 0.094 respectively). In comparison to healthcare organizations, the leaders displayed a strong awareness of individual factors and generalized their Tweets to a broader audience. The findings also suggest that users prefer accessing social media platforms for information during health emergencies and that leaders and healthcare institutions should realize the potential to use them effectively.
Collapse
Affiliation(s)
- Manmeet Kaur Baxi
- Department of Computer Science, Lakehead University, Thunder Bay, Ontario, Canada
| | - Joshua Philip
- Superior Collegiate and Vocational Institute, Thunder Bay, Ontario, Canada
| | - Vijay Mago
- Department of Computer Science, Lakehead University, Thunder Bay, Ontario, Canada
| |
Collapse
|
6
|
Russo S, Bonassi S. Prospects and Pitfalls of Machine Learning in Nutritional Epidemiology. Nutrients 2022; 14:1705. [PMID: 35565673 PMCID: PMC9105182 DOI: 10.3390/nu14091705] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 04/13/2022] [Accepted: 04/14/2022] [Indexed: 02/06/2023] Open
Abstract
Nutritional epidemiology employs observational data to discover associations between diet and disease risk. However, existing analytic methods of dietary data are often sub-optimal, with limited incorporation and analysis of the correlations between the studied variables and nonlinear behaviours in the data. Machine learning (ML) is an area of artificial intelligence that has the potential to improve modelling of nonlinear associations and confounding which are found in nutritional data. These opportunities notwithstanding, the applications of ML in nutritional epidemiology must be approached cautiously to safeguard the scientific quality of the results and provide accurate interpretations. Given the complex scenario around ML, judicious application of such tools is necessary to offer nutritional epidemiology a novel analytical resource for dietary measurement and assessment and a tool to model the complexity of dietary intake and its relation to health. This work describes the applications of ML in nutritional epidemiology and provides guidelines to avoid common pitfalls encountered in applying predictive statistical models to nutritional data. Furthermore, it helps unfamiliar readers better assess the significance of their results and provides new possible future directions in the field of ML in nutritional epidemiology.
Collapse
Affiliation(s)
- Stefania Russo
- EcoVision Lab, Photogrammetry and Remote Sensing Group, ETH Zürich, 8092 Zurich, Switzerland
| | - Stefano Bonassi
- Department of Human Sciences and Quality of Life Promotion, San Raffaele University, 00166 Rome, Italy;
- Unit of Clinical and Molecular Epidemiology, IRCCS San Raffaele Roma, 00163 Rome, Italy
| |
Collapse
|
7
|
Côté M, Lamarche B. Artificial intelligence in nutrition research: perspectives on current and future applications. Appl Physiol Nutr Metab 2021; 47:1-8. [PMID: 34525321 DOI: 10.1139/apnm-2021-0448] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Artificial intelligence (AI) is a rapidly evolving area that offers unparalleled opportunities of progress and applications in many healthcare fields. In this review, we provide an overview of the main and latest applications of AI in nutrition research and identify gaps to address to potentialize this emerging field. AI algorithms may help better understand and predict the complex and non-linear interactions between nutrition-related data and health outcomes, particularly when large amounts of data need to be structured and integrated, such as in metabolomics. AI-based approaches, including image recognition, may also improve dietary assessment by maximizing efficiency and addressing systematic and random errors associated with self-reported measurements of dietary intakes. Finally, AI applications can extract, structure and analyze large amounts of data from social media platforms to better understand dietary behaviours and perceptions among the population. In summary, AI-based approaches will likely improve and advance nutrition research as well as help explore new applications. However, further research is needed to identify areas where AI does deliver added value compared with traditional approaches, and other areas where AI is simply not likely to advance the field. Novelty: Artificial intelligence offers unparalleled opportunities of progress and applications in nutrition. There remain gaps to address to potentialize this emerging field.
Collapse
Affiliation(s)
- Mélina Côté
- Centre de recherche Nutrition, santé et société (NUTRISS), INAF, Université Laval, Québec, QC, Canada
- School of Nutrition, Université Laval, Québec, QC, Canada
| | - Benoît Lamarche
- Centre de recherche Nutrition, santé et société (NUTRISS), INAF, Université Laval, Québec, QC, Canada
- School of Nutrition, Université Laval, Québec, QC, Canada
| |
Collapse
|
8
|
Yeung AWK, Kletecka-Pulker M, Eibensteiner F, Plunger P, Völkl-Kernstock S, Willschke H, Atanasov AG. Implications of Twitter in Health-Related Research: A Landscape Analysis of the Scientific Literature. Front Public Health 2021; 9:654481. [PMID: 34307273 PMCID: PMC8299201 DOI: 10.3389/fpubh.2021.654481] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 06/09/2021] [Indexed: 01/07/2023] Open
Abstract
Background: Twitter, representing a big social media network, is broadly used for the communication of health-related information. In this work, we aimed to identify and analyze the scientific literature on Twitter use in context of health by utilizing a bibliometric approach, in order to obtain quantitative information on dominant research topics, trending themes, key publications, scientific institutions, and prolific researchers who contributed to this scientific area. Methods: Web of Science electronic database was searched to identify relevant papers on Twitter and health. Basic bibliographic data was obtained utilizing the "Analyze" function of the database. Full records and cited references were exported to VOSviewer, a dedicated bibliometric software, for further analysis. A term map and a keyword map were synthesized to visualize recurring words within titles, abstracts and keywords. Results: The analysis was based on the data from 2,582 papers. The first papers were published in 2009, and the publication count increased rapidly since 2015. Original articles and reviews were published in a ratio of 10.6:1. The Journal of Medical Internet Research was the top journal, and the United States had contributions to over half (52%) of these publications, being the home-country of eight of the top ten most productive institutions. Keyword analysis identified six topically defined clusters, with professional education in healthcare being the top theme cluster (consisting of 66 keywords). The identified papers often investigated Twitter together with other social media, such as YouTube and Facebook. Conclusions: A great diversity of themes was found in the identified papers, including: professional education in healthcare, big data and sentiment analysis, social marketing and substance use, physical and emotional well-being of young adults, and public health and health communication. Our quantitative analysis outlines Twitter as both, an increasingly popular data source, and a highly versatile tool for health-related research.
Collapse
Affiliation(s)
- Andy Wai Kan Yeung
- Oral and Maxillofacial Radiology, Applied Oral Sciences and Community Dental Care, Faculty of Dentistry, The University of Hong Kong, Hong Kong, China
| | - Maria Kletecka-Pulker
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Institute for Ethics and Law in Medicine, University of Vienna, Vienna, Austria
| | - Fabian Eibensteiner
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Division of Pediatric Nephrology and Gastroenterology, Department of Pediatrics and Adolescent Medicine, Comprehensive Center for Pediatrics, Medical University of Vienna, Vienna, Austria
| | - Petra Plunger
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria
| | - Sabine Völkl-Kernstock
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria
| | - Harald Willschke
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Medical University Vienna, Vienna, Austria
| | - Atanas G Atanasov
- Ludwig Boltzmann Institute for Digital Health and Patient Safety, Medical University of Vienna, Vienna, Austria.,Institute of Genetics and Animal Biotechnology of the Polish Academy of Sciences, Magdalenka, Poland.,Institute of Neurobiology, Bulgarian Academy of Sciences, Sofia, Bulgaria.,Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria
| |
Collapse
|
9
|
Morgenstern JD, Rosella LC, Costa AP, de Souza RJ, Anderson LN. Perspective: Big Data and Machine Learning Could Help Advance Nutritional Epidemiology. Adv Nutr 2021; 12:621-631. [PMID: 33606879 PMCID: PMC8166570 DOI: 10.1093/advances/nmaa183] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 11/04/2020] [Accepted: 12/29/2020] [Indexed: 01/09/2023] Open
Abstract
The field of nutritional epidemiology faces challenges posed by measurement error, diet as a complex exposure, and residual confounding. The objective of this perspective article is to highlight how developments in big data and machine learning can help address these challenges. New methods of collecting 24-h dietary recalls and recording diet could enable larger samples and more repeated measures to increase statistical power and measurement precision. In addition, use of machine learning to automatically classify pictures of food could become a useful complimentary method to help improve precision and validity of dietary measurements. Diet is complex due to thousands of different foods that are consumed in varying proportions, fluctuating quantities over time, and differing combinations. Current dietary pattern methods may not integrate sufficient dietary variation, and most traditional modeling approaches have limited incorporation of interactions and nonlinearity. Machine learning could help better model diet as a complex exposure with nonadditive and nonlinear associations. Last, novel big data sources could help avoid unmeasured confounding by offering more covariates, including both omics and features derived from unstructured data with machine learning methods. These opportunities notwithstanding, application of big data and machine learning must be approached cautiously to ensure quality of dietary measurements, avoid overfitting, and confirm accurate interpretations. Greater use of machine learning and big data would also require substantial investments in training, collaborations, and computing infrastructure. Overall, we propose that judicious application of big data and machine learning in nutrition science could offer new means of dietary measurement, more tools to model the complexity of diet and its relations with diseases, and additional potential ways of addressing confounding.
Collapse
Affiliation(s)
- Jason D Morgenstern
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
| | - Laura C Rosella
- Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
- Vector Institute, Toronto, Ontario, Canada
| | - Andrew P Costa
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
| | - Russell J de Souza
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
- Population Health Research Institute, Hamilton Health Sciences, Hamilton, Ontario, Canada
| | - Laura N Anderson
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
| |
Collapse
|
10
|
Zhang G, Yuan J, Yu M, Wu T, Luo X, Chen F. A machine learning method for acute hypotensive episodes prediction using only non-invasive parameters. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021; 200:105845. [PMID: 33309303 DOI: 10.1016/j.cmpb.2020.105845] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 11/12/2020] [Indexed: 06/12/2023]
Abstract
BACKGROUND AND OBJECTIVES Accurate prediction of acute hypotensive episodes (AHE) is fundamentally important for timely and appropriate clinical decision-making, as it can provide medical professionals with sufficient time to accurately select more efficient therapeutic interventions for each specific condition. However, existing methods are invasive, easily affected by artifacts and can be difficult to acquire in a pre-hospital setting. METHODS In this study, 1055 patients' records were extracted from the Multiparameter Intelligent Monitoring in Intensive Care II database (MIMIC II), comprising of 388 AHE records and 667 non-AHE records. Six commonly used machine learning algorithms were selected and used to develop an AHE prediction model based on features extracted from seven types of non-invasive physiological parameters. RESULTS The optimal observation window and prediction gap were selected as 300 minutes and 60 minutes, respectively. For GBDT, XGB and AdaBoost, the optimal feature subsets contained only 39% of the overall features. An ensemble prediction model was developed using the voting method to achieve a more robust performance with an accuracy (ACC) of 0.822 and area under the receiver operating characteristic curve (AUC) of 0.878. CONCLUSION A novel machine learning method that uses only noninvasive physiological parameters offers a promising solution for easy and prompt AHE prediction in widespread scenario applications, including pre-hospital and in-hospital care.
Collapse
Affiliation(s)
- Guang Zhang
- Institute of Medical Support, Academy of Military Sciences, Tianjin, China
| | - Jing Yuan
- Institute of Medical Support, Academy of Military Sciences, Tianjin, China
| | - Ming Yu
- Institute of Medical Support, Academy of Military Sciences, Tianjin, China
| | - Taihu Wu
- Institute of Medical Support, Academy of Military Sciences, Tianjin, China
| | - Xi Luo
- Institute of Medical Support, Academy of Military Sciences, Tianjin, China; NCO School of Army Medical University, Hebei, China
| | - Feng Chen
- Institute of Medical Support, Academy of Military Sciences, Tianjin, China.
| |
Collapse
|
11
|
Tassone J, Yan P, Simpson M, Mendhe C, Mago V, Choudhury S. Utilizing deep learning and graph mining to identify drug use on Twitter data. BMC Med Inform Decis Mak 2020; 20:304. [PMID: 33380324 PMCID: PMC7772918 DOI: 10.1186/s12911-020-01335-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 11/16/2020] [Indexed: 11/26/2022] Open
Abstract
Background The collection and examination of social media has become a useful mechanism for studying the mental activity and behavior tendencies of users. Through the analysis of a collected set of Twitter data, a model will be developed for predicting positively referenced, drug-related tweets. From this, trends and correlations can be determined. Methods Social media data (tweets and attributes) were collected and processed using topic pertaining keywords, such as drug slang and use-conditions (methods of drug consumption). Potential candidates were preprocessed resulting in a dataset of 3,696,150 rows. The predictive classification power of multiple methods was compared including SVM, XGBoost, BERT and CNN-based classifiers. For the latter, a deep learning approach was implemented to screen and analyze the semantic meaning of the tweets. Results To test the predictive capability of the model, SVM and XGBoost were first employed. The results calculated from the models respectively displayed an accuracy of 59.33% and 54.90%, with AUC’s of 0.87 and 0.71. The values show a low predictive capability with little discrimination. Conversely, the CNN-based classifiers presented a significant improvement, between the two models tested. The first was trained with 2661 manually labeled samples, while the other included synthetically generated tweets culminating in 12,142 samples. The accuracy scores were 76.35% and 82.31%, with an AUC of 0.90 and 0.91. Using association rule mining in conjunction with the CNN-based classifier showed a high likelihood for keywords such as “smoke”, “cocaine”, and “marijuana” triggering a drug-positive classification. Conclusion Predictive analysis with a CNN is promising, whereas attribute-based models presented little predictive capability and were not suitable for analyzing text of data. This research found that the commonly mentioned drugs had a level of correspondence with frequently used illicit substances, proving the practical usefulness of this system. Lastly, the synthetically generated set provided increased accuracy scores and improves the predictive capability.
Collapse
Affiliation(s)
- Joseph Tassone
- Department of Computer Science, Lakehead University, 955 Oliver Road, Thunder Bay, P7B 5E1, Canada
| | - Peizhi Yan
- Department of Computer Science, Lakehead University, 955 Oliver Road, Thunder Bay, P7B 5E1, Canada
| | - Mackenzie Simpson
- Department of Computer Science, Lakehead University, 955 Oliver Road, Thunder Bay, P7B 5E1, Canada
| | - Chetan Mendhe
- Department of Computer Science, Lakehead University, 955 Oliver Road, Thunder Bay, P7B 5E1, Canada
| | - Vijay Mago
- Department of Computer Science, Lakehead University, 955 Oliver Road, Thunder Bay, P7B 5E1, Canada.
| | - Salimur Choudhury
- Department of Computer Science, Lakehead University, 955 Oliver Road, Thunder Bay, P7B 5E1, Canada
| |
Collapse
|
12
|
Social media analytics in nutrition research: a rapid review of current usage in investigation of dietary behaviours. Public Health Nutr 2020; 24:1193-1209. [PMID: 33353573 DOI: 10.1017/s1368980020005248] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
OBJECTIVE Social media analytics (SMA) has a track record in business research. The utilisation in nutrition research is unknown, despite social media being populated with real-time eating behaviours. This rapid review aimed to explore the use of SMA in nutrition research with the investigation of dietary behaviours. DESIGN The review was conducted according to rapid review guidelines by WHO and the National Collaborating Centre for Methods and Tools. Five databases of peer-reviewed, English language studies were searched using the keywords 'social media' in combination with 'data analytics' and 'food' or 'nutrition' and screened for those with general population health using SMA on public domain, social media data between 2014 and 2020. RESULTS The review identified 34 studies involving SMA in the investigation of dietary behaviours. Nutrition topics included population nutrition health investigations, alcohol consumption, dieting and eating out of the home behaviours. All studies involved content analysis with evidence of surveillance and engagement. Twitter was predominant with data sets in tens of millions. SMA tools were observed in data discovery, collection and preparation, but less so in data analysis. Approximately, a third of the studies involved interdisciplinary collaborations with health representation and only two studies involved nutrition disciplines. Less than a quarter of studies obtained formal human ethics approval. CONCLUSIONS SMA in nutrition research with the investigation of dietary behaviours is emerging, nevertheless, if consideration is taken with technological capabilities and ethical integrity, the future shows promise at a broad population census level and as a scoping tool or complementary, triangulation instrument.
Collapse
|
13
|
Singh T, Roberts K, Cohen T, Cobb N, Wang J, Fujimoto K, Myneni S. Social Media as a Research Tool (SMaaRT) for Risky Behavior Analytics: Methodological Review. JMIR Public Health Surveill 2020; 6:e21660. [PMID: 33252345 PMCID: PMC7735906 DOI: 10.2196/21660] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Revised: 10/05/2020] [Accepted: 11/06/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Modifiable risky health behaviors, such as tobacco use, excessive alcohol use, being overweight, lack of physical activity, and unhealthy eating habits, are some of the major factors for developing chronic health conditions. Social media platforms have become indispensable means of communication in the digital era. They provide an opportunity for individuals to express themselves, as well as share their health-related concerns with peers and health care providers, with respect to risky behaviors. Such peer interactions can be utilized as valuable data sources to better understand inter-and intrapersonal psychosocial mediators and the mechanisms of social influence that drive behavior change. OBJECTIVE The objective of this review is to summarize computational and quantitative techniques facilitating the analysis of data generated through peer interactions pertaining to risky health behaviors on social media platforms. METHODS We performed a systematic review of the literature in September 2020 by searching three databases-PubMed, Web of Science, and Scopus-using relevant keywords, such as "social media," "online health communities," "machine learning," "data mining," etc. The reporting of the studies was directed by the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Two reviewers independently assessed the eligibility of studies based on the inclusion and exclusion criteria. We extracted the required information from the selected studies. RESULTS The initial search returned a total of 1554 studies, and after careful analysis of titles, abstracts, and full texts, a total of 64 studies were included in this review. We extracted the following key characteristics from all of the studies: social media platform used for conducting the study, risky health behavior studied, the number of posts analyzed, study focus, key methodological functions and tools used for data analysis, evaluation metrics used, and summary of the key findings. The most commonly used social media platform was Twitter, followed by Facebook, QuitNet, and Reddit. The most commonly studied risky health behavior was nicotine use, followed by drug or substance abuse and alcohol use. Various supervised and unsupervised machine learning approaches were used for analyzing textual data generated from online peer interactions. Few studies utilized deep learning methods for analyzing textual data as well as image or video data. Social network analysis was also performed, as reported in some studies. CONCLUSIONS Our review consolidates the methodological underpinnings for analyzing risky health behaviors and has enhanced our understanding of how social media can be leveraged for nuanced behavioral modeling and representation. The knowledge gained from our review can serve as a foundational component for the development of persuasive health communication and effective behavior modification technologies aimed at the individual and population levels.
Collapse
Affiliation(s)
- Tavleen Singh
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States
| | - Kirk Roberts
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States
| | - Trevor Cohen
- Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, United States
| | - Nathan Cobb
- Georgetown University Medical Center, Washington, DC, United States
| | - Jing Wang
- School of Nursing, The University of Texas Health Science Center, San Antonio, TX, United States
| | - Kayo Fujimoto
- School of Public Health, The University of Texas Health Science Center, Houston, TX, United States
| | - Sahiti Myneni
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States
| |
Collapse
|