1
|
Baker JL, Bjerregaard LG. Advancing precision public health for obesity in children. Rev Endocr Metab Disord 2023; 24:1003-1010. [PMID: 37055611 PMCID: PMC10101815 DOI: 10.1007/s11154-023-09802-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/12/2023] [Indexed: 04/15/2023]
Abstract
Worldwide, far too many children and adolescents are living with the disease of obesity. Despite decades of public health initiatives, rates are still rising in many countries. This raises the question of whether precision public health may be a more successful approach to preventing obesity in youth. In this review, the objective was to review the literature on precision public health in the context of childhood obesity prevention and to discuss how precision public health may advance the field of childhood obesity prevention. As precision public health is a concept that is still evolving and not fully identifiable in the literature, a lack of published studies precluded a formal review. Therefore, the approach of using a broad interpretation of precision public health was used and recent advances in childhood obesity research in the areas of surveillance and risk factor identification as well as intervention, evaluation and implementation using selected studies were summarized. Encouragingly, big data from a multitude of designed and organic sources are being used in new and innovative ways to provide more granular surveillance and risk factor identification in obesity in children. Challenges were identified in terms of data access, completeness, and integration, ensuring inclusion of all members of society, ethics, and translation to policy. As precision public health advances, it may yield novel insights that can contribute to strong policies acting in concert that ultimately lead to the prevention of obesity in children.
Collapse
Affiliation(s)
- Jennifer L Baker
- Center for Clinical Research and Prevention, Copenhagen University Hospital- Bispebjerg and Frederiksberg, Frederiksberg, Denmark.
| | - Lise G Bjerregaard
- Center for Clinical Research and Prevention, Copenhagen University Hospital- Bispebjerg and Frederiksberg, Frederiksberg, Denmark
| |
Collapse
|
2
|
Wirtz Baker JM, Pou SA, Niclis C, Haluszka E, Aballay LR. Non-traditional data sources in obesity research: a systematic review of their use in the study of obesogenic environments. Int J Obes (Lond) 2023:10.1038/s41366-023-01331-3. [PMID: 37393408 DOI: 10.1038/s41366-023-01331-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 06/01/2023] [Accepted: 06/21/2023] [Indexed: 07/03/2023]
Abstract
BACKGROUND The complex nature of obesity increasingly requires a comprehensive approach that includes the role of environmental factors. For understanding contextual determinants, the resources provided by technological advances could become a key factor in obesogenic environment research. This study aims to identify different sources of non-traditional data and their applications, considering the domains of obesogenic environments: physical, sociocultural, political and economic. METHODS We conducted a systematic search in PubMed, Scopus and LILACS databases by two independent groups of reviewers, from September to December 2021. We included those studies oriented to adult obesity research using non-traditional data sources, published in the last 5 years in English, Spanish or Portuguese. The overall reporting followed the PRISMA guidelines. RESULTS The initial search yielded 1583 articles, 94 articles were kept for full-text screening, and 53 studies met the eligibility criteria and were included. We extracted information about countries of origin, study design, observation units, obesity-related outcomes, environment variables, and non-traditional data sources used. Our results revealed that most of the studies originated from high-income countries (86.54%) and used geospatial data within a GIS (76.67%), social networks (16.67%), and digital devices (11.66%) as data sources. Geospatial data were the most utilised data source and mainly contributed to the study of the physical domains of obesogenic environments, followed by social networks providing data to the analysis of the sociocultural domain. A gap in the literature exploring the political domain of environments was also evident. CONCLUSION The disparities between countries are noticeable. Geospatial and social network data sources contributed to studying the physical and sociocultural environments, which could be a valuable complement to those traditionally used in obesity research. We propose the use of information available on the Internet, addressed by artificial intelligence-based tools, to increase the knowledge on political and economic dimensions of the obesogenic environment.
Collapse
Affiliation(s)
- Julia Mariel Wirtz Baker
- Health Sciences Research Institute (INICSA), National Council of Scientific and Technical Research (CONICET), Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
- Human Nutrition Research Centre (CenINH), School of Nutrition, Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
| | - Sonia Alejandra Pou
- Health Sciences Research Institute (INICSA), National Council of Scientific and Technical Research (CONICET), Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
- Human Nutrition Research Centre (CenINH), School of Nutrition, Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
| | - Camila Niclis
- Health Sciences Research Institute (INICSA), National Council of Scientific and Technical Research (CONICET), Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
- Human Nutrition Research Centre (CenINH), School of Nutrition, Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
| | - Eugenia Haluszka
- Health Sciences Research Institute (INICSA), National Council of Scientific and Technical Research (CONICET), Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
- Human Nutrition Research Centre (CenINH), School of Nutrition, Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina
| | - Laura Rosana Aballay
- Human Nutrition Research Centre (CenINH), School of Nutrition, Faculty of Medical Sciences, National University of Córdoba, Bv. De La Reforma, Ciudad Universitaria, Zip Code 5000, Córdoba, Argentina.
| |
Collapse
|
3
|
An R, Shen J, Xiao Y. Applications of Artificial Intelligence to Obesity Research: Scoping Review of Methodologies. J Med Internet Res 2022; 24:e40589. [PMID: 36476515 PMCID: PMC9856437 DOI: 10.2196/40589] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 10/05/2022] [Accepted: 11/01/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Obesity is a leading cause of preventable death worldwide. Artificial intelligence (AI), characterized by machine learning (ML) and deep learning (DL), has become an indispensable tool in obesity research. OBJECTIVE This scoping review aimed to provide researchers and practitioners with an overview of the AI applications to obesity research, familiarize them with popular ML and DL models, and facilitate the adoption of AI applications. METHODS We conducted a scoping review in PubMed and Web of Science on the applications of AI to measure, predict, and treat obesity. We summarized and categorized the AI methodologies used in the hope of identifying synergies, patterns, and trends to inform future investigations. We also provided a high-level, beginner-friendly introduction to the core methodologies to facilitate the dissemination and adoption of various AI techniques. RESULTS We identified 46 studies that used diverse ML and DL models to assess obesity-related outcomes. The studies found AI models helpful in detecting clinically meaningful patterns of obesity or relationships between specific covariates and weight outcomes. The majority (18/22, 82%) of the studies comparing AI models with conventional statistical approaches found that the AI models achieved higher prediction accuracy on test data. Some (5/46, 11%) of the studies comparing the performances of different AI models revealed mixed results, indicating the high contingency of model performance on the data set and task it was applied to. An accelerating trend of adopting state-of-the-art DL models over standard ML models was observed to address challenging computer vision and natural language processing tasks. We concisely introduced the popular ML and DL models and summarized their specific applications in the studies included in the review. CONCLUSIONS This study reviewed AI-related methodologies adopted in the obesity literature, particularly ML and DL models applied to tabular, image, and text data. The review also discussed emerging trends such as multimodal or multitask AI models, synthetic data generation, and human-in-the-loop that may witness increasing applications in obesity research.
Collapse
Affiliation(s)
- Ruopeng An
- Brown School, Washington University in St. Louis, St. Louis, MO, United States
| | - Jing Shen
- Department of Physical Education, China University of Geosciences, Beijing, China
| | - Yunyu Xiao
- Weill Cornell Medical College, Cornell University, Ithaca, NY, United States
| |
Collapse
|
4
|
Soffer S, Zimlichman E, Levin MA, Zebrowski AM, Glicksberg BS, Freeman R, Reich DL, Klang E. Machine learning to predict in-hospital mortality among patients with severe obesity: Proof of concept study. Obes Sci Pract 2022; 8:474-482. [PMID: 35949284 PMCID: PMC9358726 DOI: 10.1002/osp4.571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 09/25/2021] [Accepted: 10/01/2021] [Indexed: 11/22/2022] Open
Abstract
Objectives Hospitalized patients with severe obesity require adapted hospital management. The aim of this study was to evaluate a machine learning model to predict in-hospital mortality among this population. Methods Data of unselected consecutive emergency department admissions of hospitalized patients with severe obesity (BMI ≥ 40 kg/m2) was analyzed. Data was retrieved from five hospitals from the Mount Sinai health system, New York. The study time frame was between January 2011 and December 2019. Data was used to train a gradient-boosting machine learning model to identify in-hospital mortality. The model was trained and evaluated based on the data from four hospitals and externally validated on held-out data from the fifth hospital. Results A total of 14,078 hospital admissions of inpatients with severe obesity were included. The in-hospital mortality rate was 297/14,078 (2.1%). In univariate analysis, albumin (area under the curve [AUC] = 0.77), blood urea nitrogen (AUC = 0.76), acuity level (AUC = 0.73), lactate (AUC = 0.72), and chief complaint (AUC = 0.72) were the best single predictors. For Youden's index, the model had a sensitivity of 0.77 (95% CI: 0.67-0.86) with a false positive rate of 1:9. Conclusion A machine learning model trained on clinical measures provides proof of concept performance in predicting mortality in patients with severe obesity. This implies that such models may help to adopt specific decision support tools for this population.
Collapse
Affiliation(s)
- Shelly Soffer
- Internal Medicine BAssuta Medical CenterAshdodIsrael
- Ben‐Gurion University of the NegevBe’er ShevaIsrael
| | - Eyal Zimlichman
- Hospital ManagementSheba Medical CenterTel HashomerIsrael
- Sackler Medical SchoolTel Aviv UniversityTel AvivIsrael
- Sheba Talpiot Medical Leadership ProgramTel HashomerIsrael
| | - Matthew A. Levin
- Department of Population Health Science and PolicyInstitute for Healthcare Delivery ScienceIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Anesthesiology, Perioperative and Pain MedicineIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Alexis M. Zebrowski
- Department of Emergency MedicineIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Population Health Science and PolicyInstitute for Translational EpidemiologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Benjamin S. Glicksberg
- Hasso Plattner Institute for Digital Health at Mount SinaiIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Robert Freeman
- Department of Population Health Science and PolicyInstitute for Healthcare Delivery ScienceIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - David L. Reich
- Department of Anesthesiology, Perioperative and Pain MedicineIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Eyal Klang
- Sackler Medical SchoolTel Aviv UniversityTel AvivIsrael
- Sheba Talpiot Medical Leadership ProgramTel HashomerIsrael
- Department of Diagnostic ImagingSheba Medical CenterTel HashomerIsrael
| |
Collapse
|
5
|
Food Sales and Adult Weight Status: Results of a Cross-Sectional Study in England. Nutrients 2022; 14:nu14091745. [PMID: 35565710 PMCID: PMC9105113 DOI: 10.3390/nu14091745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 04/05/2022] [Accepted: 04/06/2022] [Indexed: 02/04/2023] Open
Abstract
Ecological studies often use supermarket location as a proxy measure of the food environment. In this study, we used data on sales at a leading mainstream supermarket chain to explore how area-level supermarket use is associated with overweight and obesity in English adults. Sales data were aggregated to local authority level and joined to a national dataset describing self-reported height and weight and fruit and vegetable consumption. Regression models showed a modest association between higher levels of unhealthy food sales relative to health food sales and increased odds of being overweight and obese. Although effect sizes were small, they persisted after adjustment for area-level deprivation. Supermarket sales data provide additional understanding in the study of food environments and their impact on increasing weight status. Future health policies should consider using ‘big data’ combined with other research methods to address the increasing consumption of unhealthy and highly processed foods.
Collapse
|
6
|
Benítez-Andrades JA, Alija-Pérez JM, Vidal ME, Pastor-Vargas R, García-Ordás MT. Traditional Machine Learning Models and Bidirectional Encoder Representations From Transformer (BERT)-Based Automatic Classification of Tweets About Eating Disorders: Algorithm Development and Validation Study. JMIR Med Inform 2022; 10:e34492. [PMID: 35200156 PMCID: PMC8914746 DOI: 10.2196/34492] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 01/07/2022] [Accepted: 02/01/2022] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Eating disorders affect an increasing number of people. Social networks provide information that can help. OBJECTIVE We aimed to find machine learning models capable of efficiently categorizing tweets about eating disorders domain. METHODS We collected tweets related to eating disorders, for 3 consecutive months. After preprocessing, a subset of 2000 tweets was labeled: (1) messages written by people suffering from eating disorders or not, (2) messages promoting suffering from eating disorders or not, (3) informative messages or not, and (4) scientific or nonscientific messages. Traditional machine learning and deep learning models were used to classify tweets. We evaluated accuracy, F1 score, and computational time for each model. RESULTS A total of 1,058,957 tweets related to eating disorders were collected. were obtained in the 4 categorizations, with The bidirectional encoder representations from transformer-based models had the best score among the machine learning and deep learning techniques applied to the 4 categorization tasks (F1 scores 71.1%-86.4%). CONCLUSIONS Bidirectional encoder representations from transformer-based models have better performance, although their computational cost is significantly higher than those of traditional techniques, in classifying eating disorder-related tweets.
Collapse
Affiliation(s)
| | - José-Manuel Alija-Pérez
- SECOMUCI Research Group, Escuela de Ingenierías Industrial e Informática, Universidad de León, León, Spain
| | | | - Rafael Pastor-Vargas
- Communications and Control Systems Department, Spanish National University for Distance Education, Madrid, Spain
| | - María Teresa García-Ordás
- SECOMUCI Research Group, Escuela de Ingenierías Industrial e Informática, Universidad de León, León, Spain
| |
Collapse
|
7
|
Jenneson VL, Pontin F, Greenwood DC, Clarke GP, Morris MA. A systematic review of supermarket automated electronic sales data for population dietary surveillance. Nutr Rev 2021; 80:1711-1722. [PMID: 34757399 PMCID: PMC9086796 DOI: 10.1093/nutrit/nuab089] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Context Most dietary assessment methods are limited by self-report biases, how long they take for participants to complete, and cost of time for dietitians to extract content. Electronically recorded, supermarket-obtained transactions are an objective measure of food purchases, with reduced bias and improved timeliness and scale. Objective The use, breadth, context, and utility of electronic purchase records for dietary research is assessed and discussed in this systematic review. Data sources Four electronic databases (MEDLINE, EMBASE, PsycINFO, Global Health) were searched. Included studies used electronically recorded supermarket transactions to investigate the diet of healthy, free-living adults. Data extraction Searches identified 3422 articles, of which 145 full texts were retrieved and 72 met inclusion criteria. Study quality was assessed using the National Institutes of Health Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies. Data analysis Purchase records were used in observational studies, policy evaluations, and experimental designs. Nutrition outcomes included dietary patterns, nutrients, and food category sales. Transactions were linked to nutrient data from retailers, commercial data sources, and national food composition databases. Conclusion Electronic sales data have the potential to transform dietary assessment and worldwide understanding of dietary behavior. Validation studies are warranted to understand limits to agreement and extrapolation to individual-level diets. Systematic Review Registration PROSPERO registration no. CRD42018103470
Collapse
Affiliation(s)
- Victoria L Jenneson
- V.L. Jenneson, F. Pontin, D.C. Greenwood, and M.A. Morris are with the Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom. V.L. Jenneson, F. Pontin, and G.P. Clarke are with the School of Geography, Faculty of Environment, University of Leeds, Leeds, United Kingdom. D.C. Greenwood and M.A. Morris are with the School of Medicine, Faculty of Medicine and Health, University of Leeds, Leeds, United Kingdom
| | - Francesca Pontin
- V.L. Jenneson, F. Pontin, D.C. Greenwood, and M.A. Morris are with the Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom. V.L. Jenneson, F. Pontin, and G.P. Clarke are with the School of Geography, Faculty of Environment, University of Leeds, Leeds, United Kingdom. D.C. Greenwood and M.A. Morris are with the School of Medicine, Faculty of Medicine and Health, University of Leeds, Leeds, United Kingdom
| | - Darren C Greenwood
- V.L. Jenneson, F. Pontin, D.C. Greenwood, and M.A. Morris are with the Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom. V.L. Jenneson, F. Pontin, and G.P. Clarke are with the School of Geography, Faculty of Environment, University of Leeds, Leeds, United Kingdom. D.C. Greenwood and M.A. Morris are with the School of Medicine, Faculty of Medicine and Health, University of Leeds, Leeds, United Kingdom
| | - Graham P Clarke
- V.L. Jenneson, F. Pontin, D.C. Greenwood, and M.A. Morris are with the Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom. V.L. Jenneson, F. Pontin, and G.P. Clarke are with the School of Geography, Faculty of Environment, University of Leeds, Leeds, United Kingdom. D.C. Greenwood and M.A. Morris are with the School of Medicine, Faculty of Medicine and Health, University of Leeds, Leeds, United Kingdom
| | - Michelle A Morris
- V.L. Jenneson, F. Pontin, D.C. Greenwood, and M.A. Morris are with the Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom. V.L. Jenneson, F. Pontin, and G.P. Clarke are with the School of Geography, Faculty of Environment, University of Leeds, Leeds, United Kingdom. D.C. Greenwood and M.A. Morris are with the School of Medicine, Faculty of Medicine and Health, University of Leeds, Leeds, United Kingdom
| |
Collapse
|
8
|
Shi X, Nikolic G, Epelde G, Arrúe M, Bidaurrazaga Van-Dierdonck J, Bilbao R, De Moor B. An ensemble-based feature selection framework to select risk factors of childhood obesity for policy decision making. BMC Med Inform Decis Mak 2021; 21:222. [PMID: 34289843 PMCID: PMC8293582 DOI: 10.1186/s12911-021-01580-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 07/11/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The increasing prevalence of childhood obesity makes it essential to study the risk factors with a sample representative of the population covering more health topics for better preventive policies and interventions. It is aimed to develop an ensemble feature selection framework for large-scale data to identify risk factors of childhood obesity with good interpretability and clinical relevance. METHODS We analyzed the data collected from 426,813 children under 18 during 2000-2019. A BMI above the 90th percentile for the children of the same age and gender was defined as overweight. An ensemble feature selection framework, Bagging-based Feature Selection framework integrating MapReduce (BFSMR), was proposed to identify risk factors. The framework comprises 5 models (filter with mutual information/SVM-RFE/Lasso/Ridge/Random Forest) from filter, wrapper, and embedded feature selection methods. Each feature selection model identified 10 variables based on variable importance. Considering accuracy, F-score, and model characteristics, the models were classified into 3 levels with different weights: Lasso/Ridge, Filter/SVM-RFE, and Random Forest. The voting strategy was applied to aggregate the selected features, with both feature weights and model weights taken into consideration. We compared our voting strategy with another two for selecting top-ranked features in terms of 6 dimensions of interpretability. RESULTS Our method performed the best to select the features with good interpretability and clinical relevance. The top 10 features selected by BFSMR are age, sex, birth year, breastfeeding type, smoking habit and diet-related knowledge of both children and mothers, exercise, and Mother's systolic blood pressure. CONCLUSION Our framework provides a solution for identifying a diverse and interpretable feature set without model bias from large-scale data, which can help identify risk factors of childhood obesity and potentially some other diseases for future interventions or policies.
Collapse
Affiliation(s)
- Xi Shi
- Department of Electrical Engineering (ESAT), Stadius Centre for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Kasteelpark Arenberg 10 - box 2446, 3001, Leuven, Belgium.
| | - Gorana Nikolic
- Department of Electrical Engineering (ESAT), Stadius Centre for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Kasteelpark Arenberg 10 - box 2446, 3001, Leuven, Belgium
| | - Gorka Epelde
- Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Donostia-San Sebastián, Spain.,Biodonostia Health Research Institute, eHealth Group, Donostia-San Sebastián, Spain
| | - Mónica Arrúe
- Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Donostia-San Sebastián, Spain.,Biodonostia Health Research Institute, eHealth Group, Donostia-San Sebastián, Spain
| | | | - Roberto Bilbao
- Basque Foundation for Research and Innovation, Bilbao, Spain
| | - Bart De Moor
- Department of Electrical Engineering (ESAT), Stadius Centre for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Kasteelpark Arenberg 10 - box 2446, 3001, Leuven, Belgium
| |
Collapse
|
9
|
Clarke H, Clark S, Birkin M, Iles-Smith H, Glaser A, Morris MA. Understanding Barriers to Novel Data Linkages: Topic Modeling of the Results of the LifeInfo Survey. J Med Internet Res 2021; 23:e24236. [PMID: 33998998 PMCID: PMC8167605 DOI: 10.2196/24236] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 01/27/2021] [Accepted: 04/12/2021] [Indexed: 12/31/2022] Open
Abstract
Background Novel consumer and lifestyle data, such as those collected by supermarket loyalty cards or mobile phone exercise tracking apps, offer numerous benefits for researchers seeking to understand diet- and exercise-related risk factors for diseases. However, limited research has addressed public attitudes toward linking these data with individual health records for research purposes. Data linkage, combining data from multiple sources, provides the opportunity to enhance preexisting data sets to gain new insights. Objective The aim of this study is to identify key barriers to data linkage and recommend safeguards and procedures that would encourage individuals to share such data for potential future research. Methods The LifeInfo Survey consulted the public on their attitudes toward sharing consumer and lifestyle data for research purposes. Where barriers to data sharing existed, participants provided unstructured survey responses detailing what would make them more likely to share data for linkage with their health records in the future. The topic modeling technique latent Dirichlet allocation was used to analyze these textual responses to uncover common thematic topics within the texts. Results Participants provided responses related to sharing their store loyalty card data (n=2338) and health and fitness app data (n=1531). Key barriers to data sharing identified through topic modeling included data safety and security, personal privacy, requirements of further information, fear of data being accessed by others, problems with data accuracy, not understanding the reason for data linkage, and not using services that produce these data. We provide recommendations for addressing these issues to establish the best practice for future researchers interested in using these data. Conclusions This study formulates a large-scale consultation of public attitudes toward this kind of data linkage, which is an important first step in understanding and addressing barriers to participation in research using novel consumer and lifestyle data.
Collapse
Affiliation(s)
- Holly Clarke
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom
| | - Stephen Clark
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.,School of Geography, University of Leeds, Leeds, United Kingdom
| | - Mark Birkin
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.,School of Geography, University of Leeds, Leeds, United Kingdom
| | - Heather Iles-Smith
- Leeds Teaching Hospitals NHS Trust, Leeds, United Kingdom.,School of Health and Society, University of Salford, Salford, United Kingdom
| | - Adam Glaser
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.,Leeds Teaching Hospitals NHS Trust, Leeds, United Kingdom.,Leeds Institute of Medical Research, School of Medicine, University of Leeds, Leeds, United Kingdom
| | - Michelle A Morris
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.,Leeds Institute of Medical Research, School of Medicine, University of Leeds, Leeds, United Kingdom
| |
Collapse
|
10
|
Johnstone AM, Brown A. MRC Hot Topic workshop report: Reshaping the food environment – applying interdisciplinary perspectives in appetite research. NUTR BULL 2021. [DOI: 10.1111/nbu.12493] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
| | - Adrian Brown
- Centre for Obesity Research Department of Medicine University College London London UK
- National Institute of Health Research UCLH Biomedical Research Centre London UK
| |
Collapse
|
11
|
Oldroyd RA, Hobbs M, Campbell M, Jenneson V, Marek L, Morris MA, Pontin F, Sturley C, Tomintz M, Wiki J, Birkin M, Kingham S, Wilson M. Progress Towards Using Linked Population-Based Data For Geohealth Research: Comparisons Of Aotearoa New Zealand And The United Kingdom. APPLIED SPATIAL ANALYSIS AND POLICY 2021; 14:1025-1040. [PMID: 33942015 PMCID: PMC8081771 DOI: 10.1007/s12061-021-09381-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 04/20/2021] [Indexed: 06/12/2023]
Abstract
Globally, geospatial concepts are becoming increasingly important in epidemiological and public health research. Individual level linked population-based data afford researchers with opportunities to undertake complex analyses unrivalled by other sources. However, there are significant challenges associated with using such data for impactful geohealth research. Issues range from extracting, linking and anonymising data, to the translation of findings into policy whilst working to often conflicting agendas of government and academia. Innovative organisational partnerships are therefore central to effective data use. To extend and develop existing collaborations between the institutions, in June 2019, authors from the Leeds Institute for Data Analytics and the Alan Turing Institute, London, visited the Geohealth Laboratory based at the University of Canterbury, New Zealand. This paper provides an overview of insight shared during a two-day workshop considering aspects of linked population-based data for impactful geohealth research. Specifically, we discuss both the collaborative partnership between New Zealand's Ministry of Health (MoH) and the University of Canterbury's GeoHealth Lab and novel infrastructure, and commercial partnerships enabled through the Leeds Institute for Data Analytics and the Alan Turing Institute in the UK. We consider the New Zealand Integrated Data Infrastructure as a case study approach to population-based linked health data and compare similar approaches taken by the UK towards integrated data infrastructures, including the ESRC Big Data Network centres, the UK Biobank, and longitudinal cohorts. We reflect on and compare the geohealth landscapes in New Zealand and the UK to set out recommendations and considerations for this rapidly evolving discipline.
Collapse
Affiliation(s)
- R. A. Oldroyd
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- School of Geography, University of Leeds, Leeds, UK
| | - M. Hobbs
- GeoHealth Laboratory, Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
- Health Sciences, College of Education, Health and Human Development, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - M. Campbell
- GeoHealth Laboratory, Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
- School of Earth and Environment, College of Science, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - V. Jenneson
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- GeoHealth Laboratory, Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - L. Marek
- GeoHealth Laboratory, Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - M. A. Morris
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- School of Medicine, University of Leeds, Leeds, UK
- Alan Turing Institute, London, UK
| | - F. Pontin
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
| | - C. Sturley
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- School of Medicine, University of Leeds, Leeds, UK
| | - M. Tomintz
- GeoHealth Laboratory, Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - J. Wiki
- GeoHealth Laboratory, Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - M. Birkin
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Alan Turing Institute, London, UK
| | - S. Kingham
- GeoHealth Laboratory, Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
- School of Earth and Environment, College of Science, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - M. Wilson
- Geospatial Research Institute, University of Canterbury, Christchurch, Canterbury, New Zealand
| |
Collapse
|
12
|
Ventura V, Cavaliere A, Iannò B. #Socialfood: Virtuous or vicious? A systematic review. Trends Food Sci Technol 2021. [DOI: 10.1016/j.tifs.2021.02.018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
13
|
Kamel Boulos MN, Koh K. Smart city lifestyle sensing, big data, geo-analytics and intelligence for smarter public health decision-making in overweight, obesity and type 2 diabetes prevention: the research we should be doing. Int J Health Geogr 2021; 20:12. [PMID: 33658039 PMCID: PMC7926080 DOI: 10.1186/s12942-021-00266-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The public health burden caused by overweight, obesity (OO) and type-2 diabetes (T2D) is very significant and continues to rise worldwide. The causation of OO and T2D is complex and highly multifactorial rather than a mere energy intake (food) and expenditure (exercise) imbalance. But previous research into food and physical activity (PA) neighbourhood environments has mainly focused on associating body mass index (BMI) with proximity to stores selling fresh fruits and vegetables or fast food restaurants and takeaways, or with neighbourhood walkability factors and access to green spaces or public gym facilities, making largely naive, crude and inconsistent assumptions and conclusions that are far from the spirit of 'precision and accuracy public health'. Different people and population groups respond differently to the same food and PA environments, due to a myriad of unique individual and population group factors (genetic/epigenetic, metabolic, dietary and lifestyle habits, health literacy profiles, screen viewing times, stress levels, sleep patterns, environmental air and noise pollution levels, etc.) and their complex interplays with each other and with local food and PA settings. Furthermore, the same food store or fast food outlet can often sell or serve both healthy and non-healthy options/portions, so a simple binary classification into 'good' or 'bad' store/outlet should be avoided. Moreover, appropriate physical exercise, whilst essential for good health and disease prevention, is not very effective for weight maintenance or loss (especially when solely relied upon), and cannot offset the effects of a bad diet. The research we should be doing in the third decade of the twenty-first century should use a systems thinking approach, helped by recent advances in sensors, big data and related technologies, to investigate and consider all these factors in our quest to design better targeted and more effective public health interventions for OO and T2D control and prevention.
Collapse
Affiliation(s)
- Maged N. Kamel Boulos
- School of Information Management, Sun Yat-Sen University, East Campus, Guangzhou, 510006 Guangdong China
| | - Keumseok Koh
- Department of Geography, The University of Hong Kong, Pokfulam RD, Hong Kong, China
| |
Collapse
|
14
|
Tran HNQ, McMahon E, Moodie M, Ananthapavan J. A Systematic Review of Economic Evaluations of Health-Promoting Food Retail-Based Interventions. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18031356. [PMID: 33540905 PMCID: PMC7908088 DOI: 10.3390/ijerph18031356] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 01/28/2021] [Accepted: 01/29/2021] [Indexed: 01/02/2023]
Abstract
BACKGROUND While the number of retail interventions with impacts on diet- and/or health-related outcomes is increasing, the economic evaluation literature is limited. This review investigated (i) the cost-effectiveness of health-promoting food retail interventions and (ii) key assumptions adopted in these evaluations. METHODS A systematic review of published academic studies was undertaken (CRD42020153763). Fourteen databases were searched. Eligible studies were identified, analysed, and reported following the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines. RESULTS Eight studies that evaluated 30 retail interventions were included in the review. Common outcomes reported were cost per healthy food item purchased/served or cost per disability-adjusted life year (DALY) averted. Four studies undertook cost-utility analyses and half of these studies concluded that retail interventions were cost-effective in improving health outcomes. Most studies did not state any assumptions regarding compensatory behaviour (i.e., purchases/consumption of non-intervention foods or food purchases/consumption from non-intervention settings) and presumed that sales data were indicative of consumption. CONCLUSION The cost-effectiveness of retail-based health-promoting interventions is inconclusive. Future health-promoting retail interventions should regularly include an economic evaluation which addresses key assumptions related to compensatory behaviour and the use of sales data as a proxy for consumption.
Collapse
Affiliation(s)
- Huong Ngoc Quynh Tran
- Deakin Health Economics, School of Health and Social Development, Institute for Health Transformation, Deakin University, Geelong, VIC 3217, Australia; (M.M.); (J.A.)
- Global Obesity Centre, Institute for Health Transformation, Deakin University, Geelong, VIC 3217, Australia
- Correspondence: ; Tel.: +613-9244-5578
| | - Emma McMahon
- Wellbeing and Preventable Chronic Disease Division, Menzies School of Health Research, Charles Darwin University, Darwin, NT 0811, Australia;
| | - Marj Moodie
- Deakin Health Economics, School of Health and Social Development, Institute for Health Transformation, Deakin University, Geelong, VIC 3217, Australia; (M.M.); (J.A.)
- Global Obesity Centre, Institute for Health Transformation, Deakin University, Geelong, VIC 3217, Australia
| | - Jaithri Ananthapavan
- Deakin Health Economics, School of Health and Social Development, Institute for Health Transformation, Deakin University, Geelong, VIC 3217, Australia; (M.M.); (J.A.)
- Global Obesity Centre, Institute for Health Transformation, Deakin University, Geelong, VIC 3217, Australia
| |
Collapse
|
15
|
Abstract
Food safety continues to threaten public health. Machine learning holds potential in leveraging large, emerging data sets to improve the safety of the food supply and mitigate the impact of food safety incidents. Foodborne pathogen genomes and novel data streams, including text, transactional, and trade data, have seen emerging applications enabled by a machine learning approach, such as prediction of antibiotic resistance, source attribution of pathogens, and foodborne outbreak detection and risk assessment. In this article, we provide a gentle introduction to machine learning in the context of food safety and an overview of recent developments and applications. With many of these applications still in their nascence, general and domain-specific pitfalls and challenges associated with machine learning have begun to be recognized and addressed, which are critical to prospective use and future deployment of large data sets and their associated machine learning models for food safety applications.
Collapse
Affiliation(s)
- Xiangyu Deng
- Center for Food Safety, University of Georgia, Griffin, Georgia 30223, USA;
| | - Shuhao Cao
- Department of Mathematics and Statistics, Washington University, St. Louis, Missouri 63105, USA;
| | - Abigail L Horn
- Department of Preventive Medicine, University of Southern California, Los Angeles, California 90032, USA;
| |
Collapse
|
16
|
Arribas-Bel D, Green M, Rowe F, Singleton A. Open data products-A framework for creating valuable analysis ready data. JOURNAL OF GEOGRAPHICAL SYSTEMS 2021; 23:497-514. [PMID: 34697537 PMCID: PMC8528182 DOI: 10.1007/s10109-021-00363-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 06/29/2021] [Indexed: 05/05/2023]
Abstract
This paper develops the notion of "open data product". We define an open data product as the open result of the processes through which a variety of data (open and not) are turned into accessible information through a service, infrastructure, analytics or a combination of all of them, where each step of development is designed to promote open principles. Open data products are born out of a (data) need and add value beyond simply publishing existing datasets. We argue that the process of adding value should adhere to the principles of open (geographic) data science, ensuring openness, transparency and reproducibility. We also contend that outreach, in the form of active communication and dissemination through dashboards, software and publication are key to engage end-users and ensure societal impact. Open data products have major benefits. First, they enable insights from highly sensitive, controlled and/or secure data which may not be accessible otherwise. Second, they can expand the use of commercial and administrative data for the public good leveraging on their high temporal frequency and geographic granularity. We also contend that there is a compelling need for open data products as we experience the current data revolution. New, emerging data sources are unprecedented in temporal frequency and geographical resolution, but they are large, unstructured, fragmented and often hard to access due to privacy and confidentiality concerns. By transforming raw (open or "closed") data into ready to use open data products, new dimensions of human geographical processes can be captured and analysed, as we illustrate with existing examples. We conclude by arguing that several parallels exist between the role that open source software played in enabling research on spatial analysis in the 90 s and early 2000s, and the opportunities that open data products offer to unlock the potential of new forms of (geo-)data.
Collapse
Affiliation(s)
- Dani Arribas-Bel
- Geographic Data Science Lab, Department of Geography and Planning, University of Liverpool, Roxby Building, 74, Bedford St S., Liverpool, L69 7ZT UK
| | - Mark Green
- Geographic Data Science Lab, Department of Geography and Planning, University of Liverpool, Roxby Building, 74, Bedford St S., Liverpool, L69 7ZT UK
| | - Francisco Rowe
- Geographic Data Science Lab, Department of Geography and Planning, University of Liverpool, Roxby Building, 74, Bedford St S., Liverpool, L69 7ZT UK
| | - Alex Singleton
- Geographic Data Science Lab, Department of Geography and Planning, University of Liverpool, Roxby Building, 74, Bedford St S., Liverpool, L69 7ZT UK
| |
Collapse
|
17
|
O'Donnell S, Doyle G, O'Malley G, Browne S, O'Connor J, Mars M, Kechadi MTM. Establishing consensus on key public health indicators for the monitoring and evaluating childhood obesity interventions: a Delphi panel study. BMC Public Health 2020; 20:1733. [PMID: 33203390 PMCID: PMC7670696 DOI: 10.1186/s12889-020-09814-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 11/02/2020] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Childhood obesity is influenced by myriad individual, societal and environmental factors that are not typically reflected in current interventions. Socio-ecological conditions evolve and require ongoing monitoring in terms of assessing their influence on child health. The aim of this study was to identify and prioritise indicators deemed relevant by public health authorities for monitoring and evaluating childhood obesity interventions. METHOD A three-round Delphi Panel composed of experts from regions across Europe, with a remit in childhood obesity intervention, were asked to identify indicators that were a priority in their efforts to address childhood obesity in their respective jurisdictions. In Round 1, 16 panellists answered a series of open-ended questions to identify the most relevant indicators concerning the evaluation and subsequent monitoring of interventions addressing childhood obesity, focusing on three main domains: built environments, dietary environments, and health inequalities. In Rounds 2 and 3, panellists rated the importance of each of the identified indicators within these domains, and the responses were then analysed quantitatively. RESULTS Twenty-seven expert panellists were invited to participate in the study. Of these, 16/27 completed round 1 (5 9% response rate), 14/16 completed round 2 (87.5% response rate), and 8/14 completed the third and final round (57% response rate). Consensus (defined as > 70% agreement) was reached on a total of 45 of the 87 indicators (49%) across three primary domains (built and dietary environments and health inequalities), with 100% consensus reached for 5 of these indicators (6%). CONCLUSION Forty-five potential indicators were identified, pertaining primarily to the dietary environment, built environment and health inequalities. These results have important implications more widely for evaluating interventions aimed at childhood obesity reduction and prevention.
Collapse
Affiliation(s)
- Shane O'Donnell
- School of Sociology, University College Dublin, D04 V1W8,, Dublin, Ireland.
| | - Gerardine Doyle
- UCD College of Business and UCD Geary Institute for Public Policy, University College Dublin, Dublin, A94 XF34, Ireland
| | - Grace O'Malley
- School of Physiotherapy, Division of Population Health Sciences, Royal College of Surgeons Ireland, D02 YN77, Dublin, Ireland.,Children's Health Ireland, Temple Street, D01 XD99, Dublin, Ireland
| | - Sarah Browne
- School of Public Health, Physiotherapy & Sports Science, Woodview House, Belfield, University College Dublin, Dublin, 04V1W8, Ireland
| | - James O'Connor
- School of Computer Science, Insight Centre for Data Analytics, University College Dublin, D04 V1W8, Dublin, Ireland
| | - Monica Mars
- Division of Human Nutrition and Health, Wageningen University and Research, PO Box 17, NL-6700, AA, Wageningen, The Netherlands
| | - M-Tahar M Kechadi
- School of Computer Science, Insight Centre for Data Analytics, University College Dublin, D04 V1W8, Dublin, Ireland
| |
Collapse
|
18
|
Youssef A, Keown-Stoneman C, Maunder R, Wnuk S, Wiljer D, Mylopoulos M, Sockalingam S. Differences in physical and mental health-related quality of life outcomes 3 years after bariatric surgery: a group-based trajectory analysis. Surg Obes Relat Dis 2020; 16:1837-1849. [DOI: 10.1016/j.soard.2020.06.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2020] [Revised: 05/27/2020] [Accepted: 06/11/2020] [Indexed: 02/07/2023]
|
19
|
Khan IH, Javaid M. Big Data Applications in Medical Field: A Literature Review. JOURNAL OF INDUSTRIAL INTEGRATION AND MANAGEMENT 2020. [DOI: 10.1142/s242486222030001x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Digital imaging and medical reporting have acquired an essential role in healthcare, but the main challenge is the storage of a high volume of patient data. Although newer technologies are already introduced in the medical sciences to save records size, Big Data provides advancements by storing a large amount of data to improve the efficiency and quality of patient treatment with better care. It provides intelligent automation capabilities to reduce errors than manual inputs. Large numbers of research papers on big data in the medical field are studied and analyzed for their impacts, benefits, and applications. Big data has great potential to support the digitalization of all medical and clinical records and then save the entire data regarding the medical history of an individual or a group. This paper discusses big data usage for various industries and sectors. Finally, 12 significant applications for the medical field by the implementation of big data are identified and studied with a brief description. This technology can be gainfully used to extract useful information from the available data by analyzing and managing them through a combination of hardware and software. With technological advancement, big data provides health-related information for millions of patient-related to life issues such as lab tests reporting, clinical narratives, demographics, prescription, medical diagnosis, and related documentation. Thus, Big Data is essential in developing a better yet efficient analysis and storage healthcare services. The demand for big data applications is increasing due to its capability of handling and analyzing massive data. Not only in the future but even now, Big Data is proving itself as an axiom of storing, developing, analyzing, and providing overall health information to the physicians.
Collapse
Affiliation(s)
- Ibrahim Haleem Khan
- School of Engineering Sciences and Technology, Jamia Hamdard, New Delhi, India
| | - Mohd Javaid
- Department of Mechanical Engineering, Jamia Millia Islamia, New Delhi, India
| |
Collapse
|
20
|
Mirchev M, Mircheva I, Kerekovska A. The Academic Viewpoint on Patient Data Ownership in the Context of Big Data: Scoping Review. J Med Internet Res 2020; 22:e22214. [PMID: 32808934 PMCID: PMC7463395 DOI: 10.2196/22214] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 07/24/2020] [Accepted: 07/26/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The ownership of patient information in the context of big data is a relatively new problem, which is not yet fully recognized by the medical academic community. The problem is interdisciplinary, incorporating legal, ethical, medical, and aspects of information and communication technologies, requiring a sophisticated analysis. However, no previous scoping review has mapped existing studies on the subject. OBJECTIVE This study aims to map and assess published studies on patient data ownership in the context of big data as viewed by the academic community. METHODS A scoping review was conducted based on the 5-stage framework outlined by Arksey and O'Malley and further developed by Levac, Colquhoun, and O'Brien. The organization and reporting of results of the scoping review were conducted according to PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses and its extensions for Scoping Reviews). A systematic and comprehensive search of 4 scientific information databases, PubMed, ScienceDirect, Scopus, and Springer, was performed for studies published between January 2000 and October 2019. Two authors independently assessed the eligibility of the studies and the extracted data. RESULTS The review included 32 eligible articles authored by academicians that correspond to 3 focus areas: problem (ownership), area (health care), and context (big data). Five major aspects were studied: the scientific area of publications, aspects and academicians' perception of ownership in the context of big data, proposed solutions, and practical applications for data ownership issues in the context of big data. The aspects in which publications consider ownership of medical data are not clearly distinguished but can be summarized as ethical, legal, political, and managerial. The ownership of patient data is perceived primarily as a challenge fundamental to conducting medical research, including data sales and sharing, and to a lesser degree as a means of control, problem, threat, and opportunity also in view of medical research. Although numerous solutions falling into 3 categories, technology, law, and policy, were proposed, only 3 real applications were discussed. CONCLUSIONS The issue of ownership of patient information in the context of big data is poorly researched; it is not addressed consistently and in its integrity, and there is no consensus on policy decisions and the necessary legal regulations. Future research should investigate the issue of ownership as a core research question and not as a minor fragment among other topics. More research is needed to increase the body of knowledge regarding the development of adequate policies and relevant legal frameworks in compliance with ethical standards. The combined efforts of multidisciplinary academic teams are needed to overcome existing gaps in the perception of ownership, the aspects of ownership, and the possible solutions to patient data ownership issues in the reality of big data.
Collapse
Affiliation(s)
- Martin Mirchev
- Department of Social Medicine and Healthcare Organization, Faculty of Public Health, Medical University of Varna, Varna, Bulgaria
| | - Iskra Mircheva
- Department of Social Medicine and Healthcare Organization, Faculty of Public Health, Medical University of Varna, Varna, Bulgaria
| | - Albena Kerekovska
- Department of Social Medicine and Healthcare Organization, Faculty of Public Health, Medical University of Varna, Varna, Bulgaria
| |
Collapse
|
21
|
Green MA, Watson AW, Brunstrom JM, Corfe BM, Johnstone AM, Williams EA, Stevenson E. Comparing supermarket loyalty card data with traditional diet survey data for understanding how protein is purchased and consumed in older adults for the UK, 2014-16. Nutr J 2020; 19:83. [PMID: 32791968 PMCID: PMC7427066 DOI: 10.1186/s12937-020-00602-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Accepted: 08/06/2020] [Indexed: 11/10/2022] Open
Abstract
Background Our ability to understand population-level dietary intake patterns is dependent on having access to high quality data. Diet surveys are common diet assessment methods, but can be limited by bias associated with under-reporting. Food purchases tracked using supermarket loyalty card records may supplement traditional surveys, however they are rarely available to academics and policy makers. The aim of our study is to explore population level patterns of protein purchasing and consumption in ageing adults (40 years onwards). Methods We used diet survey data from the National Diet and Nutrition Survey (2014–16) on food consumption, and loyalty card records on food purchases from a major high street supermarket retailer (2016–17) covering the UK. We computed the percentage of total energy derived from protein, protein intake per kg of body mass, and percentage of protein acquired by food type. Results We found that protein consumption (as the percentage of total energy purchased) increased between ages 40–65 years, and declined thereafter. In comparison, protein purchased in supermarkets was roughly 2–2.5 percentage points lower at each year of age. The proportion of adults meeting recommended levels of protein was lowest in age groups 55–69 and 70+. The time of protein consumption was skewed towards evening meals, with low intakes during breakfast or between main meals. Meat, fish and poultry dominated as sources of protein purchased and consumed, although adults also acquired a large share of their protein from dairy and bread, with little from plant protein. Conclusions Our study provides novel insights into how protein is purchased and consumed by ageing adults in the UK. Supermarket loyalty card data can reveal patterns of protein purchasing that when combined with traditional sources of dietary intake may enhance our understanding of dietary behaviours.
Collapse
Affiliation(s)
- Mark A Green
- Geographic Data Science Lab, School of Environmental Sciences, University of Liverpool, Liverpool, UK.
| | - Anthony W Watson
- Human Nutrition Research Centre, Institute of Cellular Medicine, Newcastle University, Newcastle Upon-Tyne, UK
| | | | - Bernard M Corfe
- Department of Oncology & Metabolism, University of Sheffield, Sheffield, UK
| | | | | | - Emma Stevenson
- Human Nutrition Research Centre, Institute of Cellular Medicine, Newcastle University, Newcastle Upon-Tyne, UK
| |
Collapse
|
22
|
Campbell EA, Qian T, Miller JM, Bass EJ, Masino AJ. Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data. Int J Obes (Lond) 2020; 44:1753-1765. [PMID: 32494036 PMCID: PMC7381422 DOI: 10.1038/s41366-020-0614-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Revised: 04/29/2020] [Accepted: 05/20/2020] [Indexed: 11/30/2022]
Abstract
BACKGROUND Electronic health records (EHRs) are potentially important components in addressing pediatric obesity in clinical settings and at the population level. This work aims to identify temporal condition patterns surrounding obesity incidence in a large pediatric population that may inform clinical care and childhood obesity policy and prevention efforts. METHODS EHR data from healthcare visits with an initial record of obesity incidence (index visit) from 2009 through 2016 at the Children's Hospital of Philadelphia, and visits immediately before (pre-index) and after (post-index), were compared with a matched control population of patients with a healthy weight to characterize the prevalence of common diagnoses and condition trajectories. The study population consisted of 49,694 patients with pediatric obesity and their corresponding matched controls. The SPADE algorithm was used to identify common temporal condition patterns in the case population. McNemar's test was used to assess the statistical significance of pattern prevalence differences between the case and control populations. RESULTS SPADE identified 163 condition patterns that were present in at least 1% of cases; 80 were significantly more common among cases and 45 were significantly more common among controls (p < 0.05). Asthma and allergic rhinitis were strongly associated with childhood obesity incidence, particularly during the pre-index and index visits. Seven conditions were commonly diagnosed for cases exclusively during pre-index visits, including ear, nose, and throat disorders and gastroenteritis. CONCLUSIONS The novel application of SPADE on a large retrospective dataset revealed temporally dependent condition associations with obesity incidence. Allergic rhinitis and asthma had a particularly high prevalence during pre-index visits. These conditions, along with those exclusively observed during pre-index visits, may represent signals of future obesity. While causation cannot be inferred from these associations, the temporal condition patterns identified here represent hypotheses that can be investigated to determine causal relationships in future obesity research.
Collapse
Affiliation(s)
- Elizabeth A Campbell
- Department of Information Science, College of Computing and Informatics, Drexel University, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Ting Qian
- Department of Psychology, Princeton University, Princeton, NJ, USA
| | - Jeffrey M Miller
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Ellen J Bass
- Department of Information Science, College of Computing and Informatics, Drexel University, Philadelphia, PA, USA
- Department of Health Systems and Sciences Research, College of Nursing and Health Professions, Philadelphia, PA, USA
| | - Aaron J Masino
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- Department of Anesthesiology and Critical Care, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
23
|
Campbell M, Smith D, Baird J, Vogel C, Moon EG. A critical review of diet-related surveys in England, 1970-2018. ACTA ACUST UNITED AC 2020; 78:66. [PMID: 32699631 PMCID: PMC7370528 DOI: 10.1186/s13690-020-00447-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Accepted: 07/07/2020] [Indexed: 11/30/2022]
Abstract
Background Many diet-related surveys have been conducted in England over the past four to five decades. Yet, diet-related ill-health is estimated to cost the NHS £5.8 billion annually. There has been no recent assessment of the diet-related surveys currently available in England. This paper aims to fill this gap in the literature by providing researchers, especially those interested in conducting secondary (quantitative) research on diet, with a detailed overview of the major repeated cross-sectional and longitudinal surveys conducted in England over the last 48 years (1970–2018). Method A three-stage review process was used to identify and assess surveys and synthesise the information necessary for achieving the paper’s aim. Surveys were identified using the UK Data Service, Cohort and Longitudinal Studies Enhancement Resources (CLOSER), the Medical Research Council (MRC) Cohort Directory and the Consumer Data Research Centre (CDRC) online data repositories/directories. Surveys were summarised to include a brief background, the survey design and methodology used, variables captured, the target population, level of geography covered, the type of dietary assessment method(s) used, primary data users, data accessibility, availability and costs, as well as key survey features and considerations. Results The key considerations identified across the various surveys following the review include: the overall survey design and the different dietary assessment method(s) used in each survey; methodological changes and general inconsistencies in the type and quantity of diet-related questions posed across and within surveys over time; and differences in the level of geography and target groups captured. Conclusion It is highly unlikely that any survey dataset will meet all the needs of researchers. Nevertheless, researchers are encouraged to make good use of the secondary data currently available, in order to conduct the research necessary for the creation of more evidence-based diet-related policies and interventions in England. The review process used in this paper is one that can be easily replicated and one which future studies can use to update and expand upon to assist researchers in identifying the survey(s) most aligned to their research questions.
Collapse
Affiliation(s)
- Monique Campbell
- University of Southampton, School of Geography and Environmental Science, Southampton, UK
| | - Dianna Smith
- University of Southampton, School of Geography and Environmental Science, Southampton, UK
| | - Janis Baird
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, UK
| | - Christina Vogel
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, UK
| | - Emeritus Graham Moon
- University of Southampton, School of Geography and Environmental Science, Southampton, UK
| |
Collapse
|
24
|
Cirillo D, Catuara-Solarz S, Morey C, Guney E, Subirats L, Mellino S, Gigante A, Valencia A, Rementeria MJ, Chadha AS, Mavridis N. Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare. NPJ Digit Med 2020; 3:81. [PMID: 32529043 PMCID: PMC7264169 DOI: 10.1038/s41746-020-0288-5] [Citation(s) in RCA: 186] [Impact Index Per Article: 37.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 04/28/2020] [Indexed: 01/10/2023] Open
Abstract
Precision Medicine implies a deep understanding of inter-individual differences in health and disease that are due to genetic and environmental factors. To acquire such understanding there is a need for the implementation of different types of technologies based on artificial intelligence (AI) that enable the identification of biomedically relevant patterns, facilitating progress towards individually tailored preventative and therapeutic interventions. Despite the significant scientific advances achieved so far, most of the currently used biomedical AI technologies do not account for bias detection. Furthermore, the design of the majority of algorithms ignore the sex and gender dimension and its contribution to health and disease differences among individuals. Failure in accounting for these differences will generate sub-optimal results and produce mistakes as well as discriminatory outcomes. In this review we examine the current sex and gender gaps in a subset of biomedical technologies used in relation to Precision Medicine. In addition, we provide recommendations to optimize their utilization to improve the global health and disease landscape and decrease inequalities.
Collapse
Affiliation(s)
- Davide Cirillo
- Barcelona Supercomputing Center (BSC), C/ Jordi Girona, 29, 08034 Barcelona, Spain
| | - Silvina Catuara-Solarz
- Telefonica Innovation Alpha Health, Torre Telefonica, Plaça d’Ernest Lluch i Martin, 5, 08019 Barcelona, Spain
- The Women’s Brain Project (WBP), Guntershausen, Switzerland
| | - Czuee Morey
- The Women’s Brain Project (WBP), Guntershausen, Switzerland
- Wega Informatik AG, Aeschengraben 20, CH-4051 Basel, Switzerland
| | - Emre Guney
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Research Institute and Pompeu Fabra University, Dr. Aiguader, 88, 08003 Barcelona, Spain
| | - Laia Subirats
- Eurecat - Centre Tecnològic de Catalunya, C/ Bilbao, 72, Edifici A, 08005 Barcelona, Spain
- eHealth Center, Universitat Oberta de Catalunya, Rambla del Poblenou, 156, 08018 Barcelona, Spain
| | - Simona Mellino
- The Women’s Brain Project (WBP), Guntershausen, Switzerland
| | | | - Alfonso Valencia
- Barcelona Supercomputing Center (BSC), C/ Jordi Girona, 29, 08034 Barcelona, Spain
- ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
| | | | | | - Nikolaos Mavridis
- The Women’s Brain Project (WBP), Guntershausen, Switzerland
- Interactive Robots and Media Laboratory (IRML), Abu Dhabi, United Arab Emirates
| |
Collapse
|
25
|
Wilkins E, Aravani A, Downing A, Drewnowski A, Griffiths C, Zwolinsky S, Birkin M, Alvanides S, Morris MA. Evidence from big data in obesity research: international case studies. Int J Obes (Lond) 2020; 44:1028-1040. [PMID: 31988482 DOI: 10.1038/s41366-020-0532-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Revised: 12/20/2019] [Accepted: 01/07/2020] [Indexed: 12/28/2022]
Abstract
BACKGROUND/OBJECTIVE Obesity is thought to be the product of over 100 different factors, interacting as a complex system over multiple levels. Understanding the drivers of obesity requires considerable data, which are challenging, costly and time-consuming to collect through traditional means. Use of 'big data' presents a potential solution to this challenge. Big data is defined by Delphi consensus as: always digital, has a large sample size, and a large volume or variety or velocity of variables that require additional computing power (Vogel et al. Int J Obes. 2019). 'Additional computing power' introduces the concept of big data analytics. The aim of this paper is to showcase international research case studies presented during a seminar series held by the Economic and Social Research Council (ESRC) Strategic Network for Obesity in the UK. These are intended to provide an in-depth view of how big data can be used in obesity research, and the specific benefits, limitations and challenges encountered. METHODS AND RESULTS Three case studies are presented. The first investigated the influence of the built environment on physical activity. It used spatial data on green spaces and exercise facilities alongside individual-level data on physical activity and swipe card entry to leisure centres, collected as part of a local authority exercise class initiative. The second used a variety of linked electronic health datasets to investigate associations between obesity surgery and the risk of developing cancer. The third used data on tax parcel values alongside data from the Seattle Obesity Study to investigate sociodemographic determinants of obesity in Seattle. CONCLUSIONS The case studies demonstrated how big data could be used to augment traditional data to capture a broader range of variables in the obesity system. They also showed that big data can present improvements over traditional data in relation to size, coverage, temporality, and objectivity of measures. However, the case studies also encountered challenges or limitations; particularly in relation to hidden/unforeseen biases and lack of contextual information. Overall, despite challenges, big data presents a relatively untapped resource that shows promise in helping to understand drivers of obesity.
Collapse
Affiliation(s)
- Emma Wilkins
- Leeds Institute for Data Analytics and School of Medicine, University of Leeds, Leeds, UK
| | - Ariadni Aravani
- Leeds Institute for Data Analytics and School of Medicine, University of Leeds, Leeds, UK
| | - Amy Downing
- Leeds Institute for Data Analytics and School of Medicine, University of Leeds, Leeds, UK
| | - Adam Drewnowski
- Center for Public Health Nutrition, University of Washington, Seattle, WA, USA
| | | | | | - Mark Birkin
- Leeds Institute for Data Analytics and School of Geography, University of Leeds, Leeds, UK
| | - Seraphim Alvanides
- Engineering and Environment, Northumbria University, Newcastle, UK.,GESIS-Leibniz Institute for the Social Sciences, Cologne, Germany
| | - Michelle A Morris
- Leeds Institute for Data Analytics and School of Medicine, University of Leeds, Leeds, UK.
| |
Collapse
|
26
|
The evolution of Health & Place: Text mining papers published between 1995 and 2018. Health Place 2020; 61:102207. [DOI: 10.1016/j.healthplace.2019.102207] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Revised: 09/13/2019] [Accepted: 09/13/2019] [Indexed: 01/26/2023]
|
27
|
Creating a long-term future for big data in obesity research. Int J Obes (Lond) 2019; 43:2587-2592. [PMID: 31641212 DOI: 10.1038/s41366-019-0477-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 10/02/2019] [Accepted: 10/04/2019] [Indexed: 11/09/2022]
Abstract
Big data are part of the future in obesity research. The ESRC funded Strategic Network for Obesity has together generated a series of papers, published in the International Journal for Obesity illustrating various aspects of their utility, in particular relating to the large social and environmental drivers of obesity. This article is the final part of the series and reflects upon progress to date and identifies four areas that require attention to promote the continued role of big data in research. We additionally include a 'getting started with big data' checklist to encourage more obesity researchers to engage with alternative data resources.
Collapse
|
28
|
Mamiya H, Schmidt AM, Moodie EEM, Ma Y, Buckeridge DL. An Area-Level Indicator of Latent Soda Demand: Spatial Statistical Modeling of Grocery Store Transaction Data to Characterize the Nutritional Landscape in Montreal, Canada. Am J Epidemiol 2019; 188:1713-1722. [PMID: 31063186 DOI: 10.1093/aje/kwz115] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 04/24/2019] [Accepted: 04/29/2019] [Indexed: 12/26/2022] Open
Abstract
Measurement of neighborhood dietary patterns at high spatial resolution allows public health agencies to identify and monitor communities with an elevated risk of nutrition-related chronic diseases. Currently, data on diet are obtained primarily through nutrition surveys, which produce measurements at low spatial resolutions. The availability of store-level grocery transaction data provides an opportunity to refine the measurement of neighborhood dietary patterns. We used these data to develop an indicator of area-level latent demand for soda in the Census Metropolitan Area of Montreal in 2012 by applying a hierarchical Bayesian spatial model to data on soda sales from 1,097 chain retail food outlets. The utility of the indicator of latent soda demand was evaluated by assessing its association with the neighborhood relative risk of prevalent type 2 diabetes mellitus. The indicator improved the fit of the disease-mapping model (deviance information criterion: 2,140 with the indicator and 2,148 without) and enables a novel approach to nutrition surveillance.
Collapse
Affiliation(s)
- Hiroshi Mamiya
- Surveillance Lab, McGill Clinical and Health Informatics, McGill University, Montreal, Quebec, Canada
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, Quebec, Canada
| | - Alexandra M Schmidt
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, Quebec, Canada
| | - Erica E M Moodie
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, Quebec, Canada
| | - Yu Ma
- Desautels Faculty of Management, McGill University, Montreal, Quebec, Canada
| | - David L Buckeridge
- Surveillance Lab, McGill Clinical and Health Informatics, McGill University, Montreal, Quebec, Canada
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montreal, Quebec, Canada
| |
Collapse
|
29
|
A Delphi study to build consensus on the definition and use of big data in obesity research. Int J Obes (Lond) 2019; 43:2573-2586. [PMID: 30655580 PMCID: PMC6892733 DOI: 10.1038/s41366-018-0313-9] [Citation(s) in RCA: 184] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 11/01/2018] [Accepted: 11/29/2018] [Indexed: 02/08/2023]
Abstract
BACKGROUND 'Big data' has great potential to help address the global health challenge of obesity. However, lack of clarity with regard to the definition of big data and frameworks for effectively using big data in the context of obesity research may be hindering progress. The aim of this study was to establish agreed approaches for the use of big data in obesity-related research. METHODS A Delphi method of consensus development was used, comprising three survey rounds. In Round 1, participants were asked to rate agreement/disagreement with 77 statements across seven domains relating to definitions of, and approaches to, using big data in the context of obesity research. Participants were also asked to contribute further ideas in relation to these topics, which were incorporated as new statements (n = 8) in Round 2. In Rounds 2 and 3 participants re-appraised their ratings in view of the group consensus. RESULTS Ninety-six experts active in obesity-related research were invited to participate. Of these, 36/96 completed Round 1 (37.5% response rate), 29/36 completed Round 2 (80.6% response rate) and 26/29 completed Round 3 (89.7% response rate). Consensus (defined as > 70% agreement) was achieved for 90.6% (n = 77) of statements, with 100% consensus achieved for the Definition of Big Data, Data Governance, and Quality and Inference domains. CONCLUSIONS Experts agreed that big data was more nuanced than the oft-cited definition of 'volume, variety and velocity', and includes quantitative, qualitative, observational or intervention data from a range of sources that have been collected for research or other purposes. Experts repeatedly called for third party action, for example to develop frameworks for reporting and ethics, to clarify data governance requirements, to support training and skill development and to facilitate sharing of big data. Further advocacy will be required to encourage organisations to adopt these roles.
Collapse
|
30
|
Li L, Song Q, Yang X. K-means clustering of overweight and obese population using quantile-transformed metabolic data. Diabetes Metab Syndr Obes 2019; 12:1573-1582. [PMID: 31692562 PMCID: PMC6711566 DOI: 10.2147/dmso.s206640] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Accepted: 07/09/2019] [Indexed: 12/26/2022] Open
Abstract
OBJECTIVE Use of K-means clustering for big data technology to cluster an overweight and obese population metabolically. METHODS K-means clustering with the help of quantile transformation of attribute values was applied to overcome the impact of the considerable variation in the values of obesity attributes involving outliers and skewed distribution. RESULTS Overall, 447 subjects were categorized into six clusters; metabolically normal, mild, and severe categories. There were clearly separated metabolically normal Cluster 1 and severe Cluster 2, as well as intermediate Cluster 3, 4, and 5 that had profiles of fewer attributes with abnormal values. Cluster 3 was characteristic of sole hypertension. Cluster 3 and 4 exhibited contrasting HDL-C and LDL-C levels despite similarly elevated total cholesterol. Cluster 6 with slightly elevated triglyceride was closest to the normal group. Four- and 10-quantile-transformations yielded consistent clustering results. Compared with the original data, the quantile-transformed data produced more regular and spherical clusters and evenly distributed clusters in terms of object numbers. CONCLUSIONS This big data analysis strategy makes use of quantile-transformation of data to overcome the issue of outliers and the irregular distribution and applies to the analysis of other non-communicable diseases.
Collapse
Affiliation(s)
- Li Li
- Department of Endocrinology and Metabolism, Ningbo First Hospital, Ningbo, People’s Republic of China
| | - Qifa Song
- Department of Microbiology, Ningbo Municipal Centre for Disease Control and Prevention, Ningbo, People’s Republic of China
- Correspondence: Qifa SongDepartment of Microbiology, Ningbo Municipal Centre for Disease Control and Prevention, No. 237, Yongfeng Road, Ningbo, Zhejiang Province315010, People’s Republic of ChinaTel +86 05 748 727 4563Email
| | - Xi Yang
- Department of Endocrinology and Metabolism, Ningbo First Hospital, Ningbo, People’s Republic of China
| |
Collapse
|
31
|
Shelton RC, Lee M, Brotzman LE, Crookes DM, Jandorf L, Erwin D, Gage-Bouchard EA. Use of social network analysis in the development, dissemination, implementation, and sustainability of health behavior interventions for adults: A systematic review. Soc Sci Med 2018; 220:81-101. [PMID: 30412922 DOI: 10.1016/j.socscimed.2018.10.013] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Revised: 10/15/2018] [Accepted: 10/18/2018] [Indexed: 11/30/2022]
Abstract
Interest in conceptualizing, measuring, and applying social network analysis (SNA) in public health has grown tremendously in recent years. While these studies have broadened our understanding of the role that social networks play in health, there has been less research that has investigated the application of SNA to inform health-related interventions. This systematic review aimed to capture the current applied use of SNA in the development, dissemination, implementation, and sustainability of health behavior interventions for adults. We identified 52 articles published between 2004 and 2016. A wide variety of study settings were identified, most commonly in the US context and most often related to sexual health and HIV prevention. We found that 38% of articles explicitly applied SNA to inform some aspect of interventions. Use of SNA to inform intervention development (as opposed to dissemination, implementation, or sustainability) was most common. The majority of articles represented in this review (n = 39) were quantitative studies, and 13 articles included a qualitative component. Partial networks were most represented across articles, and over 100 different networks measures were assessed. The most commonly described measures were network density, size, and degree centrality. Finally, very few articles defined SNA and not all articles using SNA were theoretically-informed. Given the nascent and heterogeneous state of the literature in this area, this is an important time for the field to coalesce on terminology, measures, and theoretical frameworks. We highlight areas for researchers to advance work on the application of SNA in the design, dissemination, implementation and sustainability of behavioral interventions.
Collapse
Affiliation(s)
- Rachel C Shelton
- Columbia University Mailman School of Public Health, Department of Sociomedical Sciences, 722 West 168th Street, New York, NY, 10032, USA.
| | - Matthew Lee
- Columbia University Mailman School of Public Health, Department of Sociomedical Sciences, 722 West 168th Street, New York, NY, 10032, USA.
| | - Laura E Brotzman
- Columbia University Mailman School of Public Health, Department of Sociomedical Sciences, 722 West 168th Street, New York, NY, 10032, USA.
| | - Danielle M Crookes
- Columbia University Mailman School of Public Health, Department of Epidemiology, 722 West 168th Street, New York, NY, 10032, USA.
| | - Lina Jandorf
- Icahn School of Medicine at Mount Sinai, Department of Oncological Sciences, One Gustave L. Levy Place, New York, NY, 10029, USA.
| | - Deborah Erwin
- Roswell Park Comprehensive Cancer Center, Elm and Carlton Streets, Buffalo, NY, 14263, USA.
| | | |
Collapse
|
32
|
Can big data solve a big problem? Reporting the obesity data landscape in line with the Foresight obesity system map. Int J Obes (Lond) 2018; 42:1963-1976. [PMID: 30242238 PMCID: PMC6291418 DOI: 10.1038/s41366-018-0184-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 03/07/2018] [Accepted: 04/09/2018] [Indexed: 11/20/2022]
Abstract
Background Obesity research at a population level is multifaceted and complex. This has been characterised in the UK by the Foresight obesity systems map, identifying over 100 variables, across seven domain areas which are thought to influence energy balance, and subsequent obesity. Availability of data to consider the whole obesity system is traditionally lacking. However, in an era of big data, new possibilities are emerging. Understanding what data are available can be the first challenge, followed by an inconsistency in data reporting to enable adequate use in the obesity context. In this study we map data sources against the Foresight obesity system map domains and nodes and develop a framework to report big data for obesity research. Opportunities and challenges associated with this new data approach to whole systems obesity research are discussed. Methods Expert opinion from the ESRC Strategic Network for Obesity was harnessed in order to develop a data source reporting framework for obesity research. The framework was then tested on a range of data sources. In order to assess availability of data sources relevant to obesity research, a data mapping exercise against the Foresight obesity systems map domains and nodes was carried out. Results A reporting framework was developed to recommend the reporting of key information in line with these headings: Background; Elements; Exemplars; Content; Ownership; Aggregation; Sharing; Temporality (BEE-COAST). The new BEE-COAST framework was successfully applied to eight exemplar data sources from the UK. 80% coverage of the Foresight obesity systems map is possible using a wide range of big data sources. The remaining 20% were primarily biological measurements often captured by more traditional laboratory based research. Conclusions Big data offer great potential across many domains of obesity research and need to be leveraged in conjunction with traditional data for societal benefit and health promotion.
Collapse
|