1
|
Lang O, Yaya-Stupp D, Traynis I, Cole-Lewis H, Bennett CR, Lyles CR, Lau C, Irani M, Semturs C, Webster DR, Corrado GS, Hassidim A, Matias Y, Liu Y, Hammel N, Babenko B. Using generative AI to investigate medical imagery models and datasets. EBioMedicine 2024; 102:105075. [PMID: 38565004 PMCID: PMC10993140 DOI: 10.1016/j.ebiom.2024.105075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 03/05/2024] [Accepted: 03/06/2024] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND AI models have shown promise in performing many medical imaging tasks. However, our ability to explain what signals these models have learned is severely lacking. Explanations are needed in order to increase the trust of doctors in AI-based models, especially in domains where AI prediction capabilities surpass those of humans. Moreover, such explanations could enable novel scientific discovery by uncovering signals in the data that aren't yet known to experts. METHODS In this paper, we present a workflow for generating hypotheses to understand which visual signals in images are correlated with a classification model's predictions for a given task. This approach leverages an automatic visual explanation algorithm followed by interdisciplinary expert review. We propose the following 4 steps: (i) Train a classifier to perform a given task to assess whether the imagery indeed contains signals relevant to the task; (ii) Train a StyleGAN-based image generator with an architecture that enables guidance by the classifier ("StylEx"); (iii) Automatically detect, extract, and visualize the top visual attributes that the classifier is sensitive towards. For visualization, we independently modify each of these attributes to generate counterfactual visualizations for a set of images (i.e., what the image would look like with the attribute increased or decreased); (iv) Formulate hypotheses for the underlying mechanisms, to stimulate future research. Specifically, present the discovered attributes and corresponding counterfactual visualizations to an interdisciplinary panel of experts so that hypotheses can account for social and structural determinants of health (e.g., whether the attributes correspond to known patho-physiological or socio-cultural phenomena, or could be novel discoveries). FINDINGS To demonstrate the broad applicability of our approach, we present results on eight prediction tasks across three medical imaging modalities-retinal fundus photographs, external eye photographs, and chest radiographs. We showcase examples where many of the automatically-learned attributes clearly capture clinically known features (e.g., types of cataract, enlarged heart), and demonstrate automatically-learned confounders that arise from factors beyond physiological mechanisms (e.g., chest X-ray underexposure is correlated with the classifier predicting abnormality, and eye makeup is correlated with the classifier predicting low hemoglobin levels). We further show that our method reveals a number of physiologically plausible, previously-unknown attributes based on the literature (e.g., differences in the fundus associated with self-reported sex, which were previously unknown). INTERPRETATION Our approach enables hypotheses generation via attribute visualizations and has the potential to enable researchers to better understand, improve their assessment, and extract new knowledge from AI-based models, as well as debug and design better datasets. Though not designed to infer causality, importantly, we highlight that attributes generated by our framework can capture phenomena beyond physiology or pathophysiology, reflecting the real world nature of healthcare delivery and socio-cultural factors, and hence interdisciplinary perspectives are critical in these investigations. Finally, we will release code to help researchers train their own StylEx models and analyze their predictive tasks of interest, and use the methodology presented in this paper for responsible interpretation of the revealed attributes. FUNDING Google.
Collapse
Affiliation(s)
| | | | - Ilana Traynis
- Work Done at Google Via Advanced Clinical, Deerfield, IL, USA
| | | | | | - Courtney R Lyles
- Google, Mountain View, CA, USA; University of California San Francisco, Department of Medicine, San Francisco, CA, USA
| | | | | | | | | | | | | | | | - Yun Liu
- Google, Mountain View, CA, USA
| | | | | |
Collapse
|
2
|
Schaekermann M, Spitz T, Pyles M, Cole-Lewis H, Wulczyn E, Pfohl SR, Martin D, Jaroensri R, Keeling G, Liu Y, Farquhar S, Xue Q, Lester J, Hughes C, Strachan P, Tan F, Bui P, Mermel CH, Peng LH, Matias Y, Corrado GS, Webster DR, Virmani S, Semturs C, Liu Y, Horn I, Cameron Chen PH. Health equity assessment of machine learning performance (HEAL): a framework and dermatology AI model case study. EClinicalMedicine 2024; 70:102479. [PMID: 38685924 PMCID: PMC11056401 DOI: 10.1016/j.eclinm.2024.102479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 01/16/2024] [Accepted: 01/25/2024] [Indexed: 05/02/2024] Open
Abstract
Background Artificial intelligence (AI) has repeatedly been shown to encode historical inequities in healthcare. We aimed to develop a framework to quantitatively assess the performance equity of health AI technologies and to illustrate its utility via a case study. Methods Here, we propose a methodology to assess whether health AI technologies prioritise performance for patient populations experiencing worse outcomes, that is complementary to existing fairness metrics. We developed the Health Equity Assessment of machine Learning performance (HEAL) framework designed to quantitatively assess the performance equity of health AI technologies via a four-step interdisciplinary process to understand and quantify domain-specific criteria, and the resulting HEAL metric. As an illustrative case study (analysis conducted between October 2022 and January 2023), we applied the HEAL framework to a dermatology AI model. A set of 5420 teledermatology cases (store-and-forward cases from patients of 20 years or older, submitted from primary care providers in the USA and skin cancer clinics in Australia), enriched for diversity in age, sex and race/ethnicity, was used to retrospectively evaluate the AI model's HEAL metric, defined as the likelihood that the AI model performs better for subpopulations with worse average health outcomes as compared to others. The likelihood that AI performance was anticorrelated to pre-existing health outcomes was estimated using bootstrap methods as the probability that the negated Spearman's rank correlation coefficient (i.e., "R") was greater than zero. Positive values of R suggest that subpopulations with poorer health outcomes have better AI model performance. Thus, the HEAL metric, defined as p (R >0), measures how likely the AI technology is to prioritise performance for subpopulations with worse average health outcomes as compared to others (presented as a percentage below). Health outcomes were quantified as disability-adjusted life years (DALYs) when grouping by sex and age, and years of life lost (YLLs) when grouping by race/ethnicity. AI performance was measured as top-3 agreement with the reference diagnosis from a panel of 3 dermatologists per case. Findings Across all dermatologic conditions, the HEAL metric was 80.5% for prioritizing AI performance of racial/ethnic subpopulations based on YLLs, and 92.1% and 0.0% respectively for prioritizing AI performance of sex and age subpopulations based on DALYs. Certain dermatologic conditions were significantly associated with greater AI model performance compared to a reference category of less common conditions. For skin cancer conditions, the HEAL metric was 73.8% for prioritizing AI performance of age subpopulations based on DALYs. Interpretation Analysis using the proposed HEAL framework showed that the dermatology AI model prioritised performance for race/ethnicity, sex (all conditions) and age (cancer conditions) subpopulations with respect to pre-existing health disparities. More work is needed to investigate ways of promoting equitable AI performance across age for non-cancer conditions and to better understand how AI models can contribute towards improving equity in health outcomes. Funding Google LLC.
Collapse
Affiliation(s)
| | | | - Malcolm Pyles
- Advanced Clinical, Deerfield, IL, USA
- Department of Dermatology, Cleveland Clinic, Cleveland, OH, USA
| | | | | | | | | | | | | | - Yuan Liu
- Google Health, Mountain View, CA, USA
| | | | | | - Jenna Lester
- Advanced Clinical, Deerfield, IL, USA
- Department of Dermatology, University of California, San Francisco, CA, USA
| | | | | | | | - Peggy Bui
- Google Health, Mountain View, CA, USA
| | | | | | | | | | | | | | | | - Yun Liu
- Google Health, Mountain View, CA, USA
| | - Ivor Horn
- Google Health, Mountain View, CA, USA
| | | |
Collapse
|
3
|
Weng WH, Sellergen A, Kiraly AP, D'Amour A, Park J, Pilgrim R, Pfohl S, Lau C, Natarajan V, Azizi S, Karthikesalingam A, Cole-Lewis H, Matias Y, Corrado GS, Webster DR, Shetty S, Prabhakara S, Eswaran K, Celi LAG, Liu Y. An intentional approach to managing bias in general purpose embedding models. Lancet Digit Health 2024; 6:e126-e130. [PMID: 38278614 DOI: 10.1016/s2589-7500(23)00227-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 10/24/2023] [Accepted: 11/02/2023] [Indexed: 01/28/2024]
Abstract
Advances in machine learning for health care have brought concerns about bias from the research community; specifically, the introduction, perpetuation, or exacerbation of care disparities. Reinforcing these concerns is the finding that medical images often reveal signals about sensitive attributes in ways that are hard to pinpoint by both algorithms and people. This finding raises a question about how to best design general purpose pretrained embeddings (GPPEs, defined as embeddings meant to support a broad array of use cases) for building downstream models that are free from particular types of bias. The downstream model should be carefully evaluated for bias, and audited and improved as appropriate. However, in our view, well intentioned attempts to prevent the upstream components-GPPEs-from learning sensitive attributes can have unintended consequences on the downstream models. Despite producing a veneer of technical neutrality, the resultant end-to-end system might still be biased or poorly performing. We present reasons, by building on previously published data, to support the reasoning that GPPEs should ideally contain as much information as the original data contain, and highlight the perils of trying to remove sensitive attributes from a GPPE. We also emphasise that downstream prediction models trained for specific tasks and settings, whether developed using GPPEs or not, should be carefully designed and evaluated to avoid bias that makes models vulnerable to issues such as distributional shift. These evaluations should be done by a diverse team, including social scientists, on a diverse cohort representing the full breadth of the patient population for which the final model is intended.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Leo A G Celi
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Yun Liu
- Google, Mountain View, CA, USA.
| |
Collapse
|
4
|
Arora A, Alderman JE, Palmer J, Ganapathi S, Laws E, McCradden MD, Oakden-Rayner L, Pfohl SR, Ghassemi M, McKay F, Treanor D, Rostamzadeh N, Mateen B, Gath J, Adebajo AO, Kuku S, Matin R, Heller K, Sapey E, Sebire NJ, Cole-Lewis H, Calvert M, Denniston A, Liu X. The value of standards for health datasets in artificial intelligence-based applications. Nat Med 2023; 29:2929-2938. [PMID: 37884627 PMCID: PMC10667100 DOI: 10.1038/s41591-023-02608-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 09/22/2023] [Indexed: 10/28/2023]
Abstract
Artificial intelligence as a medical device is increasingly being applied to healthcare for diagnosis, risk stratification and resource allocation. However, a growing body of evidence has highlighted the risk of algorithmic bias, which may perpetuate existing health inequity. This problem arises in part because of systemic inequalities in dataset curation, unequal opportunity to participate in research and inequalities of access. This study aims to explore existing standards, frameworks and best practices for ensuring adequate data diversity in health datasets. Exploring the body of existing literature and expert views is an important step towards the development of consensus-based guidelines. The study comprises two parts: a systematic review of existing standards, frameworks and best practices for healthcare datasets; and a survey and thematic analysis of stakeholder views of bias, health equity and best practices for artificial intelligence as a medical device. We found that the need for dataset diversity was well described in literature, and experts generally favored the development of a robust set of guidelines, but there were mixed views about how these could be implemented practically. The outputs of this study will be used to inform the development of standards for transparency of data diversity in health datasets (the STANDING Together initiative).
Collapse
Affiliation(s)
- Anmol Arora
- School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Joseph E Alderman
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
| | - Joanne Palmer
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
| | | | - Elinor Laws
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
| | - Melissa D McCradden
- Department of Bioethics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Genetics and Genome Biology, Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
- Dalla Lana School of Public Health, Toronto, Ontario, Canada
| | - Lauren Oakden-Rayner
- The Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia
| | | | - Marzyeh Ghassemi
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- Vector Institute, Toronto, Ontario, Canada
| | - Francis McKay
- The Ethox Centre and the Wellcome Centre for Ethics and Humanities, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Darren Treanor
- Leeds Teaching Hospitals NHS Trust, Leeds, UK
- University of Leeds, Leeds, UK
- Department of Clinical Pathology and Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
- Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
| | | | - Bilal Mateen
- Institute for Health Informatics, University College London, London, UK
- Wellcome Trust, London, UK
| | - Jacqui Gath
- Patient and Public Involvement and Engagement (PPIE) Group, STANDING Together, Birmingham, UK
| | - Adewole O Adebajo
- Patient and Public Involvement and Engagement (PPIE) Group, STANDING Together, Birmingham, UK
| | | | - Rubeta Matin
- Oxford University Hospitals NHS Foundation Trust, Oxford, UK
| | | | - Elizabeth Sapey
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
- PIONEER, HDR UK Hub in Acute Care, Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK
| | - Neil J Sebire
- National Institute for Health and Care Research, Great Ormond Street Hospital Biomedical Research Centre, London, UK
- Great Ormond Street Institute of Child Health, University Hospital London, London, UK
| | | | - Melanie Calvert
- National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
- Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK
- Centre for Patient Reported Outcomes Research, Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research Applied Research Collaboration West Midlands, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research Birmingham-Oxford Blood and Transplant Research Unit in Precision Transplant and Cellular Therapeutics, University of Birmingham, Birmingham, UK
- DEMAND Hub, University of Birmingham, Birmingham, UK
- UK SPINE, University of Birmingham, Birmingham, UK
| | - Alastair Denniston
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
- Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research Biomedical Research Centre, Moorfields Eye Hospital/University College London, London, UK
| | - Xiaoxuan Liu
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK.
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK.
- National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK.
| |
Collapse
|
5
|
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, Scales N, Tanwani A, Cole-Lewis H, Pfohl S, Payne P, Seneviratne M, Gamble P, Kelly C, Babiker A, Schärli N, Chowdhery A, Mansfield P, Demner-Fushman D, Agüera Y Arcas B, Webster D, Corrado GS, Matias Y, Chou K, Gottweis J, Tomasev N, Liu Y, Rajkomar A, Barral J, Semturs C, Karthikesalingam A, Natarajan V. Large language models encode clinical knowledge. Nature 2023; 620:172-180. [PMID: 37438534 PMCID: PMC10396962 DOI: 10.1038/s41586-023-06291-2] [Citation(s) in RCA: 179] [Impact Index Per Article: 179.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 06/05/2023] [Indexed: 07/14/2023]
Abstract
Large language models (LLMs) have demonstrated impressive capabilities, but the bar for clinical applications is high. Attempts to assess the clinical knowledge of models typically rely on automated evaluations based on limited benchmarks. Here, to address these limitations, we present MultiMedQA, a benchmark combining six existing medical question answering datasets spanning professional medicine, research and consumer queries and a new dataset of medical questions searched online, HealthSearchQA. We propose a human evaluation framework for model answers along multiple axes including factuality, comprehension, reasoning, possible harm and bias. In addition, we evaluate Pathways Language Model1 (PaLM, a 540-billion parameter LLM) and its instruction-tuned variant, Flan-PaLM2 on MultiMedQA. Using a combination of prompting strategies, Flan-PaLM achieves state-of-the-art accuracy on every MultiMedQA multiple-choice dataset (MedQA3, MedMCQA4, PubMedQA5 and Measuring Massive Multitask Language Understanding (MMLU) clinical topics6), including 67.6% accuracy on MedQA (US Medical Licensing Exam-style questions), surpassing the prior state of the art by more than 17%. However, human evaluation reveals key gaps. To resolve this, we introduce instruction prompt tuning, a parameter-efficient approach for aligning LLMs to new domains using a few exemplars. The resulting model, Med-PaLM, performs encouragingly, but remains inferior to clinicians. We show that comprehension, knowledge recall and reasoning improve with model scale and instruction prompt tuning, suggesting the potential utility of LLMs in medicine. Our human evaluations reveal limitations of today's models, reinforcing the importance of both evaluation frameworks and method development in creating safe, helpful LLMs for clinical applications.
Collapse
Affiliation(s)
| | | | - Tao Tu
- Google Research, Mountain View, CA, USA
| | | | - Jason Wei
- Google Research, Mountain View, CA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Yun Liu
- Google Research, Mountain View, CA, USA
| | | | | | | | | | | |
Collapse
|
6
|
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, Scales N, Tanwani A, Cole-Lewis H, Pfohl S, Payne P, Seneviratne M, Gamble P, Kelly C, Babiker A, Schärli N, Chowdhery A, Mansfield P, Demner-Fushman D, Agüera Y Arcas B, Webster D, Corrado GS, Matias Y, Chou K, Gottweis J, Tomasev N, Liu Y, Rajkomar A, Barral J, Semturs C, Karthikesalingam A, Natarajan V. Publisher Correction: Large language models encode clinical knowledge. Nature 2023; 620:E19. [PMID: 37500979 PMCID: PMC10412443 DOI: 10.1038/s41586-023-06455-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Affiliation(s)
| | | | - Tao Tu
- Google Research, Mountain View, CA, USA
| | | | - Jason Wei
- Google Research, Mountain View, CA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Yun Liu
- Google Research, Mountain View, CA, USA
| | | | | | | | | | | |
Collapse
|
7
|
Sucala M, Ezeanochie NP, Cole-Lewis H, Turgiss J. An iterative, interdisciplinary, collaborative framework for developing and evaluating digital behavior change interventions. Transl Behav Med 2021; 10:1538-1548. [PMID: 31328775 PMCID: PMC7796712 DOI: 10.1093/tbm/ibz109] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The rapid expansion of technology promises to transform the behavior science field by revolutionizing the ways in which individuals can monitor and improve their health behaviors. To fully live into this promise, the behavior science field must address distinct challenges, including: building interventions that are not only scientifically sound but also engaging; using evaluation methods to precisely assess intervention components for intervention optimization; and building personalized interventions that acknowledge and adapt to the dynamic ecosystem of individual and contextual variables that impact behavior change. The purpose of this paper is to provide a framework to address these challenges by leveraging behavior science, human-centered design, and data science expertise throughout the cycle of developing and evaluating digital behavior change interventions (DBCIs). To define this framework, we reviewed current models and practices for intervention development and evaluation, as well as technology industry models for product development. The framework promotes an iterative process, aiming to maximize outcomes by incorporating faster and more frequent testing cycles into the lifecycle of a DBCI. Within the framework provided, we describe each phase, from development to evaluation, to discuss the optimal practices, necessary stakeholders, and proposed evaluation methods. The proposed framework may inform practices in both academia and industry, as well as highlight the need to offer collaborative platforms to ensure successful partnerships that can lead to more effective DBCIs that reach broad and diverse populations.
Collapse
Affiliation(s)
- Madalina Sucala
- Johnson and Johnson Health and Wellness Solutions Inc., New Brunswick, NJ, USA
| | | | - Heather Cole-Lewis
- Johnson and Johnson Health and Wellness Solutions Inc., New Brunswick, NJ, USA
| | - Jennifer Turgiss
- Johnson and Johnson Health and Wellness Solutions Inc., New Brunswick, NJ, USA
| |
Collapse
|
8
|
Sucala M, Cole-Lewis H, Arigo D, Oser M, Goldstein S, Hekler EB, Diefenbach MA. Behavior science in the evolving world of digital health: considerations on anticipated opportunities and challenges. Transl Behav Med 2021; 11:495-503. [PMID: 32320039 PMCID: PMC7963278 DOI: 10.1093/tbm/ibaa034] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Digital health promises to increase intervention reach and effectiveness for a range of behavioral health outcomes. Behavioral scientists have a unique opportunity to infuse their expertise in all phases of a digital health intervention, from design to implementation. The aim of this study was to assess behavioral scientists' interests and needs with respect to digital health endeavors, as well as gather expert insight into the role of behavioral science in the evolution of digital health. The study used a two-phased approach: (a) a survey of behavioral scientists' current needs and interests with respect to digital health endeavors (n = 346); (b) a series of interviews with digital health stakeholders for their expert insight on the evolution of the health field (n = 15). In terms of current needs and interests, the large majority of surveyed behavioral scientists (77%) already participate in digital health projects, and from those who have not done so yet, the majority (65%) reported intending to do so in the future. In terms of the expected evolution of the digital health field, interviewed stakeholders anticipated a number of changes, from overall landscape changes through evolving models of reimbursement to more significant oversight and regulations. These findings provide a timely insight into behavioral scientists' current needs, barriers, and attitudes toward the use of technology in health care and public health. Results might also highlight the areas where behavioral scientists can leverage their expertise to both enhance digital health's potential to improve health, as well as to prevent the potential unintended consequences that can emerge from scaling the use of technology in health care.
Collapse
Affiliation(s)
- Madalina Sucala
- Health and Wellness Solutions, Johnson and Johnson Inc., New Brunswick, NJ, USA
| | - Heather Cole-Lewis
- Health and Wellness Solutions, Johnson and Johnson Inc., New Brunswick, NJ, USA
| | - Danielle Arigo
- Department of Psychology, Rowan University, Glassboro, NJ, USA
| | - Megan Oser
- Mahana Therapeutics, San Francisco, CA, USA
| | - Stephanie Goldstein
- Department of Psychiatry and Human Behavior, The Warren Alpert Medical School of Brown University and The Miriam Hospital/Weight Control and Diabetes Research Center, Providence, RI, USA
| | - Eric B Hekler
- Department of Family Medicine and Public Health, University of California, San Diego (UCSD), Center for Wireless and Population Health Systems, Qualcomm Institute at UCSD, San Diego, CA, USA
| | - Michael A Diefenbach
- Departments of Medicine and Urology, Northwell Health and the School of Medicine at Hofstra/Northwell, East Garden City, NY, USA
| |
Collapse
|
9
|
Cole-Lewis H, Ezeanochie N, Turgiss J. Understanding Health Behavior Technology Engagement: Pathway to Measuring Digital Behavior Change Interventions. JMIR Form Res 2019; 3:e14052. [PMID: 31603427 PMCID: PMC6813486 DOI: 10.2196/14052] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Revised: 07/23/2019] [Accepted: 08/14/2019] [Indexed: 11/13/2022] Open
Abstract
Researchers and practitioners of digital behavior change interventions (DBCI) use varying and, often, incongruent definitions of the term "engagement," thus leading to a lack of precision in DBCI measurement and evaluation. The objective of this paper is to propose discrete definitions for various types of user engagement and to explain why precision in the measurement of these engagement types is integral to ensuring the intervention is effective for health behavior modulation. Additionally, this paper presents a framework and practical steps for how engagement can be measured in practice and used to inform DBCI design and evaluation. The key purpose of a DBCI is to influence change in a target health behavior of a user, which may ultimately improve a health outcome. Using available literature and practice-based knowledge of DBCI, the framework conceptualizes two primary categories of engagement that must be measured in DBCI. The categories are health behavior engagement, referred to as "Big E," and DBCI engagement, referred to as "Little e." DBCI engagement is further bifurcated into two subclasses: (1) user interactions with features of the intervention designed to encourage frequency of use (ie, simple login, games, and social interactions) and make the user experience appealing, and (2) user interactions with behavior change intervention components (ie, behavior change techniques), which influence determinants of health behavior and subsequently influence health behavior. Achievement of Big E in an intervention delivered via digital means is contingent upon Little e. If users do not interact with DBCI features and enjoy the user experience, exposure to behavior change intervention components will be limited and less likely to influence the behavioral determinants that lead to health behavior engagement (Big E). Big E is also dependent upon the quality and relevance of the behavior change intervention components within the solution. Therefore, the combination of user interactions and behavior change intervention components creates Little e, which is, in turn, designed to improve Big E. The proposed framework includes a model to support measurement of DBCI that describes categories of engagement and details how features of Little e produce Big E. This framework can be applied to DBCI to support various health behaviors and outcomes and can be utilized to identify gaps in intervention efficacy and effectiveness.
Collapse
Affiliation(s)
- Heather Cole-Lewis
- Johnson and Johnson Health and Wellness Solutions, Inc, New Brunswick, NJ, United States
| | - Nnamdi Ezeanochie
- Johnson and Johnson Health and Wellness Solutions, Inc, New Brunswick, NJ, United States
| | - Jennifer Turgiss
- Johnson and Johnson Health and Wellness Solutions, Inc, New Brunswick, NJ, United States
| |
Collapse
|
10
|
Sarma S, Nemser B, Cole-Lewis H, Kaonga N, Negin J, Namakula P, Ohemeng-Dapaah S, Kanter AS. Effectiveness of SMS Technology on Timely Community Health Worker Follow-Up for Childhood Malnutrition: A Retrospective Cohort Study in sub-Saharan Africa. Glob Health Sci Pract 2018; 6:345-355. [PMID: 29959274 PMCID: PMC6024632 DOI: 10.9745/ghsp-d-16-00290] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2016] [Accepted: 05/08/2018] [Indexed: 11/15/2022]
Abstract
BACKGROUND The Millennium Villages Project facilitated technology-based health interventions in rural under-resourced areas of sub-Saharan Africa. Our study examined whether data entry using SMS compared with paper forms by community health workers (CHWs) led to higher proportion of timely follow-up visits for malnutrition screening in under-5 children in Ghana, Rwanda, Senegal, and Uganda. METHODS Children under 5 years were screened for malnutrition every 90 days by CHWs using mid-upper arm circumference (MUAC) readings. CHWs used either SMS texts or paper forms to enter MUAC data. Reminder texts were sent at 15 days before follow-up was needed. Chi-square tests assessed proportion of timely follow-up visits within 90 days between SMS and paper groups. Logistic regression analysis was conducted in a step-wise multivariate model. Post-hoc power calculations were conducted to verify strength of associations. RESULTS SMS data entry was associated with a higher proportion of timely malnutrition follow-up visits compared with paper forms across all sites. The association was strongest with consistent SMS use over consecutive visits. SMS use at the first of 2 consecutive visits was most effective, highlighting the importance of SMS reminder alerts. CONCLUSIONS SMS technology with reminders increased timely CHW malnutrition screening visits for under-5 children in Ghana, Rwanda, Senegal, and Uganda, highlighting the importance of such technology for improving health worker behavior in low-resource settings.
Collapse
Affiliation(s)
- Shohinee Sarma
- Mailman School of Public Health, Columbia University, New York, NY, USA. Now with McMaster University, Hamilton, Canada.
| | - Bennett Nemser
- Millennium Villages Project, Earth Institute, Columbia University. Now with UNICEF, New York, NY, USA. Now with University of the Western Cape, Cape Town, South Africa
| | - Heather Cole-Lewis
- Yale University School of Epidemiology and Public Health, New Haven, CT, USA. Now with Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - Nadi Kaonga
- Tufts University School of Medicine, Boston, MA, USA
| | - Joel Negin
- Sydney School of Public Health, University of Sydney, Sydney, Australia
| | - Patricia Namakula
- Millennium Villages Project, Earth Institute, Columbia University. Now with Columbia Global Centers Africa, Nairobi, Kenya
| | - Seth Ohemeng-Dapaah
- Millennium Villages Project, Earth Institute, Columbia University. Now with Millennium Promise, Dakar, Senegal
| | - Andrew S Kanter
- Millennium Villages Project, Earth Institute, Columbia University, New York, NY, USA. Now with Departments of Biomedical Informatics and Epidemiology, Columbia University, New York, NY, USA
| |
Collapse
|
11
|
Pugatch J, Grenen E, Surla S, Schwarz M, Cole-Lewis H. Information Architecture of Web-Based Interventions to Improve Health Outcomes: Systematic Review. J Med Internet Res 2018; 20:e97. [PMID: 29563076 PMCID: PMC5978245 DOI: 10.2196/jmir.7867] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Revised: 12/06/2017] [Accepted: 01/03/2018] [Indexed: 11/13/2022] Open
Abstract
Background The rise in usage of and access to new technologies in recent years has led to a growth
in digital health behavior change interventions. As the shift to digital platforms
continues to grow, it is increasingly important to consider how the field of information
architecture (IA) can inform the development of digital health interventions. IA is the
way in which digital content is organized and displayed, which strongly impacts users’
ability to find and use content. While many information architecture best practices
exist, there is a lack of empirical evidence on the role it plays in influencing
behavior change and health outcomes. Objective Our aim was to conduct a systematic review synthesizing the existing literature on
website information architecture and its effect on health outcomes, behavioral outcomes,
and website engagement. Methods To identify all existing information architecture and health behavior literature, we
searched articles published in English in the following databases (no date restrictions
imposed): ACM Digital Library, CINAHL, Cochrane Library, Google Scholar, Ebsco, and
PubMed. The search terms used included information terms (eg, information architecture,
interaction design, persuasive design), behavior terms (eg, health behavior, behavioral
intervention, ehealth), and health terms (eg, smoking, physical activity, diabetes). The
search results were reviewed to determine if they met the inclusion and exclusion
criteria created to identify empirical research that studied the effect of IA on health
outcomes, behavioral outcomes, or website engagement. Articles that met inclusion
criteria were assessed for study quality. Then, data from the articles were extracted
using a priori categories established by 3 reviewers. However, the limited health
outcome data gathered from the studies precluded a meta-analysis. Results The initial literature search yielded 685 results, which was narrowed down to three
publications that examined the effect of information architecture on health outcomes,
behavioral outcomes, or website engagement. One publication studied the isolated impact
of information architecture on outcomes of interest (ie, website use and engagement;
health-related knowledge, attitudes, and beliefs; and health behaviors), while the other
two publications studied the impact of information architecture, website features (eg,
interactivity, email prompts, and forums), and tailored content on these outcomes. The
paper that investigated IA exclusively found that a tunnel IA improved site engagement
and behavior knowledge, but it decreased users’ perceived efficiency. The first study
that did not isolate IA found that the enhanced site condition improved site usage but
not the amount of content viewed. The second study that did not isolate IA found that a
tailored site condition improved site usage, behavior knowledge, and some behavior
outcomes. Conclusions No clear conclusion can be made about the relationship between IA and health outcomes,
given limited evidence in the peer-reviewed literature connecting IA to behavioral
outcomes and website engagement. Only one study reviewed solely manipulated IA, and we
therefore recommend improving the scientific evidence base such that additional
empirical studies investigate the impact of IA in isolation. Moreover, information from
the gray literature and expert opinion might be identified and added to the evidence
base, in order to lay the groundwork for hypothesis generation to improve empirical
evidence on information architecture and health and behavior outcomes.
Collapse
Affiliation(s)
| | | | | | | | - Heather Cole-Lewis
- Johnson & Johnson Health and Wellness Solutions, Inc, New Brunswick, NJ, United States
| |
Collapse
|
12
|
Bedrosian R, Hawrilenko M, Cole-Lewis H. Trajectories of Depressive Symptoms Among Web-Based Health Risk Assessment Participants. J Med Internet Res 2017; 19:e96. [PMID: 28363881 PMCID: PMC5392210 DOI: 10.2196/jmir.6480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2016] [Revised: 12/01/2016] [Accepted: 12/19/2016] [Indexed: 11/13/2022] Open
Abstract
Background Health risk assessments (HRAs), which often screen for depressive symptoms, are administered to millions of employees and health plan members each year. HRA data provide an opportunity to examine longitudinal trends in depressive symptomatology, as researchers have done previously with other populations. Objective The primary research questions were: (1) Can we observe longitudinal trajectories in HRA populations like those observed in other study samples? (2) Do HRA variables, which primarily reflect modifiable health risks, help us to identify predictors associated with these trajectories? (3) Can we make meaningful recommendations for population health management, applicable to HRA participants, based on predictors we identify? Methods This study used growth mixture modeling (GMM) to examine longitudinal trends in depressive symptomatology among 22,963 participants in a Web-based HRA used by US employers and health plans. The HRA assessed modifiable health risks and variables such as stress, sleep, and quality of life. Results Five classes were identified: A “minimal depression” class (63.91%, 14,676/22,963) whose scores were consistently low across time, a “low risk” class (19.89%, 4568/22,963) whose condition remained subthreshold, a “deteriorating” class (3.15%, 705/22,963) who began at subthreshold but approached severe depression by the end of the study, a “chronic” class (4.71%, 1081/22,963) who remained highly depressed over time, and a “remitting” class (8.42%, 1933/22,963) who had moderate depression to start, but crossed into minimal depression by the end. Among those with subthreshold symptoms, individuals who were male (P<.001) and older (P=.01) were less likely to show symptom deterioration, whereas current depression treatment (P<.001) and surprisingly, higher sleep quality (P<.001) were associated with increased probability of membership in the “deteriorating” class as compared with “low risk.” Among participants with greater symptomatology to start, those in the “severe” class tended to be younger than the “remitting” class (P<.001). Lower baseline sleep quality (P<.001), quality of life (P<.001), stress level (P<.001), and current treatment involvement (P<.001) were all predictive of membership in the “severe” class. Conclusions The trajectories identified were consistent with trends in previous research. The results identified some key predictors: we discuss those that mirror prior studies and offer some hypotheses as to why others did not. The finding that 1 in 5 HRA participants with subthreshold symptoms deteriorated to the point of clinical distress during succeeding years underscores the need to learn more about such individuals. We offer additional recommendations for follow-up research, which should be designed to reflect changes in health plan demographics and HRA delivery platforms. In addition to utilizing additional variables such as cognitive style to refine predictive models, future research could also begin to test the impact of more aggressive outreach strategies aimed at participants who are likely to deteriorate or remain significantly depressed over time.
Collapse
Affiliation(s)
- Richard Bedrosian
- Johnson & Johnson Health & Wellness Solutions, Johnson & Johnson, New Brunswick, NJ, United States
| | - Matt Hawrilenko
- Clark University, Department of Psychology, Worcester, MA, United States
| | - Heather Cole-Lewis
- Johnson & Johnson Health & Wellness Solutions, Johnson & Johnson, New Brunswick, NJ, United States
| |
Collapse
|
13
|
Augustson E, Cole-Lewis H, Sanders A, Schwarz M, Geng Y, Coa K, Hunt Y. Analysing user-reported data for enhancement of SmokefreeTXT: a national text message smoking cessation intervention. Tob Control 2016; 26:683-689. [PMID: 27852892 DOI: 10.1136/tobaccocontrol-2016-052945] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Revised: 10/12/2016] [Accepted: 10/26/2016] [Indexed: 11/03/2022]
Abstract
OBJECTIVE This observational study highlights key insights related to participant engagement and cessation among adults who voluntarily subscribed to the nationwide US-based SmokefreeTXT program, a 42-day mobile phone text message smoking cessation program. METHODS Point prevalence abstinence rates were calculated for subscribers who initiated treatment in the program (n=18 080). The primary outcomes for this study were treatment completion and point prevalence abstinence rate at the end of the 42-day treatment. Secondary outcomes were point prevalence abstinence rates at 7 days postquit, 3 months post-treatment and 6 months post-treatment, as well as response rates to point prevalence abstinence assessments. RESULTS Over half the sample completed the 42-day treatment (n=9686). The end-of-treatment point prevalence abstinence for subscribers who initiated treatment was 7.2%. Among those who completed the entire 42 days of treatment, the end-of-treatment point prevalence abstinence was 12.9%. For subscribers who completed treatment, point prevalence abstinence results varied: 7 days postquit (23.7%), 3 months post-treatment (7.3%) and 6 months post-treatment (3.7%). Response rates for abstinence assessment messages ranged from 4.36% to 34.48%. CONCLUSIONS Findings from this study illuminate the need to more deeply understand reasons for subscriber non-response and opt out and, in turn, improve program engagement and our ability to increase the likelihood for participants to stop smoking and measure long-term outcomes. Patterns of opt out for the program mirror the relapse curve generally observed for smoking cessation, thus highlighting time points at which to increase efforts to retain participants and provide additional support or incentives.
Collapse
Affiliation(s)
- Erik Augustson
- Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland, USA
| | | | | | | | | | | | - Yvonne Hunt
- Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland, USA
| |
Collapse
|
14
|
Cole-Lewis H, Perotte A, Galica K, Dreyer L, Griffith C, Schwarz M, Yun C, Patrick H, Coa K, Augustson E. Social Network Behavior and Engagement Within a Smoking Cessation Facebook Page. J Med Internet Res 2016; 18:e205. [PMID: 27485315 PMCID: PMC4987490 DOI: 10.2196/jmir.5574] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 06/23/2016] [Accepted: 06/26/2016] [Indexed: 11/13/2022] Open
Abstract
Background Social media platforms are increasingly being used to support individuals in behavior change attempts, including smoking cessation. Examining the interactions of participants in health-related social media groups can help inform our understanding of how these groups can best be leveraged to facilitate behavior change. Objective The aim of this study was to analyze patterns of participation, self-reported smoking cessation length, and interactions within the National Cancer Institutes’ Facebook community for smoking cessation support. Methods Our sample consisted of approximately 4243 individuals who interacted (eg, posted, commented) on the public Smokefree Women Facebook page during the time of data collection. In Phase 1, social network visualizations and centrality measures were used to evaluate network structure and engagement. In Phase 2, an inductive, thematic qualitative content analysis was conducted with a subsample of 500 individuals, and correlational analysis was used to determine how participant engagement was associated with self-reported session length. Results Between February 2013 and March 2014, there were 875 posts and 4088 comments from approximately 4243 participants. Social network visualizations revealed the moderator’s role in keeping the community together and distributing the most active participants. Correlation analyses suggest that engagement in the network was significantly inversely associated with cessation status (Spearman correlation coefficient = −0.14, P=.03, N=243). The content analysis of 1698 posts from 500 randomly selected participants identified the most frequent interactions in the community as providing support (43%, n=721) and announcing number of days smoke free (41%, n=689). Conclusions These findings highlight the importance of the moderator for network engagement and provide helpful insights into the patterns and types of interactions participants are engaging in. This study adds knowledge of how the social network of a smoking cessation community behaves within the confines of a Facebook group.
Collapse
|
15
|
Graham AL, Carpenter KM, Cha S, Cole S, Jacobs MA, Raskob M, Cole-Lewis H. Systematic review and meta-analysis of Internet interventions for smoking cessation among adults. Subst Abuse Rehabil 2016; 7:55-69. [PMID: 27274333 PMCID: PMC4876804 DOI: 10.2147/sar.s101660] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Background The aim of this systematic review was to determine the effectiveness of Internet interventions in promoting smoking cessation among adult tobacco users relative to other forms of intervention recommended in treatment guidelines. Methods This review followed Cochrane Collaboration guidelines for systematic reviews. Combinations of “Internet,” “web-based,” and “smoking cessation intervention” and related keywords were used in both automated and manual searches. We included randomized trials published from January 1990 through to April 2015. A modified version of the Cochrane risk of bias assessment tool was used. We calculated risk ratios (RRs) for each study. Meta-analysis was conducted using random-effects method to pool RRs. Presentation of results follows the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Results Forty randomized trials involving 98,530 participants were included. Most trials had a low risk of bias in most domains. Pooled results comparing Internet interventions to assessment-only/waitlist control were significant (RR 1.60, 95% confidence interval [CI] 1.15–2.21, I2=51.7%; four studies). Pooled results of largely static Internet interventions compared to print materials were not significant (RR 0.83, 95% CI 0.63–1.10, I2=0%; two studies), whereas comparisons of interactive Internet interventions to print materials were significant (RR 2.10, 95% CI 1.25–3.52, I2=41.6%; two studies). No significant effects were observed in pooled results of Internet interventions compared to face-to-face counseling (RR 1.35, 95% CI 0.97–1.87, I2=0%; four studies) or to telephone counseling (RR 0.95, 95% CI 0.79–1.13, I2=0%; two studies). The majority of trials compared different Internet interventions; pooled results from 15 such trials (24 comparisons) found a significant effect in favor of experimental Internet interventions (RR 1.16, 95% CI 1.03–1.31, I2=76.7%). Conclusion Internet interventions are superior to other broad reach cessation interventions (ie, print materials), equivalent to other currently recommended treatment modes (telephone and in-person counseling), and they have an important role to play in the arsenal of tobacco-dependence treatments.
Collapse
Affiliation(s)
- Amanda L Graham
- Schroeder Institute for Tobacco Research and Policy Studies, Truth Initiative, Washington, DC, USA; Department of Oncology, Georgetown University Medical Center/Cancer Prevention and Control Program, Lombardi Comprehensive Cancer Center, Washington, DC, USA
| | | | - Sarah Cha
- Schroeder Institute for Tobacco Research and Policy Studies, Truth Initiative, Washington, DC, USA
| | - Sam Cole
- Alere Wellbeing, Seattle, WA, USA
| | - Megan A Jacobs
- Schroeder Institute for Tobacco Research and Policy Studies, Truth Initiative, Washington, DC, USA
| | | | - Heather Cole-Lewis
- Johnson & Johnson Health and Wellness Solutions, Inc., New Brunswick, NJ, USA; ICF International, Rockville, MD, USA
| |
Collapse
|
16
|
Abstract
The aim of this systematic review of reviews is to identify mobile text-messaging interventions designed for health improvement and behavior change and to derive recommendations for practice. We have compiled and reviewed existing systematic research reviews and meta-analyses to organize and summarize the text-messaging intervention evidence base, identify best-practice recommendations based on findings from multiple reviews, and explore implications for future research. Our review found that the majority of published text-messaging interventions were effective when addressing diabetes self-management, weight loss, physical activity, smoking cessation, and medication adherence for antiretroviral therapy. However, we found limited evidence across the population of studies and reviews to inform recommended intervention characteristics. Although strong evidence supports the value of integrating text-messaging interventions into public health practice, additional research is needed to establish longer-term intervention effects, identify recommended intervention characteristics, and explore issues of cost-effectiveness.
Collapse
Affiliation(s)
- Amanda K Hall
- Department of Biomedical Informatics and Medical Education, University of Washington, School of Medicine, Seattle, Washington 98105;
| | | | | |
Collapse
|
17
|
Mamykina L, Heitkemper EM, Smaldone AM, Kukafka R, Cole-Lewis H, Davidson PG, Mynatt ED, Tobin JN, Cassells A, Goodman C, Hripcsak G. Structured scaffolding for reflection and problem solving in diabetes self-management: qualitative study of mobile diabetes detective. J Am Med Inform Assoc 2016; 23:129-36. [PMID: 26769910 DOI: 10.1093/jamia/ocv169] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2015] [Accepted: 10/13/2015] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVE To investigate subjective experiences and patterns of engagement with a novel electronic tool for facilitating reflection and problem solving for individuals with type 2 diabetes, Mobile Diabetes Detective (MoDD). METHODS In this qualitative study, researchers conducted semi-structured interviews with individuals from economically disadvantaged communities and ethnic minorities who are participating in a randomized controlled trial of MoDD. The transcripts of the interviews were analyzed using inductive thematic analysis; usage logs were analyzed to determine how actively the study participants used MoDD. RESULTS Fifteen participants in the MoDD randomized controlled trial were recruited for the qualitative interviews. Usage log analysis showed that, on average, during the 4 weeks of the study, the study participants logged into MoDD twice per week, reported 120 blood glucose readings, and set two behavioral goals. The qualitative interviews suggested that individuals used MoDD to follow the steps of the problem-solving process, from identifying problematic blood glucose patterns, to exploring behavioral triggers contributing to these patterns, to selecting alternative behaviors, to implementing these behaviors while monitoring for improvements in glycemic control. DISCUSSION This qualitative study suggested that informatics interventions for reflection and problem solving can provide structured scaffolding for facilitating these processes by guiding users through the different steps of the problem-solving process and by providing them with context-sensitive evidence and practice-based knowledge related to diabetes self-management on each of those steps. CONCLUSION This qualitative study suggested that MoDD was perceived as a useful tool in engaging individuals in self-monitoring, reflection, and problem solving.
Collapse
Affiliation(s)
- Lena Mamykina
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | | | | | - Rita Kukafka
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - Heather Cole-Lewis
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | | | | | - Jonathan N Tobin
- The Rockefeller University Center for Clinical and Translational Science, New York, NY USA Department of Epidemiology and Public Health, Albert Einstein College of Medicine of Yeshiva University/Montefiore Medical Center, New York, NY USA
| | | | | | - George Hripcsak
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| |
Collapse
|
18
|
Cole-Lewis H, Pugatch J, Sanders A, Varghese A, Posada S, Yun C, Schwarz M, Augustson E. Social Listening: A Content Analysis of E-Cigarette Discussions on Twitter. J Med Internet Res 2015; 17:e243. [PMID: 26508089 PMCID: PMC4642379 DOI: 10.2196/jmir.4969] [Citation(s) in RCA: 94] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Revised: 09/22/2015] [Accepted: 09/23/2015] [Indexed: 11/29/2022] Open
Abstract
Background Electronic cigarette (e-cigarette) use has increased in the United States, leading to active debate in the public health sphere regarding e-cigarette use and regulation. To better understand trends in e-cigarette attitudes and behaviors, public health and communication professionals can turn to the dialogue taking place on popular social media platforms such as Twitter. Objective The objective of this study was to conduct a content analysis to identify key conversation trends and patterns over time using historical Twitter data. Methods A 5-category content analysis was conducted on a random sample of tweets chosen from all publicly available tweets sent between May 1, 2013, and April 30, 2014, that matched strategic keywords related to e-cigarettes. Relevant tweets were isolated from the random sample of approximately 10,000 tweets and classified according to sentiment, user description, genre, and theme. Descriptive analyses including univariate and bivariate associations, as well as correlation analyses were performed on all categories in order to identify patterns and trends. Results The analysis revealed an increase in e-cigarette–related tweets from May 2013 through April 2014, with tweets generally being positive; 71% of the sample tweets were classified as having a positive sentiment. The top two user categories were everyday people (65%) and individuals who are part of the e-cigarette community movement (16%). These two user groups were responsible for a majority of informational (79%) and news tweets (75%), compared to reputable news sources and foundations or organizations, which combined provided 5% of informational tweets and 12% of news tweets. Personal opinion (28%), marketing (21%), and first person e-cigarette use or intent (20%) were the three most common genres of tweets, which tended to have a positive sentiment. Marketing was the most common theme (26%), and policy and government was the second most common theme (20%), with 86% of these tweets coming from everyday people and the e-cigarette community movement combined, compared to 5% of policy and government tweets coming from government, reputable news sources, and foundations or organizations combined. Conclusions Everyday people and the e-cigarette community are dominant forces across several genres and themes, warranting continued monitoring to understand trends and their implications regarding public opinion, e-cigarette use, and smoking cessation. Analyzing social media trends is a meaningful way to inform public health practitioners of current sentiments regarding e-cigarettes, and this study contributes a replicable methodology.
Collapse
|
19
|
Cole-Lewis H, Varghese A, Sanders A, Schwarz M, Pugatch J, Augustson E. Assessing Electronic Cigarette-Related Tweets for Sentiment and Content Using Supervised Machine Learning. J Med Internet Res 2015; 17:e208. [PMID: 26307512 PMCID: PMC4642404 DOI: 10.2196/jmir.4392] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Revised: 05/21/2015] [Accepted: 06/12/2015] [Indexed: 11/23/2022] Open
Abstract
Background Electronic cigarettes (e-cigarettes) continue to be a growing topic among social media users, especially on Twitter. The ability to analyze conversations about e-cigarettes in real-time can provide important insight into trends in the public’s knowledge, attitudes, and beliefs surrounding e-cigarettes, and subsequently guide public health interventions. Objective Our aim was to establish a supervised machine learning algorithm to build predictive classification models that assess Twitter data for a range of factors related to e-cigarettes. Methods Manual content analysis was conducted for 17,098 tweets. These tweets were coded for five categories: e-cigarette relevance, sentiment, user description, genre, and theme. Machine learning classification models were then built for each of these five categories, and word groupings (n-grams) were used to define the feature space for each classifier. Results Predictive performance scores for classification models indicated that the models correctly labeled the tweets with the appropriate variables between 68.40% and 99.34% of the time, and the percentage of maximum possible improvement over a random baseline that was achieved by the classification models ranged from 41.59% to 80.62%. Classifiers with the highest performance scores that also achieved the highest percentage of the maximum possible improvement over a random baseline were Policy/Government (performance: 0.94; % improvement: 80.62%), Relevance (performance: 0.94; % improvement: 75.26%), Ad or Promotion (performance: 0.89; % improvement: 72.69%), and Marketing (performance: 0.91; % improvement: 72.56%). The most appropriate word-grouping unit (n-gram) was 1 for the majority of classifiers. Performance continued to marginally increase with the size of the training dataset of manually annotated data, but eventually leveled off. Even at low dataset sizes of 4000 observations, performance characteristics were fairly sound. Conclusions Social media outlets like Twitter can uncover real-time snapshots of personal sentiment, knowledge, attitudes, and behavior that are not as accessible, at this scale, through any other offline platform. Using the vast data available through social media presents an opportunity for social science and public health methodologies to utilize computational methodologies to enhance and extend research and practice. This study was successful in automating a complex five-category manual content analysis of e-cigarette-related content on Twitter using machine learning techniques. The study details machine learning model specifications that provided the best accuracy for data related to e-cigarettes, as well as a replicable methodology to allow extension of these methods to additional topics.
Collapse
|
20
|
Sipsma HL, Jones KL, Cole-Lewis H. Breastfeeding among adolescent mothers: a systematic review of interventions from high-income countries. J Hum Lact 2015; 31:221-9; quiz 321-2. [PMID: 25480018 DOI: 10.1177/0890334414561264] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Accepted: 11/03/2014] [Indexed: 11/17/2022]
Abstract
Despite growing evidence of the benefits of breastfeeding, rates of breastfeeding remain disproportionately low among adolescent mothers compared with older mothers in the United States. Current interventions primarily target adult women, and little evidence is available for breastfeeding promotion among young women. Accordingly, we aim to review interventions designed to improve breastfeeding rates among adolescents to make recommendations for future research and practice. We searched MEDLINE and PsycINFO for articles that evaluated interventions aiming to improve rates of breastfeeding initiation, duration, or exclusivity among adolescents. Inclusion criteria included interventions targeting pregnant or postpartum adolescents (mean/median age <22 years) that were conducted in high-income settings. Six interventions met our inclusion criteria; of these, 4 interventions aimed to increase breastfeeding initiation, 5 aimed to increase breastfeeding duration, and 4 aimed to increase breastfeeding exclusivity. Interventions included school-based programs, home visits, and telephone support that were implemented by a combination of peer counselors, nurse clinicians, doulas, and lactation consultants. Only 1 intervention, a combination of education and counseling provided by a lactation consultant-peer counselor team, significantly improved both breastfeeding initiation and duration. Other results were mixed, and studies were subject to several methodological limitations. We recommend that more interventions should be developed and evaluated. In addition, interventions should be less resource intensive, be more theoretically driven, and specifically include mothers and partners of adolescents to successfully promote breastfeeding among adolescent mothers.
Collapse
Affiliation(s)
- Heather L Sipsma
- Department of Women, Children and Family Health Science, University of Illinois at Chicago College of Nursing, Chicago, IL, USA
| | - Krista L Jones
- Department of Health Systems Sciences, University of Illinois at Chicago College of Nursing, Chicago, IL, USA
| | - Heather Cole-Lewis
- Department of Biomedical Informatics, Columbia University Medical Center, New York, NY, USA ICF International, Rockville, MD, USA
| |
Collapse
|
21
|
Mechael P, Nemser B, Cosmaciuc R, Cole-Lewis H, Ohemeng-Dapaah S, Dusabe S, Kaonga NN, Namakula P, Shemsanga M, Burbach R, Kanter AS. Capitalizing on the characteristics of mHealth to evaluate its impact. J Health Commun 2012; 17 Suppl 1:62-66. [PMID: 22548600 DOI: 10.1080/10810730.2012.679847] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The field of mHealth has made significant advances in a short period of time, demanding a more thorough and scientific approach to understanding and evaluating its progress. A recent review of mHealth literature identified two primary research needs in order for mHealth to strengthen health systems and promote healthy behaviors, namely health outcomes and cost-benefits (Mechael et al., 2010 ). In direct response to the gaps identified in mHealth research, the aim of this paper is to present the study design and highlight key observations and next steps from an evaluation of the mHealth activities within the electronic health (eHealth) architecture implemented by the Millennium Villages Project (MVP) by leveraging data generated through mobile technology itself alongside complementary qualitative research and costing assessments. The study, funded by the International Development and Research Centre (IDRC) as part of the Open Architecture Standards and Information Systems research project (OASIS II) (Sinha, 2009 ), is being implemented on data generated by 14 MVP sites in 10 Sub-Saharan African countries including more in-depth research in Ghana, Rwanda, Tanzania, and Uganda. Specific components of the study include rigorous quantitative case-control analyses and other epidemiological approaches (such as survival analysis) supplemented by in-depth qualitative interviews spread out over 18 months, as well as a costing study to assess the impact of mHealth on health outcomes, service delivery, and efficiency.
Collapse
|
22
|
Webster TR, Mantopoulos J, Jackson E, Cole-Lewis H, Kidane L, Kebede S, Abebe Y, Lawson R, Bradley EH. A brief questionnaire for assessing patient healthcare experiences in low-income settings. Int J Qual Health Care 2011; 23:258-68. [DOI: 10.1093/intqhc/mzr019] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
23
|
Abstract
Mobile phone text messaging is a potentially powerful tool for behavior change because it is widely available, inexpensive, and instant. This systematic review provides an overview of behavior change interventions for disease management and prevention delivered through text messaging. Evidence on behavior change and clinical outcomes was compiled from randomized or quasi-experimental controlled trials of text message interventions published in peer-reviewed journals by June 2009. Only those interventions using text message as the primary mode of communication were included. Study quality was assessed by using a standardized measure. Seventeen articles representing 12 studies (5 disease prevention and 7 disease management) were included. Intervention length ranged from 3 months to 12 months, none had long-term follow-up, and message frequency varied. Of 9 sufficiently powered studies, 8 found evidence to support text messaging as a tool for behavior change. Effects exist across age, minority status, and nationality. Nine countries are represented in this review, but it is problematic that only one is a developing country, given potential benefits of such a widely accessible, relatively inexpensive tool for health behavior change. Methodological issues and gaps in the literature are highlighted, and recommendations for future studies are provided.
Collapse
Affiliation(s)
- Heather Cole-Lewis
- Yale University School of Epidemiology and Public Health, PO Box 208034, New Haven, CT 06520-8034, USA.
| | | |
Collapse
|
24
|
Abstract
OBJECTIVES Strong evidence exists to support an intergenerational cycle of adolescent fatherhood, yet such a cycle has not been studied. We examined whether paternal adolescent fatherhood (i.e., father of study participant was age 19 years or younger when his first child was born) and other factors derived from the ecological systems theory predicted participant adolescent fatherhood. METHODS Data included 1496 young males who were interviewed annually from the National Longitudinal Survey of Youth 1997. Cox regression survival analysis was used to determine the effect of paternal adolescent fatherhood on participant adolescent fatherhood. RESULTS Sons of adolescent fathers were 1.8 times more likely to become adolescent fathers than were sons of older fathers, after other risk factors were accounted for. Additionally, factors from each ecological domain-individual (delinquency), family (maternal education), peer (early adolescent dating), and environment (race/ethnicity, physical risk environment)-were independent predictors of adolescent fatherhood. CONCLUSIONS These findings support the need for pregnancy prevention interventions specifically designed for young males who may be at high risk for continuing this cycle. Interventions that address multiple levels of risk will likely be most successful at reducing pregnancies among partners of young men.
Collapse
Affiliation(s)
- Heather Sipsma
- Yale School of Public Health, New Haven, CT 06520-8034, USA
| | | | | | | |
Collapse
|
25
|
Mmbando BP, Cole-Lewis H, Sembuche S, Kamugisha ML, Theander T, Lusingu JPA, Lemnge MM. Risk factors for low birth-weight in areas with varying malaria transmission in Korogwe, Tanzania: implications for malaria control. ACTA ACUST UNITED AC 2009; 10:137-43. [PMID: 19024338 DOI: 10.4314/thrb.v10i3.14353] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Low birth weight (LBW) is a risk factor for infant mortality, morbidity, growth retardation, poor cognitive development, and chronic diseases. Maternal exposure to diseases such as malaria, HIV, and syphilis has been shown to have a significant impact on birth weight (BW). This study was aimed at determining whether there was a difference in rates of LBW in areas of varying malaria transmission intensity in Korogwe, Tanzania. Retrospective data for one year (June 2004-May 2005) in three maternal and child health (MCH) clinics in the district were analysed. Villages were stratified into three strata: lowlands-semi urban (average altitude of 320m), lowlands-rural (below 600m) and highlands (> or =600m). There was a significant decreasing trend of rate of LBW from rural lowlands to highlands (chi2trend = 7.335, P=0.007). Adjusting for covariates, women in parity-two were at reduced risk of delivering LBW babies compared to first parity women (OR=0.44, 95% CI 0.19-0.98, P=0.045). Similarly, the risk of LBW was higher in women who had delayed MCH gestational booking and in women who conceived during high malaria transmission seasons. There was high degree of preference of digits ending with 0/5 in reporting BW in the studied MCHs. In conclusion, a rate of LWB was high in rural lowlands where malaria is also endemic, and was associated with high malaria transmission seasons.
Collapse
Affiliation(s)
- B P Mmbando
- National Institute for Medical Research, Tanga Medical Research Centre, Tanzania.
| | | | | | | | | | | | | |
Collapse
|