1
|
Cho H, Froelicher D, Chen J, Edupalli M, Pyrgelis A, Troncoso-Pastoriza JR, Hubaux JP, Berger B. Secure and federated genome-wide association studies for biobank-scale datasets. Nat Genet 2025; 57:809-814. [PMID: 39994472 PMCID: PMC11985345 DOI: 10.1038/s41588-025-02109-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 01/28/2025] [Indexed: 02/26/2025]
Abstract
Sharing data across institutions for genome-wide association studies (GWAS) would enhance the discovery of genetic variation linked to health and disease1,2. However, existing data-sharing regulations limit the scope of such collaborations3. Although cryptographic tools for secure computation promise to enable collaborative analysis with formal privacy guarantees, existing approaches either are computationally impractical or do not implement current state-of-the-art methods4-6. We introduce secure federated genome-wide association studies (SF-GWAS), a combination of secure computation frameworks and distributed algorithms that empowers efficient and accurate GWAS on private data held by multiple entities while ensuring data confidentiality. SF-GWAS supports widely used GWAS pipelines based on principal-component analysis or linear mixed models. We demonstrate the accuracy and practical runtimes of SF-GWAS on five datasets, including a UK Biobank cohort of 410,000 individuals, showcasing an order-of-magnitude improvement in runtime compared to previous methods. Our work enables secure collaborative genomic studies at unprecedented scale.
Collapse
Affiliation(s)
- Hyunghoon Cho
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, CT, USA.
- Department of Computer Science, Yale University, New Haven, CT, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - David Froelicher
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Computer Science and AI Laboratory, MIT, Cambridge, MA, USA
| | - Jeffrey Chen
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Computer Science and AI Laboratory, MIT, Cambridge, MA, USA
| | | | - Apostolos Pyrgelis
- School of Computer and Communication Sciences, EPFL, Lausanne, Switzerland
| | | | - Jean-Pierre Hubaux
- School of Computer and Communication Sciences, EPFL, Lausanne, Switzerland.
- Tune Insight SA, Lausanne, Switzerland.
| | - Bonnie Berger
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Computer Science and AI Laboratory, MIT, Cambridge, MA, USA.
- Department of Mathematics, MIT, Cambridge, MA, USA.
| |
Collapse
|
2
|
Zainal NH, Liu X, Leong U, Yan X, Chakraborty B. Bridging Innovation and Equity: Advancing Public Health Through Just-in-Time Adaptive Interventions. Annu Rev Public Health 2025; 46:43-68. [PMID: 39656954 DOI: 10.1146/annurev-publhealth-071723-103909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2024]
Abstract
This review explores the transformative potential of just-in-time adaptive interventions (JITAIs) as a scalable solution for addressing health disparities in underserved populations. JITAIs, delivered via mobile health technologies, could provide personalized, context-aware interventions based on real-time data to address public health challenges such as addiction treatment, chronic disease management, and mental health support. JITAIs can dynamically adjust intervention strategies, enhancing accessibility and engagement for marginalized communities. We highlight the utility of JITAIs in reducing opportunity costs associated with traditional in-person health interventions. Examples from various health domains demonstrate the adaptability of JITAIs in tailoring interventions to meet diverse needs. The review also emphasizes the need for community involvement, robust evaluation frameworks, and ethical considerations in implementing JITAIs, particularly in low- and middle-income countries. Sustainable funding models and technological innovations are necessary to ensure equitable access and effectively scale these interventions. By bridging the gap between research and practice, JITAIs could improve health outcomes and reduce disparities in vulnerable populations.
Collapse
Affiliation(s)
- Nur Hani Zainal
- Department of Psychology, National University of Singapore, Singapore
| | - Xueqing Liu
- Centre for Quantitative Medicine, Duke-NUS Medical School, National University of Singapore, Singapore;
| | - Utek Leong
- Department of Psychology, National University of Singapore, Singapore
| | - Xiaoxi Yan
- Centre for Quantitative Medicine, Duke-NUS Medical School, National University of Singapore, Singapore;
| | - Bibhas Chakraborty
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA
- Department of Statistics and Data Science, National University of Singapore, Singapore
- Programme in Health Services and Systems Research, Duke-NUS Medical School, National University of Singapore, Singapore
- Centre for Quantitative Medicine, Duke-NUS Medical School, National University of Singapore, Singapore;
| |
Collapse
|
3
|
Barnes C, Aboy MR, Minssen T, Allen JW, Earp BD, Savulescu J, Mann SP. Enabling Demonstrated Consent for Biobanking with Blockchain and Generative AI. THE AMERICAN JOURNAL OF BIOETHICS : AJOB 2025; 25:96-111. [PMID: 39499856 PMCID: PMC12005476 DOI: 10.1080/15265161.2024.2416117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2024]
Abstract
Participation in research is supposed to be voluntary and informed. Yet it is difficult to ensure people are adequately informed about the potential uses of their biological materials when they donate samples for future research. We propose a novel consent framework which we call "demonstrated consent" that leverages blockchain technology and generative AI to address this problem. In a demonstrated consent model, each donated sample is associated with a unique non-fungible token (NFT) on a blockchain, which records in its metadata information about the planned and past uses of the sample in research, and is updated with each use of the sample. This information is accessible to a large language model (LLM) customized to present this information in an understandable and interactive manner. Thus, our model uses blockchain and generative AI technologies to track, make available, and explain information regarding planned and past uses of donated samples.
Collapse
Affiliation(s)
| | | | | | | | - Brian D. Earp
- University of Oxford
- National University of Singapore
| | | | | |
Collapse
|
4
|
Choi YA, Kim Y, Miao P, Lappalainen T, Gürsoy G. Secure and federated quantitative trait loci mapping with privateQTL. CELL GENOMICS 2025; 5:100769. [PMID: 39947138 PMCID: PMC11872535 DOI: 10.1016/j.xgen.2025.100769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Revised: 12/04/2024] [Accepted: 01/15/2025] [Indexed: 03/05/2025]
Abstract
Understanding the relationship between genotypes and phenotypes is crucial for advancing personalized medicine. Expression quantitative trait loci (eQTL) mapping plays a significant role by correlating genetic variants to gene expression levels. Despite the progress made by large-scale projects, eQTL mapping still faces challenges in statistical power and privacy concerns. Multi-site studies can increase sample sizes but are hindered by privacy issues. We present privateQTL, a novel framework leveraging secure multi-party computation for secure and federated eQTL mapping. When tested in a real-world scenario with data from different studies, privateQTL outperformed meta-analysis by accurately correcting for covariates and batch effect and retaining higher accuracy and precision for both eGene-eVariant mapping and effect size estimation. In addition, privateQTL is modular and scalable, making it adaptable for other molecular phenotypes and large-scale studies. Our results indicate that privateQTL is a practical solution for privacy-preserving collaborative eQTL mapping.
Collapse
Affiliation(s)
- Yoolim Annie Choi
- Columbia University, Department of Biomedical Informatics, New York, NY, USA; New York Genome Center, New York, NY, USA
| | - Yebin Kim
- New York Genome Center, New York, NY, USA
| | - Peihan Miao
- Brown University, Department of Computer Science, Providence, RI, USA
| | - Tuuli Lappalainen
- New York Genome Center, New York, NY, USA; Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Solna, Sweden
| | - Gamze Gürsoy
- Columbia University, Department of Biomedical Informatics, New York, NY, USA; New York Genome Center, New York, NY, USA; Department of Computer Science, Columbia University, New York, NY, USA.
| |
Collapse
|
5
|
Martinson AK, Chin AT, Butte MJ, Rider NL. Artificial Intelligence and Machine Learning for Inborn Errors of Immunity: Current State and Future Promise. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. IN PRACTICE 2024; 12:2695-2704. [PMID: 39127104 DOI: 10.1016/j.jaip.2024.08.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 07/10/2024] [Accepted: 08/01/2024] [Indexed: 08/12/2024]
Abstract
Artificial intelligence (AI) and machine learning (ML) research within medicine has exponentially increased over the last decade, with studies showcasing the potential of AI/ML algorithms to improve clinical practice and outcomes. Ongoing research and efforts to develop AI-based models have expanded to aid in the identification of inborn errors of immunity (IEI). The use of larger electronic health record data sets, coupled with advances in phenotyping precision and enhancements in ML techniques, has the potential to significantly improve the early recognition of IEI, thereby increasing access to equitable care. In this review, we provide a comprehensive examination of AI/ML for IEI, covering the spectrum from data preprocessing for AI/ML analysis to current applications within immunology, and address the challenges associated with implementing clinical decision support systems to refine the diagnosis and management of IEI.
Collapse
Affiliation(s)
| | - Aaron T Chin
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California, Los Angeles, Los Angeles, Calif
| | - Manish J Butte
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California, Los Angeles, Los Angeles, Calif
| | - Nicholas L Rider
- Department of Health Systems & Implementation Science, Virginia Tech Carilion School of Medicine, Roanoke, Va; Department of Medicine, Division of Allergy-Immunology, Carilion Clinic, Roanoke, Va.
| |
Collapse
|
6
|
Schalk D, Rehms R, Hoffmann VS, Bischl B, Mansmann U. Distributed non-disclosive validation of predictive models by a modified ROC-GLM. BMC Med Res Methodol 2024; 24:190. [PMID: 39210301 PMCID: PMC11363434 DOI: 10.1186/s12874-024-02312-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Accepted: 08/20/2024] [Indexed: 09/04/2024] Open
Abstract
BACKGROUND Distributed statistical analyses provide a promising approach for privacy protection when analyzing data distributed over several databases. Instead of directly operating on data, the analyst receives anonymous summary statistics, which are combined into an aggregated result. Further, in discrimination model (prognosis, diagnosis, etc.) development, it is key to evaluate a trained model w.r.t. to its prognostic or predictive performance on new independent data. For binary classification, quantifying discrimination uses the receiver operating characteristics (ROC) and its area under the curve (AUC) as aggregation measure. We are interested to calculate both as well as basic indicators of calibration-in-the-large for a binary classification task using a distributed and privacy-preserving approach. METHODS We employ DataSHIELD as the technology to carry out distributed analyses, and we use a newly developed algorithm to validate the prediction score by conducting distributed and privacy-preserving ROC analysis. Calibration curves are constructed from mean values over sites. The determination of ROC and its AUC is based on a generalized linear model (GLM) approximation of the true ROC curve, the ROC-GLM, as well as on ideas of differential privacy (DP). DP adds noise (quantified by the ℓ 2 sensitivityΔ 2 ( f ^ ) ) to the data and enables a global handling of placement numbers. The impact of DP parameters was studied by simulations. RESULTS In our simulation scenario, the true and distributed AUC measures differ by Δ AUC < 0.01 depending heavily on the choice of the differential privacy parameters. It is recommended to check the accuracy of the distributed AUC estimator in specific simulation scenarios along with a reasonable choice of DP parameters. Here, the accuracy of the distributed AUC estimator may be impaired by too much artificial noise added from DP. CONCLUSIONS The applicability of our algorithms depends on the ℓ 2 sensitivityΔ 2 ( f ^ ) of the underlying statistical/predictive model. The simulations carried out have shown that the approximation error is acceptable for the majority of simulated cases. For models with highΔ 2 ( f ^ ) , the privacy parameters must be set accordingly higher to ensure sufficient privacy protection, which affects the approximation error. This work shows that complex measures, as the AUC, are applicable for validation in distributed setups while preserving an individual's privacy.
Collapse
Affiliation(s)
- Daniel Schalk
- Department of Statistics, LMU Munich, Munich, Germany
- DIFUTURE (DataIntegration for Future Medicine, www.difuture.de), LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), LMU Munich, Munich, Germany
| | - Raphael Rehms
- Institute for Medical Information Processing, Biometry and Epidemiology, LMU Munich, Munich, Germany
| | - Verena S Hoffmann
- Institute for Medical Information Processing, Biometry and Epidemiology, LMU Munich, Munich, Germany
| | - Bernd Bischl
- Department of Statistics, LMU Munich, Munich, Germany
- Munich Center for Machine Learning (MCML), LMU Munich, Munich, Germany
| | - Ulrich Mansmann
- Department of Statistics, LMU Munich, Munich, Germany.
- Institute for Medical Information Processing, Biometry and Epidemiology, LMU Munich, Munich, Germany.
- DIFUTURE (DataIntegration for Future Medicine, www.difuture.de), LMU Munich, Munich, Germany.
| |
Collapse
|
7
|
Cho H, Froelicher D, Dokmai N, Nandi A, Sadhuka S, Hong MM, Berger B. Privacy-Enhancing Technologies in Biomedical Data Science. Annu Rev Biomed Data Sci 2024; 7:317-343. [PMID: 39178425 PMCID: PMC11346580 DOI: 10.1146/annurev-biodatasci-120423-120107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]
Abstract
The rapidly growing scale and variety of biomedical data repositories raise important privacy concerns. Conventional frameworks for collecting and sharing human subject data offer limited privacy protection, often necessitating the creation of data silos. Privacy-enhancing technologies (PETs) promise to safeguard these data and broaden their usage by providing means to share and analyze sensitive data while protecting privacy. Here, we review prominent PETs and illustrate their role in advancing biomedicine. We describe key use cases of PETs and their latest technical advances and highlight recent applications of PETs in a range of biomedical domains. We conclude by discussing outstanding challenges and social considerations that need to be addressed to facilitate a broader adoption of PETs in biomedical data science.
Collapse
Affiliation(s)
- Hyunghoon Cho
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, Connecticut, USA;
| | - David Froelicher
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Natnatee Dokmai
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, Connecticut, USA;
| | - Anupama Nandi
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, Connecticut, USA;
| | - Shuvom Sadhuka
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Matthew M Hong
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Bonnie Berger
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| |
Collapse
|
8
|
Wirth FN, Abu Attieh H, Prasser F. OHDSI-compliance: a set of document templates facilitating the implementation and operation of a software stack for real-world evidence generation. Front Med (Lausanne) 2024; 11:1378866. [PMID: 38818399 PMCID: PMC11137233 DOI: 10.3389/fmed.2024.1378866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 05/02/2024] [Indexed: 06/01/2024] Open
Abstract
Introduction The open-source software offered by the Observational Health Data Science and Informatics (OHDSI) collective, including the OMOP-CDM, serves as a major backbone for many real-world evidence networks and distributed health data analytics platforms. While container technology has significantly simplified deployments from a technical perspective, regulatory compliance can remain a major hurdle for the setup and operation of such platforms. In this paper, we present OHDSI-Compliance, a comprehensive set of document templates designed to streamline the data protection and information security-related documentation and coordination efforts required to establish OHDSI installations. Methods To decide on a set of relevant document templates, we first analyzed the legal requirements and associated guidelines with a focus on the General Data Protection Regulation (GDPR). Moreover, we analyzed the software architecture of a typical OHDSI stack and related its components to the different general types of concepts and documentation identified. Then, we created those documents for a prototypical OHDSI installation, based on the so-called Broadsea package, following relevant guidelines from Germany. Finally, we generalized the documents by introducing placeholders and options at places where individual institution-specific content will be needed. Results We present four documents: (1) a record of processing activities, (2) an information security concept, (3) an authorization concept, as well as (4) an operational concept covering the technical details of maintaining the stack. The documents are publicly available under a permissive license. Discussion To the best of our knowledge, there are no other publicly available sets of documents designed to simplify the compliance process for OHDSI deployments. While our documents provide a comprehensive starting point, local specifics need to be added, and, due to the heterogeneity of legal requirements in different countries, further adoptions might be necessary.
Collapse
Affiliation(s)
| | | | - Fabian Prasser
- Berlin Institute of Health at Charité – Universitätsmedizin Berlin, Center of Health Data Science, Berlin, Germany
| |
Collapse
|
9
|
Mendelsohn S, Froelicher D, Loginov D, Bernick D, Berger B, Cho H. sfkit: a web-based toolkit for secure and federated genomic analysis. Nucleic Acids Res 2023; 51:W535-W541. [PMID: 37246709 PMCID: PMC10320181 DOI: 10.1093/nar/gkad464] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 05/03/2023] [Accepted: 05/14/2023] [Indexed: 05/30/2023] Open
Abstract
Advances in genomics are increasingly depending upon the ability to analyze large and diverse genomic data collections, which are often difficult to amass due to privacy concerns. Recent works have shown that it is possible to jointly analyze datasets held by multiple parties, while provably preserving the privacy of each party's dataset using cryptographic techniques. However, these tools have been challenging to use in practice due to the complexities of the required setup and coordination among the parties. We present sfkit, a secure and federated toolkit for collaborative genomic studies, to allow groups of collaborators to easily perform joint analyses of their datasets without compromising privacy. sfkit consists of a web server and a command-line interface, which together support a range of use cases including both auto-configured and user-supplied computational environments. sfkit provides collaborative workflows for the essential tasks of genome-wide association study (GWAS) and principal component analysis (PCA). We envision sfkit becoming a one-stop server for secure collaborative tools for a broad range of genomic analyses. sfkit is open-source and available at: https://sfkit.org.
Collapse
Affiliation(s)
| | - David Froelicher
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Computer Science and AI Laboratory, MIT, Cambridge, MA, USA
| | - Denis Loginov
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - David Bernick
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Bonnie Berger
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Computer Science and AI Laboratory, MIT, Cambridge, MA, USA
- Department of Mathematics, MIT, Cambridge, MA, USA
| | - Hyunghoon Cho
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| |
Collapse
|
10
|
Ohno-Machado L, Jiang X, Kuo TT, Tao S, Chen L, Ram PM, Zhang GQ, Xu H. A hierarchical strategy to minimize privacy risk when linking "De-identified" data in biomedical research consortia. J Biomed Inform 2023; 139:104322. [PMID: 36806328 DOI: 10.1016/j.jbi.2023.104322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 02/10/2023] [Accepted: 02/11/2023] [Indexed: 02/18/2023]
Abstract
Linking data across studies offers an opportunity to enrich data sets and provide a stronger basis for data-driven models for biomedical discovery and/or prognostication. Several techniques to link records have been proposed, and some have been implemented across data repositories holding molecular and clinical data. Not all these techniques guarantee appropriate privacy protection; there are trade-offs between (a) simple strategies that can be associated with data that will be linked and shared with any party and (b) more complex strategies that preserve the privacy of individuals across parties. We propose an intermediary, practical strategy to support linkage in studies that share de-identified data with Data Coordinating Centers. This technology can be extended to link data across multiple data hubs to support privacy preserving record linkage, considering data coordination centers and their awardees, which can be extended to a hierarchy of entities (e.g., awardees, data coordination centers, data hubs, etc.) b.
Collapse
Affiliation(s)
- Lucila Ohno-Machado
- UCSD Health Department of Biomedical Informatics, University of California San Diego Health, La Jolla, CA, USA; Biomedical Informatics & Data Science, Yale School of Medicine, New Haven, CT.
| | - Xiaoqian Jiang
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Tsung-Ting Kuo
- UCSD Health Department of Biomedical Informatics, University of California San Diego Health, La Jolla, CA, USA
| | - Shiqiang Tao
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Luyao Chen
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Pritham M Ram
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Guo-Qiang Zhang
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Hua Xu
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA; Biomedical Informatics & Data Science, Yale School of Medicine, New Haven, CT
| |
Collapse
|
11
|
Washington PY, Puniwai N, Kamaka M, Gürsoy G, Tatonetti N, Brenner SE, Wall DP. Session Introduction: TOWARDS ETHICAL BIOMEDICAL INFORMATICS: LEARNING FROM OLELO NOEAU, HAWAIIAN PROVERBS. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2023; 28:461-471. [PMID: 36541000 PMCID: PMC11095408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Innovations in human-centered biomedical informatics are often developed with the eventual goal of real-world translation. While biomedical research questions are usually answered in terms of how a method performs in a particular context, we argue that it is equally important to consider and formally evaluate the ethical implications of informatics solutions. Several new research paradigms have arisen as a result of the consideration of ethical issues, including but not limited for privacy-preserving computation and fair machine learning. In the spirit of the Pacific Symposium on Biocomputing, we discuss broad and fundamental principles of ethical biomedical informatics in terms of Olelo Noeau, or Hawaiian proverbs and poetical sayings that capture Hawaiian values. While we emphasize issues related to privacy and fairness in particular, there are a multitude of facets to ethical biomedical informatics that can benefit from a critical analysis grounded in ethics.
Collapse
Affiliation(s)
- Peter Y Washington
- Department of Information & Computer Sciences, University of Hawaii at Manoa Honolulu, HI 96822, USA,
| | | | | | | | | | | | | |
Collapse
|
12
|
The ethical and legal landscape of brain data governance. PLoS One 2022; 17:e0273473. [PMID: 36580464 PMCID: PMC9799320 DOI: 10.1371/journal.pone.0273473] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Accepted: 08/09/2022] [Indexed: 12/30/2022] Open
Abstract
Neuroscience research is producing big brain data which informs both advancements in neuroscience research and drives the development of advanced datasets to provide advanced medical solutions. These brain data are produced under different jurisdictions in different formats and are governed under different regulations. The governance of data has become essential and critical resulting in the development of various governance structures to ensure that the quality, availability, findability, accessibility, usability, and utility of data is maintained. Furthermore, data governance is influenced by various ethical and legal principles. However, it is still not clear what ethical and legal principles should be used as a standard or baseline when managing brain data due to varying practices and evolving concepts. Therefore, this study asks what ethical and legal principles shape the current brain data governance landscape? A systematic scoping review and thematic analysis of articles focused on biomedical, neuro and brain data governance was carried out to identify the ethical and legal principles which shape the current brain data governance landscape. The results revealed that there is currently a large variation of how the principles are presented and discussions around the terms are very multidimensional. Some of the principles are still at their infancy and are barely visible. A range of principles emerged during the thematic analysis providing a potential list of principles which can provide a more comprehensive framework for brain data governance and a conceptual expansion of neuroethics.
Collapse
|
13
|
Predictive Modelling in Clinical Bioinformatics: Key Concepts for Startups. BIOTECH 2022; 11:biotech11030035. [PMID: 35997343 PMCID: PMC9397027 DOI: 10.3390/biotech11030035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 07/30/2022] [Accepted: 08/03/2022] [Indexed: 11/17/2022] Open
Abstract
Clinical bioinformatics is a newly emerging field that applies bioinformatics techniques for facilitating the identification of diseases, discovery of biomarkers, and therapy decision. Mathematical modelling is part of bioinformatics analysis pipelines and a fundamental step to extract clinical insights from genomes, transcriptomes and proteomes of patients. Often, the chosen modelling techniques relies on either statistical, machine learning or deterministic approaches. Research that combines bioinformatics with modelling techniques have been generating innovative biomedical technology, algorithms and models with biotech applications, attracting private investment to develop new business; however, startups that emerge from these technologies have been facing difficulties to implement clinical bioinformatics pipelines, protect their technology and generate profit. In this commentary, we discuss the main concepts that startups should know for enabling a successful application of predictive modelling in clinical bioinformatics. Here we will focus on key modelling concepts, provide some successful examples and briefly discuss the modelling framework choice. We also highlight some aspects to be taken into account for a successful implementation of cost-effective bioinformatics from a business perspective.
Collapse
|
14
|
Abstract
Genomics data are important for advancing biomedical research, improving clinical care, and informing other disciplines such as forensics and genealogy. However, privacy concerns arise when genomic data are shared. In particular, the identifying nature of genetic information, its direct relationship to health status, and the potential financial harm and stigmatization posed to individuals and their blood relatives call for a survey of the privacy issues related to sharing genetic and related data and potential solutions to overcome these issues. In this work, we provide an overview of the importance of genomic privacy, the information gleaned from genomics data, the sources of potential private information leakages in genomics, and ways to preserve privacy while utilizing the genetic information in research. We discuss the relationship between trust in the scientific community and protecting privacy, illuminating a future roadmap for data sharing and study participation.
Collapse
Affiliation(s)
- Gamze Gürsoy
- Department of Biomedical Informatics, Columbia University, New York, NY, USA; .,New York Genome Center, New York, NY, USA
| |
Collapse
|
15
|
Wan Z, Hazel JW, Clayton EW, Vorobeychik Y, Kantarcioglu M, Malin BA. Sociotechnical safeguards for genomic data privacy. Nat Rev Genet 2022; 23:429-445. [PMID: 35246669 PMCID: PMC8896074 DOI: 10.1038/s41576-022-00455-y] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/24/2022] [Indexed: 12/21/2022]
Abstract
Recent developments in a variety of sectors, including health care, research and the direct-to-consumer industry, have led to a dramatic increase in the amount of genomic data that are collected, used and shared. This state of affairs raises new and challenging concerns for personal privacy, both legally and technically. This Review appraises existing and emerging threats to genomic data privacy and discusses how well current legal frameworks and technical safeguards mitigate these concerns. It concludes with a discussion of remaining and emerging challenges and illustrates possible solutions that can balance protecting privacy and realizing the benefits that result from the sharing of genetic information.
Collapse
Affiliation(s)
- Zhiyu Wan
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA
- Department of Computer Science, Vanderbilt University, Nashville, TN, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - James W Hazel
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA
- Center for Biomedical Ethics and Society, Vanderbilt University, Nashville, TN, USA
| | - Ellen Wright Clayton
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA
- Center for Biomedical Ethics and Society, Vanderbilt University, Nashville, TN, USA
- Vanderbilt University Law School, Nashville, TN, USA
| | - Yevgeniy Vorobeychik
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, MO, USA
| | - Murat Kantarcioglu
- Department of Computer Science, University of Texas at Dallas, Richardson, TX, USA
| | - Bradley A Malin
- Center for Genetic Privacy and Identity in Community Settings, Vanderbilt University Medical Center, Nashville, TN, USA.
- Department of Computer Science, Vanderbilt University, Nashville, TN, USA.
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA.
| |
Collapse
|
16
|
Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites. NPJ Digit Med 2022; 5:76. [PMID: 35701668 PMCID: PMC9198031 DOI: 10.1038/s41746-022-00615-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 05/19/2022] [Indexed: 11/09/2022] Open
Abstract
Integrating real-world data (RWD) from several clinical sites offers great opportunities to improve estimation with a more general population compared to analyses based on a single clinical site. However, sharing patient-level data across sites is practically challenging due to concerns about maintaining patient privacy. We develop a distributed algorithm to integrate heterogeneous RWD from multiple clinical sites without sharing patient-level data. The proposed distributed conditional logistic regression (dCLR) algorithm can effectively account for between-site heterogeneity and requires only one round of communication. Our simulation study and data application with the data of 14,215 COVID-19 patients from 230 clinical sites in the UnitedHealth Group Clinical Research Database demonstrate that the proposed distributed algorithm provides an estimator that is robust to heterogeneity in event rates when efficiently integrating data from multiple clinical sites. Our algorithm is therefore a practical alternative to both meta-analysis and existing distributed algorithms for modeling heterogeneous multi-site binary outcomes.
Collapse
|
17
|
Functional genomics data: privacy risk assessment and technological mitigation. Nat Rev Genet 2022; 23:245-258. [PMID: 34759381 DOI: 10.1038/s41576-021-00428-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/18/2021] [Indexed: 12/15/2022]
Abstract
The generation of functional genomics data by next-generation sequencing has increased greatly in the past decade. Broad sharing of these data is essential for research advancement but poses notable privacy challenges, some of which are analogous to those that occur when sharing genetic variant data. However, there are also unique privacy challenges that arise from cryptic information leakage during the processing and summarization of functional genomics data from raw reads to derived quantities, such as gene expression values. Here, we review these challenges and present potential solutions for mitigating privacy risks while allowing broad data dissemination and analysis.
Collapse
|
18
|
Zugman A, Harrewijn A, Cardinale EM, Zwiebel H, Freitag GF, Werwath KE, Bas‐Hoogendam JM, Groenewold NA, Aghajani M, Hilbert K, Cardoner N, Porta‐Casteràs D, Gosnell S, Salas R, Blair KS, Blair JR, Hammoud MZ, Milad M, Burkhouse K, Phan KL, Schroeder HK, Strawn JR, Beesdo‐Baum K, Thomopoulos SI, Grabe HJ, Van der Auwera S, Wittfeld K, Nielsen JA, Buckner R, Smoller JW, Mwangi B, Soares JC, Wu M, Zunta‐Soares GB, Jackowski AP, Pan PM, Salum GA, Assaf M, Diefenbach GJ, Brambilla P, Maggioni E, Hofmann D, Straube T, Andreescu C, Berta R, Tamburo E, Price R, Manfro GG, Critchley HD, Makovac E, Mancini M, Meeten F, Ottaviani C, Agosta F, Canu E, Cividini C, Filippi M, Kostić M, Munjiza A, Filippi CA, Leibenluft E, Alberton BAV, Balderston NL, Ernst M, Grillon C, Mujica‐Parodi LR, van Nieuwenhuizen H, Fonzo GA, Paulus MP, Stein MB, Gur RE, Gur RC, Kaczkurkin AN, Larsen B, Satterthwaite TD, Harper J, Myers M, Perino MT, Yu Q, Sylvester CM, Veltman DJ, Lueken U, Van der Wee NJA, Stein DJ, Jahanshad N, Thompson PM, Pine DS, Winkler AM. Mega-analysis methods in ENIGMA: The experience of the generalized anxiety disorder working group. Hum Brain Mapp 2022; 43:255-277. [PMID: 32596977 PMCID: PMC8675407 DOI: 10.1002/hbm.25096] [Citation(s) in RCA: 54] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 05/26/2020] [Accepted: 05/31/2020] [Indexed: 12/15/2022] Open
Abstract
The ENIGMA group on Generalized Anxiety Disorder (ENIGMA-Anxiety/GAD) is part of a broader effort to investigate anxiety disorders using imaging and genetic data across multiple sites worldwide. The group is actively conducting a mega-analysis of a large number of brain structural scans. In this process, the group was confronted with many methodological challenges related to study planning and implementation, between-country transfer of subject-level data, quality control of a considerable amount of imaging data, and choices related to statistical methods and efficient use of resources. This report summarizes the background information and rationale for the various methodological decisions, as well as the approach taken to implement them. The goal is to document the approach and help guide other research groups working with large brain imaging data sets as they develop their own analytic pipelines for mega-analyses.
Collapse
Affiliation(s)
- André Zugman
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Anita Harrewijn
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Elise M. Cardinale
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Hannah Zwiebel
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Gabrielle F. Freitag
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Katy E. Werwath
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Janna M. Bas‐Hoogendam
- Leiden University Medical Center, Department of PsychiatryLeidenThe Netherlands
- Leiden Institute for Brain and Cognition (LIBC)LeidenThe Netherlands
- Leiden University, Institute of Psychology, Developmental and Educational PsychologyLeidenThe Netherlands
| | - Nynke A. Groenewold
- Department of Psychiatry & Neuroscience InstituteUniversity of Cape TownCape TownSouth Africa
| | - Moji Aghajani
- Department. of PsychiatryAmsterdam UMC/VUMCAmsterdamThe Netherlands
- GGZ InGeestDepartment of Research & InnovationAmsterdamThe Netherlands
| | - Kevin Hilbert
- Department of PsychologyHumboldt‐Universität zu BerlinBerlinGermany
| | - Narcis Cardoner
- Department of Mental HealthUniversity Hospital Parc Taulí‐I3PTBarcelonaSpain
- Department of Psychiatry and Forensic MedicineUniversitat Autònoma de BarcelonaBarcelonaSpain
- Centro de Investigación Biomédica en Red de Salud MentalCarlos III Health InstituteMadridSpain
| | - Daniel Porta‐Casteràs
- Department of Mental HealthUniversity Hospital Parc Taulí‐I3PTBarcelonaSpain
- Department of Psychiatry and Forensic MedicineUniversitat Autònoma de BarcelonaBarcelonaSpain
- Centro de Investigación Biomédica en Red de Salud MentalCarlos III Health InstituteMadridSpain
| | - Savannah Gosnell
- Menninger Department of Psychiatry and Behavioral SciencesBaylor College of MedicineHoustonTexasUSA
| | - Ramiro Salas
- Menninger Department of Psychiatry and Behavioral SciencesBaylor College of MedicineHoustonTexasUSA
| | - Karina S. Blair
- Center for Neurobehavioral ResearchBoys Town National Research HospitalBoys TownNebraskaUSA
| | - James R. Blair
- Center for Neurobehavioral ResearchBoys Town National Research HospitalBoys TownNebraskaUSA
| | - Mira Z. Hammoud
- Department of PsychiatryNew York UniversityNew YorkNew YorkUSA
| | - Mohammed Milad
- Department of PsychiatryNew York UniversityNew YorkNew YorkUSA
| | - Katie Burkhouse
- Department of PsychiatryUniversity of Illinois at ChicagoChicagoIllinoisUSA
| | - K. Luan Phan
- Department of Psychiatry and Behavioral HealthThe Ohio State UniversityColumbusOhioUSA
| | - Heidi K. Schroeder
- Department of Psychiatry & Behavioral NeuroscienceUniversity of CincinnatiCincinnatiOhioUSA
| | - Jeffrey R. Strawn
- Department of Psychiatry & Behavioral NeuroscienceUniversity of CincinnatiCincinnatiOhioUSA
| | - Katja Beesdo‐Baum
- Behavioral EpidemiologyInstitute of Clinical Psychology and Psychotherapy, Technische Universität DresdenDresdenGermany
| | - Sophia I. Thomopoulos
- Imaging Genetics Center, Mark and Mary Stevens Neuroimaging and Informatics Institute, University of Southern CaliforniaMarina del ReyCaliforniaUSA
| | - Hans J. Grabe
- Department of Psychiatry and PsychotherapyUniversity Medicine GreifswaldGreifswaldGermany
- German Center for Neurodegenerative Diseases (DZNE)Site Rostock/GreifswaldGreifswaldGermany
| | - Sandra Van der Auwera
- Department of Psychiatry and PsychotherapyUniversity Medicine GreifswaldGreifswaldGermany
- German Center for Neurodegenerative Diseases (DZNE)Site Rostock/GreifswaldGreifswaldGermany
| | - Katharina Wittfeld
- Department of Psychiatry and PsychotherapyUniversity Medicine GreifswaldGreifswaldGermany
- German Center for Neurodegenerative Diseases (DZNE)Site Rostock/GreifswaldGreifswaldGermany
| | - Jared A. Nielsen
- Department of PsychologyHarvard UniversityCambridgeMassachusettsUSA
- Center for Brain ScienceHarvard UniversityCambridgeMassachusettsUSA
| | - Randy Buckner
- Department of PsychologyHarvard UniversityCambridgeMassachusettsUSA
- Center for Brain ScienceHarvard UniversityCambridgeMassachusettsUSA
- Department of PsychiatryMassachusetts General HospitalBostonMassachusettsUSA
| | - Jordan W. Smoller
- Department of PsychiatryMassachusetts General HospitalBostonMassachusettsUSA
| | - Benson Mwangi
- Center Of Excellence On Mood Disorders, Department of Psychiatry and Behavioral SciencesThe University of Texas Health Science Center at HoustonHoustonTexasUSA
| | - Jair C. Soares
- Center Of Excellence On Mood Disorders, Department of Psychiatry and Behavioral SciencesThe University of Texas Health Science Center at HoustonHoustonTexasUSA
| | - Mon‐Ju Wu
- Center Of Excellence On Mood Disorders, Department of Psychiatry and Behavioral SciencesThe University of Texas Health Science Center at HoustonHoustonTexasUSA
| | - Giovana B. Zunta‐Soares
- Center Of Excellence On Mood Disorders, Department of Psychiatry and Behavioral SciencesThe University of Texas Health Science Center at HoustonHoustonTexasUSA
| | - Andrea P. Jackowski
- LiNC, Department of PsychiatryFederal University of São PauloSão PauloSão PauloBrazil
| | - Pedro M. Pan
- LiNC, Department of PsychiatryFederal University of São PauloSão PauloSão PauloBrazil
| | - Giovanni A. Salum
- Section on Negative Affect and Social Processes, Hospital de Clínicas de Porto Alegre, Universidade Federal do Rio Grande do SulPorto AlegreRio Grande do SulBrazil
| | - Michal Assaf
- Olin Neuropsychiatry Research CenterInstitute of Living, Hartford HospitalHartfordConnecticutUSA
- Department of PsychiatryYale School of MedicineNew HavenConnecticutUSA
| | - Gretchen J. Diefenbach
- Anxiety Disorders CenterInstitute of Living, Hartford HospitalHartfordConnecticutUSA
- Yale School of MedicineNew HavenConnecticutUSA
| | - Paolo Brambilla
- Department of Neurosciences and Mental HealthFondazione IRCCS Ca' Granda Ospedale Maggiore PoliclinicoMilanItaly
| | - Eleonora Maggioni
- Department of Neurosciences and Mental HealthFondazione IRCCS Ca' Granda Ospedale Maggiore PoliclinicoMilanItaly
| | - David Hofmann
- Institute of Medical Psychology and Systems Neuroscience, University of MuensterMuensterGermany
| | - Thomas Straube
- Institute of Medical Psychology and Systems Neuroscience, University of MuensterMuensterGermany
| | - Carmen Andreescu
- Department of PsychiatryUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Rachel Berta
- Department of PsychiatryUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Erica Tamburo
- Department of PsychiatryUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Rebecca Price
- Department of Psychiatry & PsychologyUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Gisele G. Manfro
- Anxiety Disorder ProgramHospital de Clínicas de Porto AlegrePorto AlegreRio Grande do SulBrazil
- Department of PsychiatryFederal University of Rio Grande do SulPorto AlegreRio Grande do SulBrazil
| | - Hugo D. Critchley
- Department of NeuroscienceBrighton and Sussex Medical School, University of SussexBrightonUK
| | - Elena Makovac
- Centre for Neuroimaging ScienceKings College LondonLondonUK
| | - Matteo Mancini
- Department of NeuroscienceBrighton and Sussex Medical School, University of SussexBrightonUK
| | | | | | - Federica Agosta
- Neuroimaging Research Unit, Institute of Experimental Neurology, Division of NeuroscienceIRCCS San Raffaele Scientific InstituteMilanItaly
- Vita‐Salute San Raffaele UniversityMilanItaly
| | - Elisa Canu
- Neuroimaging Research Unit, Institute of Experimental Neurology, Division of NeuroscienceIRCCS San Raffaele Scientific InstituteMilanItaly
| | - Camilla Cividini
- Neuroimaging Research Unit, Institute of Experimental Neurology, Division of NeuroscienceIRCCS San Raffaele Scientific InstituteMilanItaly
| | - Massimo Filippi
- Neuroimaging Research Unit, Institute of Experimental Neurology, Division of NeuroscienceIRCCS San Raffaele Scientific InstituteMilanItaly
- Vita‐Salute San Raffaele UniversityMilanItaly
- Neurology and Neurophysiology UnitIRCCS San Raffaele Scientific InstituteMilanItaly
| | - Milutin Kostić
- Institute of Mental Health, University of BelgradeBelgradeSerbia
- Department of Psychiatry, School of MedicineUniversity of BelgradeBelgradeSerbia
| | - Ana Munjiza
- Institute of Mental Health, University of BelgradeBelgradeSerbia
| | - Courtney A. Filippi
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Ellen Leibenluft
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Bianca A. V. Alberton
- Graduate Program in Electrical and Computer Engineering, Universidade Tecnológica Federal do ParanáCuritibaPuerto RicoBrazil
| | - Nicholas L. Balderston
- Center for Neuromodulation in Depression and StressUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Monique Ernst
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Christian Grillon
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | | | | | - Gregory A. Fonzo
- Department of PsychiatryThe University of Texas at Austin Dell Medical SchoolAustinTexasUSA
| | | | - Murray B. Stein
- Department of Psychiatry & Family Medicine and Public HealthUniversity of CaliforniaLa JollaCaliforniaUSA
| | - Raquel E. Gur
- Department of PsychiatryUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Ruben C. Gur
- Department of PsychiatryUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | | | - Bart Larsen
- Department of PsychiatryUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | | | - Jennifer Harper
- Department of PsychiatryWashington UniversitySt. LouisMissouriUSA
| | - Michael Myers
- Department of PsychiatryWashington UniversitySt. LouisMissouriUSA
| | | | - Qiongru Yu
- Department of PsychiatryWashington UniversitySt. LouisMissouriUSA
| | | | - Dick J. Veltman
- Department. of PsychiatryAmsterdam UMC/VUMCAmsterdamThe Netherlands
| | - Ulrike Lueken
- Department of PsychologyHumboldt‐Universität zu BerlinBerlinGermany
| | - Nic J. A. Van der Wee
- Leiden University Medical Center, Department of PsychiatryLeidenThe Netherlands
- Leiden Institute for Brain and Cognition (LIBC)LeidenThe Netherlands
| | - Dan J. Stein
- Department of Psychiatry & Neuroscience InstituteUniversity of Cape TownCape TownSouth Africa
- SAMRC Unite on Risk & Resilience in Mental Disorders, Department of Psychiatry & Neuroscience InstituteUniversity of Cape TownCape TownSouth Africa
| | - Neda Jahanshad
- Imaging Genetics Center, Mark and Mary Stevens Neuroimaging and Informatics Institute, University of Southern CaliforniaMarina del ReyCaliforniaUSA
| | - Paul M. Thompson
- Imaging Genetics Center, Mark and Mary Stevens Neuroimaging and Informatics Institute, University of Southern CaliforniaMarina del ReyCaliforniaUSA
| | - Daniel S. Pine
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| | - Anderson M. Winkler
- National Institute of Mental Health (NIMH), National Institutes of Health (NIH)BethesdaMarylandUSA
| |
Collapse
|
19
|
El Majdoubi D, El Bakkali H, Sadki S. SmartMedChain: A Blockchain-Based Privacy-Preserving Smart Healthcare Framework. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:4145512. [PMID: 34777733 PMCID: PMC8589497 DOI: 10.1155/2021/4145512] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 09/20/2021] [Accepted: 10/15/2021] [Indexed: 11/17/2022]
Abstract
Nowadays, the adoption of Internet of Things (IoT) technology worldwide is accelerating the digital transformation of healthcare industry. In this context, smart healthcare (s-healthcare) solutions are ensuring better and innovative opportunities for healthcare providers to improve patients' care. However, these solutions raise also new challenges in terms of security and privacy due to the diversity of stakeholders, the centralized data management, and the resulting lack of trustworthiness, accountability, and control. In this paper, we propose an end-to-end Blockchain-based and privacy-preserving framework called SmartMedChain for data sharing in s-healthcare environment. The Blockchain is built on Hyperledger Fabric and stores encrypted health data by using the InterPlanetary File System (IPFS), a distributed data storage solution with high resiliency and scalability. Indeed, compared to other propositions and based on the concept of smart contracts, our solution combines both data access control and data usage auditing measures for both Medical IoT data and Electronic Health Records (EHRs) generated by s-healthcare services. In addition, s-healthcare stakeholders can be held accountable by introducing an innovative Privacy Agreement Management scheme that monitors the execution of the service in respect of patient preferences and in accordance with relevant privacy laws. Security analysis and experimental results show that the proposed SmartMedChain is feasible and efficient for s-healthcare environments.
Collapse
Affiliation(s)
- Driss El Majdoubi
- Rabat IT Center, Smart Systems Laboratory (SSL), ENSIAS, Mohammed V University in Rabat, Rabat, Morocco
| | - Hanan El Bakkali
- Rabat IT Center, Smart Systems Laboratory (SSL), ENSIAS, Mohammed V University in Rabat, Rabat, Morocco
| | - Souad Sadki
- Rabat IT Center, Smart Systems Laboratory (SSL), ENSIAS, Mohammed V University in Rabat, Rabat, Morocco
| |
Collapse
|
20
|
An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes. Sci Rep 2021; 11:19647. [PMID: 34608222 PMCID: PMC8490431 DOI: 10.1038/s41598-021-99078-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Accepted: 09/13/2021] [Indexed: 12/03/2022] Open
Abstract
Clinical research networks (CRNs), made up of multiple healthcare systems each with patient data from several care sites, are beneficial for studying rare outcomes and increasing generalizability of results. While CRNs encourage sharing aggregate data across healthcare systems, individual systems within CRNs often cannot share patient-level data due to privacy regulations, prohibiting multi-site regression which requires an analyst to access all individual patient data pooled together. Meta-analysis is commonly used to model data stored at multiple institutions within a CRN but can result in biased estimation, most notably in rare-event contexts. We present a communication-efficient, privacy-preserving algorithm for modeling multi-site zero-inflated count outcomes within a CRN. Our method, a one-shot distributed algorithm for performing hurdle regression (ODAH), models zero-inflated count data stored in multiple sites without sharing patient-level data across sites, resulting in estimates closely approximating those that would be obtained in a pooled patient-level data analysis. We evaluate our method through extensive simulations and two real-world data applications using electronic health records: examining risk factors associated with pediatric avoidable hospitalization and modeling serious adverse event frequency associated with a colorectal cancer therapy. In simulations, ODAH produced bias less than 0.1% across all settings explored while meta-analysis estimates exhibited bias up to 12.7%, with meta-analysis performing worst in settings with high zero-inflation or low event rates. Across both applied analyses, ODAH estimates had less than 10% bias for 18 of 20 coefficients estimated, while meta-analysis estimates exhibited substantially higher bias. Relative to existing methods for distributed data analysis, ODAH offers a highly accurate, computationally efficient method for modeling multi-site zero-inflated count data.
Collapse
|
21
|
Yang H, Chen L, Cheng Z, Yang M, Wang J, Lin C, Wang Y, Huang L, Chen Y, Peng S, Ke Z, Li W. Deep learning-based six-type classifier for lung cancer and mimics from histopathological whole slide images: a retrospective study. BMC Med 2021; 19:80. [PMID: 33775248 PMCID: PMC8006383 DOI: 10.1186/s12916-021-01953-2] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/26/2020] [Accepted: 02/26/2021] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Targeted therapy and immunotherapy put forward higher demands for accurate lung cancer classification, as well as benign versus malignant disease discrimination. Digital whole slide images (WSIs) witnessed the transition from traditional histopathology to computational approaches, arousing a hype of deep learning methods for histopathological analysis. We aimed at exploring the potential of deep learning models in the identification of lung cancer subtypes and cancer mimics from WSIs. METHODS We initially obtained 741 WSIs from the First Affiliated Hospital of Sun Yat-sen University (SYSUFH) for the deep learning model development, optimization, and verification. Additional 318 WSIs from SYSUFH, 212 from Shenzhen People's Hospital, and 422 from The Cancer Genome Atlas were further collected for multi-centre verification. EfficientNet-B5- and ResNet-50-based deep learning methods were developed and compared using the metrics of recall, precision, F1-score, and areas under the curve (AUCs). A threshold-based tumour-first aggregation approach was proposed and implemented for the label inferencing of WSIs with complex tissue components. Four pathologists of different levels from SYSUFH reviewed all the testing slides blindly, and the diagnosing results were used for quantitative comparisons with the best performing deep learning model. RESULTS We developed the first deep learning-based six-type classifier for histopathological WSI classification of lung adenocarcinoma, lung squamous cell carcinoma, small cell lung carcinoma, pulmonary tuberculosis, organizing pneumonia, and normal lung. The EfficientNet-B5-based model outperformed ResNet-50 and was selected as the backbone in the classifier. Tested on 1067 slides from four cohorts of different medical centres, AUCs of 0.970, 0.918, 0.963, and 0.978 were achieved, respectively. The classifier achieved high consistence to the ground truth and attending pathologists with high intraclass correlation coefficients over 0.873. CONCLUSIONS Multi-cohort testing demonstrated our six-type classifier achieved consistent and comparable performance to experienced pathologists and gained advantages over other existing computational methods. The visualization of prediction heatmap improved the model interpretability intuitively. The classifier with the threshold-based tumour-first label inferencing method exhibited excellent accuracy and feasibility in classifying lung cancers and confused nonneoplastic tissues, indicating that deep learning can resolve complex multi-class tissue classification that conforms to real-world histopathological scenarios.
Collapse
Affiliation(s)
- Huan Yang
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Lili Chen
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zhiqiang Cheng
- Department of Pathology, Shenzhen People's Hospital, Shenzhen, 518020, China
| | - Minglei Yang
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Jianbo Wang
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Chenghao Lin
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Yuefeng Wang
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Leilei Huang
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Yangshan Chen
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Sui Peng
- Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China.,Molecular Diagnosis Center or Institute of Precision Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zunfu Ke
- Department of Pathology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China. .,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China. .,Molecular Diagnosis Center or Institute of Precision Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China.
| | - Weizhong Li
- Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China. .,Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China. .,Key Laboratory of Tropical Disease Control (Ministry of Education), Sun Yat-sen University, Guangzhou, 510080, China.
| |
Collapse
|
22
|
Turvey CL, Klein DM, Nazi KM, Haidary ST, Bouhaddou O, Hsing N, Donahue M. Racial differences in patient consent policy preferences for electronic health information exchange. J Am Med Inform Assoc 2021; 27:717-725. [PMID: 32150259 DOI: 10.1093/jamia/ocaa012] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 01/07/2020] [Accepted: 02/25/2020] [Indexed: 01/27/2023] Open
Abstract
OBJECTIVE This study aimed to explore the association between demographic variables, such as race and gender, and patient consent policy preferences for health information exchange as well as self-report by VHA enrollees of information continuity between Veterans Health Administration (VHA) and community non-VHA heath care providers. MATERIALS AND METHODS Data were collected between March 25, 2016 and August 22, 2016 in an online survey of 19 567 veterans. Three questions from the 2016 Commonwealth Fund International Health Policy Survey, which addressed care continuity, were included. The survey also included questions about consent policy preference regarding opt-out, opt-in, and "break the glass" consent policies. RESULTS VHA enrollees had comparable proportions of unnecessary laboratory testing and conflicting information from providers when compared with the United States sample in the Commonwealth Survey. However, they endorsed medical record information being unavailable between organizations more highly. Demographic variables were associated with gaps in care continuity as well as consent policy preferences, with 56.8% of Whites preferring an opt-out policy as compared with 40.3% of Blacks, 44.9% of Hispanic Latinos, 48.3% of Asian/Pacific Islanders, and 38.3% of Native Americans (P < .001). DISCUSSION Observed large differences by race and ethnicity in privacy preferences for electronic health information exchange should inform implementation of these programs to ensure cultural sensitivity. Veterans experienced care continuity comparable to a general United States sample, except for less effective exchange of health records between heath care organizations. VHA followed an opt-in consent policy at the time of this survey which may underlie this gap.
Collapse
Affiliation(s)
- Carolyn L Turvey
- labelVirtual Specialty Care QUERI Program: Implementing and Evaluating Technology Facilitated Clinical Interventions to Improve Access to High Quality Specialty Care for Rural Veterans, Seattle, Washington & Iowa City, Iowa, USA.,VA Office of Rural Health, Rural Health Resource Center, Iowa City, Iowa, USA.,Iowa City VA Health Care System, Comprehensive Access and Delivery Research and Evaluation Center, Iowa City, Iowa, USA.,Department of Psychiatry, The University of Iowa Carver College of Medicine, Iowa City, Iowa, USA
| | - Dawn M Klein
- VA Office of Rural Health, Rural Health Resource Center, Iowa City, Iowa, USA.,Iowa City VA Health Care System, Comprehensive Access and Delivery Research and Evaluation Center, Iowa City, Iowa, USA.,Department of Psychiatry, The University of Iowa Carver College of Medicine, Iowa City, Iowa, USA.,J P Systems, Clifton, Virginia, USA
| | - Kim M Nazi
- Independent Information Technology Consultant, Coxsackie, New York, USA
| | - Susan T Haidary
- Veterans and Consumers Health Informatics Office, Office of Connected Care, Veterans Health Administration, US Department of Veterans Affairs, Washington, DC, USA
| | - Omar Bouhaddou
- US Department of Veterans Affairs, Veterans Health Information Exchange Program, Washington, DC.,innoVet Health, San Diego, California, USA
| | - Nelson Hsing
- US Department of Veterans Affairs, Veterans Health Information Exchange Program, Washington, DC
| | - Margaret Donahue
- US Department of Veterans Affairs, Veterans Health Information Exchange Program, Washington, DC
| |
Collapse
|
23
|
Garrett-Ruffin S, Hindash AC, Kaczkurkin AN, Mears RP, Morales S, Paul K, Pavlov YG, Keil A. Open science in psychophysiology: An overview of challenges and emerging solutions. Int J Psychophysiol 2021; 162:69-78. [PMID: 33556468 DOI: 10.1016/j.ijpsycho.2021.02.005] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 01/31/2021] [Accepted: 02/03/2021] [Indexed: 12/25/2022]
Abstract
The present review is the result of a one-day workshop on open science, held at the Annual Meeting of the Society for Psychophysiological Research in Washington, DC, September 2019. The contributors represent psychophysiological researchers at different career stages and from a wide spectrum of institutions. The state of open science in psychophysiology is discussed from different perspectives, highlighting key challenges, potential benefits, and emerging solutions that are intended to facilitate open science practices. Three domains are emphasized: data sharing, preregistration, and multi-site studies. In the context of these broader domains, we present potential implementations of specific open science procedures such as data format harmonization, power analysis, data, presentation code and analysis pipeline sharing, suitable for psychophysiological research. Practical steps are discussed that may be taken to facilitate the adoption of open science practices in psychophysiology. These steps include (1) promoting broad and accessible training in the skills needed to implement open science practices, such as collaborative research and computational reproducibility initiatives, (2) establishing mechanisms that provide practical assistance in sharing of processing pipelines, presentation code, and data in an efficient way, and (3) improving the incentive structure for open science approaches. Throughout the manuscript, we provide references and links to available resources for those interested in adopting open science practices in their research.
Collapse
Affiliation(s)
- Sherona Garrett-Ruffin
- Affective Neuroscience and Mental Health Counseling, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Alexandra Cowden Hindash
- VHA Advanced Fellow in Women's Health Research, San Francisco VA Medical Center, Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, USA
| | | | - Ryan P Mears
- Department of Psychology, University of Florida, Gainesville, FL 32611, USA
| | - Santiago Morales
- Department of Human Development and Quantitative Methodology, University of Maryland, College Park, MD 20742, USA
| | - Katharina Paul
- Department of Differential Psychology and Psychological Assessment, University Hamburg, Von Melle Park 5, 20146 Hamburg, Germany
| | - Yuri G Pavlov
- Institute of Medical Psychology and Behavioral Neurobiology, University of Tübingen, 72076 Tübingen, Germany; Department of Psychology, Ural Federal University, Ekaterinburg 620000, Russian Federation
| | - Andreas Keil
- Department of Psychology, Center for the Study of Emotion and Attention, University of Florida, Gainesville, FL 32611, USA.
| |
Collapse
|
24
|
Thapa C, Camtepe S. Precision health data: Requirements, challenges and existing techniques for data security and privacy. Comput Biol Med 2020; 129:104130. [PMID: 33271399 DOI: 10.1016/j.compbiomed.2020.104130] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 11/11/2020] [Accepted: 11/11/2020] [Indexed: 01/21/2023]
Abstract
Precision health leverages information from various sources, including omics, lifestyle, environment, social media, medical records, and medical insurance claims to enable personalized care, prevent and predict illness, and precise treatments. It extensively uses sensing technologies (e.g., electronic health monitoring devices), computations (e.g., machine learning), and communication (e.g., interaction between the health data centers). As health data contain sensitive private information, including the identity of patient and carer and medical conditions of the patient, proper care is required at all times. Leakage of these private information affects the personal life, including bullying, high insurance premium, and loss of job due to the medical history. Thus, the security, privacy of and trust on the information are of utmost importance. Moreover, government legislation and ethics committees demand the security and privacy of healthcare data. Besides, the public, who is the data source, always expects the security, privacy, and trust of their data. Otherwise, they can avoid contributing their data to the precision health system. Consequently, as the public is the targeted beneficiary of the system, the effectiveness of precision health diminishes. Herein, in the light of precision health data security, privacy, ethical and regulatory requirements, finding the best methods and techniques for the utilization of the health data, and thus precision health is essential. In this regard, firstly, this paper explores the regulations, ethical guidelines around the world, and domain-specific needs. Then it presents the requirements and investigates the associated challenges. Secondly, this paper investigates secure and privacy-preserving machine learning methods suitable for the computation of precision health data along with their usage in relevant health projects. Finally, it illustrates the best available techniques for precision health data security and privacy with a conceptual system model that enables compliance, ethics clearance, consent management, medical innovations, and developments in the health domain.
Collapse
|
25
|
Carpenter SM, Menictas M, Nahum-Shani I, Wetter DW, Murphy SA. Developments in Mobile Health Just-in-Time Adaptive Interventions for Addiction Science. CURRENT ADDICTION REPORTS 2020; 7:280-290. [PMID: 33747711 PMCID: PMC7968352 DOI: 10.1007/s40429-020-00322-y] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
PURPOSE OF REVIEW Addiction is a serious and prevalent problem across the globe. An important challenge facing intervention science is how to support addiction treatment and recovery while mitigating the associated cost and stigma. A promising solution is the use of mobile health (mHealth) just-in-time adaptive interventions (JITAIs), in which intervention options are delivered in situ via a mobile device when individuals are most in need. RECENT FINDINGS The present review describes the use of mHealth JITAIs to support addiction treatment and recovery, and provides guidance on when and how the micro-randomized trial (MRT) can be used to optimize a JITAI. We describe the design of five mHealth JITAIs in addiction and three MRT studies, and discuss challenges and future directions. SUMMARY This review aims to provide guidance for constructing effective JITAIs to support addiction treatment and recovery.
Collapse
Affiliation(s)
| | | | - Inbal Nahum-Shani
- Institute for Social Research, University of Michigan, Ann Arbor, MI
| | - David W. Wetter
- Huntsman Cancer Institute and the University of Utah, Salt Lake City, UT
| | | |
Collapse
|
26
|
Fabacher T, Schaeffer M, Tuzin N, Séverac F, Lefebvre F, Mielcarek M, Sauleau EA, Meyer N, Godet J. [Medical biostatistics with GMRC Shiny Stats - learning by doing]. ANNALES PHARMACEUTIQUES FRANÇAISES 2020; 78:499-506. [PMID: 32565157 DOI: 10.1016/j.pharma.2020.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2020] [Revised: 05/29/2020] [Accepted: 06/05/2020] [Indexed: 10/24/2022]
Abstract
Biostatistics are omnipresent in the scientific and medical literature and are an essential skill for any health student. We have developed a practical training tool - GMRC Shiny stats - an interactive application specifically dedicated to medical data statistical analysis. The application has been designed to provide an analysis workflow corresponding to the usual progression of an experienced statistician during data analysis. The most common statistical analyses can be performed (descriptive statistics, inferences according to frequentist methods, survival analyses, correlation, agreement measurements, etc.). GMRC Shiny stats is intuitive and user-friendly and assists students in choosing the most appropriate statistical tests. With all these functionalities, students can learn statistical analysis by doing. Getting involved in the statistical analysis and processing of their own data is likely to improve their biostatistics skills.
Collapse
Affiliation(s)
- T Fabacher
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France
| | - M Schaeffer
- Société Alstats, 3, route de Kientzville, 67750 Scherwiller, France
| | - N Tuzin
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France
| | - F Séverac
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France
| | - François Lefebvre
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France
| | - M Mielcarek
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France
| | - E-A Sauleau
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France
| | - N Meyer
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France
| | - J Godet
- Groupe méthode en recherche clinique-hôpitaux universitaires de Strasbourg, 1, place de l'hôpital, 67000 Strasbourg, France.
| |
Collapse
|
27
|
Alter G, Gonzalez-Beltran A, Ohno-Machado L, Rocca-Serra P. The Data Tags Suite (DATS) model for discovering data access and use requirements. Gigascience 2020; 9:giz165. [PMID: 32031623 PMCID: PMC7006671 DOI: 10.1093/gigascience/giz165] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 12/17/2019] [Accepted: 12/27/2019] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Data reuse is often controlled to protect the privacy of subjects and patients. Data discovery tools need ways to inform researchers about restrictions on data access and re-use. RESULTS We present elements in the Data Tags Suite (DATS) metadata schema describing data access, data use conditions, and consent information. DATS metadata are explained in terms of the administrative, legal, and technical systems used to protect confidential data. CONCLUSIONS The access and use metadata items in DATS are designed from the perspective of a researcher who wants to find and re-use existing data. We call for standard ways of describing informed consent and data use agreements that will enable automated systems for managing research data.
Collapse
Affiliation(s)
- George Alter
- University of Michigan, ICPSR 330 Packard Street, Ann Arbor MI 48104, USA
| | - Alejandra Gonzalez-Beltran
- Science and Technology Facilities Council, Scientific Computing Department, Rutherford Appleton Laboratory, Harwell Campus, Didcot, 0X11 0QX, United Kingdom
| | - Lucila Ohno-Machado
- University of California, San Diego, Division of Biomedical Informatics, 9500 Gilman Dr. MC 0728, La Jolla CA 92093-0728, USA
| | - Philippe Rocca-Serra
- Oxford e-Research Centre University of Oxford 7 Keble Road, Oxford, OX1 3QG United Kingdom
| |
Collapse
|