1
|
Huang YN, Munteanu V, Love MI, Ronkowski CF, Deshpande D, Wong-Beringer A, Corbett-Detig R, Dimian M, Moore JH, Garmire LX, Reddy TBK, Butte AJ, Robinson MD, Eskin E, Abedalthagafi MS, Mangul S. Perceptual and technical barriers in sharing and formatting metadata accompanying omics studies. CELL GENOMICS 2025; 5:100845. [PMID: 40215974 DOI: 10.1016/j.xgen.2025.100845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 09/26/2024] [Accepted: 03/12/2025] [Indexed: 04/15/2025]
Abstract
Metadata, or "data about data," is essential for organizing, understanding, and managing large-scale omics datasets. It enhances data discovery, integration, and interpretation, enabling reproducibility, reusability, and secondary analysis. However, metadata sharing remains hindered by perceptual and technical barriers, including the lack of uniform standards, privacy concerns, study design limitations, insufficient incentives, inadequate infrastructure, and a shortage of trained personnel. These challenges compromise data reliability and obstruct integrative meta-analyses. Addressing these issues requires standardization, education, stronger roles for journals and funding agencies, and improved incentives and infrastructure. Looking ahead, emerging technologies such as artificial intelligence and machine learning may offer promising solutions to automate metadata processes, increasing accuracy and scalability. Fostering a collaborative culture of metadata sharing will maximize the value of omics data, accelerating innovation and scientific discovery.
Collapse
Affiliation(s)
- Yu-Ning Huang
- Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Viorel Munteanu
- Department of Computers, Informatics, and Microelectronics, Technical University of Moldova, 2045 Chisinau, Moldova; Department of Biological and Morphofunctional Sciences, College of Medicine and Biological Sciences, Stefan cel Mare University of Suceava, 720229 Suceava, Romania
| | - Michael I Love
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA; Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA
| | - Cynthia Flaire Ronkowski
- Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Dhrithi Deshpande
- Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Annie Wong-Beringer
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | | | - Mihai Dimian
- Department of Computers, Electronics, and Automation, Stefan cel Mare University of Suceava, 720229 Suceava, Romania
| | - Jason H Moore
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA 90069, USA
| | - Lana X Garmire
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48105, USA
| | - T B K Reddy
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Atul J Butte
- Bakar Computational Health Sciences Institute, University of California, San Francisco (UCSF), San Francisco, CA 94143, USA; Center for Data-Driven Insights and Innovation, University of California, Oakland, Oakland, CA 94607, USA
| | - Mark D Robinson
- SIB Swiss Institute of Bioinformatics and Department of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Eleazar Eskin
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA; Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA; Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Malak S Abedalthagafi
- Department of Pathology and Laboratory Medicine, Emory University Hospital, Atlanta, GA, USA
| | - Serghei Mangul
- Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA 90089, USA; Department of Computers, Informatics, and Microelectronics, Technical University of Moldova, 2045 Chisinau, Moldova; Department of Biological and Morphofunctional Sciences, College of Medicine and Biological Sciences, Stefan cel Mare University of Suceava, 720229 Suceava, Romania; Sage Bionetworks, Seattle, WA, USA.
| |
Collapse
|
2
|
Bloodworth S, Willoughby C, Coles SJ. Data accessibility in the chemical sciences: an analysis of recent practice in organic chemistry journals. Beilstein J Org Chem 2025; 21:864-876. [PMID: 40331050 PMCID: PMC12051459 DOI: 10.3762/bjoc.21.70] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Accepted: 04/23/2025] [Indexed: 05/08/2025] Open
Abstract
The discoverability and reusability of data is critical for machine learning to drive new discovery in the chemical sciences, and the 'FAIR Guiding Principles for scientific data management and stewardship' provide a measurable set of guidelines that can be used to ensure the accessibility of reusable data. We investigate the data practice of researchers publishing in specialist organic chemistry journals, by analyzing the outputs of 240 randomly selected research papers from 12 top-ranked journals published in early 2023. We investigate compliance with recommended (but not compulsory) data policies, assess the accessibility and reusability of data, and if the existence of specific recommendations for publishing NMR data by some journals supports author compliance. We find that, although authors meet mandated requirements, there is very limited compliance with data sharing policies that are only recommended by journals. Overall, there is little evidence to suggest that authors' publishing practice meets FAIR data guidance. We suggest first steps that researchers can take to move towards a positive culture of data sharing in organic chemistry. Routine actions that we encourage as standard practice include deposition of raw and metadata to open repositories, and inclusion of machine-readable structure identifiers for all reported compounds.
Collapse
Affiliation(s)
- Sally Bloodworth
- School of Chemistry and Chemical Engineering, University of Southampton, Highfield, Southampton SO17 1BJ, UK
| | - Cerys Willoughby
- School of Chemistry and Chemical Engineering, University of Southampton, Highfield, Southampton SO17 1BJ, UK
| | - Simon J Coles
- School of Chemistry and Chemical Engineering, University of Southampton, Highfield, Southampton SO17 1BJ, UK
| |
Collapse
|
3
|
Riley M, Kilkenny MF, Robinson K, Leggat SG. Researchers' perceptions of the trustworthiness, for reuse purposes, of government health data in Victoria, Australia: Implications for policy and practice. HEALTH INF MANAG J 2025; 54:139-149. [PMID: 39045683 PMCID: PMC12038074 DOI: 10.1177/18333583241256049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2024]
Abstract
In 2022 the Australian Data Availability and Transparency Act (DATA) commenced, enabling accredited "data users" to access data from "accredited data service providers." However, the DATA Scheme lacks guidance on "trustworthiness" of the data to be utilised for reuse purposes. Objectives: To determine: (i) Do researchers using government health datasets trust the data? (ii) What factors influence their perceptions of data trustworthiness? and (iii) What are the implications for government and data custodians? Method: Authors of published studies (2008-2020) that utilised Victorian government health datasets were surveyed via a case study approach. Twenty-eight trust constructs (identified via literature review) were grouped into data factors, management properties and provider factors. Results: Fifty experienced health researchers responded. Most (88%) believed that Victorian government health data were trustworthy. When grouped, data factors and management properties were more important than data provider factors in building trust. The most important individual trust constructs were: "compliant with ethical regulation" (100%) and "monitoring privacy and confidentiality" (98%). Constructs of least importance were knowledge of "participant consent" (56%) and "major focus of the data provider was research" (50%). Conclusion: Overall, the researchers trusted government health data, but data factors and data management properties were more important than data provider factors in building trust. Implications: Government should ensure the DATA Scheme incorporates mechanisms to validate those data utilised by accredited data users and data providers have sufficient quality (intrinsic and extrinsic) to meet the requirements of "trustworthiness," and that evidentiary documentation is provided to support these "accredited data."
Collapse
Affiliation(s)
| | | | | | - Sandra G Leggat
- La Trobe University, Australia
- James Cook University, Australia
| |
Collapse
|
4
|
Delavenne C, van Schaik G, Frössling J, Cameron A, Faverjon C. Reusability challenges of livestock production data to improve animal health. Sci Data 2025; 12:458. [PMID: 40108251 PMCID: PMC11923216 DOI: 10.1038/s41597-025-04785-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2024] [Accepted: 03/05/2025] [Indexed: 03/22/2025] Open
Abstract
In veterinary epidemiology, using data routinely generated by stakeholders of the livestock production chains offers an opportunity for researchers to access a large amount of information that could be used to improve animal health. However, (re)using these non-scholarly data doesn't come without challenges. This study assesses the reusability for research purposes of 30 European datasets generated by the livestock sector to meet legislative or operational needs. Information about each dataset was collected through a questionnaire survey filled by the data owner or the data user (researchers). Datasets were described, and their compliance with the FAIR principles, a data-sharing standard, and the principle of accountability defined in the General Data Protection Regulation were assessed. The study highlighted major gaps in terms of compliance with data regulations and implementation of good data management practices, specifically considering the rare use of metadata and standard vocabularies. Filling these gaps is essential to reap the full benefits offered by the rapidly growing volume of heterogeneous data available in livestock production systems.
Collapse
Affiliation(s)
| | - Gerdien van Schaik
- Department of Population Health Sciences, section Farm Animal Health, Faculty of Veterinary Medicine, Utrecht University, Utrecht, the Netherlands
- Royal GD, Deventer, the Netherlands
| | - Jenny Frössling
- Department of Epidemiology, Surveillance and Risk Assessment, Swedish Veterinary Agency (SVA), SE-751 89, Uppsala, Sweden
- Department of Applied Animal Science and Welfare, Swedish University of Agricultural Sciences (SLU), Box 234, SE-532 23, Skara, Sweden
| | | | | |
Collapse
|
5
|
Shiell M, Bajari R, Andric D, Eubank J, Chan BF, Richardsson AJ, Ali A, Allabadi B, Alturmessov Y, Baker J, Catton A, Cullion K, DeMaria D, Dos Santos P, Feher H, Gerthoffert F, Ha M, Haw RA, Kachru A, Lepsa A, Li A, Mistry RN, Nahal-Bose HK, Pejovic A, Rich S, Rivera L, Schütte C, Su E, Tisma R, Uddin J, Wang C, Wilmer AN, Xiang L, Zhang J, Stein LD, Ferretti V, Courtot M, Yung CK. Overture: an open-source genomics data platform. Gigascience 2025; 14:giaf038. [PMID: 40272881 PMCID: PMC12020472 DOI: 10.1093/gigascience/giaf038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2024] [Revised: 02/28/2025] [Accepted: 03/07/2025] [Indexed: 04/26/2025] Open
Abstract
BACKGROUND Next-generation sequencing has created many new technological challenges in organizing and distributing genomics datasets, which now can routinely reach petabyte scales. Coupled with data-hungry artificial intelligence and machine learning applications, findable, accessible, interoperable, and reusable genomics datasets have never been more valuable. While major archives like the Genomics Data Commons, Sequence Reads Archive, and European Genome-Phenome Archive have improved researchers' ability to share and reuse data, and general-purpose repositories such as Zenodo and Figshare provide valuable platforms for research data publication, the diversity of genomics research precludes any one-size-fits-all approach. In many cases, bespoke solutions are required, and despite funding agencies and journals increasingly mandating reusable data practices, researchers still lack the technical support needed to meet the multifaceted challenges of data reuse. FINDINGS Overture bridges this gap by providing open-source software for building and deploying customizable genomics data platforms. Its architecture consists of modular microservices, each of which is generalized with narrow responsibilities that together combine to create complete data management systems. These systems enable researchers to organize, share, and explore their genomics data at any scale. Through Overture, researchers can connect their data to both humans and machines, fostering reproducibility and enabling new insights through controlled data sharing and reuse. CONCLUSIONS By making these tools freely available, we can accelerate the development of reliable genomic data management across the research community quickly, flexibly, and at multiple scales. Overture is an open-source project licensed under AGPLv3.0 with all source code publicly available from https://github.com/overture-stack and documentation on development, deployment, and usage available from www.overture.bio.
Collapse
Affiliation(s)
- Mitchell Shiell
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Rosi Bajari
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Dusan Andric
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Jon Eubank
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Brandon F Chan
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | | | - Azher Ali
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Bashar Allabadi
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | | | - Jared Baker
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Ann Catton
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Kim Cullion
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Daniel DeMaria
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | | | - Henrich Feher
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | | | - Minh Ha
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Robin A Haw
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Atul Kachru
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Alexandru Lepsa
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Alexis Li
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Rakesh N Mistry
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | | | | | - Samantha Rich
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Leonardo Rivera
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Ciarán Schütte
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Edmund Su
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Robert Tisma
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Jaser Uddin
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Chang Wang
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Alex N Wilmer
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Linda Xiang
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Junjun Zhang
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| | - Lincoln D Stein
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
- Department of Molecular Genetics, University of Toronto, Toronto, Canada, M5S 3K3
| | - Vincent Ferretti
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
- Research Center of the CHU Sainte-Justine, University of Montreal, Montreal, Canada, QC H3T 1C5
| | - Mélanie Courtot
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
- Department of Medical Biophysics, University of Toronto, Toronto, Canada, M5G 2C4
| | - Christina K Yung
- Ontario Institute for Cancer Research (OICR), Ontario, Canada, M5G 1M1
| |
Collapse
|
6
|
Alam P, Bolio A, Lin L, Larson HJ. Stakeholders' perceptions of personal health data sharing: A scoping review. PLOS DIGITAL HEALTH 2024; 3:e0000652. [PMID: 39565781 PMCID: PMC11578505 DOI: 10.1371/journal.pdig.0000652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 09/24/2024] [Indexed: 11/22/2024]
Abstract
The rapid advancement of digital health technologies has heightened demand for health data for secondary uses, highlighting the importance of understanding global perspectives on personal information sharing. This article examines stakeholder perceptions and attitudes toward the use of personal health data to improve personalized treatments, interventions, and research. It also identifies barriers and facilitators in health data sharing and pinpoints gaps in current research, aiming to inform ethical practices in healthcare settings that utilize digital technologies. We conducted a scoping review of peer reviewed empirical studies based on data pertaining to perceptions and attitudes towards sharing personal health data. The authors searched three electronic databases-Embase, MEDLINE, and Web of Science-for articles published (2015-2023), using terms relating to health data and perceptions. Thirty-nine articles met the inclusion criteria with sample size ranging from 14 to 29,275. We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews guidelines for the design and analysis of this study. We synthesized the included articles using narrative analysis. The review captured multiple stakeholder perspectives with an up-to-date range of diverse barriers and facilitators that impact data-sharing behavior. The included studies were primarily cross-sectional and geographically concentrated in high-income settings; often overlooking diverse demographics and broader global health challenges. Most of the included studies were based within North America and Western Europe, with the United States (n = 8) and the United Kingdom (n = 7) representing the most studied countries. Many reviewed studies were published in 2022 (n = 11) and used quantitative methods (n = 23). Twenty-nine studies examined the perspectives of patients and the public while six looked at healthcare professionals, researchers, and experts. Many of the studies we reviewed reported overall positive attitudes about data sharing with variations around sociodemographic factors, motivations for sharing data, type and recipient of data being shared, consent preference, and trust.
Collapse
Affiliation(s)
- Prima Alam
- The Vaccine Confidence Project, Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Laboratory of Data Discovery for Health Limited (D24H), Hong Kong Science Park, Hong Kong Special Administrative Region, China
| | - Ana Bolio
- The Vaccine Confidence Project, Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Leesa Lin
- The Vaccine Confidence Project, Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Laboratory of Data Discovery for Health Limited (D24H), Hong Kong Science Park, Hong Kong Special Administrative Region, China
| | - Heidi J. Larson
- The Vaccine Confidence Project, Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, United States of America
| |
Collapse
|
7
|
Chabilall J, Brown Q, Cengiz N, Moodley K. Data as scientific currency: Challenges experienced by researchers with sharing health data in sub-Saharan Africa. PLOS DIGITAL HEALTH 2024; 3:e0000635. [PMID: 39446843 PMCID: PMC11500889 DOI: 10.1371/journal.pdig.0000635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 09/09/2024] [Indexed: 10/26/2024]
Abstract
Innovative information-sharing techniques and rapid access to stored research data as scientific currency have proved highly beneficial in healthcare and health research. Yet, researchers often experience conflict between data sharing to promote health-related scientific knowledge for the common good and their personal academic advancement. There is a scarcity of studies exploring the perspectives of health researchers in sub-Saharan Africa (SSA) regarding the challenges with data sharing in the context of data-intensive research. The study began with a quantitative survey and research, after which the researchers engaged in a qualitative study. This qualitative cross-sectional baseline study reports on the challenges faced by health researchers, in terms of data sharing. In-depth interviews were conducted via Microsoft Teams between July 2022 and April 2023 with 16 health researchers from 16 different countries across SSA. We employed purposive and snowballing sampling techniques to invite participants via email. The recorded interviews were transcribed, coded and analysed thematically using ATLAS.ti. Five recurrent themes and several subthemes emerged related to (1) individual researcher concerns (fears regarding data sharing, publication and manuscript pressure), (2) structural issues impacting data sharing, (3) recognition in academia (scooping of research data, acknowledgement and research incentives) (4) ethical challenges experienced by health researchers in SSA (confidentiality and informed consent, commercialisation and benefit sharing) and (5) legal lacunae (gaps in laws and regulations). Significant discomfort about data sharing exists amongst health researchers in this sample of respondents from SSA, resulting in a reluctance to share data despite acknowledging the scientific benefits of such sharing. This discomfort is related to the lack of adequate guidelines and governance processes in the context of health research collaborations, both locally and internationally. Consequently, concerns about ethical and legal issues are increasing. Resources are needed in SSA to improve the quality, value and veracity of data-as these are ethical imperatives. Strengthening data governance via robust guidelines, legislation and appropriate data sharing agreements will increase trust amongst health researchers and data donors alike.
Collapse
Affiliation(s)
- Jyothi Chabilall
- Business Management, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Qunita Brown
- Division of Medical Ethics and Law, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Nezerith Cengiz
- Division of Medical Ethics and Law, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Keymanthri Moodley
- Division of Medical Ethics and Law, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| |
Collapse
|
8
|
Wendelborn C, Anger M, Schickhardt C. Promoting Data Sharing: The Moral Obligations of Public Funding Agencies. SCIENCE AND ENGINEERING ETHICS 2024; 30:35. [PMID: 39105890 PMCID: PMC11303567 DOI: 10.1007/s11948-024-00491-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 06/08/2024] [Indexed: 08/07/2024]
Abstract
Sharing research data has great potential to benefit science and society. However, data sharing is still not common practice. Since public research funding agencies have a particular impact on research and researchers, the question arises: Are public funding agencies morally obligated to promote data sharing? We argue from a research ethics perspective that public funding agencies have several pro tanto obligations requiring them to promote data sharing. However, there are also pro tanto obligations that speak against promoting data sharing in general as well as with regard to particular instruments of such promotion. We examine and weigh these obligations and conclude that all things considered funders ought to promote the sharing of data. Even the instrument of mandatory data sharing policies can be justified under certain conditions.
Collapse
Affiliation(s)
- Christian Wendelborn
- Section for Translational Medical Ethics, German Cancer Research Center (DKFZ), National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg, Germany.
- University of Konstanz, Konstanz, Germany.
| | - Michael Anger
- Section for Translational Medical Ethics, German Cancer Research Center (DKFZ), National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg, Germany
| | - Christoph Schickhardt
- Section for Translational Medical Ethics, German Cancer Research Center (DKFZ), National Center for Tumor Diseases (NCT) Heidelberg, Heidelberg, Germany
| |
Collapse
|
9
|
Schmidt C, Boissonnet T, Dohle J, Bernhardt K, Ferrando-May E, Wernet T, Nitschke R, Kunis S, Weidtkamp-Peters S. A practical guide to bioimaging research data management in core facilities. J Microsc 2024; 294:350-371. [PMID: 38752662 DOI: 10.1111/jmi.13317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 04/29/2024] [Accepted: 04/30/2024] [Indexed: 05/21/2024]
Abstract
Bioimage data are generated in diverse research fields throughout the life and biomedical sciences. Its potential for advancing scientific progress via modern, data-driven discovery approaches reaches beyond disciplinary borders. To fully exploit this potential, it is necessary to make bioimaging data, in general, multidimensional microscopy images and image series, FAIR, that is, findable, accessible, interoperable and reusable. These FAIR principles for research data management are now widely accepted in the scientific community and have been adopted by funding agencies, policymakers and publishers. To remain competitive and at the forefront of research, implementing the FAIR principles into daily routines is an essential but challenging task for researchers and research infrastructures. Imaging core facilities, well-established providers of access to imaging equipment and expertise, are in an excellent position to lead this transformation in bioimaging research data management. They are positioned at the intersection of research groups, IT infrastructure providers, the institution´s administration, and microscope vendors. In the frame of German BioImaging - Society for Microscopy and Image Analysis (GerBI-GMB), cross-institutional working groups and third-party funded projects were initiated in recent years to advance the bioimaging community's capability and capacity for FAIR bioimage data management. Here, we provide an imaging-core-facility-centric perspective outlining the experience and current strategies in Germany to facilitate the practical adoption of the FAIR principles closely aligned with the international bioimaging community. We highlight which tools and services are ready to be implemented and what the future directions for FAIR bioimage data have to offer.
Collapse
Affiliation(s)
- Christian Schmidt
- Enabling Technology Department, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Tom Boissonnet
- Center for Advanced Imaging, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Julia Dohle
- Center of Cellular Nanoanalytics, Integrated Bioimaging Facility iBiOs, University of Osnabrück, Osnabrück, Germany
| | - Karen Bernhardt
- Center of Cellular Nanoanalytics, Integrated Bioimaging Facility iBiOs, University of Osnabrück, Osnabrück, Germany
| | - Elisa Ferrando-May
- Enabling Technology Department, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Biology, University of Konstanz, Konstanz, Germany
| | - Tobias Wernet
- Life Imaging Center, University of Freiburg, Freiburg, Germany
| | - Roland Nitschke
- Life Imaging Center, University of Freiburg, Freiburg, Germany
- CIBSS and BIOSS - Centres for Biological Signalling Studies, University of Freiburg, Freiburg, Germany
| | - Susanne Kunis
- Center of Cellular Nanoanalytics, Integrated Bioimaging Facility iBiOs, University of Osnabrück, Osnabrück, Germany
| | | |
Collapse
|
10
|
Steffens S, Schröder K, Krüger M, Maack C, Streckfuss-Bömeke K, Backs J, Backofen R, Baeßler B, Devaux Y, Gilsbach R, Heijman J, Knaus J, Kramann R, Linz D, Lister AL, Maatz H, Maegdefessel L, Mayr M, Meder B, Nussbeck SY, Rog-Zielinska EA, Schulz MH, Sickmann A, Yigit G, Kohl P. The challenges of research data management in cardiovascular science: a DGK and DZHK position paper-executive summary. Clin Res Cardiol 2024; 113:672-679. [PMID: 37847314 PMCID: PMC11026239 DOI: 10.1007/s00392-023-02303-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 09/01/2023] [Indexed: 10/18/2023]
Abstract
The sharing and documentation of cardiovascular research data are essential for efficient use and reuse of data, thereby aiding scientific transparency, accelerating the progress of cardiovascular research and healthcare, and contributing to the reproducibility of research results. However, challenges remain. This position paper, written on behalf of and approved by the German Cardiac Society and German Centre for Cardiovascular Research, summarizes our current understanding of the challenges in cardiovascular research data management (RDM). These challenges include lack of time, awareness, incentives, and funding for implementing effective RDM; lack of standardization in RDM processes; a need to better identify meaningful and actionable data among the increasing volume and complexity of data being acquired; and a lack of understanding of the legal aspects of data sharing. While several tools exist to increase the degree to which data are findable, accessible, interoperable, and reusable (FAIR), more work is needed to lower the threshold for effective RDM not just in cardiovascular research but in all biomedical research, with data sharing and reuse being factored in at every stage of the scientific process. A culture of open science with FAIR research data should be fostered through education and training of early-career and established research professionals. Ultimately, FAIR RDM requires permanent, long-term effort at all levels. If outcomes can be shown to be superior and to promote better (and better value) science, modern RDM will make a positive difference to cardiovascular science and practice. The full position paper is available in the supplementary materials.
Collapse
Affiliation(s)
- Sabine Steffens
- Institute for Cardiovascular Prevention (IPEK), Ludwig-Maximilians-Universität, Munich, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Munich Heart Alliance, Munich, Germany
| | - Katrin Schröder
- Institute for Cardiovascular Physiology, Goethe University, Frankfurt Am Main, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site RheinMain, Frankfurt, Germany
| | - Martina Krüger
- Institute of Cardiovascular Physiology, University Hospital Düsseldorf, Düsseldorf, Germany
- Cardiovascular Research Institute Düsseldorf (CARID), Düsseldorf, Germany
| | - Christoph Maack
- Comprehensive Heart Failure Center (CHFC), University Clinic Würzburg, Würzburg, Germany
- Medical Clinic 1, University Clinic Würzburg, Würzburg, Germany
| | - Katrin Streckfuss-Bömeke
- Clinic for Cardiology and Pneumology, Georg-August University Göttingen, Göttingen, Germany
- DZHK (German Center for Cardiovascular Research), Partner Site Göttingen, Göttingen, Germany
- Institute of Pharmacology and Toxicology, University of Würzburg, Würzburg, Germany
| | - Johannes Backs
- Institute of Experimental Cardiology, University Hospital Heidelberg, Heidelberg, Germany
- DZHK (German Center for Cardiovascular Research), Partner Site Heidelberg/Mannheim, Heidelberg, Germany
| | - Rolf Backofen
- Faculty of Medicine, Institute for Experimental and Clinical Pharmacology and Toxicology, Albert-Ludwigs-University, Freiburg, Germany
| | - Bettina Baeßler
- Department of Diagnostic and Interventional Radiology, University Hospital Würzburg, Würzburg, Germany
| | - Yvan Devaux
- Cardiovascular Research Unit, Department of Precision Health, Luxembourg Institute of Health, Strassen, Luxembourg
| | - Ralf Gilsbach
- Institute of Experimental Cardiology, University Hospital Heidelberg, Heidelberg, Germany
- DZHK (German Center for Cardiovascular Research), Partner Site Heidelberg/Mannheim, Heidelberg, Germany
| | - Jordi Heijman
- Department of Cardiology, CARIM School for Cardiovascular Diseases, Maastricht University, Maastricht, The Netherlands
| | - Jochen Knaus
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
| | - Rafael Kramann
- Institute of Experimental Medicine and Systems Biology, RWTH Aachen Medical Faculty, Aachen, Germany
- Department of Nephrology and Clinical Immunology, RWTH Aachen Medical Faculty, Aachen, Germany
- Department of Internal Medicine, Nephrology and Transplantation, Erasmus MC, Rotterdam, The Netherlands
| | - Dominik Linz
- Department of Cardiology, Maastricht University Medical Centre and Cardiovascular Research Institute Maastricht, Maastricht, The Netherlands
- Department of Biomedical Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Allyson L Lister
- Oxford E-Research Centre (OeRC), Department of Engineering Science, University of Oxford, Oxford, UK
| | - Henrike Maatz
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, Berlin, Germany
| | - Lars Maegdefessel
- DZHK (German Centre for Cardiovascular Research), Partner Site Munich Heart Alliance, Munich, Germany
- Department for Vascular and Endovascular Surgery, Klinikum Rechts Der Isar, Technical University Munich, Munich, Germany
- Department of Medicine, Karolinska Institute, Stockholm, Sweden
| | - Manuel Mayr
- School of Cardiovascular Medicine and Sciences, King's College London British Heart Foundation Centre, London, UK
- Division of Cardiology, Department of Internal Medicine II, Medical University of Vienna, Vienna, Austria
| | - Benjamin Meder
- DZHK (German Center for Cardiovascular Research), Partner Site Heidelberg/Mannheim, Heidelberg, Germany
- Department of Internal Medicine III (Cardiology, Angiology, and Pneumology), University Hospital Heidelberg, Heidelberg, Germany
| | - Sara Y Nussbeck
- Department of Medical Informatics, University Medical Center Göttingen (UMG), Göttingen, Germany
- Central Biobank UMG, UMG, Göttingen, Germany
| | - Eva A Rog-Zielinska
- Institute for Experimental Cardiovascular Medicine, University Heart Center Freiburg-Bad Krozingen, University of Freiburg, Freiburg, Germany
- Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Marcel H Schulz
- DZHK (German Centre for Cardiovascular Research), Partner Site RheinMain, Frankfurt, Germany
- Institute of Cardiovascular Regeneration, Goethe University, Frankfurt, Germany
| | - Albert Sickmann
- Leibniz-Institut Für Analytische Wissenschaften, ISAS, E.V., Dortmund, Germany
- Department of Chemistry, College of Physical Sciences, University of Aberdeen, Aberdeen, UK
- Institute for Virology, University Hospital Essen, University of Duisburg-Essen, Essen, Germany
| | - Gökhan Yigit
- Institute of Human Genetics, University Medical Center Göttingen, Göttingen, Germany
- German Center of Cardiovascular Research (DZHK), Partner Site Göttingen, Göttingen, Germany
| | - Peter Kohl
- Institute for Experimental Cardiovascular Medicine, University Heart Center Freiburg-Bad Krozingen, University of Freiburg, Freiburg, Germany.
- Faculty of Medicine, University of Freiburg, Freiburg, Germany.
- CIBSS Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg, Germany.
| |
Collapse
|
11
|
Anger M, Wendelborn C, Schickhardt C. German funders' data sharing policies-A qualitative interview study. PLoS One 2024; 19:e0296956. [PMID: 38330079 PMCID: PMC10852319 DOI: 10.1371/journal.pone.0296956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 12/21/2023] [Indexed: 02/10/2024] Open
Abstract
BACKGROUND Data sharing is commonly seen as beneficial for science but is not yet common practice. Research funding agencies are known to play a key role in promoting data sharing, but German funders' data sharing policies appear to lag behind in international comparison. This study aims to answer the question of how German data sharing experts inside and outside funding agencies perceive and evaluate German funders' data sharing policies and overall efforts to promote data sharing. METHODS This study is based on sixteen guided expert interviews with representatives of German funders and German research data experts from stakeholder organisations, who shared their perceptions of German' funders efforts to promote data sharing. By applying the method of qualitative content analysis to our interview data, we categorise and describe noteworthy aspects of the German data sharing policy landscape and illustrate our findings with interview passages. RESULTS We present our findings in five sections to distinguish our interviewees' perceptions on a) the status quo of German funders' data sharing policies, b) the role of funders in promoting data sharing, c) current and potential measures by funders to promote data sharing, d) general barriers to those measures, and e) the implementation of more binding data sharing requirements. DISCUSSION AND CONCLUSION Although funders are perceived to be important promoters and facilitators of data sharing throughout our interviews, only few German funding agencies have data sharing policies in place. Several interviewees stated that funders could do more, for example by providing incentives for data sharing or by introducing more concrete policies. Our interviews suggest the academic freedom of grantees is widely perceived as an obstacle for German funders in introducing mandatory data sharing requirements. However, some interviewees stated that stricter data sharing requirements could be justified if data sharing is a part of good scientific practice.
Collapse
Affiliation(s)
- Michael Anger
- Section for Translational Medical Ethics, Clinical Cooperation Unit Applied Tumor Immunity, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Christian Wendelborn
- Section for Translational Medical Ethics, Clinical Cooperation Unit Applied Tumor Immunity, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Christoph Schickhardt
- Section for Translational Medical Ethics, Clinical Cooperation Unit Applied Tumor Immunity, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
12
|
Riley M, Robinson K, Kilkenny MF, Leggat SG. The knowledge and reuse practices of researchers utilising government health information assets, Victoria, Australia, 2008-2020. PLoS One 2024; 19:e0297396. [PMID: 38300890 PMCID: PMC10833579 DOI: 10.1371/journal.pone.0297396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 01/04/2024] [Indexed: 02/03/2024] Open
Abstract
BACKGROUND Using government health datasets for secondary purposes is widespread; however, little is known on researchers' knowledge and reuse practices within Australia. OBJECTIVES To explore researchers' knowledge and experience of governance processes, and their data reuse practices, when using Victorian government health datasets for research between 2008-2020. METHOD A cross-sectional quantitative survey was conducted with authors who utilised selected Victorian, Australia, government health datasets for peer-reviewed research published between 2008-2020. Information was collected on researchers': data reuse practices; knowledge of government health information assets; perceptions of data trustworthiness for reuse; and demographic characteristics. RESULTS When researchers used government health datasets, 45% linked their data, 45% found the data access process easy and 27% found it difficult. Government-curated datasets were significantly more difficult to access compared to other-agency curated datasets (p = 0.009). Many respondents received their data in less than six months (58%), in aggregated or de-identified form (76%). Most reported performing their own data validation checks (70%). To assist in data reuse, almost 71% of researchers utilised (or created) contextual documentation, 69% a data dictionary, and 62% limitations documentation. Almost 20% of respondents were not aware if data quality information existed for the dataset they had accessed. Researchers reported data was managed by custodians with rigorous confidentiality/privacy processes (94%) and good data quality processes (76%), yet half lacked knowledge of what these processes entailed. Many respondents (78%) were unaware if dataset owners had obtained consent from the dataset subjects for research applications of the data. CONCLUSION Confidentiality/privacy processes and quality control activities undertaken by data custodians were well-regarded. Many respondents included data linkage to additional government datasets in their research. Ease of data access was variable. Some documentation types were well provided and used, but improvement is required for the provision of data quality statements and limitations documentation. Provision of information on participants' informed consent in a dataset is required.
Collapse
Affiliation(s)
- Merilyn Riley
- Department of Public Health, School of Psychology and Public Health, La Trobe University, Melbourne, Australia
| | - Kerin Robinson
- Department of Public Health, School of Psychology and Public Health, La Trobe University, Melbourne, Australia
| | - Monique F. Kilkenny
- Stroke and Ageing Research, Department of Medicine, School of Clinical Sciences at Monash Health, Monash University, Victoria, Australia
- Stroke Division, The Florey Institute of Neuroscience and Mental Health, Melbourne Brain Centre, University of Melbourne, Victoria, Australia
| | - Sandra G. Leggat
- Department of Public Health, School of Psychology and Public Health, La Trobe University, Melbourne, Australia
- School of Public Health and Tropical Medicine, James Cook University, Townsville, Australia
| |
Collapse
|
13
|
Belliard F, Maineri AM, Plomp E, Ramos Padilla AF, Sun J, Zare Jeddi M. Ten simple rules for starting FAIR discussions in your community. PLoS Comput Biol 2023; 19:e1011668. [PMID: 38096152 PMCID: PMC10721007 DOI: 10.1371/journal.pcbi.1011668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2023] Open
Abstract
This work presents 10 rules that provide guidance and recommendations on how to start up discussions around the implementation of the FAIR (Findable, Accessible, Interoperable, Reusable) principles and creation of standardised ways of working. These recommendations will be particularly relevant if you are unsure where to start, who to involve, what the benefits and barriers of standardisation are, and if little work has been done in your discipline to standardise research workflows. When applied, these rules will support a more effective way of engaging the community with discussions on standardisation and practical implementation of the FAIR principles.
Collapse
Affiliation(s)
| | - Angelica Maria Maineri
- Erasmus University Rotterdam—Erasmus School of Social and Behavioral Sciences/ODISSEI, Rotterdam, the Netherlands
| | - Esther Plomp
- Delft University of Technology, Faculty of Applied Sciences, Delft, the Netherlands
| | | | - Junzi Sun
- Faculty of Aerospace Engineering, Delft University of Technology, Delft, the Netherlands
| | - Maryam Zare Jeddi
- National Institute for Public Health and the Environment (RIVM), Bilthoven, the Netherlands
| |
Collapse
|
14
|
Ng MY, Youssef A, Miner AS, Sarellano D, Long J, Larson DB, Hernandez-Boussard T, Langlotz CP. Perceptions of Data Set Experts on Important Characteristics of Health Data Sets Ready for Machine Learning: A Qualitative Study. JAMA Netw Open 2023; 6:e2345892. [PMID: 38039004 PMCID: PMC10692863 DOI: 10.1001/jamanetworkopen.2023.45892] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/20/2023] [Indexed: 12/02/2023] Open
Abstract
Importance The lack of data quality frameworks to guide the development of artificial intelligence (AI)-ready data sets limits their usefulness for machine learning (ML) research in health care and hinders the diagnostic excellence of developed clinical AI applications for patient care. Objective To discern what constitutes high-quality and useful data sets for health and biomedical ML research purposes according to subject matter experts. Design, Setting, and Participants This qualitative study interviewed data set experts, particularly those who are creators and ML researchers. Semistructured interviews were conducted in English and remotely through a secure video conferencing platform between August 23, 2022, and January 5, 2023. A total of 93 experts were invited to participate. Twenty experts were enrolled and interviewed. Using purposive sampling, experts were affiliated with a diverse representation of 16 health data sets/databases across organizational sectors. Content analysis was used to evaluate survey information and thematic analysis was used to analyze interview data. Main Outcomes and Measures Data set experts' perceptions on what makes data sets AI ready. Results Participants included 20 data set experts (11 [55%] men; mean [SD] age, 42 [11] years), of whom all were health data set creators, and 18 of the 20 were also ML researchers. Themes (3 main and 11 subthemes) were identified and integrated into an AI-readiness framework to show their association within the health data ecosystem. Participants partially determined the AI readiness of data sets using priority appraisal elements of accuracy, completeness, consistency, and fitness. Ethical acquisition and societal impact emerged as appraisal considerations in that participant samples have not been described to date in prior data quality frameworks. Factors that drive creation of high-quality health data sets and mitigate risks associated with data reuse in ML research were also relevant to AI readiness. The state of data availability, data quality standards, documentation, team science, and incentivization were associated with elements of AI readiness and the overall perception of data set usefulness. Conclusions and Relevance In this qualitative study of data set experts, participants contributed to the development of a grounded framework for AI data set quality. Data set AI readiness required the concerted appraisal of many elements and the balancing of transparency and ethical reflection against pragmatic constraints. The movement toward more reliable, relevant, and ethical AI and ML applications for patient care will inevitably require strategic updates to data set creation practices.
Collapse
Affiliation(s)
- Madelena Y. Ng
- Department of Medicine (Biomedical Informatics), Stanford University School of Medicine, Stanford, California
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California
| | - Alaa Youssef
- Department of Radiology, Stanford University School of Medicine, Stanford, California
| | - Adam S. Miner
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, California
| | - Daniela Sarellano
- Department of Radiology, Stanford University School of Medicine, Stanford, California
| | - Jin Long
- Department of Pediatrics, Stanford University School of Medicine, Stanford, California
| | - David B. Larson
- Department of Radiology, Stanford University School of Medicine, Stanford, California
| | - Tina Hernandez-Boussard
- Department of Medicine (Biomedical Informatics), Stanford University School of Medicine, Stanford, California
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California
| | - Curtis P. Langlotz
- Department of Medicine (Biomedical Informatics), Stanford University School of Medicine, Stanford, California
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California
- Department of Radiology, Stanford University School of Medicine, Stanford, California
| |
Collapse
|
15
|
Lagisz M, Aich U, Amin B, Rutkowska J, Sánchez-Mercado A, Lara CE, Nakagawa S. Little transparency and equity in scientific awards for early- and mid-career researchers in ecology and evolution. Nat Ecol Evol 2023; 7:655-665. [PMID: 37012379 DOI: 10.1038/s41559-023-02028-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 02/21/2023] [Indexed: 04/05/2023]
Abstract
Scientific awards can shape scientific careers, helping to secure jobs and grants, but can also contribute to the lack of diversity at senior levels and in the elite networks of scientists. To assess the status quo and historical trends, we evaluated 'best researcher' awards and 'best paper' early- and mid-career awards from broad-scope international journals and societies in ecology and evolution. Specifically, we collated information on eligibility rules, assessment criteria and potential gender bias. Our results reveal that, overall, few awards foster equitable access and assessment. Although many awards now explicitly allow extensions of the eligibility period for substantial career interruptions, there is a general lack of transparency in terms of assessment and consideration of other differences in access to opportunities and resources among junior researchers. Strikingly, open science practices were mentioned and valued in only one award. By highlighting instances of desirable award characteristics, we hope this work will nudge award committees to shift from simple but non-equitable award policies and practices towards strategies enhancing inclusivity and diversity. Such a shift would benefit not only those at the early- and mid-career stages but the whole research community. It is also an untapped opportunity to reward open science practices, promoting transparent and robust science.
Collapse
|
16
|
Waithira N, Kestelyn E, Chotthanawathit K, Osterrieder A, Mukaka M, Lang T, Cheah PY. Investigating the Secondary Use of Clinical Research Data: Protocol for a Mixed Methods Study. JMIR Res Protoc 2023; 12:e44875. [PMID: 36877564 PMCID: PMC10028503 DOI: 10.2196/44875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 02/10/2023] [Accepted: 02/14/2023] [Indexed: 02/16/2023] Open
Abstract
BACKGROUND The increasing emphasis to share patient data from clinical research has resulted in substantial investments in data repositories and infrastructure. However, it is unclear how shared data are used and whether anticipated benefits are being realized. OBJECTIVE The purpose of our study is to examine the current utilization of shared clinical research data sets and assess the effects on both scientific research and public health outcomes. Additionally, the study seeks to identify the factors that hinder or facilitate the ethical and efficient use of existing data based on the perspectives of data users. METHODS The study will utilize a mixed methods design, incorporating a cross-sectional survey and in-depth interviews. The survey will involve at least 400 clinical researchers, while the in-depth interviews will include 20 to 40 participants who have utilized data from repositories or institutional data access committees. The survey will target a global sample, while the in-depth interviews will focus on individuals who have used data collected from low- and middle-income countries. Quantitative data will be summarized by using descriptive statistics, while multivariable analyses will be used to assess the relationships between variables. Qualitative data will be analyzed through thematic analysis, and the findings will be reported in accordance with the COREQ (Consolidated Criteria for Reporting Qualitative Research) guidelines. The study received ethical approval from the Oxford Tropical Research Ethics Committee in 2020 (reference number: 568-20). RESULTS The results of the analysis, including both quantitative data and qualitative data, will be available in 2023. CONCLUSIONS The outcomes of our study will offer crucial understanding into the current status of data reuse in clinical research, serving as a basis for guiding future endeavors to enhance the utilization of shared data for the betterment of public health outcomes and for scientific progress. TRIAL REGISTRATION Thai Clinical Trials Registry TCTR20210301006; https://tinyurl.com/2p9atzhr. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) DERR1-10.2196/44875.
Collapse
Affiliation(s)
- Naomi Waithira
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
- Mahidol-Oxford Tropical Medicine Research Unit, Bangkok, Thailand
| | - Evelyne Kestelyn
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
- Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
| | | | - Anne Osterrieder
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Mavuto Mukaka
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
- Mahidol-Oxford Tropical Medicine Research Unit, Bangkok, Thailand
| | - Trudie Lang
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Phaik Yeong Cheah
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
- Mahidol-Oxford Tropical Medicine Research Unit, Bangkok, Thailand
| |
Collapse
|
17
|
'Small Data' for big insights in ecology. Trends Ecol Evol 2023:S0169-5347(23)00019-8. [PMID: 36797167 DOI: 10.1016/j.tree.2023.01.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 01/18/2023] [Accepted: 01/25/2023] [Indexed: 02/17/2023]
Abstract
Big Data science has significantly furthered our understanding of complex systems by harnessing large volumes of data, generated at high velocity and in great variety. However, there is a risk that Big Data collection is prioritised to the detriment of 'Small Data' (data with few observations). This poses a particular risk to ecology where Small Data abounds. Machine learning experts are increasingly looking to Small Data to drive the next generation of innovation, leading to development in methods for Small Data such as transfer learning, knowledge graphs, and synthetic data. Meanwhile, meta-analysis and causal reasoning approaches are evolving to provide new insights from Small Data. These advances should add value to high-quality Small Data catalysing future insights for ecology.
Collapse
|
18
|
Dos Santos Rocha A, Albrecht E, El-Boghdadly K. Open science should be a pleonasm. Anaesthesia 2023; 78:551-556. [PMID: 36625412 DOI: 10.1111/anae.15962] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/16/2022] [Indexed: 01/11/2023]
Affiliation(s)
- A Dos Santos Rocha
- Department of Anaesthesia, University Hospital of Lausanne and University of Lausanne, Switzerland
| | - E Albrecht
- Department of Anaesthesia, University Hospital of Lausanne and University of Lausanne, Switzerland
| | - K El-Boghdadly
- Department of Anaesthesia and Peri-operative Medicine, Guy's and St Thomas' NHS Foundation Trust, London, UK.,King's College London, UK
| |
Collapse
|
19
|
Gomes DGE, Pottier P, Crystal-Ornelas R, Hudgins EJ, Foroughirad V, Sánchez-Reyes LL, Turba R, Martinez PA, Moreau D, Bertram MG, Smout CA, Gaynor KM. Why don't we share data and code? Perceived barriers and benefits to public archiving practices. Proc Biol Sci 2022; 289:20221113. [PMID: 36416041 PMCID: PMC9682438 DOI: 10.1098/rspb.2022.1113] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 11/02/2022] [Indexed: 08/10/2023] Open
Abstract
The biological sciences community is increasingly recognizing the value of open, reproducible and transparent research practices for science and society at large. Despite this recognition, many researchers fail to share their data and code publicly. This pattern may arise from knowledge barriers about how to archive data and code, concerns about its reuse, and misaligned career incentives. Here, we define, categorize and discuss barriers to data and code sharing that are relevant to many research fields. We explore how real and perceived barriers might be overcome or reframed in the light of the benefits relative to costs. By elucidating these barriers and the contexts in which they arise, we can take steps to mitigate them and align our actions with the goals of open science, both as individual scientists and as a scientific community.
Collapse
Affiliation(s)
- Dylan G. E. Gomes
- NRC Research Associate, Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, WA 98112, USA
- Cooperative Institute for Marine Resources Studies, Hatfield Marine Science Center, Oregon State University, Newport, OR 97365, USA
| | - Patrice Pottier
- Evolution & Ecology Research Centre, School of Biological, Earth and Environmental Sciences, The University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Robert Crystal-Ornelas
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Emma J. Hudgins
- Department of Biology, Carleton University, Ottawa, Canada, K1S 5B6
| | | | | | - Rachel Turba
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095-7239, USA
| | - Paula Andrea Martinez
- Australian Research Data Commons, The University of Queensland, Brisbane 4072, Australia
| | - David Moreau
- School of Psychology and Centre for Brain Research, University of Auckland, Auckland 1010, New Zealand
| | - Michael G. Bertram
- Department of Wildlife, Fish, and Environmental Studies, Swedish University of Agricultural Sciences, Umeå, SE-907 36, Sweden
| | - Cooper A. Smout
- Institute for Globally Distributed Open Research and Education (IGDORE), Brisbane 4001, Australia
| | - Kaitlyn M. Gaynor
- Departments of Zoology and Botany, University of British Columbia, Vancouver, Canada, BC V6T 1Z4
- National Center for Ecological Analysis and Synthesis, Santa Barbara, CA 93101, USA
| |
Collapse
|
20
|
Crystal-Ornelas R, Varadharajan C, O'Ryan D, Beilsmith K, Bond-Lamberty B, Boye K, Burrus M, Cholia S, Christianson DS, Crow M, Damerow J, Ely KS, Goldman AE, Heinz SL, Hendrix VC, Kakalia Z, Mathes K, O'Brien F, Pennington SC, Robles E, Rogers A, Simmonds M, Velliquette T, Weisenhorn P, Welch JN, Whitenack K, Agarwal DA. Enabling FAIR data in Earth and environmental science with community-centric (meta)data reporting formats. Sci Data 2022; 9:700. [PMID: 36376356 PMCID: PMC9663825 DOI: 10.1038/s41597-022-01606-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 08/01/2022] [Indexed: 11/16/2022] Open
Abstract
Research can be more transparent and collaborative by using Findable, Accessible, Interoperable, and Reusable (FAIR) principles to publish Earth and environmental science data. Reporting formats-instructions, templates, and tools for consistently formatting data within a discipline-can help make data more accessible and reusable. However, the immense diversity of data types across Earth science disciplines makes development and adoption challenging. Here, we describe 11 community reporting formats for a diverse set of Earth science (meta)data including cross-domain metadata (dataset metadata, location metadata, sample metadata), file-formatting guidelines (file-level metadata, CSV files, terrestrial model data archiving), and domain-specific reporting formats for some biological, geochemical, and hydrological data (amplicon abundance tables, leaf-level gas exchange, soil respiration, water and sediment chemistry, sensor-based hydrologic measurements). More broadly, we provide guidelines that communities can use to create new (meta)data formats that integrate with their scientific workflows. Such reporting formats have the potential to accelerate scientific discovery and predictions by making it easier for data contributors to provide (meta)data that are more interoperable and reusable.
Collapse
Affiliation(s)
- Robert Crystal-Ornelas
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Github, San Francisco, CA, 94107, USA
| | - Charuleka Varadharajan
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
| | - Dylan O'Ryan
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Environmental Studies Department, California State University, Sacramento, 6000 Jed Smith Dr, Sacramento, CA, 95819, USA
| | | | - Benjamin Bond-Lamberty
- Pacific Northwest National Laboratory, Joint Global Change Research Institute at the University of Maryland-College Park, College Park, MD, 20740, USA
| | - Kristin Boye
- Environmental Geochemistry Group, SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, CA, 94025, USA
| | - Madison Burrus
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Shreyas Cholia
- Scientific Data Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | | | - Michael Crow
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37830, USA
| | - Joan Damerow
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Kim S Ely
- Environmental and Climate Sciences Department, Brookhaven National Laboratory, Upton, NY, 11973, USA
| | - Amy E Goldman
- Pacific Northwest National Laboratory, Richland, WA, 99354, USA
| | - Susan L Heinz
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37830, USA
| | - Valerie C Hendrix
- Scientific Data Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Zarine Kakalia
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Kayla Mathes
- Integrated Life Sciences, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | - Fianna O'Brien
- Scientific Data Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Stephanie C Pennington
- Pacific Northwest National Laboratory, Joint Global Change Research Institute at the University of Maryland-College Park, College Park, MD, 20740, USA
| | - Emily Robles
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Alistair Rogers
- Environmental and Climate Sciences Department, Brookhaven National Laboratory, Upton, NY, 11973, USA
| | - Maegen Simmonds
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Pivot Bio, 2910 Seventh Street, Berkeley, CA, 94710, USA
| | - Terri Velliquette
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37830, USA
| | | | - Jessica Nicole Welch
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37830, USA
| | - Karen Whitenack
- Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Deborah A Agarwal
- Scientific Data Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| |
Collapse
|
21
|
Anger M, Wendelborn C, Winkler EC, Schickhardt C. Neither carrots nor sticks? Challenges surrounding data sharing from the perspective of research funding agencies-A qualitative expert interview study. PLoS One 2022; 17:e0273259. [PMID: 36070283 PMCID: PMC9451069 DOI: 10.1371/journal.pone.0273259] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 08/04/2022] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Data Sharing is widely recognised as crucial for accelerating scientific research and improving its quality. However, data sharing is still not a common practice. Funding agencies tend to facilitate the sharing of research data by both providing incentives and requiring data sharing as part of their policies and conditions for awarding grants. The goal of our article is to answer the following question: What challenges do international funding agencies see when it comes to their own efforts to foster and implement data sharing through their policies? METHODS We conducted a series of sixteen guideline-based expert interviews with representatives of leading international funding agencies. As contact persons for open science at their respective agencies, they offered their perspectives and experiences concerning their organisations' data sharing policies. We performed a qualitative content analysis of the interviews and categorised the challenges perceived by funding agencies. RESULTS We identify and illustrate six challenges surrounding data sharing policies as perceived by leading funding agencies: The design of clear policies, monitoring of compliance, sanctions for non-compliance, incentives, support, and limitations for funders' own capabilities. However, our interviews also show how funders approach potential solutions to overcome these challenges, for example by coordinating with other agencies or adjusting grant evaluation metrics to incentivise data sharing. DISCUSSION AND CONCLUSION Our interviews point to existing flaws in funders' data sharing policies, such as a lack of clarity, a lack of monitoring of funded researchers' data sharing behaviour, and a lack of incentives. A number of agencies could suggest potential solutions but often struggle with the overall complexity of data sharing and the implementation of these measures. Funders cannot solve each challenge by themselves, but they can play an active role and lead joint efforts towards a culture of data sharing.
Collapse
Affiliation(s)
- Michael Anger
- Section for Translational Medical Ethics, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Christian Wendelborn
- Section for Translational Medical Ethics, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Eva C. Winkler
- Section for Translational Medical Ethics/Department of Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg, University Hospital Heidelberg, Heidelberg, Germany
| | - Christoph Schickhardt
- Section for Translational Medical Ethics, National Center for Tumor Diseases (NCT) Heidelberg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
22
|
Cadwallader L, Hrynaszkiewicz I. A survey of researchers' code sharing and code reuse practices, and assessment of interactive notebook prototypes. PeerJ 2022; 10:e13933. [PMID: 36032954 PMCID: PMC9406794 DOI: 10.7717/peerj.13933] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 08/01/2022] [Indexed: 01/19/2023] Open
Abstract
This research aimed to understand the needs and habits of researchers in relation to code sharing and reuse; gather feedback on prototype code notebooks created by NeuroLibre; and help determine strategies that publishers could use to increase code sharing. We surveyed 188 researchers in computational biology. Respondents were asked about how often and why they look at code, which methods of accessing code they find useful and why, what aspects of code sharing are important to them, and how satisfied they are with their ability to complete these tasks. Respondents were asked to look at a prototype code notebook and give feedback on its features. Respondents were also asked how much time they spent preparing code and if they would be willing to increase this to use a code sharing tool, such as a notebook. As a reader of research articles the most common reason (70%) for looking at code was to gain a better understanding of the article. The most commonly encountered method for code sharing-linking articles to a code repository-was also the most useful method of accessing code from the reader's perspective. As authors, the respondents were largely satisfied with their ability to carry out tasks related to code sharing. The most important of these tasks were ensuring that the code was running in the correct environment, and sharing code with good documentation. The average researcher, according to our results, is unwilling to incur additional costs (in time, effort or expenditure) that are currently needed to use code sharing tools alongside a publication. We infer this means we need different models for funding and producing interactive or executable research outputs if they are to reach a large number of researchers. For the purpose of increasing the amount of code shared by authors, PLOS Computational Biology is, as a result, focusing on policy rather than tools.
Collapse
|
23
|
Gilmore RO. Show your work: Tools for open developmental science. ADVANCES IN CHILD DEVELOPMENT AND BEHAVIOR 2022; 62:37-59. [PMID: 35249685 DOI: 10.1016/bs.acdb.2022.01.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Since grade school, students of many subjects have learned to "show their work" in order to receive full credit for assignments. Many of the reasons for students to show their work extend to the conduct of scientific research. And yet multiple barriers make it challenging to share and show the products of scientific work beyond published findings. This chapter discusses some of these barriers and how web-based data repositories help overcome them. The focus is on Databrary.org, a data library specialized for storing and sharing video data with a restricted community of institutionally approved investigators. Databrary was designed by and for developmental researchers, and so its features and policies reflect many of the specific challenges faced by this community, especially those associated with sharing video and related identifiable data. The chapter argues that developmental science poses some of the most interesting, challenging, and important questions in all of science, and that by openly sharing much more of the products and processes of our work, developmental scientists can accelerate discovery while making our scholarship much more robust and reproducible.
Collapse
Affiliation(s)
- Rick O Gilmore
- Department of Psychology, The Pennsylvania State University, Databrary.org, University Park, PA, United States.
| |
Collapse
|
24
|
Forero DA, Curioso WH, Patrinos GP. The importance of adherence to international standards for depositing open data in public repositories. BMC Res Notes 2021; 14:405. [PMID: 34727971 PMCID: PMC8561348 DOI: 10.1186/s13104-021-05817-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 10/22/2021] [Indexed: 12/14/2022] Open
Abstract
There has been an important global interest in Open Science, which include open data and methods, in addition to open access publications. It has been proposed that public availability of raw data increases the value and the possibility of confirmation of scientific findings, in addition to the potential of reducing research waste. Availability of raw data in open repositories facilitates the adequate development of meta-analysis and the cumulative evaluation of evidence for specific topics. In this commentary, we discuss key elements about data sharing in open repositories and we invite researchers around the world to deposit their data in them.
Collapse
Affiliation(s)
- Diego A Forero
- Health and Sport Sciences Research Group, School of Health and Sport Sciences, Fundación Universitaria del Área Andina, Bogotá, Colombia. .,Professional Program in Respiratory Therapy, School of Health and Sport Sciences, Fundación Universitaria del Área Andina, Bogotá, Colombia.
| | - Walter H Curioso
- Vicerrectorado de Investigación, Universidad Continental, Lima, Peru
| | - George P Patrinos
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece.,Department of Pathology, College of Medicine and Health Sciences, United Arab Emirates University, Al-Ain, UAE.,Zayed Center for Health Sciences, United Arab Emirates University, Al-Ain, UAE
| |
Collapse
|
25
|
Knowledge syntheses in medical education: Meta-research examining author gender, geographic location, and institutional affiliation. PLoS One 2021; 16:e0258925. [PMID: 34699558 PMCID: PMC8547645 DOI: 10.1371/journal.pone.0258925] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 10/10/2021] [Indexed: 11/21/2022] Open
Abstract
Introduction Authors of knowledge syntheses make many subjective decisions during their review process. Those decisions, which are guided in part by author characteristics, can impact the conduct and conclusions of knowledge syntheses, which assimilate much of the evidence base in medical education. To better understand the evidence base, this study describes the characteristics of knowledge synthesis authors, focusing on gender, geography, and institution. Methods In 2020, the authors conducted meta-research to examine authors of 963 knowledge syntheses published between 1999 and 2019 in 14 core medical education journals. Results The authors identified 4,110 manuscript authors across all authorship positions. On average there were 4.3 authors per knowledge synthesis (SD = 2.51, Median = 4, Range = 1–22); 79 knowledge syntheses (8%) were single-author publications. Over time, the average number of authors per synthesis increased (M = 1.80 in 1999; M = 5.34 in 2019). Knowledge syntheses were authored by slightly more females (n = 2047; 50.5%) than males (n = 2005; 49.5%) across all author positions. Authors listed affiliations in 58 countries, and 58 knowledge syntheses (6%) included authors from low- or middle-income countries. Authors from the United States (n = 366; 38%), Canada (n = 233; 24%), and the United Kingdom (n = 180; 19%) published the most knowledge syntheses. Authors listed affiliation at 617 unique institutions, and first authors represented 362 unique institutions with greatest representation from University of Toronto (n = 55, 6%). Across all authorship positions, the large majority of knowledge syntheses (n = 753; 78%) included authors from institutions ranked in the top 200 globally. Conclusion Knowledge synthesis author teams have grown over the past 20 years, and while there is near gender parity across all author positions, authorship has been dominated by North American researchers located at highly ranked institutions. This suggests a potential overrepresentation of certain authors with particular characteristics, which may impact the conduct and conclusions of medical education knowledge syntheses.
Collapse
|
26
|
Meyer A, Faverjon C, Hostens M, Stegeman A, Cameron A. Systematic review of the status of veterinary epidemiological research in two species regarding the FAIR guiding principles. BMC Vet Res 2021; 17:270. [PMID: 34380468 PMCID: PMC8355576 DOI: 10.1186/s12917-021-02971-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 07/06/2021] [Indexed: 01/08/2023] Open
Abstract
Background The FAIR (Findable, Accessible, Interoperable, Reusable) principles were proposed in 2016 to set a path towards reusability of research datasets. In this systematic review, we assessed the FAIRness of datasets associated with peer-reviewed articles in veterinary epidemiology research published since 2017, specifically looking at salmonids and dairy cattle. We considered the differences in practices between molecular epidemiology, the branch of epidemiology using genetic sequences of pathogens and hosts to describe disease patterns, and non-molecular epidemiology. Results A total of 152 articles were included in the assessment. Consistent with previous assessments conducted in other disciplines, our results showed that most datasets used in non-molecular epidemiological studies were not available (i.e., neither findable nor accessible). Data availability was much higher for molecular epidemiology papers, in line with a strong repository base available to scientists in this discipline. The available data objects generally scored favourably for Findable, Accessible and Reusable indicators, but Interoperability was more problematic. Conclusions None of the datasets assessed in this study met all the requirements set by the FAIR principles. Interoperability, in particular, requires specific skills in data management which may not yet be broadly available in the epidemiology community. In the discussion, we present recommendations on how veterinary research could move towards greater reusability according to FAIR principles. Overall, although many initiatives to improve data access have been started in the research community, their impact on the availability of datasets underlying published articles remains unclear to date. Supplementary Information The online version contains supplementary material available at 10.1186/s12917-021-02971-1.
Collapse
Affiliation(s)
- Anne Meyer
- Ausvet Europe, 3 rue Camille Jordan, 69001, Lyon, France. .,Department of Farm Animal Health, Utrecht University, 3512 JE, Utrecht, the Netherlands.
| | | | - Miel Hostens
- Department of Farm Animal Health, Utrecht University, 3512 JE, Utrecht, the Netherlands
| | - Arjan Stegeman
- Department of Farm Animal Health, Utrecht University, 3512 JE, Utrecht, the Netherlands
| | - Angus Cameron
- Ausvet Europe, 3 rue Camille Jordan, 69001, Lyon, France
| |
Collapse
|
27
|
Thoegersen JL, Borlund P. Researcher attitudes toward data sharing in public data repositories: a meta-evaluation of studies on researcher data sharing. JOURNAL OF DOCUMENTATION 2021. [DOI: 10.1108/jd-01-2021-0015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
PurposeThe purpose of this paper is to report a study of how research literature addresses researchers' attitudes toward data repository use. In particular, the authors are interested in how the term data sharing is defined, how data repository use is reported and whether there is need for greater clarity and specificity of terminology.Design/methodology/approachTo study how the literature addresses researcher data repository use, relevant studies were identified by searching Library Information Science and Technology Abstracts, Library and Information Science Source, Thomas Reuters' Web of Science Core Collection and Scopus. A total of 62 studies were identified for inclusion in this meta-evaluation.FindingsThe study shows a need for greater clarity and consistency in the use of the term data sharing in future studies to better understand the phenomenon and allow for cross-study comparisons. Furthermore, most studies did not address data repository use specifically. In most analyzed studies, it was not possible to segregate results relating to sharing via public data repositories from other types of sharing. When sharing in public repositories was mentioned, the prevalence of repository use varied significantly.Originality/valueResearchers' data sharing is of great interest to library and information science research and practice to inform academic libraries that are implementing data services to support these researchers. This study explores how the literature approaches this issue, especially the use of data repositories, the use of which is strongly encouraged. This paper identifies the potential for additional study focused on this area.
Collapse
|
28
|
Page MJ, Moher D, Fidler FM, Higgins JPT, Brennan SE, Haddaway NR, Hamilton DG, Kanukula R, Karunananthan S, Maxwell LJ, McDonald S, Nakagawa S, Nunan D, Tugwell P, Welch VA, McKenzie JE. The REPRISE project: protocol for an evaluation of REProducibility and Replicability In Syntheses of Evidence. Syst Rev 2021; 10:112. [PMID: 33863381 PMCID: PMC8052676 DOI: 10.1186/s13643-021-01670-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 04/07/2021] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Investigations of transparency, reproducibility and replicability in science have been directed largely at individual studies. It is just as critical to explore these issues in syntheses of studies, such as systematic reviews, given their influence on decision-making and future research. We aim to explore various aspects relating to the transparency, reproducibility and replicability of several components of systematic reviews with meta-analysis of the effects of health, social, behavioural and educational interventions. METHODS The REPRISE (REProducibility and Replicability In Syntheses of Evidence) project consists of four studies. We will evaluate the completeness of reporting and sharing of review data, analytic code and other materials in a random sample of 300 systematic reviews of interventions published in 2020 (Study 1). We will survey authors of systematic reviews to explore their views on sharing review data, analytic code and other materials and their understanding of and opinions about replication of systematic reviews (Study 2). We will then evaluate the extent of variation in results when we (a) independently reproduce meta-analyses using the same computational steps and analytic code (if available) as used in the original review (Study 3), and (b) crowdsource teams of systematic reviewers to independently replicate a subset of methods (searches for studies, selection of studies for inclusion, collection of outcome data, and synthesis of results) in a sample of the original reviews; 30 reviews will be replicated by 1 team each and 2 reviews will be replicated by 15 teams (Study 4). DISCUSSION The REPRISE project takes a systematic approach to determine how reliable systematic reviews of interventions are. We anticipate that results of the REPRISE project will inform strategies to improve the conduct and reporting of future systematic reviews.
Collapse
Affiliation(s)
- Matthew J Page
- School of Public Health and Preventive Medicine, Monash University, 553 St. Kilda Road, Melbourne, Victoria, 3004, Australia.
| | - David Moher
- Centre for Journalology, Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Fiona M Fidler
- School of BioSciences, University of Melbourne, Melbourne, Australia
- School of Historical and Philosophical Studies, University of Melbourne, Melbourne, Australia
| | - Julian P T Higgins
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Sue E Brennan
- School of Public Health and Preventive Medicine, Monash University, 553 St. Kilda Road, Melbourne, Victoria, 3004, Australia
| | - Neal R Haddaway
- Mercator Research Institute on Global Commons and Climate Change, Berlin, Germany
- African Centre for Evidence, University of Johannesburg, Johannesburg, South Africa
- Stockholm Environment Institute, Linnégatan 87D, Stockholm, Sweden
- The SEI Centre of the Collaboration for Environmental Evidence, Stockholm, Sweden
| | - Daniel G Hamilton
- School of BioSciences, University of Melbourne, Melbourne, Australia
| | - Raju Kanukula
- School of Public Health and Preventive Medicine, Monash University, 553 St. Kilda Road, Melbourne, Victoria, 3004, Australia
| | - Sathya Karunananthan
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Lara J Maxwell
- Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Steve McDonald
- School of Public Health and Preventive Medicine, Monash University, 553 St. Kilda Road, Melbourne, Victoria, 3004, Australia
| | - Shinichi Nakagawa
- Evolution & Ecology Research Centre and School of Biological, Earth and Environmental Sciences, University of New South Wales, Sydney, Australia
| | - David Nunan
- Centre for Evidence-Based Medicine, Oxford University, Oxford, UK
| | - Peter Tugwell
- School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, Canada
- Department of Medicine, Faculty of Medicine, University of Ottawa, Ottawa, Canada
- Bruyère Research Institute, Ottawa, Canada
| | - Vivian A Welch
- School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, Canada
- Bruyère Research Institute, Ottawa, Canada
| | - Joanne E McKenzie
- School of Public Health and Preventive Medicine, Monash University, 553 St. Kilda Road, Melbourne, Victoria, 3004, Australia
| |
Collapse
|
29
|
Zhuang J, Sun H, Sayler G, Kline KL, Dale VH, Jin M, Yu G, Fu B, Löffler FE. Food-Energy-Water Crises in the United States and China: Commonalities and Asynchronous Experiences Support Integration of Global Efforts. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021; 55:1446-1455. [PMID: 33442981 DOI: 10.1021/acs.est.0c06607] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Food, energy, and water (FEW) systems have been recognized as an issue of critical global importance. Understanding the mechanisms that govern the FEW nexus is essential to develop solutions and avoid humanitarian crises of displacement, famine, and disease. The U.S. and China are the world's leading economies. Although the two nations are shaped by fundamentally different political and economic systems, they share FEW trajectories in several complementary ways. These realities place the U.S. and China in unique positions to engage in problem definition, dialogue, actions, and transdisciplinary convergence of research to achieve productive solutions addressing FEW challenges. By comparing the nexus and functions of the FEW systems in the two nations, this perspective aims to facilitate collaborative innovations that reduce disciplinary silos, mitigate FEW challenges, and enhance environmental sustainability. The review of experiences and challenges facing the U.S. and China also offers valuable insights for other nations seeking to achieve sustainable development goals.
Collapse
Affiliation(s)
- Jie Zhuang
- Department of Biosystems Engineering and Soil Science, The University of Tennessee, Knoxville, Tennessee 37996, United States
- Center for Environmental Biotechnology, The University of Tennessee, Knoxville, Tennessee 37996, United States
| | - Huihui Sun
- Department of Biosystems Engineering and Soil Science, The University of Tennessee, Knoxville, Tennessee 37996, United States
| | - Gary Sayler
- Center for Environmental Biotechnology, The University of Tennessee, Knoxville, Tennessee 37996, United States
| | - Keith L Kline
- Environmental Sciences Division, Climate Change Science Institute, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| | - Virginia H Dale
- Environmental Sciences Division, Climate Change Science Institute, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
- Department of Ecology & Evolutionary Biology, The University of Tennessee, Knoxville, Tennessee 37996, United States
| | | | - Guirui Yu
- Institute of Geographic Science and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
| | - Bojie Fu
- Research Center for Eco-environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Frank E Löffler
- Department of Biosystems Engineering and Soil Science, The University of Tennessee, Knoxville, Tennessee 37996, United States
- Center for Environmental Biotechnology, The University of Tennessee, Knoxville, Tennessee 37996, United States
- Department of Ecology & Evolutionary Biology, The University of Tennessee, Knoxville, Tennessee 37996, United States
- Department of Microbiology, The University of Tennessee, Knoxville, Tennessee 37996, United States
- Department of Civil and Environmental Engineering, The University of Tennessee, Knoxville, Tennessee 37996, United States
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| |
Collapse
|
30
|
Lorenz TK, Holland KJ. Response to Sakaluk (2020): Let's Get Serious About Including Qualitative Researchers in the Open Science Conversation. ARCHIVES OF SEXUAL BEHAVIOR 2020; 49:2761-2763. [PMID: 33029751 DOI: 10.1007/s10508-020-01851-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 09/21/2020] [Accepted: 09/23/2020] [Indexed: 05/15/2023]
Affiliation(s)
- Tierney K Lorenz
- Department of Psychology, University of Nebraska-Lincoln, 238 Burnett Hall, Lincoln, NE, 68588-0156, USA.
- Center for Brain, Biology and Behavior, University of Nebraska-Lincoln, Lincoln, NE, USA.
| | - Kathryn J Holland
- Department of Psychology, University of Nebraska-Lincoln, 238 Burnett Hall, Lincoln, NE, 68588-0156, USA
- Women's and Gender Studies Program, University of Nebraska-Lincoln, Lincoln, NE, USA
| |
Collapse
|
31
|
Correction: The views, perspectives, and experiences of academic researchers with data sharing and reuse: A meta-synthesis. PLoS One 2020; 15:e0234275. [PMID: 32484828 PMCID: PMC7266347 DOI: 10.1371/journal.pone.0234275] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
[This corrects the article DOI: 10.1371/journal.pone.0229182.].
Collapse
|
32
|
COllaborative open platform E-cohorts for research acceleration in trials and epidemiology. J Clin Epidemiol 2020; 124:139-148. [PMID: 32380177 DOI: 10.1016/j.jclinepi.2020.04.021] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 04/17/2020] [Accepted: 04/28/2020] [Indexed: 01/07/2023]
Abstract
BACKGROUND The current clinical research system relies on a "one-off" project-by-project model involving a costly and time-wasting permanent construction and deconstruction of the research infrastructure. We propose a new model of research relying on collaborative principles: the COllaborative Open Platform (COOP') e-cohort. DEVELOPMENT The COOP' e-cohort aims at building a large community of patients willing to participate in research by contributing to the generation of a large database of patient-reported data, passively enriched, at the individual level, by linkage with routinely collected care and/or medico-administrative data. Approved teams can use the platform and benefit from already enrolled participants or collected data or add new online questionnaires to perform observational or interventional studies to answer a broad range of research questions. APPLICATION The Community of Patients for Research (ComPaRe) is a proof-of-concept COOP' e-cohort in the field of chronic conditions that was launched in 2017. As of April 2020, 36,000 patients have joined the project and contributed to more than 4 million data points. Patient-reported data will be enriched by linkage with the French national health system databases and with hospital data for patients receiving care in the Paris region. Since 2017, 150 researchers have used the platform for research projects. Three clinical trials nested in ComPaRe have been funded. CONCLUSION By moving from myriad independent studies to a large collaborative infrastructure of research, COOP' e-cohorts will accelerate the research process by avoiding the redundancy of many steps common to all research projects and by limiting waste of research.
Collapse
|