1
|
Byers N, Parker C, Beecroft C, Reddy TBK, Salamon H, Garrity G, Fagnan K. Identifying genomic data use with the Data Citation Explorer. Sci Data 2024; 11:1200. [PMID: 39505888 PMCID: PMC11541499 DOI: 10.1038/s41597-024-04049-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 10/28/2024] [Indexed: 11/08/2024] Open
Abstract
Increases in sequencing capacity, combined with rapid accumulation of publications and associated data resources, have increased the complexity of maintaining associations between literature and genomic data. As the volume of literature and data have exceeded the capacity of manual curation, automated approaches to maintaining and confirming associations among these resources have become necessary. Here we present the Data Citation Explorer (DCE), which discovers literature incorporating genomic data that was not formally cited. This service provides advantages over manual curation methods including consistent resource coverage, metadata enrichment, documentation of new use cases, and identification of conflicting metadata. The service reduces labor costs associated with manual review, improves the quality of genome metadata maintained by the U.S. Department of Energy Joint Genome Institute (JGI), and increases the number of known publications that incorporate its data products. The DCE facilitates an understanding of JGI impact, improves credit attribution for data generators, and can encourage data sharing by allowing scientists to see how reuse amplifies the impact of their original studies.
Collapse
Affiliation(s)
- Neil Byers
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California, 94720, USA
| | - Charles Parker
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California, 94720, USA
| | - Chris Beecroft
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California, 94720, USA
| | - T B K Reddy
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California, 94720, USA
| | - Hugh Salamon
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California, 94720, USA
| | - George Garrity
- Michigan State University, Department of Microbiology & Molecular Genetics, East Lansing, Michigan, 48824, USA
| | - Kjiersten Fagnan
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California, 94720, USA.
| |
Collapse
|
2
|
Emanuele E, Minoretti P. Measuring the Impact of Data Sharing: From Author-Level Metrics to Quantification of Economic and Non-tangible Benefits. Cureus 2023; 15:e50308. [PMID: 38205488 PMCID: PMC10777335 DOI: 10.7759/cureus.50308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/10/2023] [Indexed: 01/12/2024] Open
Abstract
In early 2023, the National Institutes of Health (NIH) implemented its Data Management and Sharing (DMS) Policy, requiring researchers to share scientific data produced with NIH funding. The policy's objective is to amplify the benefits of public investment in research by promoting the dissemination and reusability of primary data. Given this backdrop, identifying a robust methodology to assess the impact of data sharing across diverse research domains is essential. In this review, we adopted two methodological paradigms, the bottom-up and top-down strategies, and employed content analysis to pinpoint established methodologies and innovative practices within this intricate field. Although numerous author-level metrics are available to gauge the impact of data sharing, their application is still limited. Non-traditional metrics, encompassing economic (e.g., cost savings) and intangible benefits, presently appear to hold more potential for evaluating the impact of primary data sharing. Finally, we address the primary obstacles encountered by open data policies and introduce an innovative "Shared model for shared data" framework to bolster data sharing practices and refine evaluation metrics.
Collapse
|
3
|
Lewis RJ, Marstein KE, Grytnes JA. Incentivising open ecological data using blockchain technology. Sci Data 2023; 10:591. [PMID: 37679374 PMCID: PMC10485047 DOI: 10.1038/s41597-023-02496-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 08/21/2023] [Indexed: 09/09/2023] Open
Affiliation(s)
- Robert John Lewis
- Norwegian Institute for Nature Research, Bergen, Norway.
- Norwegian Institute for Bio-economy Research, Bergen, Norway.
| | | | | |
Collapse
|
4
|
Muraleedharan A, Vanderperre B. The endo-lysosomal system in Parkinson's disease: expanding the horizon. J Mol Biol 2023:168140. [PMID: 37148997 DOI: 10.1016/j.jmb.2023.168140] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 04/22/2023] [Accepted: 04/27/2023] [Indexed: 05/08/2023]
Abstract
Parkinson's disease (PD) is the second most common neurodegenerative disorder after Alzheimer's disease, and its prevalence is increasing with age. A wealth of genetic evidence indicates that the endo-lysosomal system is a major pathway driving PD pathogenesis with a growing number of genes encoding endo-lysosomal proteins identified as risk factors for PD, making it a promising target for therapeutic intervention. However, detailed knowledge and understanding of the molecular mechanisms linking these genes to the disease are available for only a handful of them (e.g. LRRK2, GBA1, VPS35). Taking on the challenge of studying poorly characterized genes and proteins can be daunting, due to the limited availability of tools and knowledge from previous literature. This review aims at providing a valuable source of molecular and cellular insights into the biology of lesser-studied PD-linked endo-lysosomal genes, to help and encourage researchers in filling the knowledge gap around these less popular genetic players. Specific endo-lysosomal pathways discussed range from endocytosis, sorting, and vesicular trafficking to the regulation of membrane lipids of these membrane-bound organelles and the specific enzymatic activities they contain. We also provide perspectives on future challenges that the community needs to tackle and propose approaches to move forward in our understanding of these poorly studied endo-lysosomal genes. This will help harness their potential in designing innovative and efficient treatments to ultimately re-establish neuronal homeostasis in PD but also other diseases involving endo-lysosomal dysfunction.
Collapse
Affiliation(s)
- Amitha Muraleedharan
- Centre d'Excellence en Recherche sur les Maladies Orphelines - Fondation Courtois and Biological Sciences Department, Université du Québec à Montréal
| | - Benoît Vanderperre
- Centre d'Excellence en Recherche sur les Maladies Orphelines - Fondation Courtois and Biological Sciences Department, Université du Québec à Montréal
| |
Collapse
|
5
|
Kabanda SM, Cengiz N, Rajaratnam K, Watson BW, Brown Q, Esterhuizen TM, Moodley K. Data sharing and data governance in sub-Saharan Africa: Perspectives from researchers and scientists engaged in data-intensive research. S AFR J SCI 2023; 119:15129. [PMID: 39483790 PMCID: PMC11526389 DOI: 10.17159/sajs.2023/15129] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 05/05/2023] [Indexed: 11/03/2024] Open
Abstract
The data ecosystem is complex and involves multiple stakeholders. Researchers and scientists engaging in data-intensive research collect, analyse, store, manage and share large volumes of data. Consequently, capturing researchers' and scientists' views from multidisciplinary fields on data use, sharing and governance adds an important African perspective to emerging debates. We conducted a descriptive cross-sectional survey and received 160 responses from researchers and scientists representing 43 sub-Saharan African countries. Whilst most respondents were satisfied with institutional data storage processes, 40% indicated that their organisations or institutions did not have a formally established process for storing data beyond the life cycle of the project. Willingness to share data was generally high, but increased when data privacy was ensured. Robust governance frameworks increased the willingness to share, as did the regulation of access to data on shared platforms. Incentivising data sharing remains controversial. Respondents were satisfied with exchanging their data for co-authorship on publications (89.4%) and collaboration on projects (77.6%). However, respondents were split almost equally in terms of sharing their data for commercial gain. Regarding the process of managing data, 40.6% indicated that their organisations do not provide training on best practices for data management. This could be related to a lack of resources, chronic institutional under-investment, and suboptimal research training and mentorship in sub-Saharan Africa. The sustainability of data sharing may require ethical incentive structures to further encourage researchers and scientists. Tangible infrastructure to facilitate such sharing is a prerequisite. Capacity development in data governance for researchers and scientists is sorely needed. Significance Data sharing is necessary to advance science, yet there are many constraints. In this study, we explored factors that promote a willingness to share, as well as constraining factors. Seeking potential solutions to improve data sharing is a scientific and ethical imperative. The standardisation of basic data sharing and data transfer agreements, and the development of a Data Access Committee will strengthen data governance and facilitate responsible data sharing in sub-Saharan Africa. Funders, institutions, researchers and scientists ought to jointly contribute to fair and equitable data use and sharing during and beyond the life cycle of research projects.
Collapse
Affiliation(s)
- Siti M. Kabanda
- Centre for Medical Ethics and Law, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Nezerith Cengiz
- Centre for Medical Ethics and Law, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Kanshukan Rajaratnam
- School for Data Science and Computational Thinking, Stellenbosch University, Stellenbosch, South Africa
| | - Bruce W. Watson
- School for Data Science and Computational Thinking, Stellenbosch University, Stellenbosch, South Africa
| | - Qunita Brown
- Centre for Medical Ethics and Law, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Tonya M. Esterhuizen
- Division of Epidemiology and Biostatistics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Keymanthri Moodley
- Centre for Medical Ethics and Law, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| |
Collapse
|
6
|
Roche DG, Berberi I, Dhane F, Lauzon F, Soeharjono S, Dakin R, Binning SA. Slow improvement to the archiving quality of open datasets shared by researchers in ecology and evolution. Proc Biol Sci 2022; 289:20212780. [PMID: 35582791 PMCID: PMC9114975 DOI: 10.1098/rspb.2021.2780] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Many leading journals in ecology and evolution now mandate open data upon publication. Yet, there is very little oversight to ensure the completeness and reusability of archived datasets, and we currently have a poor understanding of the factors associated with high-quality data sharing. We assessed 362 open datasets linked to first- or senior-authored papers published by 100 principal investigators (PIs) in the fields of ecology and evolution over a period of 7 years to identify predictors of data completeness and reusability (data archiving quality). Datasets scored low on these metrics: 56.4% were complete and 45.9% were reusable. Data reusability, but not completeness, was slightly higher for more recently archived datasets and PIs with less seniority. Journal open data policy, PI gender and PI corresponding author status were unrelated to data archiving quality. However, PI identity explained a large proportion of the variance in data completeness (27.8%) and reusability (22.0%), indicating consistent inter-individual differences in data sharing practices by PIs across time and contexts. Several PIs consistently shared data of either high or low archiving quality, but most PIs were inconsistent in how well they shared. One explanation for the high intra-individual variation we observed is that PIs often conduct research through students and postdoctoral researchers, who may be responsible for the data collection, curation and archiving. Levels of data literacy vary among trainees and PIs may not regularly perform quality control over archived files. Our findings suggest that research data management training and culture within a PI's group are likely to be more important determinants of data archiving quality than other factors such as a journal's open data policy. Greater incentives and training for individual researchers at all career stages could improve data sharing practices and enhance data transparency and reusability.
Collapse
Affiliation(s)
- Dominique G. Roche
- Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, Canada K1S 5B6,Département de science biologiques, Université de Montréal, Montréal, Canada H3C 3J7,Institut de Biologie, Université de Neuchâtel, Neuchâtel 2000, Switzerland
| | - Ilias Berberi
- Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, Canada K1S 5B6
| | - Fares Dhane
- Département de science biologiques, Université de Montréal, Montréal, Canada H3C 3J7
| | - Félix Lauzon
- Département de science biologiques, Université de Montréal, Montréal, Canada H3C 3J7,Department of Biology, McGill University, Montréal, Canada H3A 1B1
| | - Sandrine Soeharjono
- Département de science biologiques, Université de Montréal, Montréal, Canada H3C 3J7
| | - Roslyn Dakin
- Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, Canada K1S 5B6
| | - Sandra A. Binning
- Département de science biologiques, Université de Montréal, Montréal, Canada H3C 3J7
| |
Collapse
|
7
|
Dalpasso A, Ficetola GF, Giachello S, Lo Parrino E, Manenti R, Muraro M, Falaschi M. Similar species, different fates: Abundance dynamics in spatially structured populations of common and threatened frogs. DIVERS DISTRIB 2022. [DOI: 10.1111/ddi.13483] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Affiliation(s)
- Andrea Dalpasso
- Department of Environmental Science and Policy Università degli Studi di Milano Milan Italy
| | - Gentile Francesco Ficetola
- Department of Environmental Science and Policy Università degli Studi di Milano Milan Italy
- Laboratoire d’Écologie Alpine Univ. Grenoble AlpesUniv. Savoie Mont BlancCNRSLECA Grenoble France
| | - Simone Giachello
- Department of Environmental Science and Policy Università degli Studi di Milano Milan Italy
| | - Elia Lo Parrino
- Department of Environmental Science and Policy Università degli Studi di Milano Milan Italy
| | - Raoul Manenti
- Department of Environmental Science and Policy Università degli Studi di Milano Milan Italy
| | - Martina Muraro
- Department of Environmental Science and Policy Università degli Studi di Milano Milan Italy
| | - Mattia Falaschi
- Department of Environmental Science and Policy Università degli Studi di Milano Milan Italy
| |
Collapse
|