Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Brito JJ, Li J, Moore JH, Greene CS, Nogoy NA, Garmire LX, Mangul S. Recommendations to enhance rigor and reproducibility in biomedical research. Gigascience 2020;9:giaa056. [PMID: 32479592 PMCID: PMC7263079 DOI: 10.1093/gigascience/giaa056] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 04/08/2020] [Accepted: 05/06/2020] [Indexed: 12/25/2022] Open

For:	Brito JJ, Li J, Moore JH, Greene CS, Nogoy NA, Garmire LX, Mangul S. Recommendations to enhance rigor and reproducibility in biomedical research. Gigascience 2020;9:giaa056. [PMID: 32479592 PMCID: PMC7263079 DOI: 10.1093/gigascience/giaa056] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 04/08/2020] [Accepted: 05/06/2020] [Indexed: 12/25/2022] Open

Number

Cited by Other Article(s)

Cunha-Oliveira T, Ioannidis JPA, Oliveira PJ. Best practices for data management and sharing in experimental biomedical research. Physiol Rev 2024;104:1387-1408. [PMID: 38451234 DOI: 10.1152/physrev.00043.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 02/07/2024] [Accepted: 02/29/2024] [Indexed: 03/08/2024] Open

Ferrena A, Zheng XY, Jackson K, Hoang B, Morrow B, Zheng D. scDAPP: a comprehensive single-cell transcriptomics analysis pipeline optimized for cross-group comparison. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.06.592708. [PMID: 38766089 PMCID: PMC11100619 DOI: 10.1101/2024.05.06.592708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]

Alser M, Lawlor B, Abdill RJ, Waymost S, Ayyala R, Rajkumar N, LaPierre N, Brito J, Ribeiro-Dos-Santos AM, Almadhoun N, Sarwal V, Firtina C, Osinski T, Eskin E, Hu Q, Strong D, Kim BDBD, Abedalthagafi MS, Mutlu O, Mangul S. Packaging and containerization of computational methods. Nat Protoc 2024:10.1038/s41596-024-00986-0. [PMID: 38565959 DOI: 10.1038/s41596-024-00986-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 02/12/2024] [Indexed: 04/04/2024]

Affiliation(s)

Mohammed Alser Department of Information Technology and Electrical Engineering, ETH Zürich, Zurich, Switzerland
Brendan Lawlor Department of Computer Science, Munster Technological University, Cork, Ireland Department of Biological Sciences, Munster Technological University, Cork, Ireland
Richard J Abdill Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
Sharon Waymost Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
Ram Ayyala Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA Titus Family Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA, USA
Neha Rajkumar Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA, USA
Nathan LaPierre Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA Department of Human Genetics, University of Chicago, Chicago, IL, USA
Jaqueline Brito Titus Family Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA, USA
André M Ribeiro-Dos-Santos Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
Nour Almadhoun Department of Information Technology and Electrical Engineering, ETH Zürich, Zurich, Switzerland
Varuni Sarwal Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
Can Firtina Department of Information Technology and Electrical Engineering, ETH Zürich, Zurich, Switzerland
Tomasz Osinski Center for Advanced Research Computing, University of Southern California, Los Angeles, CA, USA
Eleazar Eskin Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA Department of Human Genetics, University of California, Los Angeles, CA, USA
Qiyang Hu Office of Advanced Research Computing, University of California, Los Angeles, CA, USA
Derek Strong Center for Advanced Research Computing, University of Southern California, Los Angeles, CA, USA
Byoung-Do B D Kim Center for Advanced Research Computing, University of Southern California, Los Angeles, CA, USA
Malak S Abedalthagafi Department of Pathology & Laboratory Medicine, Emory University Hospital, Atlanta, GA, USA King Salman Center for Disability Research, Riyadh, Saudi Arabia
Onur Mutlu Department of Information Technology and Electrical Engineering, ETH Zürich, Zurich, Switzerland
Serghei Mangul Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA. Titus Family Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA, USA.

Collapse

Alessandri S, Ratto ML, Rabellino S, Piacenti G, Contaldo SG, Pernice S, Beccuti M, Calogero RA, Alessandri L. CREDO: a friendly Customizable, REproducible, DOcker file generator for bioinformatics applications. BMC Bioinformatics 2024;25:110. [PMID: 38475691 DOI: 10.1186/s12859-024-05695-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 02/09/2024] [Indexed: 03/14/2024] Open

Petersen C, Mucke L, Corces MR. CHOIR improves significance-based detection of cell types and states from single-cell data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.18.576317. [PMID: 38328105 PMCID: PMC10849522 DOI: 10.1101/2024.01.18.576317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]

Samuel S, Mietchen D. Computational reproducibility of Jupyter notebooks from biomedical publications. Gigascience 2024;13:giad113. [PMID: 38206590 PMCID: PMC10783158 DOI: 10.1093/gigascience/giad113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 08/09/2023] [Accepted: 12/08/2023] [Indexed: 01/12/2024] Open

Abstract

BACKGROUND

Jupyter notebooks facilitate the bundling of executable code with its documentation and output in one interactive environment, and they represent a popular mechanism to document and share computational workflows, including for research publications. The reproducibility of computational aspects of research is a key component of scientific reproducibility but has not yet been assessed at scale for Jupyter notebooks associated with biomedical publications.

APPROACH

We address computational reproducibility at 2 levels: (i) using fully automated workflows, we analyzed the computational reproducibility of Jupyter notebooks associated with publications indexed in the biomedical literature repository PubMed Central. We identified such notebooks by mining the article's full text, trying to locate them on GitHub, and attempting to rerun them in an environment as close to the original as possible. We documented reproduction success and exceptions and explored relationships between notebook reproducibility and variables related to the notebooks or publications. (ii) This study represents a reproducibility attempt in and of itself, using essentially the same methodology twice on PubMed Central over the course of 2 years, during which the corpus of Jupyter notebooks from articles indexed in PubMed Central has grown in a highly dynamic fashion.

RESULTS

Out of 27,271 Jupyter notebooks from 2,660 GitHub repositories associated with 3,467 publications, 22,578 notebooks were written in Python, including 15,817 that had their dependencies declared in standard requirement files and that we attempted to rerun automatically. For 10,388 of these, all declared dependencies could be installed successfully, and we reran them to assess reproducibility. Of these, 1,203 notebooks ran through without any errors, including 879 that produced results identical to those reported in the original notebook and 324 for which our results differed from the originally reported ones. Running the other notebooks resulted in exceptions.

CONCLUSIONS

We zoom in on common problems and practices, highlight trends, and discuss potential improvements to Jupyter-related workflows associated with biomedical publications.

Collapse

Johnson AL, Bouvette M, Rangu N, Morley T, Schultz A, Torgerson T, Vassar M. Data-Sharing Across Otolaryngology: Comparing Journal Policies and Their Adherence to the FAIR Principles. Ann Otol Rhinol Laryngol 2024;133:105-110. [PMID: 37431814 DOI: 10.1177/00034894231185642] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2023]

Post AR, Ho N, Rasmussen E, Post I, Cho A, Hofer J, Maness AT, Parnell T, Nix DA. Hypermedia-based software architecture enables Test-Driven Development. JAMIA Open 2023;6:ooad089. [PMID: 37860604 PMCID: PMC10582517 DOI: 10.1093/jamiaopen/ooad089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 08/12/2023] [Accepted: 10/04/2023] [Indexed: 10/21/2023] Open

Abstract

Objectives

Using agile software development practices, develop and evaluate an architecture and implementation for reliable and user-friendly self-service management of bioinformatic data stored in the cloud.

Materials and methods

Comprehensive Oncology Research Environment (CORE) Browser is a new open-source web application for cancer researchers to manage sequencing data organized in a flexible format in Amazon Simple Storage Service (S3) buckets. It has a microservices- and hypermedia-based architecture, which we integrated with Test-Driven Development (TDD), the iterative writing of computable specifications for how software should work prior to development. Relying on repeating patterns found in hypermedia-based architectures, we hypothesized that hypermedia would permit developing test "templates" that can be parameterized and executed for each microservice, maximizing code coverage while minimizing effort.

Results

After one-and-a-half years of development, the CORE Browser backend had 121 test templates and 875 custom tests that were parameterized and executed 3031 times, providing 78% code coverage.

Discussion

Architecting to permit test reuse through a hypermedia approach was a key success factor for our testing efforts. CORE Browser's application of hypermedia and TDD illustrates one way to integrate software engineering methods into data-intensive networked applications. Separating bioinformatic data management from analysis distinguishes this platform from others in bioinformatics and may provide stable data management while permitting analysis methods to advance more rapidly.

Conclusion

Software engineering practices are underutilized in informatics. Similar informatics projects will more likely succeed through application of good architecture and automated testing. Our approach is broadly applicable to data management tools involving cloud data storage.

Collapse

Yang J, Liu Y, Shang J, Chen Q, Chen Q, Ren L, Zhang N, Yu Y, Li Z, Song Y, Yang S, Scherer A, Tong W, Hong H, Xiao W, Shi L, Zheng Y. The Quartet Data Portal: integration of community-wide resources for multiomics quality control. Genome Biol 2023;24:245. [PMID: 37884999 PMCID: PMC10601216 DOI: 10.1186/s13059-023-03091-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 10/17/2023] [Indexed: 10/28/2023] Open

Affiliation(s)

Jingcheng Yang State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China Greater Bay Area Institute of Precision Medicine, Guangzhou, Guangdong, China
Yaqing Liu State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Jun Shang State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Qiaochu Chen State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Qingwang Chen State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Luyao Ren State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Naixin Zhang State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Ying Yu State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Zhihui Li State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Yueqiang Song State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China
Shengpeng Yang Intelligent Storage, Alibaba Cloud, Alibaba Group, Hangzhou, Zhejiang, China
Andreas Scherer Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland EATRIS ERIC-European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
Weida Tong Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
Huixiao Hong Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
Wenming Xiao Office of Oncological Diseases, Office of New Drugs, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
Leming Shi State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China. International Human Phenome Institutes (Shanghai), Shanghai, China.
Yuanting Zheng State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, China.

Collapse

Mezuk B, Zhong C, Firestone M. Integrative approaches to methods training for early-career scientists: Rationale and process evaluation of the first cohort of the Michigan Integrative Well-Being and Inequality Training Program. J Clin Transl Sci 2023;7:e169. [PMID: 37588674 PMCID: PMC10425869 DOI: 10.1017/cts.2023.595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 06/10/2023] [Accepted: 07/12/2023] [Indexed: 08/18/2023] Open

Soriano J, Belmonte-Tebar A, de la Casa-Esperon E. Synaptonemal & CO analyzer: A tool for synaptonemal complex and crossover analysis in immunofluorescence images. Front Cell Dev Biol 2023;11:1005145. [PMID: 36743415 PMCID: PMC9894712 DOI: 10.3389/fcell.2023.1005145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 01/09/2023] [Indexed: 01/20/2023] Open

Teixeira da Silva JA. A Synthesis of the Formats for Correcting Erroneous and Fraudulent Academic Literature, and Associated Challenges. JOURNAL FOR GENERAL PHILOSOPHY OF SCIENCE = ZEITSCHRIFT FUR ALLGEMEINE WISSENSCHAFTSTHEORIE 2022;53:583-599. [PMID: 35669840 PMCID: PMC9159037 DOI: 10.1007/s10838-022-09607-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 11/14/2021] [Accepted: 02/12/2022] [Indexed: 06/15/2023]

Tumescheit C, Firth AE, Brown K. CIAlign: A highly customisable command line tool to clean, interpret and visualise multiple sequence alignments. PeerJ 2022;10:e12983. [PMID: 35310163 PMCID: PMC8932311 DOI: 10.7717/peerj.12983] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 02/01/2022] [Indexed: 01/11/2023] Open

Abstract

Background

Throughout biology, multiple sequence alignments (MSAs) form the basis of much investigation into biological features and relationships. These alignments are at the heart of many bioinformatics analyses. However, sequences in MSAs are often incomplete or very divergent, which can lead to poor alignment and large gaps. This slows down computation and can impact conclusions without being biologically relevant. Cleaning the alignment by removing common issues such as gaps, divergent sequences, large insertions and deletions and poorly aligned sequence ends can substantially improve analyses. Manual editing of MSAs is very widespread but is time-consuming and difficult to reproduce.

Results

We present a comprehensive, user-friendly MSA trimming tool with multiple visualisation options. Our highly customisable command line tool aims to give intervention power to the user by offering various options, and outputs graphical representations of the alignment before and after processing to give the user a clear overview of what has been removed. The main functionalities of the tool include removing regions of low coverage due to insertions, removing gaps, cropping poorly aligned sequence ends and removing sequences that are too divergent or too short. The thresholds for each function can be specified by the user and parameters can be adjusted to each individual MSA. CIAlign is designed with an emphasis on solving specific and common alignment problems and on providing transparency to the user.

Conclusion

CIAlign effectively removes problematic regions and sequences from MSAs and provides novel visualisation options. This tool can be used to fine-tune alignments for further analysis and processing. The tool is aimed at anyone who wishes to automatically clean up parts of an MSA and those requiring a new, accessible way of visualising large MSAs.

Collapse

Yang J, Liu Y, Shang J, Huang Y, Yu Y, Li Z, Shi L, Ran Z. BioVisReport: A Markdown-based lightweight website builder for reproducible and interactive visualization of results from peer-reviewed publications. Comput Struct Biotechnol J 2022;20:3133-3139. [PMID: 35782729 PMCID: PMC9233186 DOI: 10.1016/j.csbj.2022.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 06/05/2022] [Accepted: 06/05/2022] [Indexed: 11/18/2022] Open

Affiliation(s)

Jingcheng Yang State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, 2005 Songhu Road, Shanghai 200438, China Greater Bay Area Institute of Precision Medicine, 115 Jiaoxi Road, Guangzhou 511458, China
Yaqing Liu State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, 2005 Songhu Road, Shanghai 200438, China
Jun Shang State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, 2005 Songhu Road, Shanghai 200438, China
Yechao Huang State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, 2005 Songhu Road, Shanghai 200438, China
Ying Yu State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, 2005 Songhu Road, Shanghai 200438, China
Zhihui Li State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, 2005 Songhu Road, Shanghai 200438, China
Leming Shi State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, 2005 Songhu Road, Shanghai 200438, China
Zihan Ran Department of Research, Shanghai University of Medicine & Health Sciences Affiliated Zhoupu Hospital, 1500 Zhouyuan Road, Shanghai 201318, China Inspection and Quarantine Department, The College of Medical Technology, Shanghai University of Medicine & Health Sciences, 279 Zhouzhu Road, Shanghai 201318, China Corresponding author at: Department of Research, Shanghai University of Medicine & Health Sciences Affiliated Zhoupu Hospital, 1500 Zhouyuan Road, Shanghai 201318, China.

Collapse

Marini F, Ludt A, Linke J, Strauch K. GeneTonic: an R/Bioconductor package for streamlining the interpretation of RNA-seq data. BMC Bioinformatics 2021;22:610. [PMID: 34949163 PMCID: PMC8697502 DOI: 10.1186/s12859-021-04461-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 10/26/2021] [Indexed: 01/22/2023] Open

Abstract

BACKGROUND

The interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task, where the essential information is distributed among different tabular and list formats-normalized expression values, results from differential expression analysis, and results from functional enrichment analyses. A number of tools and databases are widely used for the purpose of identification of relevant functional patterns, yet often their contextualization within the data and results at hand is not straightforward, especially if these analytic components are not combined together efficiently.

RESULTS

We developed the GeneTonic software package, which serves as a comprehensive toolkit for streamlining the interpretation of functional enrichment analyses, by fully leveraging the information of expression values in a differential expression context. GeneTonic is implemented in R and Shiny, leveraging packages that enable HTML-based interactive visualizations for executing drilldown tasks seamlessly, viewing the data at a level of increased detail. GeneTonic is integrated with the core classes of existing Bioconductor workflows, and can accept the output of many widely used tools for pathway analysis, making this approach applicable to a wide range of use cases. Users can effectively navigate interlinked components (otherwise available as flat text or spreadsheet tables), bookmark features of interest during the exploration sessions, and obtain at the end a tailored HTML report, thus combining the benefits of both interactivity and reproducibility.

CONCLUSION

GeneTonic is distributed as an R package in the Bioconductor project ( https://bioconductor.org/packages/GeneTonic/ ) under the MIT license. Offering both bird's-eye views of the components of transcriptome data analysis and the detailed inspection of single genes, individual signatures, and their relationships, GeneTonic aims at simplifying the process of interpretation of complex and compelling RNA-seq datasets for many researchers with different expertise profiles.

Collapse

Grimes DR, Heathers J. The new normal? Redaction bias in biomedical science. ROYAL SOCIETY OPEN SCIENCE 2021;8:211308. [PMID: 34966555 PMCID: PMC8633797 DOI: 10.1098/rsos.211308] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 11/01/2021] [Indexed: 06/14/2023]

Chetnik K, Benedetti E, Gomari DP, Schweickart A, Batra R, Buyukozkan M, Wang Z, Arnold M, Zierer J, Suhre K, Krumsiek J. maplet: an extensible R toolbox for modular and reproducible metabolomics pipelines. Bioinformatics 2021;38:1168-1170. [PMID: 34694386 PMCID: PMC8796365 DOI: 10.1093/bioinformatics/btab741] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 09/24/2021] [Accepted: 10/22/2021] [Indexed: 02/03/2023] Open

Peng K, Huang YN, Sarwal V, Alachkar H, Wong‐Beringer A, Mangul S. Integrating big data computational skills in education to facilitate reproducibility and transparency in pharmaceutical sciences. JOURNAL OF THE AMERICAN COLLEGE OF CLINICAL PHARMACY 2021. [DOI: 10.1002/jac5.1519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Post AR, Luther J, Loveless JM, Ward M, Hewitt S. Enhancing research informatics core user satisfaction through agile practices. JAMIA Open 2021;4:ooab103. [PMID: 34927001 PMCID: PMC8672926 DOI: 10.1093/jamiaopen/ooab103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 10/06/2021] [Accepted: 11/18/2021] [Indexed: 11/23/2022] Open

Leipzig J, Nüst D, Hoyt CT, Ram K, Greenberg J. The role of metadata in reproducible computational research. PATTERNS (NEW YORK, N.Y.) 2021;2:100322. [PMID: 34553169 PMCID: PMC8441584 DOI: 10.1016/j.patter.2021.100322] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Cervenka M, Pascual JM, Rho JM, Thiele E, Yellen G, Whittemore V, Hartman AL. Metabolism-based therapies for epilepsy: new directions for future cures. Ann Clin Transl Neurol 2021;8:1730-1737. [PMID: 34247456 PMCID: PMC8351378 DOI: 10.1002/acn3.51423] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 06/28/2021] [Indexed: 12/22/2022] Open

Hauschild AC, Eick L, Wienbeck J, Heider D. Fostering reproducibility, reusability, and technology transfer in health informatics. iScience 2021;24:102803. [PMID: 34296072 PMCID: PMC8282945 DOI: 10.1016/j.isci.2021.102803] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open

Righelli D, Angelini C. Easyreporting simplifies the implementation of Reproducible Research layers in R software. PLoS One 2021;16:e0244122. [PMID: 33970927 PMCID: PMC8109797 DOI: 10.1371/journal.pone.0244122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 04/20/2021] [Indexed: 11/19/2022] Open

Samuel S, König-Ries B. Understanding experiments and research practices for reproducibility: an exploratory study. PeerJ 2021;9:e11140. [PMID: 33976964 PMCID: PMC8067906 DOI: 10.7717/peerj.11140] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 03/01/2021] [Indexed: 11/20/2022] Open

Rajesh A, Chang Y, Abedalthagafi MS, Wong-Beringer A, Love MI, Mangul S. Improving the completeness of public metadata accompanying omics studies. Genome Biol 2021;22:106. [PMID: 33858487 PMCID: PMC8048353 DOI: 10.1186/s13059-021-02332-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 03/29/2021] [Indexed: 12/17/2022] Open

Del Prete E, Facchiano A, Profumo A, Angelini C, Romano P. GeenaR: A Web Tool for Reproducible MALDI-TOF Analysis. Front Genet 2021;12:635814. [PMID: 33854526 PMCID: PMC8039533 DOI: 10.3389/fgene.2021.635814] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 03/01/2021] [Indexed: 12/21/2022] Open

Roberts JM, Rich-Edwards JW, McElrath TF, Garmire L, Myatt L. Subtypes of Preeclampsia: Recognition and Determining Clinical Usefulness. Hypertension 2021;77:1430-1441. [PMID: 33775113 DOI: 10.1161/hypertensionaha.120.14781] [Citation(s) in RCA: 92] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]

Abstract

The concept that preeclampsia is a multisystemic syndrome is appreciated in both research and clinical care. Our understanding of pathophysiology recognizes the role of inflammation, oxidative and endoplasm reticulum stress, and angiogenic dysfunction. Yet, we have not progressed greatly toward clinically useful prediction nor had substantial success in prevention or treatment. One possibility is that the maternal syndrome may be reached through different pathophysiological pathways, that is, subtypes of preeclampsia, that in their specificity yield more clinical utility. For example, early and late onset preeclampsia are increasingly acknowledged as different pathophysiological processes leading to a common presentation. Other subtypes of preeclampsia are supported by disparate clinical outcomes, long-range prognosis, organ systems involved, and risk factors. These insights have been supplemented by discovery-driven methods, which cluster preeclampsia cases into groups indicating different pathophysiologies. In this presentation, we review likely subtypes based on current knowledge and suggest others. We present a consideration of the requirements for a clinically meaningful preeclampsia subtype. A useful subtype should (1) identify a specific pathophysiological pathway or (2) specifically indicate maternal or fetal outcome, (3) be recognizable in a clinically useful time frame, and (4) these results should be reproducible and generalizable (but at varying frequency) including in low resource settings. We recommend that the default consideration be that preeclampsia includes several subtypes rather than trying to force all cases into a single pathophysiological pathway. The recognition of subtypes and deciphering their different pathophysiologies will provide specific targets for prevention, prediction, and treatment directing personalized care.

Collapse

Haendel MA, Chute CG, Bennett TD, Eichmann DA, Guinney J, Kibbe WA, Payne PRO, Pfaff ER, Robinson PN, Saltz JH, Spratt H, Suver C, Wilbanks J, Wilcox AB, Williams AE, Wu C, Blacketer C, Bradford RL, Cimino JJ, Clark M, Colmenares EW, Francis PA, Gabriel D, Graves A, Hemadri R, Hong SS, Hripscak G, Jiao D, Klann JG, Kostka K, Lee AM, Lehmann HP, Lingrey L, Miller RT, Morris M, Murphy SN, Natarajan K, Palchuk MB, Sheikh U, Solbrig H, Visweswaran S, Walden A, Walters KM, Weber GM, Zhang XT, Zhu RL, Amor B, Girvin AT, Manna A, Qureshi N, Kurilla MG, Michael SG, Portilla LM, Rutter JL, Austin CP, Gersing KR. The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment. J Am Med Inform Assoc 2021;28:427-443. [PMID: 32805036 PMCID: PMC7454687 DOI: 10.1093/jamia/ocaa196] [Citation(s) in RCA: 285] [Impact Index Per Article: 95.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 08/14/2020] [Indexed: 01/12/2023] Open

Abstract

Objective

Coronavirus disease 2019 (COVID-19) poses societal challenges that require expeditious data and knowledge sharing. Though organizational clinical data are abundant, these are largely inaccessible to outside researchers. Statistical, machine learning, and causal analyses are most successful with large-scale data beyond what is available in any given organization. Here, we introduce the National COVID Cohort Collaborative (N3C), an open science community focused on analyzing patient-level data from many centers.

Materials and Methods

The Clinical and Translational Science Award Program and scientific community created N3C to overcome technical, regulatory, policy, and governance barriers to sharing and harmonizing individual-level clinical data. We developed solutions to extract, aggregate, and harmonize data across organizations and data models, and created a secure data enclave to enable efficient, transparent, and reproducible collaborative analytics.

Results

Organized in inclusive workstreams, we created legal agreements and governance for organizations and researchers; data extraction scripts to identify and ingest positive, negative, and possible COVID-19 cases; a data quality assurance and harmonization pipeline to create a single harmonized dataset; population of the secure data enclave with data, machine learning, and statistical analytics tools; dissemination mechanisms; and a synthetic data pilot to democratize data access.

Conclusions

The N3C has demonstrated that a multisite collaborative learning health network can overcome barriers to rapidly build a scalable infrastructure incorporating multiorganizational clinical data for COVID-19 analytics. We expect this effort to save lives by enabling rapid collaboration among clinicians, researchers, and data scientists to identify treatments and specialized care and thereby reduce the immediate and long-term impacts of COVID-19.

Collapse

Affiliation(s)

Melissa A Haendel Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, Oregon, USA.,Translational and Integrative Sciences Center, Department of Molecular Toxicology, Oregon State University, Corvallis, Oregon, USA
Christopher G Chute Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, Maryland, USA
Tellen D Bennett Section of Informatics and Data Science, Department of Pediatrics, University of Colorado School of Medicine, University of Colorado, Aurora, Colorado, USA
David A Eichmann School of Library and Information Science, The University of Iowa, Iowa City, Iowa, USA
Justin Guinney Sage Bionetworks, Seattle, Washington, USA
Warren A Kibbe Duke University, Durham,North Carolina, USA
Philip R O Payne Institute for Informatics, Washington University in St. Louis, Saint Louis,Missouri, USA
Emily R Pfaff North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Peter N Robinson Jackson Laboratory, Bar Harbor, Maine, USA
Joel H Saltz Department of Biomedical Informatics, Stony Brook University, Stony Brook, New York, USA
Heidi Spratt University of Texas Medical Branch, Galveston, Texas, USA
Christine Suver Sage Bionetworks, Seattle, Washington, USA
John Wilbanks Sage Bionetworks, Seattle, Washington, USA
Adam B Wilcox University of Washington, Seattle, Washington, USA
Andrew E Williams Tufts Medical Center Clinical and Translational Science Institute, Tufts Medical Center, Boston,Massachusetts, USA
Chunlei Wu Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, California, USA
Clair Blacketer Janssen Research and Development, LLC, Raritan, New Jersey, USA
Robert L Bradford North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
James J Cimino University of Alabama-Birmingham, Birmingham, Alabama, USA
Marshall Clark North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Evan W Colmenares Department of Pharmaceutical Outcomes and Policy, University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Patricia A Francis Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Davera Gabriel Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Alexis Graves University of Iowa Institute for Clinical and Translational Science, The University of Iowa, Iowa City, Iowa, USA
Raju Hemadri National Center for Advancing Translational Science, Bethesda, Maryland, USA
Stephanie S Hong Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
George Hripscak Department of Biomedical Informatics, Columbia University, New York, New York, USA
Dazhi Jiao Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Jeffrey G Klann Harvard Medical School, Boston,Massachusetts, USA
Kristin Kostka IQVIA, Durham, North Carolina, USA
Adam M Lee University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Harold P Lehmann Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Lora Lingrey TriNetX, Cambridge,Massachusetts, USA
Robert T Miller Tufts Clinical and Translational Science Institute, Tufts University, Boston,Massachusetts, USA
Michele Morris Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh,Pennsylvania, USA
Shawn N Murphy Mass General Brigham, Boston,Massachusetts, USA
Karthik Natarajan Irving Medical Center, Columbia University, New York, New York, USA
Matvey B Palchuk TriNetX, Cambridge,Massachusetts, USA
Usman Sheikh National Center for Advancing Translational Science, Bethesda, Maryland, USA
Harold Solbrig Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Shyam Visweswaran Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh,Pennsylvania, USA
Anita Walden Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, Oregon, USA.,Sage Bionetworks, Seattle, Washington, USA
Kellie M Walters North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Griffin M Weber Department of Biomedical Informatics, Harvard Medical School, Boston,Massachusetts, USA
Xiaohan Tanner Zhang Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Richard L Zhu Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Benjamin Amor Palantir Technologies, Palo Alto, California, USA
Andrew T Girvin Palantir Technologies, Palo Alto, California, USA
Amin Manna Palantir Technologies, Palo Alto, California, USA
Nabeel Qureshi Palantir Technologies, Palo Alto, California, USA
Michael G Kurilla Division of Clinical Innovation, National Center for Advancing Translational Science, Bethesda, Maryland, USA
Sam G Michael National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA
Lili M Portilla Office of Strategic Alliances, National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA
Joni L Rutter Office of the Director, National Center for Advancing Translational Science, Bethesda, Maryland, USA
Christopher P Austin National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA
Ken R Gersing National Center for Advancing Translational Science, Bethesda, Maryland, USA

Collapse

Ten simple rules for writing a paper about scientific software. PLoS Comput Biol 2020;16:e1008390. [PMID: 33180774 PMCID: PMC7660560 DOI: 10.1371/journal.pcbi.1008390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open