1
|
Winter M, Ayobahan SU, Eilebrecht S, Schlich K. Natural but threatening? (I) A systematic aquatic ecotoxicity evaluation of biopolymers and modified natural polymers. ENVIRONMENTAL RESEARCH 2025; 274:121279. [PMID: 40049347 DOI: 10.1016/j.envres.2025.121279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2025] [Revised: 02/21/2025] [Accepted: 02/28/2025] [Indexed: 03/15/2025]
Abstract
Synthetic polymers and modified natural polymers are subject to EC 2023/2055, whereas biopolymers can be considered as important unregulated substituents. However, as regulatory requirements for biopolymers and a regulatory accepted hazard evaluation concept for polymers in general is missing, potential ecotoxicological effects are unknown. Biopolymers are often categorized as environmentally uncritical due to their origin, even though supporting data are missing. To assess potential environmental effects, we considered six biopolymers and modified natural polymers in a systematic ecotoxicity screening with aquatic organisms, hypothesising that the selected polymers are not ecotoxic. We tested alginate, chitosan, cellulose fibres Jelucel® HM 200, xanthan, carboxymethyl cellulose and modified starch Emwaxy® Jel 100 in OECD TG 201 (standard), 202 (miniaturised) and 236 (adapted to include OMICs) with algae, daphnids and zebrafish embryos, respectively. A screening of transcriptomic changes in zebrafish embryos was used to identify potential toxic modes-of-action of the polymers in fish. The polymers were applied in a concentration range of 1-100 mg/L as filtrates to consider potential intrinsic effects. Additionally, the impact of particulate polymers was evaluated by exposing algae and daphnids to polymer suspensions. Physical interactions were determined in algae growth studies leading to reduction of cell growth (lowest LOEC 10 mg/L). Daphnid immobility was observed with exposure to particulate chitosan (LOEC 100 mg/L). In zebrafish embryos, predominantly Emwaxy® Jel 100 affected the expression of genes involved in metabolic and catabolic processes (607 up- and 1002 down-regulated genes). Overall, polymer filtrates usually had no significant impact within the concentration range.
Collapse
Affiliation(s)
- Marie Winter
- Fraunhofer Institute for Molecular Biology and Applied Ecology, Department Ecotoxicology, Auf dem Aberg 1, Schmallenberg, 57392, Germany.
| | - Steve U Ayobahan
- Fraunhofer Institute for Molecular Biology and Applied Ecology, Department Ecotoxicogenomics, Auf dem Aberg 1, Schmallenberg, 57392, Germany
| | - Sebastian Eilebrecht
- Fraunhofer Institute for Molecular Biology and Applied Ecology, Department Ecotoxicogenomics, Auf dem Aberg 1, Schmallenberg, 57392, Germany
| | - Karsten Schlich
- Fraunhofer Institute for Molecular Biology and Applied Ecology, Department Ecotoxicology, Auf dem Aberg 1, Schmallenberg, 57392, Germany
| |
Collapse
|
2
|
Lee I, Wallace ZS, Wang Y, Park S, Nam H, Majithia AR, Ideker T. A genotype-phenotype transformer to assess and explain polygenic risk. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.10.23.619940. [PMID: 40291728 PMCID: PMC12026415 DOI: 10.1101/2024.10.23.619940] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2025]
Abstract
Genome-wide association studies have linked millions of genetic variants to biomedical phenotypes, but their utility has been limited by lack of mechanistic understanding and widespread epistatic interactions. Recently, Transformer models have emerged as a powerful machine learning architecture with potential to address these and other challenges. Accordingly, here we introduce the Genotype-to-Phenotype Transformer (G2PT), a framework for modeling hierarchical information flow among variants, genes, multigenic systems, and phenotypes. As proof-of-concept, we use G2PT to model the genetics of TG/HDL (triglycerides to high-density lipoprotein cholesterol), an indicator of metabolic health. G2PT predicts this trait via attention to 1,395 variants underlying at least 20 systems, including immune response and cholesterol transport, with accuracy exceeding state-of-the-art. It implicates 40 epistatic interactions, including epistasis between APOA4 and CETP in phospholipid transfer, a target pathway for cholesterol modification. This work positions hierarchical graph transformers as a next-generation approach to polygenic risk.
Collapse
|
3
|
Engel SR, Aleksander S, Nash RS, Wong ED, Weng S, Miyasato SR, Sherlock G, Cherry JM. Saccharomyces Genome Database: advances in genome annotation, expanded biochemical pathways, and other key enhancements. Genetics 2025; 229:iyae185. [PMID: 39530598 PMCID: PMC11912841 DOI: 10.1093/genetics/iyae185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 10/29/2024] [Accepted: 11/02/2024] [Indexed: 11/16/2024] Open
Abstract
Budding yeast (Saccharomyces cerevisiae) is the most extensively characterized eukaryotic model organism and has long been used to gain insight into the fundamentals of genetics, cellular biology, and the functions of specific genes and proteins. The Saccharomyces Genome Database (SGD) is a scientific resource that provides information about the genome and biology of S. cerevisiae. For more than 30 years, SGD has maintained the genetic nomenclature, chromosome maps, and functional annotation for budding yeast along with search and analysis tools to explore these data. Here, we describe recent updates at SGD, including the 2 most recent reference genome annotation updates, expanded biochemical pathway representation, changes to SGD search and data files, and other enhancements to the SGD website and user interface. These activities are part of our continuing effort to promote insights gained from yeast to enable the discovery of functional relationships between sequence and gene products in fungi and higher eukaryotes.
Collapse
Affiliation(s)
- Stacia R Engel
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Suzi Aleksander
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Robert S Nash
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Edith D Wong
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Shuai Weng
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Stuart R Miyasato
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| | - J Michael Cherry
- Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA
| |
Collapse
|
4
|
Symonová R, Jůza T, Tesfaye M, Brabec M, Bartoň D, Blabolil P, Draštík V, Kočvara L, Muška M, Prchalová M, Říha M, Šmejkal M, Souza AT, Sajdlová Z, Tušer M, Vašek M, Skubic C, Brabec J, Kubečka J. Transition to Piscivory Seen Through Brain Transcriptomics in a Juvenile Percid Fish: Complex Interplay of Differential Gene Transcription, Alternative Splicing, and ncRNA Activity. JOURNAL OF EXPERIMENTAL ZOOLOGY. PART A, ECOLOGICAL AND INTEGRATIVE PHYSIOLOGY 2025; 343:257-277. [PMID: 39629900 PMCID: PMC11788885 DOI: 10.1002/jez.2886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 11/11/2024] [Accepted: 11/13/2024] [Indexed: 02/04/2025]
Abstract
Pikeperch (Sander Lucioperca) belongs to main predatory fish species in freshwater bodies throughout Europe playing the key role by reducing planktivorous fish abundance. Two size classes of the young-of-the-year (YOY) pikeperch are known in Europe and North America. Our long-term fish survey elucidates late-summer size distribution of YOY pikeperch in the Lipno Reservoir (Czechia) and recognizes two distinct subcohorts: smaller pelagic planktivores heavily outnumber larger demersal piscivores. To explore molecular mechanisms accompanying the switch from planktivory to piscivory, we compared brain transcriptomes of both subcohorts and identified 148 differentially transcribed genes. The pathway enrichment analyses identified the piscivorous phase to be associated with genes involved in collagen and extracellular matrix generation with numerous Gene Ontology (GO), while the planktivorous phase was associated with genes for non-muscle-myosins (NMM) with less GO terms. Transcripts further upregulated in planktivores from the periphery of the NMM network were Pmchl, Pomcl, and Pyyb, all involved also in appetite control and producing (an)orexigenic neuropeptides. Noncoding RNAs were upregulated in transcriptomes of planktivores including three transcripts of snoRNA U85. Thirty genes mostly functionally unrelated to those differentially transcribed were alternatively spliced between the subcohorts. Our results indicate planktivores as potentially driven by voracity to initiate the switch to piscivory, while piscivores undergo a dynamic brain development. We propose a spatiotemporal spreading of juvenile development over a longer period and larger spatial scales through developmental plasticity as an adaptation to exploiting all types of resources and decreasing the intraspecific competition.
Collapse
Affiliation(s)
- Radka Symonová
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Tomáš Jůza
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Million Tesfaye
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
- South Bohemian Research Centre for Aquaculture and Biodiversity of Hydrocenoses, Faculty of Fisheries and Protection of WatersUniversity of South Bohemia in České BudějoviceVodňanyCzech Republic
| | - Marek Brabec
- Institute of Computer ScienceCzech Academy of SciencesPragueCzech Republic
| | - Daniel Bartoň
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Petr Blabolil
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Vladislav Draštík
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Luboš Kočvara
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Milan Muška
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Marie Prchalová
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Milan Říha
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Marek Šmejkal
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Allan T. Souza
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
- Institute for Atmospheric and Earth System Research INARForest Sciences, Faculty of Agriculture and Forestry, University of HelsinkiHelsinkiFinland
| | - Zuzana Sajdlová
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Michal Tušer
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Mojmír Vašek
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Cene Skubic
- Institute for Biochemistry and Molecular Genetics, Centre for Functional Genomics and Bio‐Chips, Faculty of MedicineUniversity of LjubljanaLjubljanaSlovenia
| | - Jakub Brabec
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| | - Jan Kubečka
- Institute of HydrobiologyBiology Centre of the Czech Academy of SciencesČeské BudějoviceCzech Republic
| |
Collapse
|
5
|
Raciti D, Van Auken KM, Arnaboldi V, Tabone CJ, Muller HM, Sternberg PW. Characterization and automated classification of sentences in the biomedical literature: a case study for biocuration of gene expression and protein kinase activity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.06.631539. [PMID: 39829858 PMCID: PMC11741306 DOI: 10.1101/2025.01.06.631539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
Biological knowledgebases are essential resources for biomedical researchers, providing ready access to gene function and genomic data. Professional, manual curation of knowledgebases, however, is labor-intensive and thus high-performing machine learning methods that improve biocuration efficiency are needed. Here we report on sentence-level classification to identify biocuration-relevant sentences in the full text of published references for two gene function data types: gene expression and protein kinase activity. We performed a detailed characterization of sentences from references in the WormBase bibliography and used this characterization to define three tasks for classifying sentences as either 1) fully curatable, 2) fully and partially curatable, or 3) all language-related. We evaluated various machine learning (ML) models applied to these tasks and found that GPT and BioBERT achieve the highest average performance, resulting in F1 performance scores ranging from 0.89 to 0.99 depending upon the task. Our findings demonstrate the feasibility of extracting biocuration-relevant sentences from full text. Integrating these models into professional biocuration workflows, such as those used by the Alliance of Genome Resources and the ACKnowledge community curation platform, might well facilitate efficient and accurate annotation of the biomedical literature.
Collapse
Affiliation(s)
- Daniela Raciti
- Division of Biology and Biological Engineering, 1200 E. California Boulevard, California Institute of Technology, Pasadena, CA 91125, USA
| | - Kimberly M. Van Auken
- Division of Biology and Biological Engineering, 1200 E. California Boulevard, California Institute of Technology, Pasadena, CA 91125, USA
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering, 1200 E. California Boulevard, California Institute of Technology, Pasadena, CA 91125, USA
| | | | - Hans-Michael Muller
- Division of Biology and Biological Engineering, 1200 E. California Boulevard, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paul W. Sternberg
- Division of Biology and Biological Engineering, 1200 E. California Boulevard, California Institute of Technology, Pasadena, CA 91125, USA
| |
Collapse
|
6
|
Bateman A, Martin MJ, Orchard S, Magrane M, Adesina A, Ahmad S, Bowler-Barnett EH, Bye-A-Jee H, Carpentier D, Denny P, Fan J, Garmiri P, Gonzales LJDC, Hussein A, Ignatchenko A, Insana G, Ishtiaq R, Joshi V, Jyothi D, Kandasaamy S, Lock A, Luciani A, Luo J, Lussi Y, Marin JSM, Raposo P, Rice DL, Santos R, Speretta E, Stephenson J, Totoo P, Tyagi N, Urakova N, Vasudev P, Warner K, Wijerathne S, Yu CWH, Zaru R, Bridge AJ, Aimo L, Argoud-Puy G, Auchincloss AH, Axelsen KB, Bansal P, Baratin D, Batista Neto TM, Blatter MC, Bolleman JT, Boutet E, Breuza L, Gil BC, Casals-Casas C, Echioukh KC, Coudert E, Cuche B, de Castro E, Estreicher A, Famiglietti ML, Feuermann M, Gasteiger E, Gaudet P, Gehant S, Gerritsen V, Gos A, Gruaz N, Hulo C, Hyka-Nouspikel N, Jungo F, Kerhornou A, Mercier PL, Lieberherr D, Masson P, Morgat A, Paesano S, Pedruzzi I, Pilbout S, Pourcel L, Poux S, Pozzato M, Pruess M, Redaschi N, Rivoire C, Sigrist CJA, Sonesson K, Sundaram S, Sveshnikova A, Wu CH, Arighi CN, Chen C, Chen Y, Huang H, Laiho K, Lehvaslaiho M, McGarvey P, Natale DA, Ross K, Vinayaka CR, Wang Y, Zhang J. UniProt: the Universal Protein Knowledgebase in 2025. Nucleic Acids Res 2025; 53:D609-D617. [PMID: 39552041 PMCID: PMC11701636 DOI: 10.1093/nar/gkae1010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 10/14/2024] [Accepted: 10/16/2024] [Indexed: 11/19/2024] Open
Abstract
The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication, we describe ongoing changes to our production pipeline to limit the sequences available in UniProtKB to high-quality, non-redundant reference proteomes. We continue to manually curate the scientific literature to add the latest functional data and use machine learning techniques. We also encourage community curation to ensure key publications are not missed. We provide an update on the automatic annotation methods used by UniProtKB to predict information for unreviewed entries describing unstudied proteins. Finally, updates to the UniProt website are described, including a new tab linking protein to genomic information. In recognition of its value to the scientific community, the UniProt database has been awarded Global Core Biodata Resource status.
Collapse
|
7
|
Yang H, Fu H, Zhang M, Liu Y, He YO, Wang C, Cheng L. EnrichDO: a global weighted model for Disease Ontology enrichment analysis. Gigascience 2025; 14:giaf021. [PMID: 40139908 PMCID: PMC11945307 DOI: 10.1093/gigascience/giaf021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2024] [Revised: 12/18/2024] [Accepted: 02/14/2025] [Indexed: 03/29/2025] Open
Abstract
BACKGROUND Disease Ontology (DO) has been widely studied in biomedical research and clinical practice to describe the roles of genes. DO enrichment analysis is an effective means to discover associations between genes and diseases. Compared to hundreds of Gene Ontology (GO)-based enrichment analysis methods, however, DO-based methods are relatively scarce, and most current DO-based approaches are term-for-term and thus are unable to solve over-enrichment problems caused by the "true-path" rule. RESULTS Here, we describe a novel double-weighted model, EnrichDO, which leverages the latest annotations of the human genome with DO terms and integrates DO graph topology on a global scale. Compared to classic enrichment methods (mainly for GO) and existing DO-based enrichment tools, EnrichDO performs better in both GO and DO enrichment analysis cases. It can accurately identify more specific terms, without ignoring the truly associated parent terms, as shown in the Alzheimer's disease (AD) case (AD ranked first). Moreover, both a simulated test and a data perturbation test validate the accuracy and robustness of EnrichDO. Finally, EnrichDO is applied to other types of datasets to expand its application, including gene expression profile datasets, a host gene set of microorganisms, and hallmark gene sets. Based on the findings reported here, EnrichDO shows significant improvement via all experimental results. CONCLUSIONS EnrichDO provides an effective DO enrichment analysis model for gaining insight into the significance of a particular gene set in the context of disease. To increase the usability of EnrichDO, we have developed an R-based software package, which is freely available through Bioconductor (https://bioconductor.org/packages/release/bioc/html/EnrichDO.html) or at https://github.com/liangcheng-hrbmu/EnrichDO.
Collapse
Affiliation(s)
- Haixiu Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Hongyu Fu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Meiyi Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Yangyang Liu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Yongqun Oliver He
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Chao Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
- National Health Commission (NHC) Key Laboratory of Molecular Probes and Targeted Diagnosis and Therapy, Harbin Medical University, Harbin 150028, China
| |
Collapse
|
8
|
Carbonneau M, Li Y, Qu Y, Zheng Y, Wood AC, Wang M, Liu C, Huan T, Joehanes R, Guo X, Yao J, Taylor KD, Tracy RP, Peter D, Liu Y, Johnson WC, Post WS, Blackwell T, Rotter JI, Rich SS, Redline S, Fornage M, Wang J, Ning H, Hou L, Lloyd-jones D, Ferrier K, Min YI, Carson AP, Raffield LM, Teumer A, Grabe HJ, Völzke H, Nauck M, Dörr M, Domingo-Relloso A, Fretts A, Tellez-Plaza M, Cole S, Navas-Acien A, Wang M, Murabito JM, Heard-Costa NL, Prescott B, Xanthakis V, Mozaffarian D, Levy D, Ma J. DNA Methylation Signatures of Cardiovascular Health Provide Insights into Diseases. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.11.19.24317587. [PMID: 39606375 PMCID: PMC11601778 DOI: 10.1101/2024.11.19.24317587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Background The association of overall cardiovascular health (CVH) with changes in DNA methylation (DNAm) has not been well characterized. Methods We calculated the American Heart Association's Life's Essential 8 (LE8) score to reflect CVH in five cohorts with diverse ancestry backgrounds. Epigenome-wide association studies (EWAS) for LE8 score were conducted, followed by bioinformatic analyses. DNAm loci significantly associated with LE8 score were used to calculate a CVH DNAm score. We examined the association of the CVH DNAm score with incident CVD, CVD-specific mortality, and all-cause mortality. Results We identified 609 CpGs associated with LE8 score at false discovery rate (FDR) < 0.05 in the discovery analysis and at Bonferroni corrected P < 0.05 in the multi-cohort replication stage. Most had low-to-moderate heterogeneity (414 CpGs [68.0%] with I2 < 0.2) in replication analysis. Pathway enrichment analyses and phenome-wide association study (PheWAS) search associated these CpGs with inflammatory or autoimmune phenotypes. We observed enrichment for phenotypes in the EWAS catalog, with 29-fold enrichment for stroke (P = 2.4e-15) and 21-fold for ischemic heart disease (P = 7.4e-38). Two-sample Mendelian randomization (MR) analysis showed significant association between 141 CpGs and ten phenotypes (261 CpG-phenotype pairs) at FDR < 0.05. For example, hypomethylation at cg20544516 (MIR33B; SREBF1) associated with lower risk of stroke (P = 8.1e-6). In multivariable prospective analyses, the CVH DNAm score was consistently associated with clinical outcomes across participating cohorts, the reduction in risk of incident CVD, CVD mortality, and all-cause mortality per standard deviation increase in the DNAm score ranged from 19% to 32%, 28% to 40%, and 27% to 45%, respectively. Conclusions We identified new DNAm signatures for CVH across diverse cohorts. Our analyses indicate that immune response-related pathways may be the key mechanism underpinning the association between CVH and clinical outcomes.
Collapse
Affiliation(s)
- Madeleine Carbonneau
- Population Sciences Branch, Division of Intramural Research, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
- Framingham Heart Study, Framingham, MA
| | - Yi Li
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Yishu Qu
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N Lake Shore Drive, Chicago, IL 60611, USA
| | - Yinan Zheng
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N Lake Shore Drive, Chicago, IL 60611, USA
| | - Alexis C. Wood
- United States Department of Agriculture (USDA)/ARS Children’s Nutrition Research Center, Baylor College of Medicine, TX, USA
| | - Mengyao Wang
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Chunyu Liu
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
| | - Tianxiao Huan
- Population Sciences Branch, Division of Intramural Research, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
- Framingham Heart Study, Framingham, MA
| | - Roby Joehanes
- Population Sciences Branch, Division of Intramural Research, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
- Framingham Heart Study, Framingham, MA
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, 1124 W. Carson Street, Torrance, CA 90502, USA
| | - Jie Yao
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, 1124 W. Carson Street, Torrance, CA 90502, USA
| | - Kent D. Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, 1124 W. Carson Street, Torrance, CA 90502, USA
| | - Russell P. Tracy
- Department of Pathology & Laboratory Medicine, University of Vermont Larner College of Medicine, 360 South Park Drive, Colchester, VT 05446, USA
| | - Durda Peter
- Department of Pathology & Laboratory Medicine, University of Vermont Larner College of Medicine, 360 South Park Drive, Colchester, VT 05446, USA
| | - Yongmei Liu
- Duke Molecular Physiology Institute, Duke University, Durham, NC, USA
| | - W Craig Johnson
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Wendy S. Post
- Division of Cardiology, Department of Medicine, The Johns Hopkins University School of Medicine, Baltimore, USA
| | - Tom Blackwell
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Jerome I. Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, 1124 W. Carson Street, Torrance, CA 90502, USA
| | - Stephen S. Rich
- Department of Genome Sciences, University of Virginia School of Medicine, 1200 Jefferson Park Avenue, Charlottesville, VA 22903, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Departments of Medicine and Neurology, Brigham & Women’s Hospital & Harvard Medical School, Boston, MA, 02115, USA
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Myriam Fornage
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, 1825 Pressler Street, Houston, TX 77030, USA
| | - Jun Wang
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N Lake Shore Drive, Chicago, IL 60611, USA
| | - Hongyan Ning
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N Lake Shore Drive, Chicago, IL 60611, USA
| | - Lifang Hou
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N Lake Shore Drive, Chicago, IL 60611, USA
| | - Donald Lloyd-jones
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 680 N Lake Shore Drive, Chicago, IL 60611, USA
| | - Kendra Ferrier
- Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, Aurora, CO 80045, USA
| | - Yuan-I. Min
- Department of Medicine, University of Mississippi Medical Center, 350 W. Woodrow Wilson Avenue, Suite 701, Jackson, MS 39213, USA
| | - April P. Carson
- Department of Medicine, University of Mississippi Medical Center, 350 W. Woodrow Wilson Avenue, Suite 701, Jackson, MS 39213, USA
| | - Laura M. Raffield
- Department of Genetics, University of North Carolina at Chapel Hill, 120 Mason Farm Road, Chapel Hill, NC 27599, USA
| | - Alexander Teumer
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, Greifswald, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Greifswald, Greifswald, Germany
| | - Hans J. Grabe
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, Greifswald, Germany
- German Centre for Neurodegenerative Diseases (DZNE), Partner Site Rostock/Greifswald, Greifswald, Germany
| | - Henry Völzke
- DZHK (German Centre for Cardiovascular Research), Partner Site Greifswald, Greifswald, Germany
- Department SHIP/Clinical-Epidemiological Research, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Matthias Nauck
- DZHK (German Centre for Cardiovascular Research), Partner Site Greifswald, Greifswald, Germany
- Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Marcus Dörr
- DZHK (German Centre for Cardiovascular Research), Partner Site Greifswald, Greifswald, Germany
- Department of Internal Medicine B, University Medicine Greifswald, Greifswald, Germany
| | - Arce Domingo-Relloso
- Department of Biostatistics, Columbia University Mailman School of Public Health, New York, NY, USA
- Department of Environmental Health Sciences, Columbia University Mailman School of Public Health, New York, NY, USA
| | - Amanda Fretts
- Department of Epidemiology, Cardiovascular Health Research Unit, University of Washington, Seattle, Washington, USA
| | - Maria Tellez-Plaza
- Department of Chronic Diseases Epidemiology, National Center for Epidemiology, Carlos III Health Institute, Madrid, Spain
| | - Shelley Cole
- Population Health Program, Texas Biomedical Research Institute, San Antonio, TX, USA
| | - Ana Navas-Acien
- Department of Environmental Health Sciences, Columbia University Mailman School of Public Health, New York, NY, USA
| | - Meng Wang
- Nutrition Epidemiology and Data Science, Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA
| | - Joanne M. Murabito
- Framingham Heart Study, Framingham, MA
- Department of Medicine, Section of General Internal Medicine Boston University Chobanian & Avedisian School of Medicine, Boston, MA and Boston Medical Center, Boston, MA
| | - Nancy L. Heard-Costa
- Department of Medicine, Section of General Internal Medicine Boston University Chobanian & Avedisian School of Medicine, Boston, MA and Boston Medical Center, Boston, MA
| | - Brenton Prescott
- Section of Preventive Medicine and Epidemiology, Boston University School of Medicine, Boston, MA
| | - Vanessa Xanthakis
- Framingham Heart Study, Framingham, MA
- Department of Biostatistics, Boston University School of Public Health, Boston, MA
- Section of Preventive Medicine and Epidemiology, Boston University School of Medicine, Boston, MA
| | - Dariush Mozaffarian
- Nutrition Epidemiology and Data Science, Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA
| | - Daniel Levy
- Population Sciences Branch, Division of Intramural Research, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
- Framingham Heart Study, Framingham, MA
| | - Jiantao Ma
- Nutrition Epidemiology and Data Science, Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA
| |
Collapse
|
9
|
Wang Y, Zhang X, Du X, Zhang Z, He Z. Effects of different straw breeding substrates on the growth of tomato seedlings and transcriptome analysis. Sci Rep 2024; 14:22181. [PMID: 39333764 PMCID: PMC11437046 DOI: 10.1038/s41598-024-73135-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 09/13/2024] [Indexed: 09/30/2024] Open
Abstract
Traditional substrate cultivation is now a routine practice in vegetable facility breeding. However, finding renewable substrates that can replace traditional substrates is urgent in today's production. In this study, we used the 'Pindstrup' substrate as control and two types of composite substrates made from fermented corn straw (i.e. 0-3 and 3-5 mm) to identify appropriate substrate conditions for tomato seedling growth under winter greenhouse conditions. Seedling growth potential related data and substrate water content related data were tested to carry out data-oriented support. Since the single physiological data cannot well explain the mechanism of tomato seedlings under winter greenhouse condition, transcriptomic analysis of tomato root and leaf tissues were conducted to provide theoretical basis. The physiological data of tomato seedlings and substrate showed that compared with 0-3 mm and Pindstrup substrate, tomato seedlings planted in 3-5 mm had stronger growth potential and stronger water retention, and were more suitable for planting tomato seedlings. Transcriptome analysis revealed a greater number of DEGs between the Pindstrup and the 3-5 mm. The genes in this group contribute to tomato growth as well as tomato stress response mechanisms, such as ABA-related genes, hormone-related genes and some TFs. The simulation network mechanism diagram adds evidence to the above conclusions. Overall, these results demonstrate the potential benefits of using the fermented corn straw of 3-5 mm for growing tomato seedlings and present a novel method of utilizing corn straw.
Collapse
Affiliation(s)
- Yilian Wang
- Institute of Vegetable Research, Liaoning Academy of Agricultural Sciences, Shenyang, Liaoning, China
| | - Xinyu Zhang
- Institute of Vegetable Research, Liaoning Academy of Agricultural Sciences, Shenyang, Liaoning, China
| | - Xuejing Du
- Institute of Vegetable Research, Liaoning Academy of Agricultural Sciences, Shenyang, Liaoning, China
| | - Zhibo Zhang
- Institute of Vegetable Research, Liaoning Academy of Agricultural Sciences, Shenyang, Liaoning, China
| | - Zhigang He
- Institute of Vegetable Research, Liaoning Academy of Agricultural Sciences, Shenyang, Liaoning, China.
- Institute of Plant Nutrition and Environmental Resources, Liaoning Academy of Agricultural Sciences, Shenyang, 110161, China.
| |
Collapse
|
10
|
Engel SR, Aleksander S, Nash RS, Wong ED, Weng S, Miyasato SR, Sherlock G, Cherry JM. Saccharomyces Genome Database: Advances in Genome Annotation, Expanded Biochemical Pathways, and Other Key Enhancements. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.16.613348. [PMID: 39345624 PMCID: PMC11430078 DOI: 10.1101/2024.09.16.613348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/01/2024]
Abstract
Budding yeast (Saccharomyces cerevisiae) is the most extensively characterized eukaryotic model organism and has long been used to gain insight into the fundamentals of genetics, cellular biology, and the functions of specific genes and proteins. The Saccharomyces Genome Database (SGD) is a scientific resource that provides information about the genome and biology of S. cerevisiae. For more than 30 years, SGD has maintained the genetic nomenclature, chromosome maps, and functional annotation for budding yeast along with search and analysis tools to explore these data. Here we describe recent updates at SGD, including the two most recent reference genome annotation updates, expanded biochemical pathways representation, changes to SGD search and data files, and other enhancements to the SGD website and user interface. These activities are part of our continuing effort to promote insights gained from yeast to enable the discovery of functional relationships between sequence and gene products in fungi and higher eukaryotes.
Collapse
Affiliation(s)
- Stacia R Engel
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| | - Suzi Aleksander
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| | - Robert S Nash
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| | - Edith D Wong
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| | - Shuai Weng
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| | - Stuart R Miyasato
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| | - J Michael Cherry
- Department of Genetics, Stanford University, Palo Alto, CA 94304, USA
| |
Collapse
|
11
|
Ma X, Huang T, Li X, Zhou X, Pan H, Du A, Zeng Y, Yuan K, Wang Z. Exploration of the link between COVID-19 and gastric cancer from the perspective of bioinformatics and systems biology. Front Med (Lausanne) 2024; 11:1428973. [PMID: 39371335 PMCID: PMC11449776 DOI: 10.3389/fmed.2024.1428973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 09/04/2024] [Indexed: 10/08/2024] Open
Abstract
Background Coronavirus disease 2019 (COVID-19), an infectious disease caused by severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), has caused a global pandemic. Gastric cancer (GC) poses a great threat to people's health, which is a high-risk factor for COVID-19. Previous studies have found some associations between GC and COVID-19, whereas the underlying molecular mechanisms are not well understood. Methods We employed bioinformatics and systems biology to explore these links between GC and COVID-19. Gene expression profiles of COVID-19 (GSE196822) and GC (GSE179252) were obtained from the Gene Expression Omnibus (GEO) database. After identifying the shared differentially expressed genes (DEGs) for GC and COVID-19, functional annotation, protein-protein interaction (PPI) network, hub genes, transcriptional regulatory networks and candidate drugs were analyzed. Results We identified 209 shared DEGs between COVID-19 and GC. Functional analyses highlighted immune-related pathways as key players in both diseases. Ten hub genes (CDK1, KIF20A, TPX2, UBE2C, HJURP, CENPA, PLK1, MKI67, IFI6, IFIT2) were identified. The transcription factor/gene and miRNA/gene interaction networks identified 38 transcription factors (TFs) and 234 miRNAs. More importantly, we identified ten potential therapeutic agents, including ciclopirox, resveratrol, etoposide, methotrexate, trifluridine, enterolactone, troglitazone, calcitriol, dasatinib and deferoxamine, some of which have been reported to improve and treat GC and COVID-19. Conclusion This research offer valuable insights into the molecular interplay between COVID-19 and GC, potentially guiding future therapeutic strategies.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Kefei Yuan
- Division of Liver Surgery, Department of General Surgery and Laboratory of Liver Surgery, and State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, Chengdu, China
| | - Zhen Wang
- Division of Liver Surgery, Department of General Surgery and Laboratory of Liver Surgery, and State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, Chengdu, China
| |
Collapse
|
12
|
Bai D, Ellington CN, Mo S, Song L, Xing EP. AttentionPert: accurately modeling multiplexed genetic perturbations with multi-scale effects. Bioinformatics 2024; 40:i453-i461. [PMID: 38940174 PMCID: PMC11211811 DOI: 10.1093/bioinformatics/btae244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
MOTIVATION Genetic perturbations (e.g. knockouts, variants) have laid the foundation for our understanding of many diseases, implicating pathogenic mechanisms and indicating therapeutic targets. However, experimental assays are fundamentally limited by the number of measurable perturbations. Computational methods can fill this gap by predicting perturbation effects under novel conditions, but accurately predicting the transcriptional responses of cells to unseen perturbations remains a significant challenge. RESULTS We address this by developing a novel attention-based neural network, AttentionPert, which accurately predicts gene expression under multiplexed perturbations and generalizes to unseen conditions. AttentionPert integrates global and local effects in a multi-scale model, representing both the nonuniform system-wide impact of the genetic perturbation and the localized disturbance in a network of gene-gene similarities, enhancing its ability to predict nuanced transcriptional responses to both single and multi-gene perturbations. In comprehensive experiments, AttentionPert demonstrates superior performance across multiple datasets outperforming the state-of-the-art method in predicting differential gene expressions and revealing novel gene regulations. AttentionPert marks a significant improvement over current methods, particularly in handling the diversity of gene perturbations and in predicting out-of-distribution scenarios. AVAILABILITY AND IMPLEMENTATION Code is available at https://github.com/BaiDing1234/AttentionPert.
Collapse
Affiliation(s)
- Ding Bai
- Machine Learning Department, Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, 00000, United Arabic Emirates
| | - Caleb N Ellington
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, 15213, United States
| | - Shentong Mo
- Machine Learning Department, Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, 00000, United Arabic Emirates
| | - Le Song
- Machine Learning Department, Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, 00000, United Arabic Emirates
| | - Eric P Xing
- Machine Learning Department, Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, 00000, United Arabic Emirates
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA, 15213, United States
| |
Collapse
|
13
|
Yamagata Y, Fukuyama T, Onami S, Masuya H. Prototyping an Ontological Framework for Cellular Senescence Mechanisms: A Homeostasis Imbalance Perspective. Sci Data 2024; 11:485. [PMID: 38729991 PMCID: PMC11087592 DOI: 10.1038/s41597-024-03331-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 04/29/2024] [Indexed: 05/12/2024] Open
Abstract
Although cellular senescence is a key factor in organismal aging, with both positive and negative effects on individuals, its mechanisms remain largely unknown. Thus, integrating knowledge is essential to explain how cellular senescence manifests in tissue damage and age-related diseases. Here, we propose an ontological model that organizes knowledge of cellular senescence in a computer-readable form. We manually annotated and defined cellular senescence processes, molecules, anatomical structures, phenotypes, and other entities based on the Homeostasis Imbalance Process ontology (HOIP). We described the mechanisms as causal relationships of processes and modelled a homeostatic imbalance between stress and stress response in cellular senescence for a unified framework. HOIP was assessed formally, and the relationships between cellular senescence and diseases were inferred for higher-order knowledge processing. We visualized cellular senescence processes to support knowledge utilization. Our study provides a knowledge base to help elucidate mechanisms linking cellular and organismal aging.
Collapse
Affiliation(s)
- Yuki Yamagata
- Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan.
- Life Science Data Sharing Unit, Infrastructure Research and Development Division, RIKEN Information R&D and Strategy Headquarters, 2-2-3 Minatojima-Minamimachi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan.
| | - Tsubasa Fukuyama
- AXIOHELIX CO. LTD., 8F Kubota Bldg., 1-12-17 Kandaizumicho, Chiyoda-ku, Tokyo, 101-0024, Japan
| | - Shuichi Onami
- Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan
- Life Science Data Sharing Unit, Infrastructure Research and Development Division, RIKEN Information R&D and Strategy Headquarters, 2-2-3 Minatojima-Minamimachi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan
| | - Hiroshi Masuya
- Life Science Data Sharing Unit, Infrastructure Research and Development Division, RIKEN Information R&D and Strategy Headquarters, 2-2-3 Minatojima-Minamimachi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan.
- Integrated Bioresource Information Division, RIKEN BioResource Research Center, Kouyadai 3-1-1 Tsukuba, Ibaraki, 305-0074, Japan.
| |
Collapse
|
14
|
Sternberg PW, Van Auken K, Wang Q, Wright A, Yook K, Zarowiecki M, Arnaboldi V, Becerra A, Brown S, Cain S, Chan J, Chen WJ, Cho J, Davis P, Diamantakis S, Dyer S, Grigoriadis D, Grove CA, Harris T, Howe K, Kishore R, Lee R, Longden I, Luypaert M, Müller HM, Nuin P, Quinton-Tulloch M, Raciti D, Schedl T, Schindelman G, Stein L. WormBase 2024: status and transitioning to Alliance infrastructure. Genetics 2024; 227:iyae050. [PMID: 38573366 PMCID: PMC11075546 DOI: 10.1093/genetics/iyae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/19/2024] [Accepted: 03/20/2024] [Indexed: 04/05/2024] Open
Abstract
WormBase has been the major repository and knowledgebase of information about the genome and genetics of Caenorhabditis elegans and other nematodes of experimental interest for over 2 decades. We have 3 goals: to keep current with the fast-paced C. elegans research, to provide better integration with other resources, and to be sustainable. Here, we discuss the current state of WormBase as well as progress and plans for moving core WormBase infrastructure to the Alliance of Genome Resources (the Alliance). As an Alliance member, WormBase will continue to interact with the C. elegans community, develop new features as needed, and curate key information from the literature and large-scale projects.
Collapse
Affiliation(s)
- Paul W Sternberg
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Qinghua Wang
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Adam Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Magdalena Zarowiecki
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Andrés Becerra
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Stephanie Brown
- School of Infection and Immunity, University of Glasgow, Glasgow G12 8TA, UK
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Jaehyoung Cho
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paul Davis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Stavros Diamantakis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Sarah Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | | | - Christian A Grove
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Todd Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Kevin Howe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Raymond Lee
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Ian Longden
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Manuel Luypaert
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Hans-Michael Müller
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Mark Quinton-Tulloch
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Tim Schedl
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Gary Schindelman
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| |
Collapse
|
15
|
Rutherford KM, Lera-Ramírez M, Wood V. PomBase: a Global Core Biodata Resource-growth, collaboration, and sustainability. Genetics 2024; 227:iyae007. [PMID: 38376816 PMCID: PMC11075564 DOI: 10.1093/genetics/iyae007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 01/13/2024] [Indexed: 02/21/2024] Open
Abstract
PomBase (https://www.pombase.org), the model organism database (MOD) for fission yeast, was recently awarded Global Core Biodata Resource (GCBR) status by the Global Biodata Coalition (GBC; https://globalbiodata.org/) after a rigorous selection process. In this MOD review, we present PomBase's continuing growth and improvement over the last 2 years. We describe these improvements in the context of the qualitative GCBR indicators related to scientific quality, comprehensivity, accelerating science, user stories, and collaborations with other biodata resources. This review also showcases the depth of existing connections both within the biocuration ecosystem and between PomBase and its user community.
Collapse
Affiliation(s)
- Kim M Rutherford
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - Manuel Lera-Ramírez
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Valerie Wood
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| |
Collapse
|
16
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Cherry JM, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Gramates LS, Grove CA, Hale P, Harris T, Hayman GT, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Van Auken K, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, et alAleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Cherry JM, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Gramates LS, Grove CA, Hale P, Harris T, Hayman GT, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Van Auken K, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources central infrastructure. Genetics 2024; 227:iyae049. [PMID: 38552170 PMCID: PMC11075569 DOI: 10.1093/genetics/iyae049] [Show More Authors] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/28/2024] [Accepted: 02/29/2024] [Indexed: 04/09/2024] Open
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, Caenorhabditis elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and application programming interfaces (APIs). Here, we focus on developments over the last 2 years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse software. We describe our progress toward a central persistent database to support curation, the data modeling that underpins harmonization, and progress toward a state-of-the-art literature curation system with integrated artificial intelligence and machine learning (AI/ML).
Collapse
Affiliation(s)
| | | | | | - Giulia Antonazzo
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Helen Attrill
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Andrés Becerra
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Susan M Bello
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Olin Blodgett
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | | | - Carol J Bult
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Brian R Calvi
- Department of Biology, Indiana University , Bloomington, IN 47408 , USA
| | - Seth Carbon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - J Michael Cherry
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Jaehyoung Cho
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Madeline A Crosby
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Jeffrey L De Pons
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | | | - Stavros Diamantakis
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Mary E Dolan
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gilberto dos Santos
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Sarah Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Dustin Ebert
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Stacia R Engel
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - David Fashena
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Malcolm Fisher
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Saoirse Foley
- Department of Biological Sciences, Carnegie Mellon University , 5000 Forbes Ave, Pittsburgh, PA 15203
| | - Adam C Gibson
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Varun R Gollapally
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - L Sian Gramates
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Christian A Grove
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Paul Hale
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Todd Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - G Thomas Hayman
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Yanhui Hu
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christina James-Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Kamran Karimi
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Kalpana Karra
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Anne E Kwitek
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Stanley J F Laulederkind
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Raymond Lee
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ian Longden
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Manuel Luypaert
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Nicholas Markarian
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Steven J Marygold
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Beverley Matthews
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Monica S McAndrews
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gillian Millburn
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Stuart Miyasato
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Howie Motenko
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Sierra Moxon
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Hans-Michael Muller
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory , Berkeley, CA
| | - Anushya Muruganujan
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Tremayne Mushayahama
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Robert S Nash
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Holly Paddock
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Troy Pells
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Norbert Perrimon
- Department of Genetics, Howard Hughes Medical Institute , Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115 , USA
| | - Christian Pich
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Mark Quinton-Tulloch
- European Molecular Biology Laboratory, European Bioinformatics Institute , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | | | | | - Susan Russo Gelbart
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Leyla Ruzicka
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Gary Schindelman
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - David R Shaw
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Ajay Shrivatsav
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Amy Singer
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Constance M Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Cynthia L Smith
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Jennifer R Smith
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Paul W Sternberg
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Christopher J Tabone
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033 , USA
| | - Ketaki Thorat
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jyothi Thota
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Monika Tomczuk
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Vitor Trovisco
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Marek A Tutaj
- Medical College of Wisconsin—Rat Genome Database, Departments of Physiology and Biomedical Engineering , Medical College of Wisconsin, Milwaukee, WI 53226 , USA
| | - Jose-Maria Urbano
- Department of Physiology, Development and Neuroscience , University of Cambridge, Downing Street, Cambridge CB2 3DY , UK
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Ceri E Van Slyke
- Institute of Neuroscience, University of Oregon , Eugene, OR 97403
| | - Peter D Vize
- Department of Biological Sciences, University of Calgary , 507 Campus Dr NW, Calgary, AB T2N 4V8 , Canada
| | - Qinghua Wang
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Shuai Weng
- Department of Genetics, Stanford University , Stanford, CA 94305
| | | | - Laurens G Wilming
- The Jackson Laboratory for Mammalian Genomics, Bar Harbor , ME 04609 , USA
| | - Edith D Wong
- Department of Genetics, Stanford University , Stanford, CA 94305
| | - Adam Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research , Toronto, ON M5G0A3 , Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 140-18, California Institute of Technology , Pasadena, CA 91125 , USA
| | - Pinglei Zhou
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| | - Aaron Zorn
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center , 3333 Burnet Ave, Cincinnati, OH 45229 , USA
| | - Mark Zytkovicz
- The Biological Laboratories, Harvard University , 16 Divinity Avenue, Cambridge, MA 02138 , USA
| |
Collapse
|
17
|
Chaowongdee S, Vannatim N, Malichan S, Kuncharoen N, Tongyoo P, Siriwan W. Comparative transcriptomics analysis reveals defense mechanisms of Manihot esculenta Crantz against Sri Lanka Cassava MosaicVirus. BMC Genomics 2024; 25:436. [PMID: 38698332 PMCID: PMC11067156 DOI: 10.1186/s12864-024-10315-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 04/16/2024] [Indexed: 05/05/2024] Open
Abstract
BACKGROUND Cassava mosaic disease (CMD), caused by Sri Lankan cassava mosaic virus (SLCMV) infection, has been identified as a major pernicious disease in Manihot esculenta Crantz (cassava) plantations. It is widespread in Southeast Asia, especially in Thailand, which is one of the main cassava supplier countries. With the aim of restricting the spread of SLCMV, we explored the gene expression of a tolerant cassava cultivar vs. a susceptible cassava cultivar from the perspective of transcriptional regulation and the mechanisms underlying plant immunity and adaptation. RESULTS Transcriptomic analysis of SLCMV-infected tolerant (Kasetsart 50 [KU 50]) and susceptible (Rayong 11 [R 11]) cultivars at three infection stages-that is, at 21 days post-inoculation (dpi) (early/asymptomatic), 32 dpi (middle/recovery), and 67 dpi (late infection/late recovery)-identified 55,699 expressed genes. Differentially expressed genes (DEGs) between SLCMV-infected KU 50 and R 11 cultivars at (i) 21 dpi to 32 dpi (the early to middle stage), and (ii) 32 dpi to 67 dpi (the middle stage to late stage) were then identified and validated by real-time quantitative PCR (RT-qPCR). DEGs among different infection stages represent genes that respond to and regulate the viral infection during specific stages. The transcriptomic comparison between the tolerant and susceptible cultivars highlighted the role of gene expression regulation in tolerant and susceptible phenotypes. CONCLUSIONS This study identified genes involved in epigenetic modification, transcription and transcription factor activities, plant defense and oxidative stress response, gene expression, hormone- and metabolite-related pathways, and translation and translational initiation activities, particularly in KU 50 which represented the tolerant cultivar in this study.
Collapse
Affiliation(s)
- Somruthai Chaowongdee
- Center of Excellence on Agricultural Biotechnology (AG-BIO/MHESI), Bangkok, 10900, Thailand
- Center for Agricultural Biotechnology, Kasetsart University, Kamphaengsaen Campus, Nakhon Pathom, 73140, Thailand
| | - Nattachai Vannatim
- Department of Plant Pathology, Faculty of Agriculture, Kasetsart University, Bangkok, 10900, Thailand
| | - Srihunsa Malichan
- Department of Plant Pathology, Faculty of Agriculture, Kasetsart University, Bangkok, 10900, Thailand
| | - Nattakorn Kuncharoen
- Department of Plant Pathology, Faculty of Agriculture, Kasetsart University, Bangkok, 10900, Thailand
| | - Pumipat Tongyoo
- Center of Excellence on Agricultural Biotechnology (AG-BIO/MHESI), Bangkok, 10900, Thailand
- Center for Agricultural Biotechnology, Kasetsart University, Kamphaengsaen Campus, Nakhon Pathom, 73140, Thailand
| | - Wanwisa Siriwan
- Department of Plant Pathology, Faculty of Agriculture, Kasetsart University, Bangkok, 10900, Thailand.
| |
Collapse
|
18
|
Abstract
Nearly one-third of all stroke patients develop depression at any time after a stroke, and its presence is associated with unfavorable outcomes. This narrative review aims to provide a synopsis of possible pharmacological and non-pharmacological treatment modalities for post-stroke depression (PSD). Several studies have demonstrated the efficacy and safety of selective serotonin reuptake inhibitors in treating the symptoms of this clinical condition. The treatment of PSD has been recently enhanced by innovative approaches, such as cognitive-behavioral therapy, virtual reality, telehealth, repetitive transcranial magnetic stimulation, and non-conventional therapies, which might improve depression treatment in stroke survivors. Future high-quality randomized controlled trials are necessary to confirm this hypothesis.
Collapse
Affiliation(s)
- Alberto Raggi
- Unit of Neurology, G.B. Morgagni - L. Pierantoni Civic Hospital, Forlì
| | | | - Raffaele Ferri
- Department of Neurology, Oasi Research Institute - IRCCS, Troina, Italy
| |
Collapse
|
19
|
Sun H, He Z, Gao Y, Yang Y, Wang Y, Gu A, Xu J, Quan Y, Yang Y. Polyoxyethylene tallow amine and glyphosate exert different developmental toxicities on human pluripotent stem cells-derived heart organoid model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 918:170675. [PMID: 38316312 DOI: 10.1016/j.scitotenv.2024.170675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 01/27/2024] [Accepted: 02/02/2024] [Indexed: 02/07/2024]
Abstract
The early stage of heart development is highly susceptible to various environmental factors. While the use of animal models has aided in identifying numerous environmental risk factors, the variability between species and the low throughput limit their translational potential. Recently, a type of self-assembling cardiac structures, known as human heart organoids (hHOs), exhibits a remarkable biological consistency with human heart. However, the feasibility of hHOs for assessing cardiac developmental risk factors remains unexplored. Here, we focused on the cardiac developmental effects of core components of Glyphosate-based herbicides (GBHs), the most widely used herbicides, to evaluate the reliability of hHOs for the prediction of possible cardiogenesis toxicity. GBHs have been proven toxic to cardiac development based on multiple animal models, with the mechanism remaining unknown. We found that polyoxyethylene tallow amine (POEA), the most common surfactant in GBHs formulations, played a dominant role in GBHs' heart developmental toxicity. Though there were a few differences in transcriptive features, hHOs exposed to sole POEA and combined POEA and Glyphosate would suffer from both disruption of heart contraction and disturbance of commitment in cardiomyocyte isoforms. By contrast, Glyphosate only caused mild epicardial hyperplasia. This study not only sheds light on the toxic mechanism of GBHs, but also serves as a methodological demonstration, showcasing its effectiveness in recognizing and evaluating environmental risk factors, and deciphering toxic mechanisms.
Collapse
Affiliation(s)
- Hao Sun
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Zhazheng He
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China
| | - Yao Gao
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China
| | - Yanhan Yang
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Yachang Wang
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Aihua Gu
- State Key Laboratory of Reproductive Medicine and Offspring Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
| | - Jin Xu
- State Key Laboratory of Reproductive Medicine and Offspring Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
| | - Yingyi Quan
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Yang Yang
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China.
| |
Collapse
|
20
|
Liu Y, Li X, Chen C, Ding N, Zheng P, Chen X, Ma S, Yang M. TCMNPAS: a comprehensive analysis platform integrating network formulaology and network pharmacology for exploring traditional Chinese medicine. Chin Med 2024; 19:50. [PMID: 38519956 PMCID: PMC10958928 DOI: 10.1186/s13020-024-00924-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Accepted: 03/11/2024] [Indexed: 03/25/2024] Open
Abstract
The application of network formulaology and network pharmacology has significantly advanced the scientific understanding of traditional Chinese medicine (TCM) treatment mechanisms in disease. The field of herbal biology is experiencing a surge in data generation. However, researchers are encountering challenges due to the fragmented nature of the data and the reliance on programming tools for data analysis. We have developed TCMNPAS, a comprehensive analysis platform that integrates network formularology and network pharmacology. This platform is designed to investigate in-depth the compatibility characteristics of TCM formulas and their potential molecular mechanisms. TCMNPAS incorporates multiple resources and offers a range of functions designed for automated analysis implementation, including prescription mining, molecular docking, network pharmacology analysis, and visualization. These functions enable researchers to analyze and obtain core herbs and core formulas from herbal prescription data through prescription mining. Additionally, TCMNPAS facilitates virtual screening of active compounds in TCM and its formulas through batch molecular docking, allowing for the rapid construction and analysis of networks associated with "herb-compound-target-pathway" and disease targets. Built upon the integrated analysis concept of network formulaology and network pharmacology, TCMNPAS enables quick point-and-click completion of network-based association analysis, spanning from core formula mining from clinical data to the exploration of therapeutic targets for disease treatment. TCMNPAS serves as a powerful platform for uncovering the combinatorial rules and mechanism of TCM formulas holistically. We distribute TCMNPAS within an open-source R package at GitHub ( https://github.com/yangpluszhu/tcmnpas ), and the project is freely available at http://54.223.75.62:3838/ .
Collapse
Affiliation(s)
- Yishu Liu
- LongHua Hospital Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China
| | - Xue Li
- LongHua Hospital Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China
| | - Chao Chen
- LongHua Hospital Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China
| | - Nan Ding
- LongHua Hospital Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China
| | - Peiyong Zheng
- LongHua Hospital Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China
| | - Xiaoyun Chen
- LongHua Hospital Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China
| | - Shiyu Ma
- Ruijin Hospital Affiliated to Shanghai Jiaotong University School of Medicine, Shanghai, 200025, China.
| | - Ming Yang
- LongHua Hospital Shanghai University of Traditional Chinese Medicine, Shanghai, 200032, China.
| |
Collapse
|
21
|
Markarian N, Van Auken KM, Ebert D, Sternberg PW. Enrichment on steps, not genes, improves inference of differentially expressed pathways. PLoS Comput Biol 2024; 20:e1011968. [PMID: 38527066 PMCID: PMC10994554 DOI: 10.1371/journal.pcbi.1011968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 04/04/2024] [Accepted: 03/05/2024] [Indexed: 03/27/2024] Open
Abstract
Enrichment analysis is frequently used in combination with differential expression data to investigate potential commonalities amongst lists of genes and generate hypotheses for further experiments. However, current enrichment analysis approaches on pathways ignore the functional relationships between genes in a pathway, particularly OR logic that occurs when a set of proteins can each individually perform the same step in a pathway. As a result, these approaches miss pathways with large or multiple sets because of an inflation of pathway size (when measured as the total gene count) relative to the number of steps. We address this problem by enriching on step-enabling entities in pathways. We treat sets of protein-coding genes as single entities, and we also weight sets to account for the number of genes in them using the multivariate Fisher's noncentral hypergeometric distribution. We then show three examples of pathways that are recovered with this method and find that the results have significant proportions of pathways not found in gene list enrichment analysis.
Collapse
Affiliation(s)
- Nicholas Markarian
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
- Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Kimberly M. Van Auken
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Dustin Ebert
- Division of Bioinformatics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Paul W. Sternberg
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| |
Collapse
|
22
|
Milacic M, Beavers D, Conley P, Gong C, Gillespie M, Griss J, Haw R, Jassal B, Matthews L, May B, Petryszak R, Ragueneau E, Rothfels K, Sevilla C, Shamovsky V, Stephan R, Tiwari K, Varusai T, Weiser J, Wright A, Wu G, Stein L, Hermjakob H, D’Eustachio P. The Reactome Pathway Knowledgebase 2024. Nucleic Acids Res 2024; 52:D672-D678. [PMID: 37941124 PMCID: PMC10767911 DOI: 10.1093/nar/gkad1025] [Citation(s) in RCA: 290] [Impact Index Per Article: 290.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 10/14/2023] [Accepted: 10/20/2023] [Indexed: 11/10/2023] Open
Abstract
The Reactome Knowledgebase (https://reactome.org), an Elixir and GCBR core biological data resource, provides manually curated molecular details of a broad range of normal and disease-related biological processes. Processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Here we review progress towards annotation of the entire human proteome, targeted annotation of disease-causing genetic variants of proteins and of small-molecule drugs in a pathway context, and towards supporting explicit annotation of cell- and tissue-specific pathways. Finally, we briefly discuss issues involved in making Reactome more fully interoperable with other related resources such as the Gene Ontology and maintaining the resulting community resource network.
Collapse
Affiliation(s)
- Marija Milacic
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Deidre Beavers
- Oregon Health and Science University, Portland, OR 97239, USA
| | - Patrick Conley
- Oregon Health and Science University, Portland, OR 97239, USA
| | - Chuqiao Gong
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Marc Gillespie
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
- College of Pharmacy and Health Sciences, St. John's University, Queens, NY 11439, USA
| | - Johannes Griss
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
- Department of Dermatology, Medical University of Vienna, 1090 Vienna, Austria
| | - Robin Haw
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Bijay Jassal
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Lisa Matthews
- NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Bruce May
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | | | - Eliot Ragueneau
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Karen Rothfels
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Cristoffer Sevilla
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | | | - Ralf Stephan
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
- Institute for Globally Distributed Open Research and Education (IGDORE)
| | - Krishna Tiwari
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Thawfeek Varusai
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
- Open Targets, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Joel Weiser
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Adam Wright
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Guanming Wu
- Oregon Health and Science University, Portland, OR 97239, USA
| | - Lincoln Stein
- Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A1, Canada
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | | |
Collapse
|
23
|
Antonazzo G, Gaudet P, Lovering RC, Attrill H. Representation of non-coding RNA-mediated regulation of gene expression using the Gene Ontology. RNA Biol 2024; 21:36-48. [PMID: 39374113 PMCID: PMC11459742 DOI: 10.1080/15476286.2024.2408523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Revised: 09/16/2024] [Accepted: 09/20/2024] [Indexed: 10/09/2024] Open
Abstract
Regulatory non-coding RNAs (ncRNAs) are increasingly recognized as integral to the control of biological processes. This is often through the targeted regulation of mRNA expression, but this is by no means the only mechanism through which regulatory ncRNAs act. The Gene Ontology (GO) has long been used for the systematic annotation of protein-coding and ncRNA gene function, but rapid progress in the understanding of ncRNAs meant that the ontology needed to be revised to accurately reflect current knowledge. Here, a targeted effort to revise GO terms used for the annotation of regulatory ncRNAs is described, focusing on microRNAs (miRNAs), long non-coding RNAs (lncRNAs), small interfering RNAs (siRNAs) and PIWI-interacting RNAs (piRNAs). This paper provides guidance to biocurators annotating ncRNA-mediated processes using the GO and serves as background for researchers wishing to make use of the GO in their studies of ncRNAs and the biological processes they regulate.
Collapse
Affiliation(s)
- Giulia Antonazzo
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | - Pascale Gaudet
- SIB Swiss Institute of Bioinformatics, Swiss-Prot Group, Geneva, Switzerland
| | - Ruth C. Lovering
- Functional Gene Annotation, Institute of Cardiovascular Science, University College London, London, UK
| | - Helen Attrill
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| |
Collapse
|
24
|
Bult CJ, Sternberg PW. The alliance of genome resources: transforming comparative genomics. Mamm Genome 2023; 34:531-544. [PMID: 37666946 PMCID: PMC10628019 DOI: 10.1007/s00335-023-10015-2] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 08/11/2023] [Indexed: 09/06/2023]
Abstract
Comparing genomic and biological characteristics across multiple species is essential to using model systems to investigate the molecular and cellular mechanisms underlying human biology and disease and to translate mechanistic insights from studies in model organisms for clinical applications. Building a scalable knowledge commons platform that supports cross-species comparison of rich, expertly curated knowledge regarding gene function, phenotype, and disease associations available for model organisms and humans is the primary mission of the Alliance of Genome Resources (the Alliance). The Alliance is a consortium of seven model organism knowledgebases (mouse, rat, yeast, nematode, zebrafish, frog, fruit fly) and the Gene Ontology resource. The Alliance uses a common set of gene ortholog assertions as the basis for comparing biological annotations across the organisms represented in the Alliance. The major types of knowledge associated with genes that are represented in the Alliance database currently include gene function, phenotypic alleles and variants, human disease associations, pathways, gene expression, and both protein-protein and genetic interactions. The Alliance has enhanced the ability of researchers to easily compare biological annotations for common data types across model organisms and human through the implementation of shared programmatic access mechanisms, data-specific web pages with a unified "look and feel", and interactive user interfaces specifically designed to support comparative biology. The modular infrastructure developed by the Alliance allows the resource to serve as an extensible "knowledge commons" capable of expanding to accommodate additional model organisms.
Collapse
|
25
|
Aleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, Santos GD, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Auken KV, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, et alAleksander SA, Anagnostopoulos AV, Antonazzo G, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Crosby MA, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, Santos GD, Dyer S, Ebert D, Engel SR, Fashena D, Fisher M, Foley S, Gibson AC, Gollapally VR, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hu Y, James-Zorn C, Karimi K, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, Markarian N, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nash RS, Nuin P, Paddock H, Pells T, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schindelman G, Shaw DR, Sherlock G, Shrivatsav A, Singer A, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Tomczuk M, Trovisco V, Tutaj MA, Urbano JM, Auken KV, Van Slyke CE, Vize PD, Wang Q, Weng S, Westerfield M, Wilming LG, Wong ED, Wright A, Yook K, Zhou P, Zorn A, Zytkovicz M. Updates to the Alliance of Genome Resources Central Infrastructure Alliance of Genome Resources Consortium. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.20.567935. [PMID: 38045425 PMCID: PMC10690154 DOI: 10.1101/2023.11.20.567935] [Show More Authors] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively-studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organisms currently represented in the Alliance are budding yeast, C. elegans, Drosophila, zebrafish, frog, laboratory mouse, laboratory rat, and the Gene Ontology Consortium. The project is in a rapid development phase to harmonize knowledge, store it, analyze it, and present it to the community through a web portal, direct downloads, and APIs. Here we focus on developments over the last two years. Specifically, we added and enhanced tools for browsing the genome (JBrowse), downloading sequences, mining complex data (AllianceMine), visualizing pathways, full-text searching of the literature (Textpresso), and sequence similarity searching (SequenceServer). We enhanced existing interactive data tables and added an interactive table of paralogs to complement our representation of orthology. To support individual model organism communities, we implemented species-specific "landing pages" and will add disease-specific portals soon; in addition, we support a common community forum implemented in Discourse. We describe our progress towards a central persistent database to support curation, the data modeling that underpins harmonization, and progress towards a state-of-the art literature curation system with integrated Artificial Intelligence and Machine Learning (AI/ML).
Collapse
|
26
|
Clarke JL, Cooper LD, Poelchau MF, Berardini TZ, Elser J, Farmer AD, Ficklin S, Kumari S, Laporte MA, Nelson RT, Sadohara R, Selby P, Thessen AE, Whitehead B, Sen TZ. Data sharing and ontology use among agricultural genetics, genomics, and breeding databases and resources of the Agbiodata Consortium. Database (Oxford) 2023; 2023:baad076. [PMID: 37971715 PMCID: PMC10653126 DOI: 10.1093/database/baad076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/17/2023] [Indexed: 11/19/2023]
Abstract
Over the last couple of decades, there has been a rapid growth in the number and scope of agricultural genetics, genomics and breeding databases and resources. The AgBioData Consortium (https://www.agbiodata.org/) currently represents 44 databases and resources (https://www.agbiodata.org/databases) covering model or crop plant and animal GGB data, ontologies, pathways, genetic variation and breeding platforms (referred to as 'databases' throughout). One of the goals of the Consortium is to facilitate FAIR (Findable, Accessible, Interoperable, and Reusable) data management and the integration of datasets which requires data sharing, along with structured vocabularies and/or ontologies. Two AgBioData working groups, focused on Data Sharing and Ontologies, respectively, conducted a Consortium-wide survey to assess the current status and future needs of the members in those areas. A total of 33 researchers responded to the survey, representing 37 databases. Results suggest that data-sharing practices by AgBioData databases are in a fairly healthy state, but it is not clear whether this is true for all metadata and data types across all databases; and that, ontology use has not substantially changed since a similar survey was conducted in 2017. Based on our evaluation of the survey results, we recommend (i) providing training for database personnel in a specific data-sharing techniques, as well as in ontology use; (ii) further study on what metadata is shared, and how well it is shared among databases; (iii) promoting an understanding of data sharing and ontologies in the stakeholder community; (iv) improving data sharing and ontologies for specific phenotypic data types and formats; and (v) lowering specific barriers to data sharing and ontology use, by identifying sustainability solutions, and the identification, promotion, or development of data standards. Combined, these improvements are likely to help AgBioData databases increase development efforts towards improved ontology use, and data sharing via programmatic means. Database URL https://www.agbiodata.org/databases.
Collapse
Affiliation(s)
- Jennifer L Clarke
- Department of Statistics and Department of Food Science and Technology, University of Nebraska–Lincoln, 340 Hardin Hall North Wing, Lincoln, NE 68583, USA
| | - Laurel D Cooper
- Department of Botany and Plant Pathology, Oregon State University, 2503 Cordley Hall, Corvallis, OR 97331, USA
| | - Monica F Poelchau
- USDA, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Ave, Beltsville 20705, USA
| | - Tanya Z Berardini
- The Arabidopsis Information Resource and Phoenix Bioinformatic, 39899 Balentine Drive, Suite 200, Newark, CA, USA
| | - Justin Elser
- Department of Botany and Plant Pathology, Oregon State University, 2503 Cordley Hall, Corvallis, OR 97331, USA
| | - Andrew D Farmer
- National Center for Genome Resources, 2935 Rodeo Park Dr. E., Santa Fe, NM 87505, USA
| | - Stephen Ficklin
- Department of Horticulture, Washington State University, 249 Clark Hall, PO Box 646414, Pullman, WA 99164, USA
| | - Sunita Kumari
- Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Marie-Angélique Laporte
- Digital Inclusion, Bioversity International, Parc Scientifique Agropolis II, 1990 Bd de la Lironde, Montpellier 34397, France
| | - Rex T Nelson
- USDA, Agricultural Research Service, Corn Insects and Crop Genetics Research Unit, Iowa State University, 716 Farmhouse Lane, Ames, IA 50011, USA
| | - Rie Sadohara
- Department of Plant, Soil, and Microbial Sciences, Michigan State University, 1066 Bogue St, East Lansing, MI 48824, USA
| | - Peter Selby
- School of Integrative Plant Science, College of Agriculture and Life Sciences, Cornell University, 215 Garden Avenue, Ithaca, NY 14850, USA
| | - Anne E Thessen
- Department of Biomedical Informatics, University of Colorado Anschutz, 1890 N. Revere Court, Mailstop F600, Aurora CO 80045, USA
| | - Brandon Whitehead
- Data Science and Informatics, Manaaki Whenua—Landcare Research, Ltd., Riddet Road, Massey University, Palmerston North 4472, New Zealand
| | - Taner Z Sen
- USDA, Agricultural Research Service, Crop Improvement Genetics Research Unit, Western Regional Research Center, 800 Buchanan St, Albany 94710, USA
- Department of Bioengineering, University of California, 306 Stanley Hall, Berkeley, CA 94720, USA
| |
Collapse
|
27
|
Prakash SJ, Van Auken KM, Hill DP, Sternberg PW. Semantic representation of neural circuit knowledge in Caenorhabditis elegans. Brain Inform 2023; 10:30. [PMID: 37947958 PMCID: PMC10638142 DOI: 10.1186/s40708-023-00208-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 09/22/2023] [Indexed: 11/12/2023] Open
Abstract
In modern biology, new knowledge is generated quickly, making it challenging for researchers to efficiently acquire and synthesise new information from the large volume of primary publications. To address this problem, computational approaches that generate machine-readable representations of scientific findings in the form of knowledge graphs have been developed. These representations can integrate different types of experimental data from multiple papers and biological knowledge bases in a unifying data model, providing a complementary method to manual review for interacting with published knowledge. The Gene Ontology Consortium (GOC) has created a semantic modelling framework that extends individual functional gene annotations to structured descriptions of causal networks representing biological processes (Gene Ontology-Causal Activity Modelling, or GO-CAM). In this study, we explored whether the GO-CAM framework could represent knowledge of the causal relationships between environmental inputs, neural circuits and behavior in the model nematode C. elegans [C. elegans Neural-Circuit Causal Activity Modelling (CeN-CAM)]. We found that, given extensions to several relevant ontologies, a wide variety of author statements from the literature about the neural circuit basis of egg-laying and carbon dioxide (CO2) avoidance behaviors could be faithfully represented with CeN-CAM. Through this process, we were able to generate generic data models for several categories of experimental results. We also discuss how semantic modelling may be used to functionally annotate the C. elegans connectome. Thus, Gene Ontology-based semantic modelling has the potential to support various machine-readable representations of neurobiological knowledge.
Collapse
Affiliation(s)
- Sharan J Prakash
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Kimberly M Van Auken
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - David P Hill
- The Jackson Laboratory, Bar Harbor, ME, 04609, USA
| | - Paul W Sternberg
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA.
| |
Collapse
|
28
|
Hill DP, Drabkin HJ, Smith CL, Van Auken KM, D’Eustachio P. Biochemical pathways represented by Gene Ontology-Causal Activity Models identify distinct phenotypes resulting from mutations in pathways. Genetics 2023; 225:iyad152. [PMID: 37579192 PMCID: PMC10550311 DOI: 10.1093/genetics/iyad152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 07/13/2023] [Accepted: 08/02/2023] [Indexed: 08/16/2023] Open
Abstract
Gene inactivation can affect the process(es) in which that gene acts and causally downstream ones, yielding diverse mutant phenotypes. Identifying the genetic pathways resulting in a given phenotype helps us understand how individual genes interact in a functional network. Computable representations of biological pathways include detailed process descriptions in the Reactome Knowledgebase and causal activity flows between molecular functions in Gene Ontology-Causal Activity Models (GO-CAMs). A computational process has been developed to convert Reactome pathways to GO-CAMs. Laboratory mice are widely used models of normal and pathological human processes. We have converted human Reactome GO-CAMs to orthologous mouse GO-CAMs, as a resource to transfer pathway knowledge between humans and model organisms. These mouse GO-CAMs allowed us to define sets of genes that function in a causally connected way. To demonstrate that individual variant genes from connected pathways result in similar but distinguishable phenotypes, we used the genes in our pathway models to cross-query mouse phenotype annotations in the Mouse Genome Database (MGD). Using GO-CAM representations of 2 related but distinct pathways, gluconeogenesis and glycolysis, we show that individual causal paths in gene networks give rise to discrete phenotypic outcomes resulting from perturbations of glycolytic and gluconeogenic genes. The accurate and detailed descriptions of gene interactions recovered in this analysis of well-studied processes suggest that this strategy can be applied to less well-understood processes in less well-studied model systems to predict phenotypic outcomes of novel gene variants and to identify potential gene targets in altered processes.
Collapse
Affiliation(s)
- David P Hill
- The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | | | | | - Kimberly M Van Auken
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - Peter D’Eustachio
- Department of Biochemistry and Molecular Pharmacology, New York University Grossman School of Medicine, New York, NY 10016, USA
| |
Collapse
|
29
|
Davis AP, Wiegers TC, Wiegers J, Wyatt B, Johnson RJ, Sciaky D, Barkalow F, Strong M, Planchart A, Mattingly CJ. CTD tetramers: a new online tool that computationally links curated chemicals, genes, phenotypes, and diseases to inform molecular mechanisms for environmental health. Toxicol Sci 2023; 195:155-168. [PMID: 37486259 PMCID: PMC10535784 DOI: 10.1093/toxsci/kfad069] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2023] Open
Abstract
The molecular mechanisms connecting environmental exposures to adverse endpoints are often unknown, reflecting knowledge gaps. At the Comparative Toxicogenomics Database (CTD), we developed a bioinformatics approach that integrates manually curated, literature-based interactions from CTD to generate a "CGPD-tetramer": a 4-unit block of information organized as a step-wise molecular mechanism linking an initiating Chemical, an interacting Gene, a Phenotype, and a Disease outcome. Here, we describe a novel, user-friendly tool called CTD Tetramers that generates these evidence-based CGPD-tetramers for any curated chemical, gene, phenotype, or disease of interest. Tetramers offer potential solutions for the unknown underlying mechanisms and intermediary phenotypes connecting a chemical exposure to a disease. Additionally, multiple tetramers can be assembled to construct detailed modes-of-action for chemical-induced disease pathways. As well, tetramers can help inform environmental influences on adverse outcome pathways (AOPs). We demonstrate the tool's utility with relevant use cases for a variety of environmental chemicals (eg, perfluoroalkyl substances, bisphenol A), phenotypes (eg, apoptosis, spermatogenesis, inflammatory response), and diseases (eg, asthma, obesity, male infertility). Finally, we map AOP adverse outcome terms to corresponding CTD terms, allowing users to query for tetramers that can help augment AOP pathways with additional stressors, genes, and phenotypes, as well as formulate potential AOP disease networks (eg, liver cirrhosis and prostate cancer). This novel tool, as part of the complete suite of tools offered at CTD, provides users with computational datasets and their supporting evidence to potentially fill exposure knowledge gaps and develop testable hypotheses about environmental health.
Collapse
Affiliation(s)
- Allan Peter Davis
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Thomas C Wiegers
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Jolene Wiegers
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Brent Wyatt
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Robin J Johnson
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Daniela Sciaky
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Fern Barkalow
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Melissa Strong
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Antonio Planchart
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
- Center for Human Health and the Environment, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Carolyn J Mattingly
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
- Center for Human Health and the Environment, North Carolina State University, Raleigh, North Carolina 27695, USA
| |
Collapse
|
30
|
Qian G, Fang H, Chen A, Sun Z, Huang M, Luo M, Cheng E, Zhang S, Wang X, Fang H. A hub gene signature as a therapeutic target and biomarker for sepsis and geriatric sepsis-induced ARDS concomitant with COVID-19 infection. Front Immunol 2023; 14:1257834. [PMID: 37822934 PMCID: PMC10562607 DOI: 10.3389/fimmu.2023.1257834] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 09/08/2023] [Indexed: 10/13/2023] Open
Abstract
Background COVID-19 and sepsis represent formidable public health challenges, characterized by incompletely elucidated molecular mechanisms. Elucidating the interplay between COVID-19 and sepsis, particularly in geriatric patients suffering from sepsis-induced acute respiratory distress syndrome (ARDS), is of paramount importance for identifying potential therapeutic interventions to mitigate hospitalization and mortality risks. Methods We employed bioinformatics and systems biology approaches to identify hub genes, shared pathways, molecular biomarkers, and candidate therapeutics for managing sepsis and sepsis-induced ARDS in the context of COVID-19 infection, as well as co-existing or sequentially occurring infections. We corroborated these hub genes utilizing murine sepsis-ARDS models and blood samples derived from geriatric patients afflicted by sepsis-induced ARDS. Results Our investigation revealed 189 differentially expressed genes (DEGs) shared among COVID-19 and sepsis datasets. We constructed a protein-protein interaction network, unearthing pivotal hub genes and modules. Notably, nine hub genes displayed significant alterations and correlations with critical inflammatory mediators of pulmonary injury in murine septic lungs. Simultaneously, 12 displayed significant changes and correlations with a neutrophil-recruiting chemokine in geriatric patients with sepsis-induced ARDS. Of these, six hub genes (CD247, CD2, CD40LG, KLRB1, LCN2, RETN) showed significant alterations across COVID-19, sepsis, and geriatric sepsis-induced ARDS. Our single-cell RNA sequencing analysis of hub genes across diverse immune cell types furnished insights into disease pathogenesis. Functional analysis underscored the interconnection between sepsis/sepsis-ARDS and COVID-19, enabling us to pinpoint potential therapeutic targets, transcription factor-gene interactions, DEG-microRNA co-regulatory networks, and prospective drug and chemical compound interactions involving hub genes. Conclusion Our investigation offers potential therapeutic targets/biomarkers, sheds light on the immune response in geriatric patients with sepsis-induced ARDS, emphasizes the association between sepsis/sepsis-ARDS and COVID-19, and proposes prospective alternative pathways for targeted therapeutic interventions.
Collapse
Affiliation(s)
- Guojun Qian
- Department of Anesthesiology, Zhongshan Hospital, Fudan University, Shanghai, China
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
| | - Hongwei Fang
- Department of Anesthesiology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Anning Chen
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
| | - Zhun Sun
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
| | - Meiying Huang
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
| | - Mengyuan Luo
- Department of Anesthesiology, Minhang Branch, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Erdeng Cheng
- Department of Anesthesiology, Minhang Branch, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Shengyi Zhang
- Department of Thoracic Surgery, Songjiang Hospital Affiliated to Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Xiaokai Wang
- Department of Interventional and Vascular Surgery, Xuzhou First People's Hospital, Xuzhou, China
| | - Hao Fang
- Department of Anesthesiology, Zhongshan Hospital, Fudan University, Shanghai, China
- Department of Anesthesiology, Minhang Branch, Zhongshan Hospital, Fudan University, Shanghai, China
- Fudan Zhangjiang Institute, Shanghai, China
- Department of Anesthesiology, Shanghai Geriatric Medical Center, Shanghai, China
| |
Collapse
|
31
|
Prakash SJ, Van Auken KM, Hill DP, Sternberg PW. Semantic Representation of Neural Circuit Knowledge in Caenorhabditis elegans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.28.538760. [PMID: 37162850 PMCID: PMC10168330 DOI: 10.1101/2023.04.28.538760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
In modern biology, new knowledge is generated quickly, making it challenging for researchers to efficiently acquire and synthesise new information from the large volume of primary publications. To address this problem, computational approaches that generate machine-readable representations of scientific findings in the form of knowledge graphs have been developed. These representations can integrate different types of experimental data from multiple papers and biological knowledge bases in a unifying data model, providing a complementary method to manual review for interacting with published knowledge. The Gene Ontology Consortium (GOC) has created a semantic modelling framework that extends individual functional gene annotations to structured descriptions of causal networks representing biological processes (Gene Ontology Causal Activity Modelling, or GO-CAM). In this study, we explored whether the GO-CAM framework could represent knowledge of the causal relationships between environmental inputs, neural circuits and behavior in the model nematode C. elegans (C. elegans Neural Circuit Causal Activity Modelling (CeN-CAM)). We found that, given extensions to several relevant ontologies, a wide variety of author statements from the literature about the neural circuit basis of egg-laying and carbon dioxide (CO2) avoidance behaviors could be faithfully represented with CeN-CAM. Through this process, we were able to generate generic data models for several categories of experimental results. We also discuss how semantic modelling may be used to functionally annotate the C. elegans connectome. Thus, Gene Ontology-based semantic modelling has the potential to support various machine-readable representations of neurobiological knowledge.
Collapse
Affiliation(s)
- Sharan J Prakash
- 1. Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - Kimberly M Van Auken
- 1. Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - David P Hill
- 2. The Jackson Laboratory, Bar Harbor, ME, 04609 USA
| | - Paul W Sternberg
- 1. Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| |
Collapse
|
32
|
Chen C, Luo L, Zheng C, Ding P, Liu H, Luo H. Self-prediction of relations in GO facilitates its quality auditing. J Biomed Inform 2023; 144:104441. [PMID: 37437682 DOI: 10.1016/j.jbi.2023.104441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 06/29/2023] [Accepted: 07/05/2023] [Indexed: 07/14/2023]
Abstract
As applications of the gene ontology (GO) increase rapidly in the biomedical field, quality auditing of it is becoming more and more important. Existing auditing methods are mostly based on rules, observed patterns or hypotheses. In this study, we propose a machine-learning-based framework for GO to audit itself: we first predict the IS-A relations among concepts in GO, then use differences between predicted results and existing relations to uncover potential errors. Specifically, we transfer the taxonomy of GO 2020 January release into a dataset with concept pairs as items and relations between them as labels(pairs with no direct IS-A relation are labeled as ndrs). To fully obtain the representation of each pair, we integrate the embeddings for the concept name, concept definition, as well as concept node in a substring-based topological graph. We divide the dataset into 10 parts, and rotate over all the parts by choosing one part as the testing set and the remaining as the training set each time. After 10 rotations, the prediction model predicted 4,640 existing IS-A pairs as ndrs. In the GO 2022 March release, 340 of these predictions were validated, demonstrating significance with a p-value of 1.60e-46 when compared to the results of randomly selected pairs. On the other hand, the model predicted 2,840 out of 17,079 selected ndrs in GO to be IS-A's relations. After deleting those that caused redundancies and circles, 924 predicted IS-A's relations remained. Among 200 pairs randomly selected, 30 were validated as missing IS-A's by domain experts. In conclusion, this study investigates a novel way of auditing biomedical ontologies by predicting the relations in it, which was shown to be useful for discovering potential errors.
Collapse
Affiliation(s)
- Cheng Chen
- School of Computer Science, University of South China, Hengyang, Hunan, 421001, China
| | - Lingyun Luo
- School of Computer Science, University of South China, Hengyang, Hunan, 421001, China.
| | - Chunlei Zheng
- VA Boston Cooperative Studies Program, MAVERIC, VA Boston Healthcare System, Boston, MA, USA
| | - Pingjian Ding
- School of Computer Science, University of South China, Hengyang, Hunan, 421001, China.
| | - Huan Liu
- School of Computer Science, University of South China, Hengyang, Hunan, 421001, China
| | - Hanyu Luo
- School of Computer Science, University of South China, Hengyang, Hunan, 421001, China
| |
Collapse
|
33
|
Zhao T, Sun S, Gao Y, Rong Y, Wang H, Qi S, Li Y. Luteolin and triptolide: Potential therapeutic compounds for post-stroke depression via protein STAT. Heliyon 2023; 9:e18622. [PMID: 37600392 PMCID: PMC10432979 DOI: 10.1016/j.heliyon.2023.e18622] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 07/18/2023] [Accepted: 07/24/2023] [Indexed: 08/22/2023] Open
Abstract
Post stroke depression (PSD) is a common neuropsychiatric complication following stroke closely associated with the immune system. The development of medications for PSD remains to be a considerable challenge due to the unclear mechanism of PSD. Multiple researches agree that the functions of gene ontology (GO) are efficient for the investigation of disease mechanisms, and DeepPurpose (DP) is extremely valuable for the mining of new drugs. However, GO terms and DP have not yet been applied to explore the pathogenesis and drug treatment of PSD. This study aimed to interpret the mechanism of PSD and discover important drug candidates targeting risk proteins, based on immune-related risk GO functions and informatics algorithms. According to the risk genes of PSD, we identified 335 immune-related risk GO functions and 37 compounds. Based on the construction of the GO function network, we found that STAT protein may be a pivot protein in underlying the mechanism of PSD. Additionally, we also established networks of Protein-Protein Interaction as well as Gene-GO function to facilitate the evaluation of key genes. Based on DP, a total of 37 candidate compounds targeting 7 key proteins were identified with a potential for the therapy of PSD. Furthermore, we noted that the mechanisms by which luteolin and triptolide acting on STAT-related GO function might involve three crucial pathways, including specifically hsa04010 (MAPK signaling pathway), hsa04151 (PI3K-Akt signaling pathway) and hsa04060 (Cytokine-cytokine receptor interaction). Thus, this study provided fresh and powerful information for the mechanism and therapeutic strategies of PSD.
Collapse
Affiliation(s)
- Tianyang Zhao
- Department of Anesthesiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Siqi Sun
- Department of Anesthesiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Yueyue Gao
- Department of Anesthesiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Yuting Rong
- Department of Anesthesiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Hanwenchen Wang
- The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Sihua Qi
- Department of Anesthesiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Yan Li
- Department of Anesthesiology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| |
Collapse
|
34
|
Hill DP, Drabkin HJ, Smith CL, Van Auken KM, D’Eustachio P. Biochemical Pathways Represented by Gene Ontology Causal Activity Models Identify Distinct Phenotypes Resulting from Mutations in Pathways. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.22.541760. [PMID: 37293039 PMCID: PMC10245817 DOI: 10.1101/2023.05.22.541760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Gene inactivation can affect the process(es) in which that gene acts and causally downstream ones, yielding diverse mutant phenotypes. Identifying the genetic pathways resulting in a given phenotype helps us understand how individual genes interact in a functional network. Computable representations of biological pathways include detailed process descriptions in the Reactome Knowledgebase, and causal activity flows between molecular functions in Gene Ontology-Causal Activity Models (GO-CAMs). A computational process has been developed to convert Reactome pathways to GO-CAMs. Laboratory mice are widely used models of normal and pathological human processes. We have converted human Reactome GO-CAMs to orthologous mouse GO-CAMs, as a resource to transfer pathway knowledge between humans and model organisms. These mouse GO-CAMs allowed us to define sets of genes that function in a connected and well-defined way. To test whether individual genes from well-defined pathways result in similar and distinguishable phenotypes, we used the genes in our pathway models to cross-query mouse phenotype annotations in the Mouse Genome Database (MGD). Using GO-CAM representations of two related but distinct pathways, gluconeogenesis and glycolysis, we can identify causal paths in gene networks that give rise to discrete phenotypic outcomes for perturbations of glycolysis and gluconeogenesis. The accurate and detailed descriptions of gene interactions recovered in this analysis of well-studied processes suggest that this strategy can be applied to less well-understood processes in less well-studied model systems to predict phenotypic outcomes of novel gene variants and to identify potential gene targets in altered processes.
Collapse
Affiliation(s)
| | | | | | - Kimberly M Van Auken
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena CA 91125 USA
| | - Peter D’Eustachio
- Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York NY 10016 USA
| |
Collapse
|
35
|
Malec SA, Taneja SB, Albert SM, Elizabeth Shaaban C, Karim HT, Levine AS, Munro P, Callahan TJ, Boyce RD. Causal feature selection using a knowledge graph combining structured knowledge from the biomedical literature and ontologies: A use case studying depression as a risk factor for Alzheimer's disease. J Biomed Inform 2023; 142:104368. [PMID: 37086959 PMCID: PMC10355339 DOI: 10.1016/j.jbi.2023.104368] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 03/03/2023] [Accepted: 04/17/2023] [Indexed: 04/24/2023]
Abstract
BACKGROUND Causal feature selection is essential for estimating effects from observational data. Identifying confounders is a crucial step in this process. Traditionally, researchers employ content-matter expertise and literature review to identify confounders. Uncontrolled confounding from unidentified confounders threatens validity, conditioning on intermediate variables (mediators) weakens estimates, and conditioning on common effects (colliders) induces bias. Additionally, without special treatment, erroneous conditioning on variables combining roles introduces bias. However, the vast literature is growing exponentially, making it infeasible to assimilate this knowledge. To address these challenges, we introduce a novel knowledge graph (KG) application enabling causal feature selection by combining computable literature-derived knowledge with biomedical ontologies. We present a use case of our approach specifying a causal model for estimating the total causal effect of depression on the risk of developing Alzheimer's disease (AD) from observational data. METHODS We extracted computable knowledge from a literature corpus using three machine reading systems and inferred missing knowledge using logical closure operations. Using a KG framework, we mapped the output to target terminologies and combined it with ontology-grounded resources. We translated epidemiological definitions of confounder, collider, and mediator into queries for searching the KG and summarized the roles played by the identified variables. We compared the results with output from a complementary method and published observational studies and examined a selection of confounding and combined role variables in-depth. RESULTS Our search identified 128 confounders, including 58 phenotypes, 47 drugs, 35 genes, 23 collider, and 16 mediator phenotypes. However, only 31 of the 58 confounder phenotypes were found to behave exclusively as confounders, while the remaining 27 phenotypes played other roles. Obstructive sleep apnea emerged as a potential novel confounder for depression and AD. Anemia exemplified a variable playing combined roles. CONCLUSION Our findings suggest combining machine reading and KG could augment human expertise for causal feature selection. However, the complexity of causal feature selection for depression with AD highlights the need for standardized field-specific databases of causal variables. Further work is needed to optimize KG search and transform the output for human consumption.
Collapse
Affiliation(s)
- Scott A Malec
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Sanya B Taneja
- Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA
| | - Steven M Albert
- Department of Behavioral and Community Health Sciences, School of Public Health, University of Pittsburgh, Pittsburgh, PA, USA
| | - C Elizabeth Shaaban
- Department of Epidemiology, School of Public Health, University of Pittsburgh, Pittsburgh, PA, USA
| | - Helmet T Karim
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA; Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Arthur S Levine
- Department of Neurobiology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA; The Brain Institute, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Paul Munro
- School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, USA
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - Richard D Boyce
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA; Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
36
|
Aleksander SA, Balhoff J, Carbon S, Cherry JM, Drabkin HJ, Ebert D, Feuermann M, Gaudet P, Harris NL, Hill DP, Lee R, Mi H, Moxon S, Mungall CJ, Muruganugan A, Mushayahama T, Sternberg PW, Thomas PD, Van Auken K, Ramsey J, Siegele DA, Chisholm RL, Fey P, Aspromonte MC, Nugnes MV, Quaglia F, Tosatto S, Giglio M, Nadendla S, Antonazzo G, Attrill H, Dos Santos G, Marygold S, Strelets V, Tabone CJ, Thurmond J, Zhou P, Ahmed SH, Asanitthong P, Luna Buitrago D, Erdol MN, Gage MC, Ali Kadhum M, Li KYC, Long M, Michalak A, Pesala A, Pritazahra A, Saverimuttu SCC, Su R, Thurlow KE, Lovering RC, Logie C, Oliferenko S, Blake J, Christie K, Corbani L, Dolan ME, Drabkin HJ, Hill DP, Ni L, Sitnikov D, Smith C, Cuzick A, Seager J, Cooper L, Elser J, Jaiswal P, Gupta P, Jaiswal P, Naithani S, Lera-Ramirez M, Rutherford K, Wood V, De Pons JL, Dwinell MR, Hayman GT, Kaldunski ML, Kwitek AE, Laulederkind SJF, Tutaj MA, Vedi M, Wang SJ, D'Eustachio P, Aimo L, Axelsen K, Bridge A, Hyka-Nouspikel N, Morgat A, Aleksander SA, Cherry JM, Engel SR, Karra K, Miyasato SR, Nash RS, Skrzypek MS, Weng S, Wong ED, Bakker E, Berardini TZ, et alAleksander SA, Balhoff J, Carbon S, Cherry JM, Drabkin HJ, Ebert D, Feuermann M, Gaudet P, Harris NL, Hill DP, Lee R, Mi H, Moxon S, Mungall CJ, Muruganugan A, Mushayahama T, Sternberg PW, Thomas PD, Van Auken K, Ramsey J, Siegele DA, Chisholm RL, Fey P, Aspromonte MC, Nugnes MV, Quaglia F, Tosatto S, Giglio M, Nadendla S, Antonazzo G, Attrill H, Dos Santos G, Marygold S, Strelets V, Tabone CJ, Thurmond J, Zhou P, Ahmed SH, Asanitthong P, Luna Buitrago D, Erdol MN, Gage MC, Ali Kadhum M, Li KYC, Long M, Michalak A, Pesala A, Pritazahra A, Saverimuttu SCC, Su R, Thurlow KE, Lovering RC, Logie C, Oliferenko S, Blake J, Christie K, Corbani L, Dolan ME, Drabkin HJ, Hill DP, Ni L, Sitnikov D, Smith C, Cuzick A, Seager J, Cooper L, Elser J, Jaiswal P, Gupta P, Jaiswal P, Naithani S, Lera-Ramirez M, Rutherford K, Wood V, De Pons JL, Dwinell MR, Hayman GT, Kaldunski ML, Kwitek AE, Laulederkind SJF, Tutaj MA, Vedi M, Wang SJ, D'Eustachio P, Aimo L, Axelsen K, Bridge A, Hyka-Nouspikel N, Morgat A, Aleksander SA, Cherry JM, Engel SR, Karra K, Miyasato SR, Nash RS, Skrzypek MS, Weng S, Wong ED, Bakker E, Berardini TZ, Reiser L, Auchincloss A, Axelsen K, Argoud-Puy G, Blatter MC, Boutet E, Breuza L, Bridge A, Casals-Casas C, Coudert E, Estreicher A, Livia Famiglietti M, Feuermann M, Gos A, Gruaz-Gumowski N, Hulo C, Hyka-Nouspikel N, Jungo F, Le Mercier P, Lieberherr D, Masson P, Morgat A, Pedruzzi I, Pourcel L, Poux S, Rivoire C, Sundaram S, Bateman A, Bowler-Barnett E, Bye-A-Jee H, Denny P, Ignatchenko A, Ishtiaq R, Lock A, Lussi Y, Magrane M, Martin MJ, Orchard S, Raposo P, Speretta E, Tyagi N, Warner K, Zaru R, Diehl AD, Lee R, Chan J, Diamantakis S, Raciti D, Zarowiecki M, Fisher M, James-Zorn C, Ponferrada V, Zorn A, Ramachandran S, Ruzicka L, Westerfield M. The Gene Ontology knowledgebase in 2023. Genetics 2023; 224:iyad031. [PMID: 36866529 PMCID: PMC10158837 DOI: 10.1093/genetics/iyad031] [Show More Authors] [Citation(s) in RCA: 819] [Impact Index Per Article: 409.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 02/10/2023] [Accepted: 02/11/2023] [Indexed: 03/04/2023] Open
Abstract
The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project.
Collapse
|
37
|
Lv YW, Du Y, Ma SS, Shi YC, Xu HC, Deng L, Chen XY. Proanthocyanidins attenuates ferroptosis against influenza-induced acute lung injury in mice by reducing IFN-γ. Life Sci 2023; 314:121279. [PMID: 36526043 DOI: 10.1016/j.lfs.2022.121279] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 11/24/2022] [Accepted: 12/07/2022] [Indexed: 12/15/2022]
Abstract
BACKGROUND Acute lung injury (ALI) is associated with high morbidity and mortality and is partly driven promoted by ferroptosis. Proanthocyanidins (PAs) is a natural bioactive flavonoid with anti-inflammatory and antioxidant activities. PAs can also significantly protect against acute lung inflammation and ferroptosis in alveolar epithelial cells. However, it is unclear whether PAs can alleviate ALI by reducing ferroptosis. This study aimed to evaluate the protective effects of PAs and the potential mechanisms against Influenza A virus (IAV)-induced ALI. METHODS Mice were inoculated nasally with IAV to induce ALI. IAV-induced pulmonary inflammation and ferroptosis was tested by measuring the levels of malondialdehyde (MDA), glutathione (GSH), glutathione peroxidase 4 (GPX4), solute carrier family 7 member 11 (SLC7A11) and acyl-CoA synthetase long-chain family member (ACSL4) in lung tissue. The potential targets that PAs protect against IAV-induced ALI were determined via a systemic pharmacological analysis. The molecular mechanism of PAs in ALI treatment was investigated by assessing the level of inflammation and ferroptosis markers using Western Blot and quantitative real-time PCR. RESULTS Systemic pharmacological analysis suggested that PAs protect against IAV-induced pneumonia thorough TGF-β1 and its relative signaling pathway. PAs effectively alleviated histopathological lung injury, reduced inflammatory cytokines and chemokines secretion, which were increased in IAV-infected mice. Meanwhile, PAs further prevented mouse airway inflammation in ALI, concomitant with the decreased expression TGF-β1, smad2/3, p-Smad2, p-Smad3 and ferroptosis mediator IFN-γ. Furthermore,IFN-γ promotes cell lipid peroxidation and ferroptosis,PAs significantly reduced MDA and ACSL4 levels and upregulated GSH, GPX4, and SLC7A11. CONCLUSION Overall, PAs can attenuate ferroptosis against IAV-induced ALI via the TGF-β1/Smad2/3 pathway and is a promising novel therapeutic candidate for ALI.
Collapse
Affiliation(s)
- Yi-Wen Lv
- College of Traditional Chinese Medicine, Jinan University, Guangzhou, China
| | - Yang Du
- College of Traditional Chinese Medicine, Jinan University, Guangzhou, China
| | - Sheng-Suo Ma
- College of Traditional Chinese Medicine, Jinan University, Guangzhou, China
| | - Yu-Cong Shi
- College of Traditional Chinese Medicine, Jinan University, Guangzhou, China
| | - Hua-Chong Xu
- College of Traditional Chinese Medicine, Jinan University, Guangzhou, China.
| | - Li Deng
- College of Traditional Chinese Medicine, Jinan University, Guangzhou, China; State Key Laboratory of Dampness Syndrome of Chinese Medicine, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine (Guangdong Provincial Hospital of Chinese Medicine), Guangzhou, China; Guangzhou Key Laboratory of Formula-Pattern of Traditional Chinese Medicine, Guangzhou, China.
| | - Xiao-Yin Chen
- College of Traditional Chinese Medicine, Jinan University, Guangzhou, China; State Key Laboratory of Dampness Syndrome of Chinese Medicine, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine (Guangdong Provincial Hospital of Chinese Medicine), Guangzhou, China; Guangzhou Key Laboratory of Formula-Pattern of Traditional Chinese Medicine, Guangzhou, China.
| |
Collapse
|
38
|
Yu H, Li L, Huffman A, Beverley J, Hur J, Merrell E, Huang HH, Wang Y, Liu Y, Ong E, Cheng L, Zeng T, Zhang J, Li P, Liu Z, Wang Z, Zhang X, Ye X, Handelman SK, Sexton J, Eaton K, Higgins G, Omenn GS, Athey B, Smith B, Chen L, He Y. A new framework for host-pathogen interaction research. Front Immunol 2022; 13:1066733. [PMID: 36591248 PMCID: PMC9797517 DOI: 10.3389/fimmu.2022.1066733] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 11/14/2022] [Indexed: 12/23/2022] Open
Abstract
COVID-19 often manifests with different outcomes in different patients, highlighting the complexity of the host-pathogen interactions involved in manifestations of the disease at the molecular and cellular levels. In this paper, we propose a set of postulates and a framework for systematically understanding complex molecular host-pathogen interaction networks. Specifically, we first propose four host-pathogen interaction (HPI) postulates as the basis for understanding molecular and cellular host-pathogen interactions and their relations to disease outcomes. These four postulates cover the evolutionary dispositions involved in HPIs, the dynamic nature of HPI outcomes, roles that HPI components may occupy leading to such outcomes, and HPI checkpoints that are critical for specific disease outcomes. Based on these postulates, an HPI Postulate and Ontology (HPIPO) framework is proposed to apply interoperable ontologies to systematically model and represent various granular details and knowledge within the scope of the HPI postulates, in a way that will support AI-ready data standardization, sharing, integration, and analysis. As a demonstration, the HPI postulates and the HPIPO framework were applied to study COVID-19 with the Coronavirus Infectious Disease Ontology (CIDO), leading to a novel approach to rational design of drug/vaccine cocktails aimed at interrupting processes occurring at critical host-coronavirus interaction checkpoints. Furthermore, the host-coronavirus protein-protein interactions (PPIs) relevant to COVID-19 were predicted and evaluated based on prior knowledge of curated PPIs and domain-domain interactions, and how such studies can be further explored with the HPI postulates and the HPIPO framework is discussed.
Collapse
Affiliation(s)
- Hong Yu
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
| | - Li Li
- Department of Genetics, Harvard Medical School, Boston, MA, United States
| | - Anthony Huffman
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - John Beverley
- Department of Philosophy, University at Buffalo, Buffalo, NY, United States
- Asymmetric Operations Sector, Johns Hopkins University Applied Physics Laboratory, Laurel, MD, United States
| | - Junguk Hur
- Department of Biomedical Sciences, University of North Dakota School of Medicine and Health Sciences, Grand Forks, ND, United States
| | - Eric Merrell
- Department of Philosophy, University at Buffalo, Buffalo, NY, United States
| | - Hsin-hui Huang
- University of Michigan Medical School, Ann Arbor, MI, United States
- Department of Biotechnology and Laboratory Science in Medicine, National Yang-Ming University, Taipei, Taiwan
| | - Yang Wang
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Yingtong Liu
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Edison Ong
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Liang Cheng
- Department of Bioinformatics, Harbin Medical University, Harbin, Helongjian, China
| | - Tao Zeng
- Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Jingsong Zhang
- Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Pengpai Li
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, China
| | - Zhiping Liu
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, China
| | - Zhigang Wang
- Department of Biomedical Engineering, Institute of Basic Medical Sciences and School of Basic Medicine, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, China
| | - Xiangyan Zhang
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
| | - Xianwei Ye
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People’s Hospital and National Health Commission (NHC) Key Laboratory of Immunological Diseases, People’s Hospital of Guizhou Province, Guiyang, Guizhou, China
- Department of Basic Medicine, Guizhou University Medical College, Guiyang, Guizhou, China
| | | | - Jonathan Sexton
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Kathryn Eaton
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Gerry Higgins
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Gilbert S. Omenn
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Brian Athey
- University of Michigan Medical School, Ann Arbor, MI, United States
| | - Barry Smith
- Department of Philosophy, University at Buffalo, Buffalo, NY, United States
| | - Luonan Chen
- Key Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Yongqun He
- University of Michigan Medical School, Ann Arbor, MI, United States
| |
Collapse
|
39
|
Lu Z, Chen C, Gao Y, Li Y, Zhao X, Zhang H, Wei Q, Zeng X, Li Y, Wan M. Screening target genes for the treatment of PCOS via analysis of single-cell sequencing data. Ann Med 2022; 54:2975-2989. [PMID: 36286390 PMCID: PMC9621251 DOI: 10.1080/07853890.2022.2136401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Polycystic ovary syndrome (PCOS) is a condition of the female reproductive system and it remains imperative to identify target genes responsible for its pathogenesis and develop therapeutic drugs capable of effectively treating it. METHODS We performed primary screening, staging, functional analysis as well as screening of target genes and therapeutic drugs based on single cell sequencing data of 34 oocytes from the GEO database. RESULTS Oxidative phosphorylation played a pivotal role in the development of oocytes, insulin resistance and ovulation disorders. At the cellular level, GV and MI phases were particularly critical for the biology of pregnancy. We screened PGR, SIRT1 and ADAMTS1 as hub differentially expressed genes (DEGs) and found relevant drugs using the Drug-Gene Interaction Database. In clinical study, oral contraceptives and insulin sensitisers were found to be effective in the treatment of PCOS. CONCLUSION PGR, SIRT1 and ADAMTS1 were found to be down-regulated in oocytes, ovulation and female pregnancy. These 3 genes are likely biomarkers important in the treatment of PCOS. Insulin sensitiser in combination with oral contraceptive administration were found to significantly improve PCOS.Key messagesOur study used a new bioinformatics approach to find target genes for the treatment of PCOS.Our study sought to identify target genes that affect human oocyte quality by analysing single-cell sequencing data from oocytes.We testified to our data by analysing a subset of clinical data.
Collapse
Affiliation(s)
- Zhenzhen Lu
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Chunyan Chen
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Ying Gao
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Yanhui Li
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Xiaojie Zhao
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Hanke Zhang
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Qiongqiong Wei
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Xinliu Zeng
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Yajie Li
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Min Wan
- Department of Gynecology and Obstetrics, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
40
|
Wu C, Wu Z, Chen Y, Huang X, Tian B. Potential core genes associated with COVID-19 identified via weighted gene co-expression network analysis. Swiss Med Wkly 2022; 152:40033. [PMID: 36509426 DOI: 10.57187/smw.2022.40033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
AIMS Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a novel virus belonging to the Coronaviridae family that causes coronavirus disease (COVID-19). This disease rapidly reached pandemic status, presenting a serious threat to global health. However, the detailed molecular mechanism contributing to COVID-19 has not yet been elucidated. METHODS The expression profiles, including the mRNA levels, of samples from patients infected with SARS-CoV-2 along with clinical data were obtained from the GSE152075 dataset in the Gene Expression Omnibus (GEO) database. Weighted gene co-expression network analysis (WGCNA) was used to identify co-expression modules, which were then implemented to evaluate the relationships between fundamental modules and clinical traits. The differentially expressed genes (DEGs), gene ontology (GO) functional enrichment, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway were evaluated using R software packages. RESULTS A total of 377 SARS-CoV-2-infected samples and 54 normal samples with available clinical and genetic data were obtained from the GEO database. There were 1444 DEGs identified between the sample types, which were used to screen out 11 co-expression modules in the WGCNA. Six co-expression modules were significantly associated with three clinical traits (SARS-CoV-2 positivity, age, and sex). Among the DEGs in two modules significantly correlated with SARS-CoV-2 positivity, enrichment was observed in the biological process of viral infection strategies (viral translation) in the GO analysis. The KEGG signalling pathway analysis demonstrated that the DEGs in the two modules were commonly enriched in oxidative phosphorylation, ribosome, and thermogenesis pathways. Moreover, a five-core gene set (RPL35A, RPL7A, RPS15, RPS20, and RPL17) with top connectivity with other genes was identified in the SARS-CoV-2 infection modules, suggesting that these genes may be indispensable in viral transcription after infection. CONCLUSION The identified core genes and signalling pathways associated with SARS-CoV-2 infection can significantly supplement the current understanding of COVID-19. The five core genes encoding ribosomal proteins may be indispensable in viral protein biosynthesis after SARS-CoV-2 infection and serve as therapeutic targets for COVID-19 treatment. These findings can be used as a basis for creating a hypothetical model for future experimental studies regarding associations of SARS-CoV-2 infection with ribosomal protein function.
Collapse
Affiliation(s)
- Chao Wu
- Department of Pancreatic Surgery, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Zuowei Wu
- Department of Pancreatic Surgery, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Yang Chen
- Department of Pancreatic Surgery, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Xing Huang
- Department of Pancreatic Surgery, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| | - Bole Tian
- Department of Pancreatic Surgery, West China Hospital, Sichuan University, Chengdu, Sichuan Province, China
| |
Collapse
|
41
|
Fang H, Sun Z, Chen Z, Chen A, Sun D, Kong Y, Fang H, Qian G. Bioinformatics and systems-biology analysis to determine the effects of Coronavirus disease 2019 on patients with allergic asthma. Front Immunol 2022; 13:988479. [PMID: 36211429 PMCID: PMC9537444 DOI: 10.3389/fimmu.2022.988479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 08/30/2022] [Indexed: 12/05/2022] Open
Abstract
Background The coronavirus disease (COVID-19) pandemic has posed a significant challenge for global health systems. Increasing evidence shows that asthma phenotypes and comorbidities are major risk factors for COVID-19 symptom severity. However, the molecular mechanisms underlying the association between COVID-19 and asthma are poorly understood. Therefore, we conducted bioinformatics and systems biology analysis to identify common pathways and molecular biomarkers in patients with COVID-19 and asthma, as well as potential molecular mechanisms and candidate drugs for treating patients with both COVID-19 and asthma. Methods Two sets of differentially expressed genes (DEGs) from the GSE171110 and GSE143192 datasets were intersected to identify common hub genes, shared pathways, and candidate drugs. In addition, murine models were utilized to explore the expression levels and associations of the hub genes in asthma and lung inflammation/injury. Results We discovered 157 common DEGs between the asthma and COVID-19 datasets. A protein–protein-interaction network was built using various combinatorial statistical approaches and bioinformatics tools, which revealed several hub genes and critical modules. Six of the hub genes were markedly elevated in murine asthmatic lungs and were positively associated with IL-5, IL-13 and MUC5AC, which are the key mediators of allergic asthma. Gene Ontology and pathway analysis revealed common associations between asthma and COVID-19 progression. Finally, we identified transcription factor–gene interactions, DEG–microRNA coregulatory networks, and potential drug and chemical-compound interactions using the hub genes. Conclusion We identified the top 15 hub genes that can be used as novel biomarkers of COVID-19 and asthma and discovered several promising candidate drugs that might be helpful for treating patients with COVID-19 and asthma.
Collapse
Affiliation(s)
- Hongwei Fang
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
- Department of Anesthesiology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Zhun Sun
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
| | - Zhouyi Chen
- Department of Anesthesiology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Anning Chen
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
| | - Donglin Sun
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
| | - Yan Kong
- Department of Anesthesiology (High-Tech Branch), The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Hao Fang
- Department of Anesthesiology, Zhongshan Hospital, Fudan University, Shanghai, China
- Department of Anesthesiology, Minhang Hospital, Fudan University, Shanghai, China
- *Correspondence: Guojun Qian, ; Hao Fang,
| | - Guojun Qian
- Affiliated Cancer Hospital and Institute of Guangzhou Medical University, Guangzhou, China
- *Correspondence: Guojun Qian, ; Hao Fang,
| |
Collapse
|
42
|
de Crécy-lagard V, Amorin de Hegedus R, Arighi C, Babor J, Bateman A, Blaby I, Blaby-Haas C, Bridge AJ, Burley SK, Cleveland S, Colwell LJ, Conesa A, Dallago C, Danchin A, de Waard A, Deutschbauer A, Dias R, Ding Y, Fang G, Friedberg I, Gerlt J, Goldford J, Gorelik M, Gyori BM, Henry C, Hutinet G, Jaroch M, Karp PD, Kondratova L, Lu Z, Marchler-Bauer A, Martin MJ, McWhite C, Moghe GD, Monaghan P, Morgat A, Mungall CJ, Natale DA, Nelson WC, O’Donoghue S, Orengo C, O’Toole KH, Radivojac P, Reed C, Roberts RJ, Rodionov D, Rodionova IA, Rudolf JD, Saleh L, Sheynkman G, Thibaud-Nissen F, Thomas PD, Uetz P, Vallenet D, Carter EW, Weigele PR, Wood V, Wood-Charlson EM, Xu J. A roadmap for the functional annotation of protein families: a community perspective. Database (Oxford) 2022; 2022:baac062. [PMID: 35961013 PMCID: PMC9374478 DOI: 10.1093/database/baac062] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 06/28/2022] [Accepted: 08/03/2022] [Indexed: 12/23/2022]
Abstract
Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.
Collapse
Affiliation(s)
- Valérie de Crécy-lagard
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | | | - Cecilia Arighi
- Department of Computer and Information Sciences, University of Delaware, Newark, DE 19713, USA
| | - Jill Babor
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Ian Blaby
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Crysten Blaby-Haas
- Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Alan J Bridge
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Geneva 4 CH-1211, Switzerland
| | - Stephen K Burley
- RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Stacey Cleveland
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Lucy J Colwell
- Departmenf of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Ana Conesa
- Spanish National Research Council, Institute for Integrative Systems Biology, Paterna, Valencia 46980, Spain
| | - Christian Dallago
- TUM (Technical University of Munich) Department of Informatics, Bioinformatics & Computational Biology, i12, Boltzmannstr. 3, Garching/Munich 85748, Germany
| | - Antoine Danchin
- School of Biomedical Sciences, Li KaShing Faculty of Medicine, The University of Hong Kong, 21 Sassoon Road, Pokfulam, SAR Hong Kong 999077, China
| | - Anita de Waard
- Research Collaboration Unit, Elsevier, Jericho, VT 05465, USA
| | - Adam Deutschbauer
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Raquel Dias
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Yousong Ding
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, USA
| | - Gang Fang
- NYU-Shanghai, Shanghai 200120, China
| | - Iddo Friedberg
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA 50011, USA
| | - John Gerlt
- Institute for Genomic Biology and Departments of Biochemistry and Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Joshua Goldford
- Physics of Living Systems, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Mark Gorelik
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Benjamin M Gyori
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - Christopher Henry
- Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Geoffrey Hutinet
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Marshall Jaroch
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | - Peter D Karp
- Bioinformatics Research Group, SRI International, Menlo Park, CA 94025, USA
| | | | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20817, USA
| | - Aron Marchler-Bauer
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20817, USA
| | - Maria-Jesus Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Claire McWhite
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| | - Gaurav D Moghe
- Plant Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Paul Monaghan
- Department of Agricultural Education and Communication, University of Florida, Gainesville, FL 32611, USA
| | - Anne Morgat
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Geneva 4 CH-1211, Switzerland
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Darren A Natale
- Georgetown University Medical Center, Washington, DC 20007, USA
| | - William C Nelson
- Biological Sciences Division, Pacific Northwest National Laboratories, Richland, WA 99354, USA
| | - Seán O’Donoghue
- School of Biotechnology and Biomolecular Sciences, University of NSW, Sydney, NSW 2052, Australia
| | - Christine Orengo
- Department of Structural and Molecular Biology, University College London, London WC1E 6BT, UK
| | | | - Predrag Radivojac
- Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, USA
| | - Colbie Reed
- Department of Microbiology and Cell Sciences, University of Florida, Gainesville, FL 32611, USA
| | | | - Dmitri Rodionov
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Irina A Rodionova
- Department of Bioengineering, Division of Engineering, University of California at San Diego, La Jolla, CA 92093-0412, USA
| | - Jeffrey D Rudolf
- Department of Chemistry, University of Florida, Gainesville, FL 32611, USA
| | - Lana Saleh
- New England Biolabs, Ipswich, MA 01938, USA
| | - Gloria Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Francoise Thibaud-Nissen
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), 8600 Rockville Pike, Bethesda, MD 20817, USA
| | - Paul D Thomas
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA 90033, USA
| | - Peter Uetz
- Center for Biological Data Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - David Vallenet
- LABGeM, Génomique Métabolique, CEA, Genoscope, Institut François Jacob, Université d’Évry, Université Paris-Saclay, CNRS, Evry 91057, France
| | - Erica Watson Carter
- Department of Plant Pathology, University of Florida Citrus Research and Education Center, 700 Experiment Station Rd., Lake Alfred, FL 33850, USA
| | | | - Valerie Wood
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - Elisha M Wood-Charlson
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jin Xu
- Department of Plant Pathology, University of Florida Citrus Research and Education Center, 700 Experiment Station Rd., Lake Alfred, FL 33850, USA
| |
Collapse
|
43
|
McGill JR, Lagassé HAD, Hernandez N, Hopkins L, Jankowski W, McCormick Q, Simhadri V, Golding B, Sauna ZE. A structural homology approach to identify potential cross-reactive antibody responses following SARS-CoV-2 infection. Sci Rep 2022; 12:11388. [PMID: 35794133 PMCID: PMC9259575 DOI: 10.1038/s41598-022-15225-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 06/21/2022] [Indexed: 11/09/2022] Open
Abstract
The emergence of the novel SARS-CoV-2 virus is the most important public-health issue of our time. Understanding the diverse clinical presentations of the ensuing disease, COVID-19, remains a critical unmet need. Here we present a comprehensive listing of the diverse clinical indications associated with COVID-19. We explore the theory that anti-SARS-CoV-2 antibodies could cross-react with endogenous human proteins driving some of the pathologies associated with COVID-19. We describe a novel computational approach to estimate structural homology between SARS-CoV-2 proteins and human proteins. Antibodies are more likely to interrogate 3D-structural epitopes than continuous linear epitopes. This computational workflow identified 346 human proteins containing a domain with high structural homology to a SARS-CoV-2 Wuhan strain protein. Of these, 102 proteins exhibit functions that could contribute to COVID-19 clinical pathologies. We present a testable hypothesis to delineate unexplained clinical observations vis-à-vis COVID-19 and a tool to evaluate the safety-risk profile of potential COVID-19 therapies.
Collapse
Affiliation(s)
- Joseph R McGill
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - H A Daniel Lagassé
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Nancy Hernandez
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Louis Hopkins
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Wojciech Jankowski
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Quinn McCormick
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Vijaya Simhadri
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Basil Golding
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA
| | - Zuben E Sauna
- Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, USA.
| |
Collapse
|
44
|
Chen J, Goudey B, Zobel J, Geard N, Verspoor K. Exploring automatic inconsistency detection for literature-based gene ontology annotation. Bioinformatics 2022; 38:i273-i281. [PMID: 35758780 PMCID: PMC9235499 DOI: 10.1093/bioinformatics/btac230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/08/2022] [Indexed: 11/12/2022] Open
Abstract
Motivation Literature-based gene ontology annotations (GOA) are biological database records that use controlled vocabulary to uniformly represent gene function information that is described in the primary literature. Assurance of the quality of GOA is crucial for supporting biological research. However, a range of different kinds of inconsistencies in between literature as evidence and annotated GO terms can be identified; these have not been systematically studied at record level. The existing manual-curation approach to GOA consistency assurance is inefficient and is unable to keep pace with the rate of updates to gene function knowledge. Automatic tools are therefore needed to assist with GOA consistency assurance. This article presents an exploration of different GOA inconsistencies and an early feasibility study of automatic inconsistency detection. Results We have created a reliable synthetic dataset to simulate four realistic types of GOA inconsistency in biological databases. Three automatic approaches are proposed. They provide reasonable performance on the task of distinguishing the four types of inconsistency and are directly applicable to detect inconsistencies in real-world GOA database records. Major challenges resulting from such inconsistencies in the context of several specific application settings are reported. This is the first study to introduce automatic approaches that are designed to address the challenges in current GOA quality assurance workflows. The data underlying this article are available in Github at https://github.com/jiyuc/AutoGOAConsistency.
Collapse
Affiliation(s)
- Jiyu Chen
- School of Computing and Information Systems, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Benjamin Goudey
- School of Computing and Information Systems, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Justin Zobel
- School of Computing and Information Systems, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Nicholas Geard
- School of Computing and Information Systems, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Karin Verspoor
- School of Computing and Information Systems, The University of Melbourne, Parkville, VIC 3010, Australia.,School of Computer Technologies, RMIT University, Melbourne, VIC 3000, Australia
| |
Collapse
|
45
|
Wood V, Sternberg PW, Lipshitz HD. Making biological knowledge useful for humans and machines. Genetics 2022; 220:6563297. [PMID: 35380659 PMCID: PMC8982017 DOI: 10.1093/genetics/iyac001] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
|
46
|
Agapite J, Albou LP, Aleksander SA, Alexander M, Anagnostopoulos AV, Antonazzo G, Argasinska J, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blake JA, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Christie KR, Crosby MA, Davis P, da Veiga Beltrame E, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Douglass E, Dunn B, Eagle A, Ebert D, Engel SR, Fashena D, Foley S, Frazer K, Gao S, Gibson AC, Gondwe F, Goodman J, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hill DP, Howe DG, Howe KL, Hu Y, Jha S, Kadin JA, Kaufman TC, Kalita P, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, MacPherson KA, Martin R, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nalabolu HS, Nash RS, Ng P, Nuin P, Paddock H, Paulini M, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schaper K, Schindelman G, Shimoyama M, Simison M, Shaw DR, Shrivatsav A, Singer A, Skrzypek M, Smith CM, Smith CL, et alAgapite J, Albou LP, Aleksander SA, Alexander M, Anagnostopoulos AV, Antonazzo G, Argasinska J, Arnaboldi V, Attrill H, Becerra A, Bello SM, Blake JA, Blodgett O, Bradford YM, Bult CJ, Cain S, Calvi BR, Carbon S, Chan J, Chen WJ, Michael Cherry J, Cho J, Christie KR, Crosby MA, Davis P, da Veiga Beltrame E, De Pons JL, D’Eustachio P, Diamantakis S, Dolan ME, dos Santos G, Douglass E, Dunn B, Eagle A, Ebert D, Engel SR, Fashena D, Foley S, Frazer K, Gao S, Gibson AC, Gondwe F, Goodman J, Sian Gramates L, Grove CA, Hale P, Harris T, Thomas Hayman G, Hill DP, Howe DG, Howe KL, Hu Y, Jha S, Kadin JA, Kaufman TC, Kalita P, Karra K, Kishore R, Kwitek AE, Laulederkind SJF, Lee R, Longden I, Luypaert M, MacPherson KA, Martin R, Marygold SJ, Matthews B, McAndrews MS, Millburn G, Miyasato S, Motenko H, Moxon S, Muller HM, Mungall CJ, Muruganujan A, Mushayahama T, Nalabolu HS, Nash RS, Ng P, Nuin P, Paddock H, Paulini M, Perrimon N, Pich C, Quinton-Tulloch M, Raciti D, Ramachandran S, Richardson JE, Gelbart SR, Ruzicka L, Schaper K, Schindelman G, Shimoyama M, Simison M, Shaw DR, Shrivatsav A, Singer A, Skrzypek M, Smith CM, Smith CL, Smith JR, Stein L, Sternberg PW, Tabone CJ, Thomas PD, Thorat K, Thota J, Toro S, Tomczuk M, Trovisco V, Tutaj MA, Tutaj M, Urbano JM, Van Auken K, Van Slyke CE, Wang Q, Wang SJ, Weng S, Westerfield M, Williams G, Wilming LG, Wong ED, Wright A, Yook K, Zarowiecki M, Zhou P, Zytkovicz M. Harmonizing model organism data in the Alliance of Genome Resources. Genetics 2022; 220:iyac022. [PMID: 35380658 PMCID: PMC8982023 DOI: 10.1093/genetics/iyac022] [Show More Authors] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Accepted: 01/26/2022] [Indexed: 02/06/2023] Open
Abstract
The Alliance of Genome Resources (the Alliance) is a combined effort of 7 knowledgebase projects: Saccharomyces Genome Database, WormBase, FlyBase, Mouse Genome Database, the Zebrafish Information Network, Rat Genome Database, and the Gene Ontology Resource. The Alliance seeks to provide several benefits: better service to the various communities served by these projects; a harmonized view of data for all biomedical researchers, bioinformaticians, clinicians, and students; and a more sustainable infrastructure. The Alliance has harmonized cross-organism data to provide useful comparative views of gene function, gene expression, and human disease relevance. The basis of the comparative views is shared calls of orthology relationships and the use of common ontologies. The key types of data are alleles and variants, gene function based on gene ontology annotations, phenotypes, association to human disease, gene expression, protein-protein and genetic interactions, and participation in pathways. The information is presented on uniform gene pages that allow facile summarization of information about each gene in each of the 7 organisms covered (budding yeast, roundworm Caenorhabditis elegans, fruit fly, house mouse, zebrafish, brown rat, and human). The harmonized knowledge is freely available on the alliancegenome.org portal, as downloadable files, and by APIs. We expect other existing and emerging knowledge bases to join in the effort to provide the union of useful data and features that each knowledge base currently provides.
Collapse
|
47
|
Collado-Vides J, Gaudet P, de Lorenzo V. Missing Links Between Gene Function and Physiology in Genomics. Front Physiol 2022; 13:815874. [PMID: 35295568 PMCID: PMC8918662 DOI: 10.3389/fphys.2022.815874] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 01/28/2022] [Indexed: 11/25/2022] Open
Abstract
Knowledge of biological organisms at the molecular level that has been gathered is now organized into databases, often within ontological frameworks. To enable computational comparisons of annotations across different genomes and organisms, controlled vocabularies have been essential, as is the case in the functional annotation classifications used for bacteria, such as MultiFun and the more widely used Gene Ontology. The function of individual gene products as well as the processes in which collections of them participate constitute a wealth of classes that describe the biological role of gene products in a large number of organisms in the three kingdoms of life. In this contribution, we highlight from a qualitative perspective some limitations of these frameworks and discuss challenges that need to be addressed to bridge the gap between annotation as currently captured by ontologies and databases and our understanding of the basic principles in the organization and functioning of organisms; we illustrate these challenges with some examples in bacteria. We hope that raising awareness of these issues will encourage users of Gene Ontology and similar ontologies to be careful about data interpretation and lead to improved data representation.
Collapse
Affiliation(s)
- Julio Collado-Vides
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
- Department of Biomedical Engineering, Boston University, Boston, MA, United States
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Universitat Pompeu Fabra, Barcelona, Spain
| | - Pascale Gaudet
- SIB Swiss Institute of Bioinformatics, Swiss-Prot Group, Geneva, Switzerland
| | - Víctor de Lorenzo
- Department of Systems Biology, Centro Nacional de Biotecnología CSIC, Universidad Autónoma de Madrid, Madrid, Spain
| |
Collapse
|
48
|
Yang L, Zhang W, Li M, Dam J, Huang K, Wang Y, Qiu Z, Sun T, Chen P, Zhang Z, Zhang W. Evaluation of the Prognostic Relevance of Differential Claudin Gene Expression Highlights Claudin-4 as Being Suppressed by TGFβ1 Inhibitor in Colorectal Cancer. Front Genet 2022; 13:783016. [PMID: 35281827 PMCID: PMC8907593 DOI: 10.3389/fgene.2022.783016] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 01/25/2022] [Indexed: 11/13/2022] Open
Abstract
Background: Claudins (CLDNs) are a family of closely related transmembrane proteins that have been linked to oncogenic transformation and metastasis across a range of cancers, suggesting that they may be valuable diagnostic and/or prognostic biomarkers that can be used to evaluate patient outcomes. However, CLDN expression patterns associated with colorectal cancer (CRC) remain to be defined.Methods: The mRNA levels of 21 different CLDN family genes were assessed across 20 tumor types using the Oncomine database. Correlations between these genes and patient clinical outcomes, immune cell infiltration, clinicopathological staging, lymph node metastasis, and mutational status were analyzed using the GEPIA, UALCAN, Human Protein Atlas, Tumor Immune Estimation Resource, STRING, Genenetwork, cBioportal, and DAVID databases in an effort to clarify the potential functional roles of different CLDN protein in CRC. Molecular docking analyses were used to probe potential interactions between CLDN4 and TGFβ1. Levels of CLDN4 and CLDN11 mRNA expression in clinical CRC patient samples and in the HT29 and HCT116 cell lines were assessed via qPCR. CLDN4 expression levels in these 2 cell lines were additionally assessed following TGFβ1 inhibitor treatment.Results: These analyses revealed that COAD and READ tissues exhibited the upregulation of CLDN1, CLDN2, CLDN3, CLDN4, CLDN7, and CLDN12 as well as the downregulation of CLDN5 and CLDN11 relative to control tissues. Higher CLDN11 and CLDN14 expression as well as lower CLDN23 mRNA levels were associated with poorer overall survival (OS) outcomes. Moreover, CLDN2 and CLDN3 or CLDN11 mRNA levels were significantly associated with lymph node metastatic progression in COAD or READ lower in COAD and READ tissues. A positive correlation between the expression of CLDN11 and predicted macrophage, dendritic cell, and CD4+ T cell infiltration was identified in CRC, with CLDN12 expression further being positively correlated with CD4+ T cell infiltration whereas a negative correlation was observed between such infiltration and the expression of CLDN3 and CLDN15. A positive correlation between CLDN1, CLDN16, and neutrophil infiltration was additionally detected, whereas neutrophil levels were negatively correlated with the expression of CLDN3 and CLDN15. Molecular docking suggested that CLDN4 was able to directly bind via hydrogen bond with TGFβ1. Relative to paracancerous tissues, clinical CRC tumor tissue samples exhibited CLDN4 and CLDN11 upregulation and downregulation, respectively. LY364947 was able to suppress the expression of CLDN4 in both the HT29 and HCT116 cell lines.Conclusion: Together, these results suggest that the expression of different CLDN family genes is closely associated with CRC tumor clinicopathological staging and immune cell infiltration. Moreover, CLDN4 expression is closely associated with TGFβ1 in CRC, suggesting that it and other CLDN family members may represent viable targets for antitumor therapeutic intervention.
Collapse
Affiliation(s)
- Linqi Yang
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Wenqi Zhang
- Department of Hematology, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Meng Li
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Jinxi Dam
- College of Natural Science, Michigan State University, East Lansing, MI, United States
| | - Kai Huang
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Yihan Wang
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Zhicong Qiu
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Tao Sun
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Pingping Chen
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
- *Correspondence: Wei Zhang, ; Pingping Chen, ; Zhenduo Zhang,
| | - Zhenduo Zhang
- Shijiazhuang People’s Hospital, Shijiazhuang, China
- *Correspondence: Wei Zhang, ; Pingping Chen, ; Zhenduo Zhang,
| | - Wei Zhang
- Department of Pharmacology, Hebei University of Chinese Medicine, Shijiazhuang, China
- *Correspondence: Wei Zhang, ; Pingping Chen, ; Zhenduo Zhang,
| |
Collapse
|
49
|
Davis P, Zarowiecki M, Arnaboldi V, Becerra A, Cain S, Chan J, Chen WJ, Cho J, da Veiga Beltrame E, Diamantakis S, Gao S, Grigoriadis D, Grove CA, Harris TW, Kishore R, Le T, Lee RYN, Luypaert M, Müller HM, Nakamura C, Nuin P, Paulini M, Quinton-Tulloch M, Raciti D, Rodgers FH, Russell M, Schindelman G, Singh A, Stickland T, Van Auken K, Wang Q, Williams G, Wright AJ, Yook K, Berriman M, Howe KL, Schedl T, Stein L, Sternberg PW. WormBase in 2022-data, processes, and tools for analyzing Caenorhabditis elegans. Genetics 2022; 220:6521733. [PMID: 35134929 PMCID: PMC8982018 DOI: 10.1093/genetics/iyac003] [Citation(s) in RCA: 196] [Impact Index Per Article: 65.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 12/17/2021] [Indexed: 02/06/2023] Open
Abstract
WormBase (www.wormbase.org) is the central repository for the genetics and genomics of the nematode Caenorhabditis elegans. We provide the research community with data and tools to facilitate the use of C. elegans and related nematodes as model organisms for studying human health, development, and many aspects of fundamental biology. Throughout our 22-year history, we have continued to evolve to reflect progress and innovation in the science and technologies involved in the study of C. elegans. We strive to incorporate new data types and richer data sets, and to provide integrated displays and services that avail the knowledge generated by the published nematode genetics literature. Here, we provide a broad overview of the current state of WormBase in terms of data type, curation workflows, analysis, and tools, including exciting new advances for analysis of single-cell data, text mining and visualization, and the new community collaboration forum. Concurrently, we continue the integration and harmonization of infrastructure, processes, and tools with the Alliance of Genome Resources, of which WormBase is a founding member.
Collapse
Affiliation(s)
- Paul Davis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Magdalena Zarowiecki
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Andrés Becerra
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Jaehyoung Cho
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Eduardo da Veiga Beltrame
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Stavros Diamantakis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sibyl Gao
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Dionysis Grigoriadis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Christian A Grove
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Todd W Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Tuan Le
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Raymond Y N Lee
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Manuel Luypaert
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Hans-Michael Müller
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Cecilia Nakamura
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Michael Paulini
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mark Quinton-Tulloch
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Faye H Rodgers
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Matthew Russell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gary Schindelman
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Archana Singh
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Tim Stickland
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Qinghua Wang
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Gary Williams
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Adam J Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Matt Berriman
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Kevin L Howe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tim Schedl
- Department of Genetics, Washington University School of Medicine, St Louis, MO 63110, USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Paul W Sternberg
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| |
Collapse
|
50
|
Bansal P, Morgat A, Axelsen KB, Muthukrishnan V, Coudert E, Aimo L, Hyka-Nouspikel N, Gasteiger E, Kerhornou A, Neto TB, Pozzato M, Blatter MC, Ignatchenko A, Redaschi N, Bridge A. Rhea, the reaction knowledgebase in 2022. Nucleic Acids Res 2022; 50:D693-D700. [PMID: 34755880 PMCID: PMC8728268 DOI: 10.1093/nar/gkab1016] [Citation(s) in RCA: 84] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/08/2021] [Accepted: 11/09/2021] [Indexed: 12/15/2022] Open
Abstract
Rhea (https://www.rhea-db.org) is an expert-curated knowledgebase of biochemical reactions based on the chemical ontology ChEBI (Chemical Entities of Biological Interest) (https://www.ebi.ac.uk/chebi). In this paper, we describe a number of key developments in Rhea since our last report in the database issue of Nucleic Acids Research in 2019. These include improved reaction coverage in Rhea, the adoption of Rhea as the reference vocabulary for enzyme annotation in the UniProt knowledgebase UniProtKB (https://www.uniprot.org), the development of a new Rhea website, and the designation of Rhea as an ELIXIR Core Data Resource. We hope that these and other developments will enhance the utility of Rhea as a reference resource to study and engineer enzymes and the metabolic systems in which they function.
Collapse
Affiliation(s)
- Parit Bansal
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Anne Morgat
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Kristian B Axelsen
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Venkatesh Muthukrishnan
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Elisabeth Coudert
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Lucila Aimo
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Nevila Hyka-Nouspikel
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Elisabeth Gasteiger
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Arnaud Kerhornou
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Teresa Batista Neto
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Monica Pozzato
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Marie-Claude Blatter
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Alex Ignatchenko
- EMBL-EBI European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nicole Redaschi
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| | - Alan Bridge
- Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
| |
Collapse
|