1
|
Köhler CA, Grün S, Denker M. Improving data sharing and knowledge transfer via the Neuroelectrophysiology Analysis Ontology (NEAO). Sci Data 2025; 12:907. [PMID: 40442093 PMCID: PMC12122730 DOI: 10.1038/s41597-025-05213-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2025] [Accepted: 05/15/2025] [Indexed: 06/02/2025] Open
Abstract
Describing the analysis of data from electrophysiology experiments investigating the function of neural systems is challenging. On the one hand, data can be analyzed by distinct methods with similar purposes, such as different algorithms to estimate the spectral power content of a measured time series. On the other hand, different software codes can implement the same analysis algorithm, while adopting different names to identify functions and parameters. These ambiguities complicate reporting analysis results, e.g., in a manuscript or on a scientific platform. Here, we illustrate how an ontology to describe the analysis process can assist in improving clarity, rigour and comprehensibility by complementing, simplifying and classifying the details of the implementation. We implemented the Neuroelectrophysiology Analysis Ontology (NEAO) to define a vocabulary and to standardize the descriptions of processes for neuroelectrophysiology data analysis. Real-world examples demonstrate how NEAO can annotate provenance information describing an analysis. Based on such provenance, we detail how it supports querying information (e.g., using knowledge graphs) that enable researchers to find, understand and reuse analysis results.
Collapse
Affiliation(s)
- Cristiano A Köhler
- Institute for Advanced Simulation (IAS-6), Jülich Research Centre, Jülich, Germany.
- RWTH Aachen University, Aachen, Germany.
| | - Sonja Grün
- Institute for Advanced Simulation (IAS-6), Jülich Research Centre, Jülich, Germany
- Theoretical Systems Neurobiology, RWTH Aachen University, Aachen, Germany
- JARA-Institute Brain Structure-Function Relationships (INM-10), Jülich Research Centre, Jülich, Germany
| | - Michael Denker
- Institute for Advanced Simulation (IAS-6), Jülich Research Centre, Jülich, Germany
| |
Collapse
|
2
|
Shen X, Yan S, Zeng T, Xia F, Jiang D, Wan G, Cao D, Wu R. TarIKGC: A Target Identification Tool Using Semantics-Enhanced Knowledge Graph Completion with Application to CDK2 Inhibitor Discovery. J Med Chem 2025; 68:1793-1809. [PMID: 39745279 DOI: 10.1021/acs.jmedchem.4c02543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2025]
Abstract
Target identification is a critical stage in the drug discovery pipeline. Various computational methodologies have been dedicated to enhancing the classification performance of compound-target interactions, yet significant room remains for improving the recommendation performance. To address this challenge, we developed TarIKGC, a tool for target prioritization that leverages semantics enhanced knowledge graph (KG) completion. This method harnesses knowledge representation learning within a heterogeneous compound-target-disease network. Specifically, TarIKGC combines an attention-based aggregation graph neural network with a multimodal feature extractor network to simultaneously learn internal semantic features from biomedical entities and topological features from the KG. Furthermore, a KG embedding model is employed to identify missing relationships among compounds and targets. In silico evaluations highlighted the superior performance of TarIKGC in drug repositioning tasks. In addition, TarIKGC successfully identified two potential cyclin-dependent kinase 2 (CDK2) inhibitors with novel scaffolds through reverse target fishing. Both compounds exhibited antiproliferative activities across multiple therapeutic indications targeting CDK2.
Collapse
Affiliation(s)
- Xiaojuan Shen
- State Key Laboratory of Anti-Infective Drug Discovery and Development, School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou 510006, China
| | - Shijia Yan
- State Key Laboratory of Anti-Infective Drug Discovery and Development, School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou 510006, China
| | - Tao Zeng
- State Key Laboratory of Anti-Infective Drug Discovery and Development, School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou 510006, China
| | - Fei Xia
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Dejun Jiang
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, Hunan 410013, China
| | - Guohui Wan
- State Key Laboratory of Anti-Infective Drug Discovery and Development, School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou 510006, China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, Hunan 410013, China
| | - Ruibo Wu
- State Key Laboratory of Anti-Infective Drug Discovery and Development, School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou 510006, China
| |
Collapse
|
3
|
Ji Z, Guo S, Qiao Y, McDougal RA. Automating literature screening and curation with applications to computational neuroscience. J Am Med Inform Assoc 2024; 31:1463-1470. [PMID: 38722233 PMCID: PMC11187430 DOI: 10.1093/jamia/ocae097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 03/19/2024] [Accepted: 04/16/2024] [Indexed: 06/21/2024] Open
Abstract
OBJECTIVE ModelDB (https://modeldb.science) is a discovery platform for computational neuroscience, containing over 1850 published model codes with standardized metadata. These codes were mainly supplied from unsolicited model author submissions, but this approach is inherently limited. For example, we estimate we have captured only around one-third of NEURON models, the most common type of models in ModelDB. To more completely characterize the state of computational neuroscience modeling work, we aim to identify works containing results derived from computational neuroscience approaches and their standardized associated metadata (eg, cell types, research topics). MATERIALS AND METHODS Known computational neuroscience work from ModelDB and identified neuroscience work queried from PubMed were included in our study. After pre-screening with SPECTER2 (a free document embedding method), GPT-3.5, and GPT-4 were used to identify likely computational neuroscience work and relevant metadata. RESULTS SPECTER2, GPT-4, and GPT-3.5 demonstrated varied but high abilities in identification of computational neuroscience work. GPT-4 achieved 96.9% accuracy and GPT-3.5 improved from 54.2% to 85.5% through instruction-tuning and Chain of Thought. GPT-4 also showed high potential in identifying relevant metadata annotations. DISCUSSION Accuracy in identification and extraction might further be improved by dealing with ambiguity of what are computational elements, including more information from papers (eg, Methods section), improving prompts, etc. CONCLUSION Natural language processing and large language model techniques can be added to ModelDB to facilitate further model discovery, and will contribute to a more standardized and comprehensive framework for establishing domain-specific resources.
Collapse
Affiliation(s)
- Ziqing Ji
- Biostatistics, Yale School of Public Health, Yale University, New Haven, CT 06510, United States
| | - Siyan Guo
- Biostatistics, Yale School of Public Health, Yale University, New Haven, CT 06510, United States
| | - Yujie Qiao
- Biostatistics, Yale School of Public Health, Yale University, New Haven, CT 06510, United States
- Integrative Genomics, Princeton University, Princeton, NJ 08540, United States
| | - Robert A McDougal
- Biostatistics, Yale School of Public Health, Yale University, New Haven, CT 06510, United States
- Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT 06510, United States
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06510, United States
- Wu Tsai Institute, Yale University, New Haven, CT 06510, United States
| |
Collapse
|
4
|
Surles-Zeigler MC, Sincomb T, Gillespie TH, de Bono B, Bresnahan J, Mawe GM, Grethe JS, Tappan S, Heal M, Martone ME. Extending and using anatomical vocabularies in the stimulating peripheral activity to relieve conditions project. Front Neuroinform 2022; 16:819198. [PMID: 36090663 PMCID: PMC9449460 DOI: 10.3389/fninf.2022.819198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Accepted: 07/18/2022] [Indexed: 11/25/2022] Open
Abstract
The stimulating peripheral activity to relieve conditions (SPARC) program is a US National Institutes of Health-funded effort to improve our understanding of the neural circuitry of the autonomic nervous system (ANS) in support of bioelectronic medicine. As part of this effort, the SPARC project is generating multi-species, multimodal data, models, simulations, and anatomical maps supported by a comprehensive knowledge base of autonomic circuitry. To facilitate the organization of and integration across multi-faceted SPARC data and models, SPARC is implementing the findable, accessible, interoperable, and reusable (FAIR) data principles to ensure that all SPARC products are findable, accessible, interoperable, and reusable. We are therefore annotating and describing all products with a common FAIR vocabulary. The SPARC Vocabulary is built from a set of community ontologies covering major domains relevant to SPARC, including anatomy, physiology, experimental techniques, and molecules. The SPARC Vocabulary is incorporated into tools researchers use to segment and annotate their data, facilitating the application of these ontologies for annotation of research data. However, since investigators perform deep annotations on experimental data, not all terms and relationships are available in community ontologies. We therefore implemented a term management and vocabulary extension pipeline where SPARC researchers may extend the SPARC Vocabulary using InterLex, an online vocabulary management system. To ensure the quality of contributed terms, we have set up a curated term request and review pipeline specifically for anatomical terms involving expert review. Accepted terms are added to the SPARC Vocabulary and, when appropriate, contributed back to community ontologies to enhance ANS coverage. Here, we provide an overview of the SPARC Vocabulary, the infrastructure and process for implementing the term management and review pipeline. In an analysis of >300 anatomical contributed terms, the majority represented composite terms that necessitated combining terms within and across existing ontologies. Although these terms are not good candidates for community ontologies, they can be linked to structures contained within these ontologies. We conclude that the term request pipeline serves as a useful adjunct to community ontologies for annotating experimental data and increases the FAIRness of SPARC data.
Collapse
Affiliation(s)
| | - Troy Sincomb
- Department of Neuroscience, University of California, San Diego, La Jolla, CA, United States
| | - Thomas H. Gillespie
- Department of Neuroscience, University of California, San Diego, La Jolla, CA, United States
| | - Bernard de Bono
- Whitby et al., Inc., Indianapolis, IN, United States
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Jacqueline Bresnahan
- Brain and Spinal Injury Center, University of California, San Francisco, San Francisco, CA, United States
| | - Gary M. Mawe
- Department of Neurological Sciences, University of Vermont, Burlington, VT, United States
| | - Jeffrey S. Grethe
- Department of Neuroscience, University of California, San Diego, La Jolla, CA, United States
| | | | - Maci Heal
- MBF Bioscience, Williston, VT, United States
| | - Maryann E. Martone
- Department of Neuroscience, University of California, San Diego, La Jolla, CA, United States
| |
Collapse
|
5
|
Eriksson O, Bhalla US, Blackwell KT, Crook SM, Keller D, Kramer A, Linne ML, Saudargienė A, Wade RC, Hellgren Kotaleski J. Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows. eLife 2022; 11:e69013. [PMID: 35792600 PMCID: PMC9259018 DOI: 10.7554/elife.69013] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 05/13/2022] [Indexed: 12/22/2022] Open
Abstract
Modeling in neuroscience occurs at the intersection of different points of view and approaches. Typically, hypothesis-driven modeling brings a question into focus so that a model is constructed to investigate a specific hypothesis about how the system works or why certain phenomena are observed. Data-driven modeling, on the other hand, follows a more unbiased approach, with model construction informed by the computationally intensive use of data. At the same time, researchers employ models at different biological scales and at different levels of abstraction. Combining these models while validating them against experimental data increases understanding of the multiscale brain. However, a lack of interoperability, transparency, and reusability of both models and the workflows used to construct them creates barriers for the integration of models representing different biological scales and built using different modeling philosophies. We argue that the same imperatives that drive resources and policy for data - such as the FAIR (Findable, Accessible, Interoperable, Reusable) principles - also support the integration of different modeling approaches. The FAIR principles require that data be shared in formats that are Findable, Accessible, Interoperable, and Reusable. Applying these principles to models and modeling workflows, as well as the data used to constrain and validate them, would allow researchers to find, reuse, question, validate, and extend published models, regardless of whether they are implemented phenomenologically or mechanistically, as a few equations or as a multiscale, hierarchical system. To illustrate these ideas, we use a classical synaptic plasticity model, the Bienenstock-Cooper-Munro rule, as an example due to its long history, different levels of abstraction, and implementation at many scales.
Collapse
Affiliation(s)
- Olivia Eriksson
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
| | - Upinder Singh Bhalla
- National Center for Biological Sciences, Tata Institute of Fundamental ResearchBangaloreIndia
| | - Kim T Blackwell
- Department of Bioengineering, Volgenau School of Engineering, George Mason UniversityFairfaxUnited States
| | - Sharon M Crook
- School of Mathematical and Statistical Sciences, Arizona State UniversityTempeUnited States
| | - Daniel Keller
- Blue Brain Project, École Polytechnique Fédérale de LausanneLausanneSwitzerland
| | - Andrei Kramer
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| | - Marja-Leena Linne
- Faculty of Medicine and Health Technology, Tampere UniversityTampereFinland
| | - Ausra Saudargienė
- Neuroscience Institute, Lithuanian University of Health SciencesKaunasLithuania
- Department of Informatics, Vytautas Magnus UniversityKaunasLithuania
| | - Rebecca C Wade
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS)HeidelbergGermany
- Center for Molecular Biology (ZMBH), ZMBH-DKFZ Alliance, University of HeidelbergHeidelbergGermany
- Interdisciplinary Center for Scientific Computing (IWR), Heidelberg UniversityHeidelbergGermany
| | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| |
Collapse
|
6
|
Gillespie TH, Tripathy SJ, Sy MF, Martone ME, Hill SL. The Neuron Phenotype Ontology: A FAIR Approach to Proposing and Classifying Neuronal Types. Neuroinformatics 2022; 20:793-809. [PMID: 35267146 PMCID: PMC9547803 DOI: 10.1007/s12021-022-09566-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/19/2022] [Indexed: 12/31/2022]
Abstract
The challenge of defining and cataloging the building blocks of the brain requires a standardized approach to naming neurons and organizing knowledge about their properties. The US Brain Initiative Cell Census Network, Human Cell Atlas, Blue Brain Project, and others are generating vast amounts of data and characterizing large numbers of neurons throughout the nervous system. The neuroscientific literature contains many neuron names (e.g. parvalbumin-positive interneuron or layer 5 pyramidal cell) that are commonly used and generally accepted. However, it is often unclear how such common usage types relate to many evidence-based types that are proposed based on the results of new techniques. Further, comparing different types across labs remains a significant challenge. Here, we propose an interoperable knowledge representation, the Neuron Phenotype Ontology (NPO), that provides a standardized and automatable approach for naming cell types and normalizing their constituent phenotypes using identifiers from community ontologies as a common language. The NPO provides a framework for systematically organizing knowledge about cellular properties and enables interoperability with existing neuron naming schemes. We evaluate the NPO by populating a knowledge base with three independent cortical neuron classifications derived from published data sets that describe neurons according to molecular, morphological, electrophysiological, and synaptic properties. Competency queries to this knowledge base demonstrate that the NPO knowledge model enables interoperability between the three test cases and neuron names commonly used in the literature.
Collapse
Affiliation(s)
- Thomas H Gillespie
- Department of Neuroscience, University of California, San Diego, CA, USA
| | - Shreejoy J Tripathy
- Department of Psychiatry, University of Toronto, Toronto, ON, Canada
- Department of Physiology, University of Toronto, Toronto, ON, Canada
- Krembil Centre for Neuroinformatics, Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Mohameth François Sy
- Blue Brain Project, École Polytechnique Fédérale de Lausanne (EPFL), Campus Biotech, 1202, Geneva, Switzerland
| | - Maryann E Martone
- Department of Neuroscience, University of California, San Diego, CA, USA
| | - Sean L Hill
- Department of Psychiatry, University of Toronto, Toronto, ON, Canada.
- Department of Physiology, University of Toronto, Toronto, ON, Canada.
- Krembil Centre for Neuroinformatics, Centre for Addiction and Mental Health, Toronto, ON, Canada.
- Blue Brain Project, École Polytechnique Fédérale de Lausanne (EPFL), Campus Biotech, 1202, Geneva, Switzerland.
| |
Collapse
|
7
|
Schürmann F, Courcol JD, Ramaswamy S. Computational Concepts for Reconstructing and Simulating Brain Tissue. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1359:237-259. [PMID: 35471542 DOI: 10.1007/978-3-030-89439-9_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
It has previously been shown that it is possible to derive a new class of biophysically detailed brain tissue models when one computationally analyzes and exploits the interdependencies or the multi-modal and multi-scale organization of the brain. These reconstructions, sometimes referred to as digital twins, enable a spectrum of scientific investigations. Building such models has become possible because of increase in quantitative data but also advances in computational capabilities, algorithmic and methodological innovations. This chapter presents the computational science concepts that provide the foundation to the data-driven approach to reconstructing and simulating brain tissue as developed by the EPFL Blue Brain Project, which was originally applied to neocortical microcircuitry and extended to other brain regions. Accordingly, the chapter covers aspects such as a knowledge graph-based data organization and the importance of the concept of a dataset release. We illustrate algorithmic advances in finding suitable parameters for electrical models of neurons or how spatial constraints can be exploited for predicting synaptic connections. Furthermore, we explain how in silico experimentation with such models necessitates specific addressing schemes or requires strategies for an efficient simulation. The entire data-driven approach relies on the systematic validation of the model. We conclude by discussing complementary strategies that not only enable judging the fidelity of the model but also form the basis for its systematic refinements.
Collapse
Affiliation(s)
- Felix Schürmann
- Blue Brain Project, École polytechnique fédérale de Lausanne (EPFL), Geneva, Switzerland.
| | - Jean-Denis Courcol
- Blue Brain Project, École polytechnique fédérale de Lausanne (EPFL), Geneva, Switzerland
| | - Srikanth Ramaswamy
- Blue Brain Project, École polytechnique fédérale de Lausanne (EPFL), Geneva, Switzerland
| |
Collapse
|
8
|
Abrams MB, Bjaalie JG, Das S, Egan GF, Ghosh SS, Goscinski WJ, Grethe JS, Kotaleski JH, Ho ETW, Kennedy DN, Lanyon LJ, Leergaard TB, Mayberg HS, Milanesi L, Mouček R, Poline JB, Roy PK, Strother SC, Tang TB, Tiesinga P, Wachtler T, Wójcik DK, Martone ME. A Standards Organization for Open and FAIR Neuroscience: the International Neuroinformatics Coordinating Facility. Neuroinformatics 2022; 20:25-36. [PMID: 33506383 PMCID: PMC9036053 DOI: 10.1007/s12021-020-09509-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/28/2020] [Indexed: 01/07/2023]
Abstract
There is great need for coordination around standards and best practices in neuroscience to support efforts to make neuroscience a data-centric discipline. Major brain initiatives launched around the world are poised to generate huge stores of neuroscience data. At the same time, neuroscience, like many domains in biomedicine, is confronting the issues of transparency, rigor, and reproducibility. Widely used, validated standards and best practices are key to addressing the challenges in both big and small data science, as they are essential for integrating diverse data and for developing a robust, effective, and sustainable infrastructure to support open and reproducible neuroscience. However, developing community standards and gaining their adoption is difficult. The current landscape is characterized both by a lack of robust, validated standards and a plethora of overlapping, underdeveloped, untested and underutilized standards and best practices. The International Neuroinformatics Coordinating Facility (INCF), an independent organization dedicated to promoting data sharing through the coordination of infrastructure and standards, has recently implemented a formal procedure for evaluating and endorsing community standards and best practices in support of the FAIR principles. By formally serving as a standards organization dedicated to open and FAIR neuroscience, INCF helps evaluate, promulgate, and coordinate standards and best practices across neuroscience. Here, we provide an overview of the process and discuss how neuroscience can benefit from having a dedicated standards body.
Collapse
Affiliation(s)
| | - Jan G. Bjaalie
- Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
| | - Samir Das
- McGill Centre for Integrative Neuroscience, McGill University, Montreal, QC Canada
| | - Gary F. Egan
- Monash Biomedical Imaging, Monash University, Clayton, VIC Australia
| | - Satrajit S. Ghosh
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA USA ,Department of Otolaryngology - Head and Neck Surgery Harvard Medical School Boston, Boston, MA USA
| | | | - Jeffrey S. Grethe
- Department of Neuroscience, School of Medicine, University of California, San Diego, La Jolla, CA USA
| | | | - Eric Tatt Wei Ho
- Centre for Intelligent Signal and Imaging Research, Institute of Health and Analytics, Universiti Teknologi PETRONAS, Perak, Malaysia
| | - David N. Kennedy
- Department of Psychiatry, University of Massachusetts Medical School, Worchester, MA USA
| | | | | | - Helen S. Mayberg
- Nash Family Center for Advanced Circuit Therapeutics, Icahn School of Medicine, New York, NY USA
| | - Luciano Milanesi
- Institute of Biomedical Technologies, National Research Council (CNR), Milan, Italy
| | - Roman Mouček
- Department of Computer Science and Engineering, Faculty of Applied Sciences, University of West Bohemia, Pilsen, Czech Republic
| | - J. B. Poline
- Montreal Neurological Institute, Faculty of Medicine and Health Sciences, McGill University, Montreal, Canada
| | - Prasun K. Roy
- Computational Neuroscience & Neuroimaging Laboratory, School of Bio-Medical Engineering, Indian Institute of Technology (BHU), Varanasi, UP India
| | - Stephen C. Strother
- Rotman Research Institute, Baycrest Centre, Department of Medical Biophysics, University of Toronto, Ontario, ON Canada
| | - Tong Boon Tang
- Centre for Intelligent Signal and Imaging Research, Institute of Health and Analytics, Universiti Teknologi PETRONAS, Bandar Seri Iskandar, Malaysia
| | - Paul Tiesinga
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, Nijmegen, Netherlands
| | - Thomas Wachtler
- Department of Biology II, Ludwig-Maximilians-Universität München, Martinsried, Planegg Germany
| | - Daniel K. Wójcik
- Laboratory of Neuroinformatics, Nencki Institute of Experimental Biology of Polish Academy of Sciences, Warsaw, Poland
| | - Maryann E. Martone
- Department of Neuroscience, School of Medicine, University of California, San Diego, La Jolla, CA USA
| |
Collapse
|
9
|
Osanlouy M, Bandrowski A, de Bono B, Brooks D, Cassarà AM, Christie R, Ebrahimi N, Gillespie T, Grethe JS, Guercio LA, Heal M, Lin M, Kuster N, Martone ME, Neufeld E, Nickerson DP, Soltani EG, Tappan S, Wagenaar JB, Zhuang K, Hunter PJ. The SPARC DRC: Building a Resource for the Autonomic Nervous System Community. Front Physiol 2021; 12:693735. [PMID: 34248680 PMCID: PMC8265045 DOI: 10.3389/fphys.2021.693735] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Accepted: 05/28/2021] [Indexed: 02/01/2023] Open
Abstract
The Data and Resource Center (DRC) of the NIH-funded SPARC program is developing databases, connectivity maps, and simulation tools for the mammalian autonomic nervous system. The experimental data and mathematical models supplied to the DRC by the SPARC consortium are curated, annotated and semantically linked via a single knowledgebase. A data portal has been developed that allows discovery of data and models both via semantic search and via an interface that includes Google Map-like 2D flatmaps for displaying connectivity, and 3D anatomical organ scaffolds that provide a common coordinate framework for cross-species comparisons. We discuss examples that illustrate the data pipeline, which includes data upload, curation, segmentation (for image data), registration against the flatmaps and scaffolds, and finally display via the web portal, including the link to freely available online computational facilities that will enable neuromodulation hypotheses to be investigated by the autonomic neuroscience community and device manufacturers.
Collapse
Affiliation(s)
- Mahyar Osanlouy
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Anita Bandrowski
- Department of Neuroscience, University of California, San Diego, San Diego, CA, United States
| | - Bernard de Bono
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - David Brooks
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | | | - Richard Christie
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Nazanin Ebrahimi
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Tom Gillespie
- Department of Neuroscience, University of California, San Diego, San Diego, CA, United States
| | - Jeffrey S. Grethe
- Department of Neuroscience, University of California, San Diego, San Diego, CA, United States
| | | | - Maci Heal
- MBF Bioscience, Williston, VT, United States
| | - Mabelle Lin
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Niels Kuster
- IT'IS Foundation, Zurich, Switzerland
- Department of Information Technology and Electrical Engineering, Swiss Federal Institute of Technology (ETHZ), Zurich, Switzerland
| | - Maryann E. Martone
- Department of Neuroscience, University of California, San Diego, San Diego, CA, United States
| | - Esra Neufeld
- IT'IS Foundation, Zurich, Switzerland
- Department of Information Technology and Electrical Engineering, Swiss Federal Institute of Technology (ETHZ), Zurich, Switzerland
| | - David P. Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Elias G. Soltani
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | | | | | | | - Peter J. Hunter
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| |
Collapse
|
10
|
Bijari K, Akram MA, Ascoli GA. An open-source framework for neuroscience metadata management applied to digital reconstructions of neuronal morphology. Brain Inform 2020; 7:2. [PMID: 32219575 PMCID: PMC7098402 DOI: 10.1186/s40708-020-00103-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 03/14/2020] [Indexed: 12/21/2022] Open
Abstract
Research advancements in neuroscience entail the production of a substantial amount of data requiring interpretation, analysis, and integration. The complexity and diversity of neuroscience data necessitate the development of specialized databases and associated standards and protocols. NeuroMorpho.Org is an online repository of over one hundred thousand digitally reconstructed neurons and glia shared by hundreds of laboratories worldwide. Every entry of this public resource is associated with essential metadata describing animal species, anatomical region, cell type, experimental condition, and additional information relevant to contextualize the morphological content. Until recently, the lack of a user-friendly, structured metadata annotation system relying on standardized terminologies constituted a major hindrance in this effort, limiting the data release pace. Over the past 2 years, we have transitioned the original spreadsheet-based metadata annotation system of NeuroMorpho.Org to a custom-developed, robust, web-based framework for extracting, structuring, and managing neuroscience information. Here we release the metadata portal publicly and explain its functionality to enable usage by data contributors. This framework facilitates metadata annotation, improves terminology management, and accelerates data sharing. Moreover, its open-source development provides the opportunity of adapting and extending the code base to other related research projects with similar requirements. This metadata portal is a beneficial web companion to NeuroMorpho.Org which saves time, reduces errors, and aims to minimize the barrier for direct knowledge sharing by domain experts. The underlying framework can be progressively augmented with the integration of increasingly autonomous machine intelligence components.
Collapse
Affiliation(s)
- Kayvan Bijari
- Krasnow Institute for Advanced Study, George Mason University, Fairfax, VA USA
| | - Masood A. Akram
- Krasnow Institute for Advanced Study, George Mason University, Fairfax, VA USA
| | - Giorgio A. Ascoli
- Krasnow Institute for Advanced Study, George Mason University, Fairfax, VA USA
| |
Collapse
|
11
|
Shepherd GM, Marenco L, Hines ML, Migliore M, McDougal RA, Carnevale NT, Newton AJH, Surles-Zeigler M, Ascoli GA. Neuron Names: A Gene- and Property-Based Name Format, With Special Reference to Cortical Neurons. Front Neuroanat 2019; 13:25. [PMID: 30949034 DOI: 10.3389/fnana.2019.00025/full] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Accepted: 02/07/2019] [Indexed: 05/25/2023] Open
Abstract
Precision in neuron names is increasingly needed. We are entering a new era in which classical anatomical criteria are only the beginning toward defining the identity of a neuron as carried in its name. New criteria include patterns of gene expression, membrane properties of channels and receptors, pharmacology of neurotransmitters and neuropeptides, physiological properties of impulse firing, and state-dependent variations in expression of characteristic genes and proteins. These gene and functional properties are increasingly defining neuron types and subtypes. Clarity will therefore be enhanced by conveying as much as possible the genes and properties in the neuron name. Using a tested format of parent-child relations for the region and subregion for naming a neuron, we show how the format can be extended so that these additional properties can become an explicit part of a neuron's identity and name, or archived in a linked properties database. Based on the mouse, examples are provided for neurons in several brain regions as proof of principle, with extension to the complexities of neuron names in the cerebral cortex. The format has dual advantages, of ensuring order in archiving the hundreds of neuron types across all brain regions, as well as facilitating investigation of a given neuron type or given gene or property in the context of all its properties. In particular, we show how the format is extensible to the variety of neuron types and subtypes being revealed by RNA-seq and optogenetics. As current research reveals increasingly complex properties, the proposed approach can facilitate a consensus that goes beyond traditional neuron types.
Collapse
Affiliation(s)
- Gordon M Shepherd
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | - Luis Marenco
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | - Michael L Hines
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
| | - Michele Migliore
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Institute of Biophysics, National Research Council, Palermo, Italy
| | - Robert A McDougal
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | - Nicholas T Carnevale
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
| | - Adam J H Newton
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Department of Physiology and Pharmacology, SUNY Downstate Medical Center, Brooklyn, NY, United States
| | - Monique Surles-Zeigler
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | - Giorgio A Ascoli
- Bioengineering Department and Center for Neural Informatics, Krasnow Institute for Advanced Study, George Mason University, Fairfax, VA, United States
| |
Collapse
|
12
|
Shepherd GM, Marenco L, Hines ML, Migliore M, McDougal RA, Carnevale NT, Newton AJH, Surles-Zeigler M, Ascoli GA. Neuron Names: A Gene- and Property-Based Name Format, With Special Reference to Cortical Neurons. Front Neuroanat 2019; 13:25. [PMID: 30949034 PMCID: PMC6437103 DOI: 10.3389/fnana.2019.00025] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Accepted: 02/07/2019] [Indexed: 12/15/2022] Open
Abstract
Precision in neuron names is increasingly needed. We are entering a new era in which classical anatomical criteria are only the beginning toward defining the identity of a neuron as carried in its name. New criteria include patterns of gene expression, membrane properties of channels and receptors, pharmacology of neurotransmitters and neuropeptides, physiological properties of impulse firing, and state-dependent variations in expression of characteristic genes and proteins. These gene and functional properties are increasingly defining neuron types and subtypes. Clarity will therefore be enhanced by conveying as much as possible the genes and properties in the neuron name. Using a tested format of parent-child relations for the region and subregion for naming a neuron, we show how the format can be extended so that these additional properties can become an explicit part of a neuron's identity and name, or archived in a linked properties database. Based on the mouse, examples are provided for neurons in several brain regions as proof of principle, with extension to the complexities of neuron names in the cerebral cortex. The format has dual advantages, of ensuring order in archiving the hundreds of neuron types across all brain regions, as well as facilitating investigation of a given neuron type or given gene or property in the context of all its properties. In particular, we show how the format is extensible to the variety of neuron types and subtypes being revealed by RNA-seq and optogenetics. As current research reveals increasingly complex properties, the proposed approach can facilitate a consensus that goes beyond traditional neuron types.
Collapse
Affiliation(s)
- Gordon M. Shepherd
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | - Luis Marenco
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | - Michael L. Hines
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
| | - Michele Migliore
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Institute of Biophysics, National Research Council, Palermo, Italy
| | - Robert A. McDougal
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | | | - Adam J. H. Newton
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Department of Physiology and Pharmacology, SUNY Downstate Medical Center, Brooklyn, NY, United States
| | - Monique Surles-Zeigler
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, United States
- Yale Center for Medical Informatics, New Haven, CT, United States
| | - Giorgio A. Ascoli
- Bioengineering Department and Center for Neural Informatics, Krasnow Institute for Advanced Study, George Mason University, Fairfax, VA, United States
| |
Collapse
|
13
|
Blanch A, García R, Planes J, Gil R, Balada F, Blanco E, Aluja A. Ontologies About Human Behavior. EUROPEAN PSYCHOLOGIST 2017. [DOI: 10.1027/1016-9040/a000295] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Abstract. The development of information and communication technologies has stimulated a variety of data and informational resources about human behavior. This is contributing toward collaborative efforts in the formalization and systematization of an overwhelming volume of scientific information. Several tools are helpful for this endeavor, among which the ontology is growing in popularity. Most of the available informational resources adopt the ontology to organize a shared conceptualization of a given body of knowledge. In the present study, we reviewed ontology resources (n = 17) that can be of interest to researchers and scholars involved in human behavior and psychological research. The selected ontologies were contrasted on the three main components of ontologies, classes, individuals, and properties, and on scheme and knowledge metrics. Moreover, we recorded the associations of the terms within a given ontology with terms of other ontologies (mappings), the number of projects using a particular ontology, and whether an ontology was available within the Bioportal, an extensive repository about biomedical ontologies. A few working examples were also provided to clarify how these resources might contribute to improve the analysis, understanding, and research cooperation about human behavior and psychological research.
Collapse
Affiliation(s)
- Angel Blanch
- Department of Psychology, Faculty of Education, Psychology and Social Work, University of Lleida, Spain
- Institute of Biomedical Research (IRB Lleida), Spain
| | - Roberto García
- Department of Computing Science and Industrial Engineering, University of Lleida, Spain
| | - Jordi Planes
- Department of Computing Science and Industrial Engineering, University of Lleida, Spain
| | - Rosa Gil
- Department of Computing Science and Industrial Engineering, University of Lleida, Spain
| | - Ferran Balada
- Department of Psychobiology, Institute of Neurosciences, Universitat Autònoma de Barcelona, Spain
| | - Eduardo Blanco
- Department of Psychology, Faculty of Education, Psychology and Social Work, University of Lleida, Spain
- Institute of Biomedical Research (IRB Lleida), Spain
| | - Anton Aluja
- Department of Psychology, Faculty of Education, Psychology and Social Work, University of Lleida, Spain
- Institute of Biomedical Research (IRB Lleida), Spain
| |
Collapse
|
14
|
Polavaram S, Ascoli GA. An ontology-based search engine for digital reconstructions of neuronal morphology. Brain Inform 2017; 4:123-134. [PMID: 28337675 PMCID: PMC5413594 DOI: 10.1007/s40708-017-0062-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Accepted: 03/13/2017] [Indexed: 11/27/2022] Open
Abstract
Neuronal morphology is extremely diverse across and within animal species, developmental stages, brain regions, and cell types. This diversity is functionally important because neuronal structure strongly affects synaptic integration, spiking dynamics, and network connectivity. Digital reconstructions of axonal and dendritic arbors are thus essential to quantify and model information processing in the nervous system. NeuroMorpho.Org is an established repository containing tens of thousands of digitally reconstructed neurons shared by several hundred laboratories worldwide. Each neuron is annotated with specific metadata based on the published references and additional details provided by data owners. The number of represented metadata concepts has grown over the years in parallel with the increase of available data. Until now, however, the lack of standardized terminologies and of an adequately structured metadata schema limited the effectiveness of user searches. Here we present a new organization of NeuroMorpho.Org metadata grounded on a set of interconnected hierarchies focusing on the main dimensions of animal species, anatomical regions, and cell types. We have comprehensively mapped each metadata term in NeuroMorpho.Org to this formal ontology, explicitly resolving all ambiguities caused by synonymy and homonymy. Leveraging this consistent framework, we introduce OntoSearch, a powerful functionality that seamlessly enables retrieval of morphological data based on expert knowledge and logical inferences through an intuitive string-based user interface with auto-complete capability. In addition to returning the data directly matching the search criteria, OntoSearch also identifies a pool of possible hits by taking into consideration incomplete metadata annotation.
Collapse
Affiliation(s)
- Sridevi Polavaram
- Center for Neural Informatics, Structures and Plasticity, Krasnow Institute for Advanced Study, George Mason University, Fairfax, VA, USA
| | - Giorgio A Ascoli
- Center for Neural Informatics, Structures and Plasticity, Krasnow Institute for Advanced Study, George Mason University, Fairfax, VA, USA.
| |
Collapse
|
15
|
Bigdely-Shamlo N, Cockfield J, Makeig S, Rognon T, La Valle C, Miyakoshi M, Robbins KA. Hierarchical Event Descriptors (HED): Semi-Structured Tagging for Real-World Events in Large-Scale EEG. Front Neuroinform 2016; 10:42. [PMID: 27799907 PMCID: PMC5065975 DOI: 10.3389/fninf.2016.00042] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Accepted: 09/21/2016] [Indexed: 11/13/2022] Open
Abstract
Real-world brain imaging by EEG requires accurate annotation of complex subject-environment interactions in event-rich tasks and paradigms. This paper describes the evolution of the Hierarchical Event Descriptor (HED) system for systematically describing both laboratory and real-world events. HED version 2, first described here, provides the semantic capability of describing a variety of subject and environmental states. HED descriptions can include stimulus presentation events on screen or in virtual worlds, experimental or spontaneous events occurring in the real world environment, and events experienced via one or multiple sensory modalities. Furthermore, HED 2 can distinguish between the mere presence of an object and its actual (or putative) perception by a subject. Although the HED framework has implicit ontological and linked data representations, the user-interface for HED annotation is more intuitive than traditional ontological annotation. We believe that hiding the formal representations allows for a more user-friendly interface, making consistent, detailed tagging of experimental, and real-world events possible for research users. HED is extensible while retaining the advantages of having an enforced common core vocabulary. We have developed a collection of tools to support HED tag assignment and validation; these are available at hedtags.org. A plug-in for EEGLAB (sccn.ucsd.edu/eeglab), CTAGGER, is also available to speed the process of tagging existing studies.
Collapse
Affiliation(s)
| | - Jeremy Cockfield
- Department of Computer Science, University of Texas at San Antonio San Antonio, TX, USA
| | - Scott Makeig
- Swartz Center for Computational Neuroscience, University of California, San Diego San Diego, CA, USA
| | - Thomas Rognon
- Department of Computer Science, University of Texas at San Antonio San Antonio, TX, USA
| | - Chris La Valle
- Department of Computer Science, University of Texas at San Antonio San Antonio, TX, USA
| | - Makoto Miyakoshi
- Swartz Center for Computational Neuroscience, University of California, San Diego San Diego, CA, USA
| | - Kay A Robbins
- Department of Computer Science, University of Texas at San Antonio San Antonio, TX, USA
| |
Collapse
|
16
|
Gökdeniz E, Özgür A, Canbeyli R. Automated Neuroanatomical Relation Extraction: A Linguistically Motivated Approach with a PVT Connectivity Graph Case Study. Front Neuroinform 2016; 10:39. [PMID: 27708573 PMCID: PMC5030238 DOI: 10.3389/fninf.2016.00039] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2016] [Accepted: 08/23/2016] [Indexed: 11/13/2022] Open
Abstract
Identifying the relations among different regions of the brain is vital for a better understanding of how the brain functions. While a large number of studies have investigated the neuroanatomical and neurochemical connections among brain structures, their specific findings are found in publications scattered over a large number of years and different types of publications. Text mining techniques have provided the means to extract specific types of information from a large number of publications with the aim of presenting a larger, if not necessarily an exhaustive picture. By using natural language processing techniques, the present paper aims to identify connectivity relations among brain regions in general and relations relevant to the paraventricular nucleus of the thalamus (PVT) in particular. We introduce a linguistically motivated approach based on patterns defined over the constituency and dependency parse trees of sentences. Besides the presence of a relation between a pair of brain regions, the proposed method also identifies the directionality of the relation, which enables the creation and analysis of a directional brain region connectivity graph. The approach is evaluated over the manually annotated data sets of the WhiteText Project. In addition, as a case study, the method is applied to extract and analyze the connectivity graph of PVT, which is an important brain region that is considered to influence many functions ranging from arousal, motivation, and drug-seeking behavior to attention. The results of the PVT connectivity graph show that PVT may be a new target of research in mood assessment.
Collapse
Affiliation(s)
- Erinç Gökdeniz
- Department of Computer Engineering, Boğaziçi University İstanbul, Turkey
| | - Arzucan Özgür
- Department of Computer Engineering, Boğaziçi University İstanbul, Turkey
| | - Reşit Canbeyli
- Department of Psychology, Boğaziçi University İstanbul, Turkey
| |
Collapse
|
17
|
Diehl AD, Meehan TF, Bradford YM, Brush MH, Dahdul WM, Dougall DS, He Y, Osumi-Sutherland D, Ruttenberg A, Sarntivijai S, Van Slyke CE, Vasilevsky NA, Haendel MA, Blake JA, Mungall CJ. The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability. J Biomed Semantics 2016; 7:44. [PMID: 27377652 PMCID: PMC4932724 DOI: 10.1186/s13326-016-0088-7] [Citation(s) in RCA: 172] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 06/23/2016] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND The Cell Ontology (CL) is an OBO Foundry candidate ontology covering the domain of canonical, natural biological cell types. Since its inception in 2005, the CL has undergone multiple rounds of revision and expansion, most notably in its representation of hematopoietic cells. For in vivo cells, the CL focuses on vertebrates but provides general classes that can be used for other metazoans, which can be subtyped in species-specific ontologies. CONSTRUCTION AND CONTENT Recent work on the CL has focused on extending the representation of various cell types, and developing new modules in the CL itself, and in related ontologies in coordination with the CL. For example, the Kidney and Urinary Pathway Ontology was used as a template to populate the CL with additional cell types. In addition, subtypes of the class 'cell in vitro' have received improved definitions and labels to provide for modularity with the representation of cells in the Cell Line Ontology and Reagent Ontology. Recent changes in the ontology development methodology for CL include a switch from OBO to OWL for the primary encoding of the ontology, and an increasing reliance on logical definitions for improved reasoning. UTILITY AND DISCUSSION The CL is now mandated as a metadata standard for large functional genomics and transcriptomics projects, and is used extensively for annotation, querying, and analyses of cell type specific data in sequencing consortia such as FANTOM5 and ENCODE, as well as for the NIAID ImmPort database and the Cell Image Library. The CL is also a vital component used in the modular construction of other biomedical ontologies-for example, the Gene Ontology and the cross-species anatomy ontology, Uberon, use CL to support the consistent representation of cell types across different levels of anatomical granularity, such as tissues and organs. CONCLUSIONS The ongoing improvements to the CL make it a valuable resource to both the OBO Foundry community and the wider scientific community, and we continue to experience increased interest in the CL both among developers and within the user community.
Collapse
Affiliation(s)
- Alexander D. Diehl
- />Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, Buffalo, NY 14203 USA
| | - Terrence F. Meehan
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD UK
| | - Yvonne M. Bradford
- />ZFIN, the Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97403 USA
| | - Matthew H. Brush
- />Ontology Development Group, Library, Oregon Health and Science University, Portland, Oregon 97239 USA
| | - Wasila M. Dahdul
- />Department of Biology, University of South Dakota, Vermillion, SD 57069 USA
- />National Evolutionary Synthesis Center, Durham, NC 27705 USA
| | - David S. Dougall
- />Southwestern Medical Center, University of Texas, Dallas, TX 75235 USA
| | - Yongqun He
- />Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109 USA
| | - David Osumi-Sutherland
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD UK
| | - Alan Ruttenberg
- />Oral Diagnostics Sciences, University at Buffalo School of Dental Medicine, Buffalo, NY 14210 USA
| | - Sirarat Sarntivijai
- />European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD UK
| | - Ceri E. Van Slyke
- />ZFIN, the Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97403 USA
| | - Nicole A. Vasilevsky
- />Ontology Development Group, Library, Oregon Health and Science University, Portland, Oregon 97239 USA
| | - Melissa A. Haendel
- />Ontology Development Group, Library, Oregon Health and Science University, Portland, Oregon 97239 USA
| | | | | |
Collapse
|
18
|
Name-calling in the hippocampus (and beyond): coming to terms with neuron types and properties. Brain Inform 2016; 4:1-12. [PMID: 27747821 PMCID: PMC5319951 DOI: 10.1007/s40708-016-0053-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2016] [Accepted: 05/24/2016] [Indexed: 01/25/2023] Open
Abstract
Widely spread naming inconsistencies in neuroscience pose a vexing obstacle to effective communication within and across areas of expertise. This problem is particularly acute when identifying neuron types and their properties. Hippocampome.org is a web-accessible neuroinformatics resource that organizes existing data about essential properties of all known neuron types in the rodent hippocampal formation. Hippocampome.org links evidence supporting the assignment of a property to a type with direct pointers to quotes and figures. Mining this knowledge from peer-reviewed reports reveals the troubling extent of terminological ambiguity and undefined terms. Examples span simple cases of using multiple synonyms and acronyms for the same molecular biomarkers (or other property) to more complex cases of neuronal naming. New publications often use different terms without mapping them to previous terms. As a result, neurons of the same type are assigned disparate names, while neurons of different types are bestowed the same name. Furthermore, non-unique properties are frequently used as names, and several neuron types are not named at all. In order to alleviate this nomenclature confusion regarding hippocampal neuron types and properties, we introduce a new functionality of Hippocampome.org: a fully searchable, curated catalog of human and machine-readable definitions, each linked to the corresponding neuron and property terms. Furthermore, we extend our robust approach to providing each neuron type with an informative name and unique identifier by mapping all encountered synonyms and homonyms.
Collapse
|
19
|
Ozyurt IB, Grethe JS, Martone ME, Bandrowski AE. Resource Disambiguator for the Web: Extracting Biomedical Resources and Their Citations from the Scientific Literature. PLoS One 2016; 11:e0146300. [PMID: 26730820 PMCID: PMC5156472 DOI: 10.1371/journal.pone.0146300] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 12/15/2015] [Indexed: 11/19/2022] Open
Abstract
The NIF Registry developed and maintained by the Neuroscience Information Framework is a cooperative project aimed at cataloging research resources, e.g., software tools, databases and tissue banks, funded largely by governments and available as tools to research scientists. Although originally conceived for neuroscience, the NIF Registry has over the years broadened in the scope to include research resources of general relevance to biomedical research. The current number of research resources listed by the Registry numbers over 13K. The broadening in scope to biomedical science led us to re-christen the NIF Registry platform as SciCrunch. The NIF/SciCrunch Registry has been cataloging the resource landscape since 2006; as such, it serves as a valuable dataset for tracking the breadth, fate and utilization of these resources. Our experience shows research resources like databases are dynamic objects, that can change location and scope over time. Although each record is entered manually and human-curated, the current size of the registry requires tools that can aid in curation efforts to keep content up to date, including when and where such resources are used. To address this challenge, we have developed an open source tool suite, collectively termed RDW: Resource Disambiguator for the (Web). RDW is designed to help in the upkeep and curation of the registry as well as in enhancing the content of the registry by automated extraction of resource candidates from the literature. The RDW toolkit includes a URL extractor from papers, resource candidate screen, resource URL change tracker, resource content change tracker. Curators access these tools via a web based user interface. Several strategies are used to optimize these tools, including supervised and unsupervised learning algorithms as well as statistical text analysis. The complete tool suite is used to enhance and maintain the resource registry as well as track the usage of individual resources through an innovative literature citation index honed for research resources. Here we present an overview of the Registry and show how the RDW tools are used in curation and usage tracking.
Collapse
|
20
|
Whetzel PL, Grethe JS, Banks DE, Martone ME. The NIDDK Information Network: A Community Portal for Finding Data, Materials, and Tools for Researchers Studying Diabetes, Digestive, and Kidney Diseases. PLoS One 2015; 10:e0136206. [PMID: 26393351 PMCID: PMC4578941 DOI: 10.1371/journal.pone.0136206] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Accepted: 07/30/2015] [Indexed: 11/19/2022] Open
Abstract
The NIDDK Information Network (dkNET; http://dknet.org) was launched to serve the needs of basic and clinical investigators in metabolic, digestive and kidney disease by facilitating access to research resources that advance the mission of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). By research resources, we mean the multitude of data, software tools, materials, services, projects and organizations available to researchers in the public domain. Most of these are accessed via web-accessible databases or web portals, each developed, designed and maintained by numerous different projects, organizations and individuals. While many of the large government funded databases, maintained by agencies such as European Bioinformatics Institute and the National Center for Biotechnology Information, are well known to researchers, many more that have been developed by and for the biomedical research community are unknown or underutilized. At least part of the problem is the nature of dynamic databases, which are considered part of the "hidden" web, that is, content that is not easily accessed by search engines. dkNET was created specifically to address the challenge of connecting researchers to research resources via these types of community databases and web portals. dkNET functions as a "search engine for data", searching across millions of database records contained in hundreds of biomedical databases developed and maintained by independent projects around the world. A primary focus of dkNET are centers and projects specifically created to provide high quality data and resources to NIDDK researchers. Through the novel data ingest process used in dkNET, additional data sources can easily be incorporated, allowing it to scale with the growth of digital data and the needs of the dkNET community. Here, we provide an overview of the dkNET portal and its functions. We show how dkNET can be used to address a variety of use cases that involve searching for research resources.
Collapse
Affiliation(s)
- Patricia L. Whetzel
- Center for Research in Biological Systems, University of California, San Diego, San Diego, California, United States of America
| | - Jeffrey S. Grethe
- Center for Research in Biological Systems, University of California, San Diego, San Diego, California, United States of America
| | - Davis E. Banks
- Center for Research in Biological Systems, University of California, San Diego, San Diego, California, United States of America
| | - Maryann E. Martone
- Center for Research in Biological Systems, University of California, San Diego, San Diego, California, United States of America
- Dept of Neurosciences, University of California, San Diego, San Diego, California, United States of America
- * E-mail:
| |
Collapse
|
21
|
Turner JA, Pasquerello D, Turner MD, Keator DB, Alpert K, King M, Landis D, Calhoun VD, Potkin SG, Tallis M, Ambite JL, Wang L. Terminology development towards harmonizing multiple clinical neuroimaging research repositories. DATA INTEGRATION IN THE LIFE SCIENCES : ... INTERNATIONAL WORKSHOP, DILS ... : PROCEEDINGS. DILS (CONFERENCE) 2015; 9162:104-117. [PMID: 26688838 PMCID: PMC4682911 DOI: 10.1007/978-3-319-21843-4_8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Data sharing and mediation across disparate neuroimaging repositories requires extensive effort to ensure that the different domains of data types are referred to by commonly agreed upon terms. Within the SchizConnect project, which enables querying across decentralized databases of neuroimaging, clinical, and cognitive data from various studies of schizophrenia, we developed a model for each data domain, identified common usable terms that could be agreed upon across the repositories, and linked them to standard ontological terms where possible. We had the goal of facilitating both the current user experience in querying and future automated computations and reasoning regarding the data. We found that existing terminologies are incomplete for these purposes, even with the history of neuroimaging data sharing in the field; and we provide a model for efforts focused on querying multiple clinical neuroimaging repositories.
Collapse
Affiliation(s)
- Jessica A. Turner
- Georgia State University, Atlanta, Georgia, USA
- Mind Research Network, Albuquerque, New Mexico, USA
| | | | | | | | | | | | - Drew Landis
- Mind Research Network, Albuquerque, New Mexico, USA
| | - Vince D. Calhoun
- Mind Research Network, Albuquerque, New Mexico, USA
- University of New Mexico, Albuquerque, New Mexico, USA
| | | | - Marcelo Tallis
- University of Southern California, Los Angeles, California, USA
| | | | - Lei Wang
- Northwestern University, Chicago, Illinois, USA
| |
Collapse
|
22
|
Tiesinga P, Bakker R, Hill S, Bjaalie JG. Feeding the human brain model. Curr Opin Neurobiol 2015; 32:107-14. [DOI: 10.1016/j.conb.2015.02.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Revised: 02/06/2015] [Accepted: 02/06/2015] [Indexed: 10/23/2022]
|
23
|
French L, Liu P, Marais O, Koreman T, Tseng L, Lai A, Pavlidis P. Text mining for neuroanatomy using WhiteText with an updated corpus and a new web application. Front Neuroinform 2015; 9:13. [PMID: 26052282 PMCID: PMC4439553 DOI: 10.3389/fninf.2015.00013] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Accepted: 05/07/2015] [Indexed: 11/13/2022] Open
Abstract
We describe the WhiteText project, and its progress towards automatically extracting statements of neuroanatomical connectivity from text. We review progress to date on the three main steps of the project: recognition of brain region mentions, standardization of brain region mentions to neuroanatomical nomenclature, and connectivity statement extraction. We further describe a new version of our manually curated corpus that adds 2,111 connectivity statements from 1,828 additional abstracts. Cross-validation classification within the new corpus replicates results on our original corpus, recalling 67% of connectivity statements at 51% precision. The resulting merged corpus provides 5,208 connectivity statements that can be used to seed species-specific connectivity matrices and to better train automated techniques. Finally, we present a new web application that allows fast interactive browsing of the over 70,000 sentences indexed by the system, as a tool for accessing the data and assisting in further curation. Software and data are freely available at http://www.chibi.ubc.ca/WhiteText/.
Collapse
Affiliation(s)
- Leon French
- Rotman Research Institute, University of Toronto Toronto, ON, Canada
| | - Po Liu
- Department of Psychiatry, University of British Columbia Vancouver, BC, Canada
| | - Olivia Marais
- Department of Psychiatry, University of British Columbia Vancouver, BC, Canada
| | - Tianna Koreman
- Department of Psychiatry, University of British Columbia Vancouver, BC, Canada
| | - Lucia Tseng
- Department of Psychiatry, University of British Columbia Vancouver, BC, Canada
| | - Artemis Lai
- Department of Psychiatry, University of British Columbia Vancouver, BC, Canada
| | - Paul Pavlidis
- Department of Psychiatry, University of British Columbia Vancouver, BC, Canada ; Centre for High-Throughput Biology, University of British Columbia Vancouver, BC, Canada
| |
Collapse
|
24
|
Abstract
The maturation of in vivo neuroimaging has led to incredible quantities of digital information about the human brain. While much is made of the data deluge in science, neuroimaging represents the leading edge of this onslaught of "big data". A range of neuroimaging databasing approaches has streamlined the transmission, storage, and dissemination of data from such brain imaging studies. Yet few, if any, common solutions exist to support the science of neuroimaging. In this article, we discuss how modern neuroimaging research represents a multifactorial and broad ranging data challenge, involving the growing size of the data being acquired; sociological and logistical sharing issues; infrastructural challenges for multi-site, multi-datatype archiving; and the means by which to explore and mine these data. As neuroimaging advances further, e.g. aging, genetics, and age-related disease, new vision is needed to manage and process this information while marshalling of these resources into novel results. Thus, "big data" can become "big" brain science.
Collapse
|
25
|
Ullmann JFP, Janke AL, Reutens D, Watson C. Development of MRI-based atlases of non-human brains. J Comp Neurol 2014; 523:391-405. [PMID: 25236843 DOI: 10.1002/cne.23678] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2014] [Revised: 09/15/2014] [Accepted: 09/17/2014] [Indexed: 12/12/2022]
Abstract
Brain atlases are a fundamental resource for neuroscience research. In the past few decades they have undergone a transition from traditional printed histological atlases to digital atlases made up of multiple data sets from multiple modalities, and atlases based on magnetic resonance imaging (MRI) have become widespread. Here we discuss the methods involved in making an MRI brain atlas, including registration of multiple data sets into a model, ontological classification, segmentation of a minimum deformation model, dissemination strategies, and applications of these atlases. Finally, we discuss possible future directions in the development of brain atlases.
Collapse
Affiliation(s)
- Jeremy F P Ullmann
- Centre for Advanced Imaging, The University of Queensland, Brisbane, Queensland, 4072, Australia
| | | | | | | |
Collapse
|
26
|
A neuroinformatics of brain modeling and its implementation in the Brain Operation Database BODB. Neuroinformatics 2014; 12:5-26. [PMID: 24234915 DOI: 10.1007/s12021-013-9209-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
We present principles for an integrated neuroinformatics framework which makes explicit how models are grounded on empirical evidence, explain (or not) existing empirical results and make testable predictions. The new ontological framework makes explicit how models bring together structural, functional, and related empirical observations. We emphasize schematics of the model’s operation linked to summaries of empirical data (SEDs) used in both the design and testing of the model, with tests comparing SEDs to summaries of simulation results (SSRs) from the model. We stress the importance of protocols for models as well as experiments. We complement the structural ontology of nested brain structures with a functional ontology of Brain Operating Principles (BOPs) for observed neural function and an ontological framework for grounding models in empirical data. We present an implementation of this ontological framework in the Brain Operation Database (BODB), an environment in which modelers and experimentalists can work together by making use of their shared empirical data, models and expertise.
Collapse
|
27
|
Marenco LN, Wang R, Bandrowski AE, Grethe JS, Shepherd GM, Miller PL. Extending the NIF DISCO framework to automate complex workflow: coordinating the harvest and integration of data from diverse neuroscience information resources. Front Neuroinform 2014; 8:58. [PMID: 25018728 PMCID: PMC4071641 DOI: 10.3389/fninf.2014.00058] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 05/06/2014] [Indexed: 11/15/2022] Open
Abstract
This paper describes how DISCO, the data aggregator that supports the Neuroscience Information Framework (NIF), has been extended to play a central role in automating the complex workflow required to support and coordinate the NIF’s data integration capabilities. The NIF is an NIH Neuroscience Blueprint initiative designed to help researchers access the wealth of data related to the neurosciences available via the Internet. A central component is the NIF Federation, a searchable database that currently contains data from 231 data and information resources regularly harvested, updated, and warehoused in the DISCO system. In the past several years, DISCO has greatly extended its functionality and has evolved to play a central role in automating the complex, ongoing process of harvesting, validating, integrating, and displaying neuroscience data from a growing set of participating resources. This paper provides an overview of DISCO’s current capabilities and discusses a number of the challenges and future directions related to the process of coordinating the integration of neuroscience data within the NIF Federation.
Collapse
Affiliation(s)
- Luis N Marenco
- Center for Medical Informatics, Yale University School of Medicine New Haven, CT, USA ; VA Connecticut Healthcare System, US Department of Veterans Affairs West Haven, CT, USA ; Department of Neurobiology, Yale University School of Medicine New Haven, CT, USA
| | - Rixin Wang
- Center for Medical Informatics, Yale University School of Medicine New Haven, CT, USA
| | - Anita E Bandrowski
- Department of Neurosciences, Center for Research in Biological Systems, University of California at San Diego La Jolla, CA, USA
| | - Jeffrey S Grethe
- Department of Neurosciences, Center for Research in Biological Systems, University of California at San Diego La Jolla, CA, USA
| | - Gordon M Shepherd
- Department of Neurobiology, Yale University School of Medicine New Haven, CT, USA
| | - Perry L Miller
- Center for Medical Informatics, Yale University School of Medicine New Haven, CT, USA ; VA Connecticut Healthcare System, US Department of Veterans Affairs West Haven, CT, USA ; Department of Anesthesiology, Yale University School of Medicine New Haven, CT, USA ; Department of Molecular, Cellular and Developmental Biology, Yale University New Haven, CT, USA
| |
Collapse
|
28
|
Veeraraghavan H, Miller JV. Faceted visualization of three dimensional neuroanatomy by combining ontology with faceted search. Neuroinformatics 2014; 12:245-59. [PMID: 24006207 PMCID: PMC3943828 DOI: 10.1007/s12021-013-9202-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
In this work, we present a faceted-search based approach for visualization of anatomy by combining a three dimensional digital atlas with an anatomy ontology. Specifically, our approach provides a drill-down search interface that exposes the relevant pieces of information (obtained by searching the ontology) for a user query. Hence, the user can produce visualizations starting with minimally specified queries. Furthermore, by automatically translating the user queries into the controlled terminology our approach eliminates the need for the user to use controlled terminology. We demonstrate the scalability of our approach using an abdominal atlas and the same ontology. We implemented our visualization tool on the opensource 3D Slicer software. We present results of our visualization approach by combining a modified Foundational Model of Anatomy (FMA) ontology with the Surgical Planning Laboratory (SPL) Brain 3D digital atlas, and geometric models specific to patients computed using the SPL brain tumor dataset.
Collapse
|
29
|
Abstract
Autism Spectrum Disorder (ASD) is highly heritable, and although there has been active research in an attempt to discover the genetic factors underlying ASD, diagnosis still depends heavily on behavioral assessments. Recently, several large-scale initiatives, including those of the Autism Consortium, have contributed to the collection of extensive information from families affected by ASD. Our goal was to develop an ontology that can be used 1) to provide improved access to the data collected by those who study ASD and other neurodevelopmental disorders, and 2) to assess and compare the characteristics of the instruments that are used in the assessment of ASD. We analyzed two dozen instruments used to assess ASD, studying the nature of the questions asked and items assessed, the method of delivery, and the overall scope of the content. These data together with the extensive literature on ASD contributed to our iterative development of an ASD phenotype ontology. The final ontology comprises 283 concepts distributed across three high-level classes, 'Personal Traits', 'Social Competence', and 'Medical History'. The ontology is fully integrated with the Autism Consortium database, allowing researchers to pose ontology-based questions. The ontology also allows researchers to assess the degree of overlap among a set of candidate instruments according to several objective criteria. The ASD phenotype ontology has promise for use in research settings where extensive phenotypic data have been collected, allowing a concept-based approach to identifying behavioral features of importance and for correlating these with genotypic data.
Collapse
Affiliation(s)
- Alexa T McCray
- Center for Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, MA, 02115, USA,
| | | | | |
Collapse
|
30
|
Quantitative imaging biomarker ontology (QIBO) for knowledge representation of biomedical imaging biomarkers. J Digit Imaging 2014; 26:630-41. [PMID: 23589184 DOI: 10.1007/s10278-013-9599-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
A widening array of novel imaging biomarkers is being developed using ever more powerful clinical and preclinical imaging modalities. These biomarkers have demonstrated effectiveness in quantifying biological processes as they occur in vivo and in the early prediction of therapeutic outcomes. However, quantitative imaging biomarker data and knowledge are not standardized, representing a critical barrier to accumulating medical knowledge based on quantitative imaging data. We use an ontology to represent, integrate, and harmonize heterogeneous knowledge across the domain of imaging biomarkers. This advances the goal of developing applications to (1) improve precision and recall of storage and retrieval of quantitative imaging-related data using standardized terminology; (2) streamline the discovery and development of novel imaging biomarkers by normalizing knowledge across heterogeneous resources; (3) effectively annotate imaging experiments thus aiding comprehension, re-use, and reproducibility; and (4) provide validation frameworks through rigorous specification as a basis for testable hypotheses and compliance tests. We have developed the Quantitative Imaging Biomarker Ontology (QIBO), which currently consists of 488 terms spanning the following upper classes: experimental subject, biological intervention, imaging agent, imaging instrument, image post-processing algorithm, biological target, indicated biology, and biomarker application. We have demonstrated that QIBO can be used to annotate imaging experiments with standardized terms in the ontology and to generate hypotheses for novel imaging biomarker-disease associations. Our results established the utility of QIBO in enabling integrated analysis of quantitative imaging data.
Collapse
|
31
|
Goh SYM, Irimia A, Torgerson CM, Horn JDV. Neuroinformatics challenges to the structural, connectomic, functional and electrophysiological multimodal imaging of human traumatic brain injury. Front Neuroinform 2014; 8:19. [PMID: 24616696 PMCID: PMC3935464 DOI: 10.3389/fninf.2014.00019] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 02/11/2014] [Indexed: 01/14/2023] Open
Abstract
Throughout the past few decades, the ability to treat and rehabilitate traumatic brain injury (TBI) patients has become critically reliant upon the use of neuroimaging to acquire adequate knowledge of injury-related effects upon brain function and recovery. As a result, the need for TBI neuroimaging analysis methods has increased in recent years due to the recognition that spatiotemporal computational analyses of TBI evolution are useful for capturing the effects of TBI dynamics. At the same time, however, the advent of such methods has brought about the need to analyze, manage, and integrate TBI neuroimaging data using informatically inspired approaches which can take full advantage of their large dimensionality and informational complexity. Given this perspective, we here discuss the neuroinformatics challenges for TBI neuroimaging analysis in the context of structural, connectivity, and functional paradigms. Within each of these, the availability of a wide range of neuroimaging modalities can be leveraged to fully understand the heterogeneity of TBI pathology; consequently, large-scale computer hardware resources and next-generation processing software are often required for efficient data storage, management, and analysis of TBI neuroimaging data. However, each of these paradigms poses challenges in the context of informatics such that the ability to address them is critical for augmenting current capabilities to perform neuroimaging analysis of TBI and to improve therapeutic efficacy.
Collapse
Affiliation(s)
- S Y Matthew Goh
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| | - Andrei Irimia
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| | - Carinna M Torgerson
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| | - John D Van Horn
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| |
Collapse
|
32
|
Nichols BN, Mejino JL, Detwiler LT, Nilsen TT, Martone ME, Turner JA, Rubin DL, Brinkley JF. Neuroanatomical domain of the foundational model of anatomy ontology. J Biomed Semantics 2014; 5:1. [PMID: 24398054 PMCID: PMC3944952 DOI: 10.1186/2041-1480-5-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2013] [Accepted: 12/24/2013] [Indexed: 11/10/2022] Open
Abstract
Background The diverse set of human brain structure and function analysis methods represents a difficult challenge for reconciling multiple views of neuroanatomical organization. While different views of organization are expected and valid, no widely adopted approach exists to harmonize different brain labeling protocols and terminologies. Our approach uses the natural organizing framework provided by anatomical structure to correlate terminologies commonly used in neuroimaging. Description The Foundational Model of Anatomy (FMA) Ontology provides a semantic framework for representing the anatomical entities and relationships that constitute the phenotypic organization of the human body. In this paper we describe recent enhancements to the neuroanatomical content of the FMA that models cytoarchitectural and morphological regions of the cerebral cortex, as well as white matter structure and connectivity. This modeling effort is driven by the need to correlate and reconcile the terms used in neuroanatomical labeling protocols. By providing an ontological framework that harmonizes multiple views of neuroanatomical organization, the FMA provides developers with reusable and computable knowledge for a range of biomedical applications. Conclusions A requirement for facilitating the integration of basic and clinical neuroscience data from diverse sources is a well-structured ontology that can incorporate, organize, and associate neuroanatomical data. We applied the ontological framework of the FMA to align the vocabularies used by several human brain atlases, and to encode emerging knowledge about structural connectivity in the brain. We highlighted several use cases of these extensions, including ontology reuse, neuroimaging data annotation, and organizing 3D brain models.
Collapse
|
33
|
Jensen M, Cox AP, Chaudhry N, Ng M, Sule D, Duncan W, Ray P, Weinstock-Guttman B, Smith B, Ruttenberg A, Szigeti K, Diehl AD. The neurological disease ontology. J Biomed Semantics 2013; 4:42. [PMID: 24314207 PMCID: PMC4028878 DOI: 10.1186/2041-1480-4-42] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2013] [Accepted: 11/29/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We are developing the Neurological Disease Ontology (ND) to provide a framework to enable representation of aspects of neurological diseases that are relevant to their treatment and study. ND is a representational tool that addresses the need for unambiguous annotation, storage, and retrieval of data associated with the treatment and study of neurological diseases. ND is being developed in compliance with the Open Biomedical Ontology Foundry principles and builds upon the paradigm established by the Ontology for General Medical Science (OGMS) for the representation of entities in the domain of disease and medical practice. Initial applications of ND will include the annotation and analysis of large data sets and patient records for Alzheimer's disease, multiple sclerosis, and stroke. DESCRIPTION ND is implemented in OWL 2 and currently has more than 450 terms that refer to and describe various aspects of neurological diseases. ND directly imports the development version of OGMS, which uses BFO 2. Term development in ND has primarily extended the OGMS terms 'disease', 'diagnosis', 'disease course', and 'disorder'. We have imported and utilize over 700 classes from related ontology efforts including the Foundational Model of Anatomy, Ontology for Biomedical Investigations, and Protein Ontology. ND terms are annotated with ontology metadata such as a label (term name), term editors, textual definition, definition source, curation status, and alternative terms (synonyms). Many terms have logical definitions in addition to these annotations. Current development has focused on the establishment of the upper-level structure of the ND hierarchy, as well as on the representation of Alzheimer's disease, multiple sclerosis, and stroke. The ontology is available as a version-controlled file at http://code.google.com/p/neurological-disease-ontology along with a discussion list and an issue tracker. CONCLUSION ND seeks to provide a formal foundation for the representation of clinical and research data pertaining to neurological diseases. ND will enable its users to connect data in a robust way with related data that is annotated using other terminologies and ontologies in the biomedical domain.
Collapse
Affiliation(s)
- Mark Jensen
- Department of Philosophy, University at Buffalo, 135 Park Hall, Buffalo, NY 14260, USA
| | - Alexander P Cox
- Department of Philosophy, University at Buffalo, 135 Park Hall, Buffalo, NY 14260, USA
| | - Naveed Chaudhry
- Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, 701 Ellicott Street, Buffalo, NY 14203, USA
| | - Marcus Ng
- Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, 701 Ellicott Street, Buffalo, NY 14203, USA
| | - Donat Sule
- Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, 701 Ellicott Street, Buffalo, NY 14203, USA
| | - William Duncan
- Department of Philosophy, University at Buffalo, 135 Park Hall, Buffalo, NY 14260, USA
| | - Patrick Ray
- Department of Philosophy, University at Buffalo, 135 Park Hall, Buffalo, NY 14260, USA
| | - Bianca Weinstock-Guttman
- Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, 701 Ellicott Street, Buffalo, NY 14203, USA
| | - Barry Smith
- Department of Philosophy, University at Buffalo, 135 Park Hall, Buffalo, NY 14260, USA
- Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, 701 Ellicott Street, Buffalo, NY 14203, USA
| | - Alan Ruttenberg
- Department of Oral Diagnostic Sciences, University at Buffalo School of Dental Medicine, 355 Squire Hall, Buffalo, NY 14214, USA
| | - Kinga Szigeti
- Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, 701 Ellicott Street, Buffalo, NY 14203, USA
| | - Alexander D Diehl
- Department of Neurology, University at Buffalo School of Medicine and Biomedical Sciences, 701 Ellicott Street, Buffalo, NY 14203, USA
| |
Collapse
|
34
|
Roncaglia P, Martone ME, Hill DP, Berardini TZ, Foulger RE, Imam FT, Drabkin H, Mungall CJ, Lomax J. The Gene Ontology (GO) Cellular Component Ontology: integration with SAO (Subcellular Anatomy Ontology) and other recent developments. J Biomed Semantics 2013; 4:20. [PMID: 24093723 PMCID: PMC3852282 DOI: 10.1186/2041-1480-4-20] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 09/24/2013] [Indexed: 12/31/2022] Open
Abstract
Background The Gene Ontology (GO) (http://www.geneontology.org/) contains a set of terms for describing the activity and actions of gene products across all kingdoms of life. Each of these activities is executed in a location within a cell or in the vicinity of a cell. In order to capture this context, the GO includes a sub-ontology called the Cellular Component (CC) ontology (GO-CCO). The primary use of this ontology is for GO annotation, but it has also been used for phenotype annotation, and for the annotation of images. Another ontology with similar scope to the GO-CCO is the Subcellular Anatomy Ontology (SAO), part of the Neuroscience Information Framework Standard (NIFSTD) suite of ontologies. The SAO also covers cell components, but in the domain of neuroscience. Description Recently, the GO-CCO was enriched in content and links to the Biological Process and Molecular Function branches of GO as well as to other ontologies. This was achieved in several ways. We carried out an amalgamation of SAO terms with GO-CCO ones; as a result, nearly 100 new neuroscience-related terms were added to the GO. The GO-CCO also contains relationships to GO Biological Process and Molecular Function terms, as well as connecting to external ontologies such as the Cell Ontology (CL). Terms representing protein complexes in the Protein Ontology (PRO) reference GO-CCO terms for their species-generic counterparts. GO-CCO terms can also be used to search a variety of databases. Conclusions In this publication we provide an overview of the GO-CCO, its overall design, and some recent extensions that make use of additional spatial information. One of the most recent developments of the GO-CCO was the merging in of the SAO, resulting in a single unified ontology designed to serve the needs of GO annotators as well as the specific needs of the neuroscience community.
Collapse
Affiliation(s)
- Paola Roncaglia
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK.
| | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Ethier JF, Dameron O, Curcin V, McGilchrist MM, Verheij RA, Arvanitis TN, Taweel A, Delaney BC, Burgun A. A unified structural/terminological interoperability framework based on LexEVS: application to TRANSFoRm. J Am Med Inform Assoc 2013; 20:986-94. [PMID: 23571850 PMCID: PMC3756256 DOI: 10.1136/amiajnl-2012-001312] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2012] [Revised: 03/10/2013] [Accepted: 03/14/2013] [Indexed: 12/17/2022] Open
Abstract
OBJECTIVE Biomedical research increasingly relies on the integration of information from multiple heterogeneous data sources. Despite the fact that structural and terminological aspects of interoperability are interdependent and rely on a common set of requirements, current efforts typically address them in isolation. We propose a unified ontology-based knowledge framework to facilitate interoperability between heterogeneous sources, and investigate if using the LexEVS terminology server is a viable implementation method. MATERIALS AND METHODS We developed a framework based on an ontology, the general information model (GIM), to unify structural models and terminologies, together with relevant mapping sets. This allowed a uniform access to these resources within LexEVS to facilitate interoperability by various components and data sources from implementing architectures. RESULTS Our unified framework has been tested in the context of the EU Framework Program 7 TRANSFoRm project, where it was used to achieve data integration in a retrospective diabetes cohort study. The GIM was successfully instantiated in TRANSFoRm as the clinical data integration model, and necessary mappings were created to support effective information retrieval for software tools in the project. CONCLUSIONS We present a novel, unifying approach to address interoperability challenges in heterogeneous data sources, by representing structural and semantic models in one framework. Systems using this architecture can rely solely on the GIM that abstracts over both the structure and coding. Information models, terminologies and mappings are all stored in LexEVS and can be accessed in a uniform manner (implementing the HL7 CTS2 service functional model). The system is flexible and should reduce the effort needed from data sources personnel for implementing and managing the integration.
Collapse
Affiliation(s)
| | | | - Vasa Curcin
- Department of Computing, Imperial College London, London, UK
| | | | | | - Theodoros N Arvanitis
- School of Electronic, Electrical and Computer Engineering, University of Birmingham, Birmingham, UK
| | - Adel Taweel
- Department of Computer Science, King's College London, London, UK
| | - Brendan C Delaney
- Department of Primary Care and Public Health Sciences, King's College London, London, UK
| | - Anita Burgun
- INSERM UMR936, Université de Rennes 1, Rennes, France
| |
Collapse
|
36
|
Larson SD, Martone ME. NeuroLex.org: an online framework for neuroscience knowledge. Front Neuroinform 2013; 7:18. [PMID: 24009581 PMCID: PMC3757470 DOI: 10.3389/fninf.2013.00018] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2012] [Accepted: 08/06/2013] [Indexed: 11/13/2022] Open
Abstract
The ability to transmit, organize, and query information digitally has brought with it the challenge of how to best use this power to facilitate scientific inquiry. Today, few information systems are able to provide detailed answers to complex questions about neuroscience that account for multiple spatial scales, and which cross the boundaries of diverse parts of the nervous system such as molecules, cellular parts, cells, circuits, systems and tissues. As a result, investigators still primarily seek answers to their questions in an increasingly densely populated collection of articles in the literature, each of which must be digested individually. If it were easier to search a knowledge base that was structured to answer neuroscience questions, such a system would enable questions to be answered in seconds that would otherwise require hours of literature review. In this article, we describe NeuroLex.org, a wiki-based website and knowledge management system. Its goal is to bring neurobiological knowledge into a framework that allows neuroscientists to review the concepts of neuroscience, with an emphasis on multiscale descriptions of the parts of nervous systems, aggregate their understanding with that of other scientists, link them to data sources and descriptions of important concepts in neuroscience, and expose parts that are still controversial or missing. To date, the site is tracking ~25,000 unique neuroanatomical parts and concepts in neurobiology spanning experimental techniques, behavioral paradigms, anatomical nomenclature, genes, proteins and molecules. Here we show how the structuring of information about these anatomical parts in the nervous system can be reused to answer multiple neuroscience questions, such as displaying all known GABAergic neurons aggregated in NeuroLex or displaying all brain regions that are known within NeuroLex to send axons into the cerebellar cortex.
Collapse
Affiliation(s)
- Stephen D Larson
- Department of Neurosciences, University of California San Diego La Jolla, CA, USA
| | | |
Collapse
|
37
|
Puelles L, Harrison M, Paxinos G, Watson C. A developmental ontology for the mammalian brain based on the prosomeric model. Trends Neurosci 2013; 36:570-8. [PMID: 23871546 DOI: 10.1016/j.tins.2013.06.004] [Citation(s) in RCA: 170] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2011] [Revised: 06/12/2013] [Accepted: 06/20/2013] [Indexed: 12/22/2022]
Abstract
In the past, attempts to create a hierarchical classification of brain structures (an ontology) have been limited by the lack of adequate data on developmental processes. Recent studies on gene expression during brain development have demonstrated the true morphologic interrelations of different parts of the brain. A developmental ontology takes into account the progressive rostrocaudal and dorsoventral differentiation of the neural tube, and the radial migration of derivatives from progenitor areas, using fate mapping and other experimental techniques. In this review, we used the prosomeric model of brain development to build a hierarchical classification of brain structures based chiefly on gene expression. Because genomic control of neural morphogenesis is remarkably conservative, this ontology should prove essentially valid for all vertebrates, aiding terminological unification.
Collapse
Affiliation(s)
- Luis Puelles
- Department of Human Anatomy, University of Murcia, Murcia 30003, Spain
| | | | | | | |
Collapse
|
38
|
Abstract
As open science neuroinformatics databases the Brede Database and Brede Wiki seek to make distribution and federation of their content as easy and transparent as possible. The databases rely on simple formats and allow other online tools to reuse their content. This paper describes the possible interconnections on different levels between the Brede tools and other databases.
Collapse
Affiliation(s)
- Finn Årup Nielsen
- DTU Compute, Technical University of Denmark, Kongens Lyngby, Denmark,
| |
Collapse
|
39
|
Maynard SM, Mungall CJ, Lewis SE, Imam FT, Martone ME. A knowledge based approach to matching human neurodegenerative disease and animal models. Front Neuroinform 2013; 7:7. [PMID: 23717278 PMCID: PMC3653101 DOI: 10.3389/fninf.2013.00007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2012] [Accepted: 04/09/2013] [Indexed: 12/19/2022] Open
Abstract
Neurodegenerative diseases present a wide and complex range of biological and clinical features. Animal models are key to translational research, yet typically only exhibit a subset of disease features rather than being precise replicas of the disease. Consequently, connecting animal to human conditions using direct data-mining strategies has proven challenging, particularly for diseases of the nervous system, with its complicated anatomy and physiology. To address this challenge we have explored the use of ontologies to create formal descriptions of structural phenotypes across scales that are machine processable and amenable to logical inference. As proof of concept, we built a Neurodegenerative Disease Phenotype Ontology (NDPO) and an associated Phenotype Knowledge Base (PKB) using an entity-quality model that incorporates descriptions for both human disease phenotypes and those of animal models. Entities are drawn from community ontologies made available through the Neuroscience Information Framework (NIF) and qualities are drawn from the Phenotype and Trait Ontology (PATO). We generated ~1200 structured phenotype statements describing structural alterations at the subcellular, cellular and gross anatomical levels observed in 11 human neurodegenerative conditions and associated animal models. PhenoSim, an open source tool for comparing phenotypes, was used to issue a series of competency questions to compare individual phenotypes among organisms and to determine which animal models recapitulate phenotypic aspects of the human disease in aggregate. Overall, the system was able to use relationships within the ontology to bridge phenotypes across scales, returning non-trivial matches based on common subsumers that were meaningful to a neuroscientist with an advanced knowledge of neuroanatomy. The system can be used both to compare individual phenotypes and also phenotypes in aggregate. This proof of concept suggests that expressing complex phenotypes using formal ontologies provides considerable benefit for comparing phenotypes across scales and species.
Collapse
Affiliation(s)
- Sarah M Maynard
- Department of Neurosciences, Center for Research in Biological Systems, University of California San Diego, San Diego, CA, USA
| | | | | | | | | |
Collapse
|
40
|
A survey of the neuroscience resource landscape: perspectives from the neuroscience information framework. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2013. [PMID: 23195120 DOI: 10.1016/b978-0-12-388408-4.00003-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/05/2024]
Abstract
The number of available neuroscience resources (databases, tools, materials, and networks) available via the Web continues to expand, particularly in light of newly implemented data sharing policies required by funding agencies and journals. However, the nature of dense, multifaceted neuroscience data and the design of classic search engine systems make efficient, reliable, and relevant discovery of such resources a significant challenge. This challenge is especially pertinent for online databases, whose dynamic content is largely opaque to contemporary search engines. The Neuroscience Information Framework was initiated to address this problem of finding and utilizing neuroscience-relevant resources. Since its first production release in 2008, NIF has been surveying the resource landscape for the neurosciences, identifying relevant resources and working to make them easily discoverable by the neuroscience community. In this chapter, we provide a survey of the resource landscape for neuroscience: what types of resources are available, how many there are, what they contain, and most importantly, ways in which these resources can be utilized by the research community to advance neuroscience research.
Collapse
|
41
|
Vita R, Overton JA, Greenbaum JA, Sette A, Peters B. Query enhancement through the practical application of ontology: the IEDB and OBI. J Biomed Semantics 2013; 4 Suppl 1:S6. [PMID: 23734660 PMCID: PMC3633001 DOI: 10.1186/2041-1480-4-s1-s6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Ontologies categorize entities, express relationships between them, and provide standardized definitions. Thus, they can be used to present and enforce the specific relationships between database components. The Immune Epitope Database (IEDB, http://www.iedb.org) utilizes the Ontology for Biomedical Investigations (OBI) and several additional ontologies to represent immune epitope mapping experiments. Here, we describe our experiences utilizing this representation in order to provide enhanced database search functionality. We applied a simple approach to incorporate the benefits of the information captured in a formal ontology directly into the user web interface, resulting in an improved user experience with minimal changes to the database itself. The integration is easy to maintain, provides standardized terms and definitions, and allows for subsumption queries. In addition to these immediate benefits, our long-term goal is to enable true semantic integration of data and knowledge in the biomedical domain. We describe our progress towards that goal and what we perceive as the main obstacles.
Collapse
Affiliation(s)
- Randi Vita
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, 9420 Athena Circle, La Jolla, CA 92037, USA.
| | | | | | | | | | | |
Collapse
|
42
|
Aranguren ME, Fernández-Breis JT, Mungall C, Antezana E, González AR, Wilkinson MD. OPPL-Galaxy, a Galaxy tool for enhancing ontology exploitation as part of bioinformatics workflows. J Biomed Semantics 2013; 4:2. [PMID: 23286517 PMCID: PMC3643862 DOI: 10.1186/2041-1480-4-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 12/27/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Biomedical ontologies are key elements for building up the Life Sciences Semantic Web. Reusing and building biomedical ontologies requires flexible and versatile tools to manipulate them efficiently, in particular for enriching their axiomatic content. The Ontology Pre Processor Language (OPPL) is an OWL-based language for automating the changes to be performed in an ontology. OPPL augments the ontologists' toolbox by providing a more efficient, and less error-prone, mechanism for enriching a biomedical ontology than that obtained by a manual treatment. RESULTS We present OPPL-Galaxy, a wrapper for using OPPL within Galaxy. The functionality delivered by OPPL (i.e. automated ontology manipulation) can be combined with the tools and workflows devised within the Galaxy framework, resulting in an enhancement of OPPL. Use cases are provided in order to demonstrate OPPL-Galaxy's capability for enriching, modifying and querying biomedical ontologies. CONCLUSIONS Coupling OPPL-Galaxy with other bioinformatics tools of the Galaxy framework results in a system that is more than the sum of its parts. OPPL-Galaxy opens a new dimension of analyses and exploitation of biomedical ontologies, including automated reasoning, paving the way towards advanced biological data analyses.
Collapse
Affiliation(s)
- Mikel Egaña Aranguren
- Ontology Engineering Group, School of Computer Science, Technical University of Madrid (UPM), Boadilla del Monte, 28660, Spain
- Biological Informatics Group, Centre for Plant Biotechnology and Genomics (CBGP), Technical University of Madrid (UPM), Pozuelo de Alarcón, 28223, Spain
| | | | - Chris Mungall
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, US
| | - Erick Antezana
- Department of Biology, Norwegian University of Science and Technology (NTNU), Høgskoleringen 5, Trondheim, N-7491, Norway
| | - Alejandro Rodríguez González
- Biological Informatics Group, Centre for Plant Biotechnology and Genomics (CBGP), Technical University of Madrid (UPM), Pozuelo de Alarcón, 28223, Spain
| | - Mark D Wilkinson
- Biological Informatics Group, Centre for Plant Biotechnology and Genomics (CBGP), Technical University of Madrid (UPM), Pozuelo de Alarcón, 28223, Spain
| |
Collapse
|
43
|
Calabrese E, Johnson GA, Watson C. An ontology-based segmentation scheme for tracking postnatal changes in the developing rodent brain with MRI. Neuroimage 2012; 67:375-84. [PMID: 23246176 DOI: 10.1016/j.neuroimage.2012.11.037] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Revised: 10/11/2012] [Accepted: 11/15/2012] [Indexed: 10/27/2022] Open
Abstract
The postnatal period of neurodevelopment has been implicated in a number of brain disorders including autism and schizophrenia. Rodent models have proven to be invaluable in advancing our understanding of the human brain, and will almost certainly play a pivotal role in future studies on postnatal neurodevelopment. The growing field of magnetic resonance microscopy has the potential to revolutionize our understanding of neurodevelopment, if it can be successfully and appropriately assimilated into the vast body of existing neuroscience research. In this study, we demonstrate the utility of a developmental neuro-ontology designed specifically for tracking regional changes in MR biomarkers throughout postnatal neurodevelopment. Using this ontological classification as a segmentation guide, we track regional changes in brain volume in rats between postnatal day zero and postnatal day 80 and demonstrate differential growth rates in axial versus paraxial brain regions. Both the ontology and the associated label volumes are provided as a foundation for future MR-based studies of postnatal neurodevelopment in normal and disease states.
Collapse
Affiliation(s)
- Evan Calabrese
- Center for In Vivo Microscopy, Department of Radiology, Box 3302 Duke University Medical Center, Durham, NC 27710, USA
| | | | | |
Collapse
|
44
|
Brown JA, Rudie JD, Bandrowski A, Van Horn JD, Bookheimer SY. The UCLA multimodal connectivity database: a web-based platform for brain connectivity matrix sharing and analysis. Front Neuroinform 2012; 6:28. [PMID: 23226127 PMCID: PMC3508475 DOI: 10.3389/fninf.2012.00028] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2012] [Accepted: 11/14/2012] [Indexed: 11/13/2022] Open
Abstract
Brain connectomics research has rapidly expanded using functional MRI (fMRI) and diffusion-weighted MRI (dwMRI). A common product of these varied analyses is a connectivity matrix (CM). A CM stores the connection strength between any two regions (“nodes”) in a brain network. This format is useful for several reasons: (1) it is highly distilled, with minimal data size and complexity, (2) graph theory can be applied to characterize the network's topology, and (3) it retains sufficient information to capture individual differences such as age, gender, intelligence quotient (IQ), or disease state. Here we introduce the UCLA Multimodal Connectivity Database (http://umcd.humanconnectomeproject.org), an openly available website for brain network analysis and data sharing. The site is a repository for researchers to publicly share CMs derived from their data. The site also allows users to select any CM shared by another user, compute graph theoretical metrics on the site, visualize a report of results, or download the raw CM. To date, users have contributed over 2000 individual CMs, spanning different imaging modalities (fMRI, dwMRI) and disorders (Alzheimer's, autism, Attention Deficit Hyperactive Disorder). To demonstrate the site's functionality, whole brain functional and structural connectivity matrices are derived from 60 subjects' (ages 26–45) resting state fMRI (rs-fMRI) and dwMRI data and uploaded to the site. The site is utilized to derive graph theory global and regional measures for the rs-fMRI and dwMRI networks. Global and nodal graph theoretical measures between functional and structural networks exhibit low correspondence. This example demonstrates how this tool can enhance the comparability of brain networks from different imaging modalities and studies. The existence of this connectivity-based repository should foster broader data sharing and enable larger-scale meta-analyses comparing networks across imaging modality, age group, and disease state.
Collapse
Affiliation(s)
- Jesse A Brown
- Center for Cognitive Neuroscience, University of California Los Angeles Los Angeles, CA, USA ; Department of Psychiatry and Biobehavioral Sciences, University of California Los Angeles Los Angeles, CA, USA ; Interdepartmental Program in Neuroscience, University of California Los Angeles Los Angeles, CA, USA
| | | | | | | | | |
Collapse
|
45
|
Poldrack RA, Mumford JA, Schonberg T, Kalar D, Barman B, Yarkoni T. Discovering relations between mind, brain, and mental disorders using topic mapping. PLoS Comput Biol 2012; 8:e1002707. [PMID: 23071428 PMCID: PMC3469446 DOI: 10.1371/journal.pcbi.1002707] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 08/02/2012] [Indexed: 11/18/2022] Open
Abstract
Neuroimaging research has largely focused on the identification of associations between brain activation and specific mental functions. Here we show that data mining techniques applied to a large database of neuroimaging results can be used to identify the conceptual structure of mental functions and their mapping to brain systems. This analysis confirms many current ideas regarding the neural organization of cognition, but also provides some new insights into the roles of particular brain systems in mental function. We further show that the same methods can be used to identify the relations between mental disorders. Finally, we show that these two approaches can be combined to empirically identify novel relations between mental disorders and mental functions via their common involvement of particular brain networks. This approach has the potential to discover novel endophenotypes for neuropsychiatric disorders and to better characterize the structure of these disorders and the relations between them. One of the major challenges of neuroscience research is to integrate the results of the large number of published research studies in order to better understand how psychological functions are mapped onto brain systems. In this research, we take advantage of a large database of neuroimaging studies, along with text mining methods, to extract information about the topics that are found in the brain imaging literature and their mapping onto reported brain activation data. We also show that this method can be used to identify new relations between psychological functions and mental disorders, through their shared brain activity patterns. This work provides a new way to discover the underlying structure that relates brain function and mental processes.
Collapse
Affiliation(s)
- Russell A Poldrack
- Imaging Research Center and Departments of Psychology and Neurobiology, University of Texas, Austin, Texas, United States of America.
| | | | | | | | | | | |
Collapse
|
46
|
French L, Pavlidis P. Using text mining to link journal articles to neuroanatomical databases. J Comp Neurol 2012; 520:1772-83. [PMID: 22120205 DOI: 10.1002/cne.23012] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The electronic linking of neuroscience information, including data embedded in the primary literature, would permit powerful queries and analyses driven by structured databases. This task would be facilitated by automated procedures that can identify biological concepts in journals. Here we apply an approach for automatically mapping formal identifiers of neuroanatomical regions to text found in journal abstracts, applying it to a large body of abstracts from the Journal of Comparative Neurology (JCN). The analyses yield over 100,000 brain region mentions, which we map to 8,225 brain region concepts in multiple organisms. Based on the analysis of a manually annotated corpus, we estimate mentions are mapped at 95% precision and 63% recall. Our results provide insights into the patterns of publication on brain regions and species of study in JCN but also point to important challenges in the standardization of neuroanatomical nomenclatures. We find that many terms in the formal terminologies never appear in a JCN abstract, and, conversely, many terms that authors use are not reflected in the terminologies. To improve the terminologies, we deposited 136 unrecognized brain regions into the Neuroscience Lexicon (NeuroLex). The training data, terminologies, normalizations, evaluations, and annotated journal abstracts are freely available at http://www.chibi.ubc.ca/WhiteText/.
Collapse
Affiliation(s)
- Leon French
- Bioinformatics Graduate Program, University of British Columbia, Vancouver, British Columbia, V6T 1Z4 Canada
| | | |
Collapse
|
47
|
Kennedy DN, Hodge SM, Gao Y, Frazier JA, Haselgrove C. The internet brain volume database: a public resource for storage and retrieval of volumetric data. Neuroinformatics 2012; 10:129-40. [PMID: 21931990 DOI: 10.1007/s12021-011-9130-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Every month, numerous publications appear that include neuroanatomic volumetric observations. The current and past literature that includes volumetric measurements is vast, but variable with respect to specific species, structures, and subject characteristics (such as gender, age, pathology, etc.). In this report we introduce the Internet Brain Volume Database (IBVD), www.nitrc.org/projects/ibvd , a site devoted to facilitating access to and utilization of neuroanatomic volumetric observations as published in the literature. We review the design and functionality of the site. The IBVD is the first database dedicated to integrating, exposing and sharing brain volumetric observations across species and disease. It offers valuable functionality for quality assurance assessment of results as well as support for meta-analysis across large segments of the published literature that are obscured from traditional text-based search engines.
Collapse
Affiliation(s)
- David N Kennedy
- Department of Psychiatry, University of Massachusetts Medical School, 356 Plantation St, Biotech 1, Suite 100, Worcester, MA 01605, USA.
| | | | | | | | | |
Collapse
|
48
|
Köhler S, Doelken SC, Rath A, Aymé S, Robinson PN. Ontological phenotype standards for neurogenetics. Hum Mutat 2012; 33:1333-9. [PMID: 22573485 DOI: 10.1002/humu.22112] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 04/13/2012] [Indexed: 12/22/2022]
Abstract
Neurological disorders comprise one of the largest groups of human diseases. Due to the myriad symptoms and the extreme degree of clinical variability characteristic of many neurological diseases, the differential diagnosis process is extremely challenging. Even though most neurogenetic diseases are individually rare, collectively, the subgroup of neurogenetic disorders is large, comprising more than 2,400 different disorders. Recently, increasing efforts have been undertaken to unravel the molecular basis of neurogenetic diseases and to correlate pathogenetic mechanisms with clinical signs and symptoms. In order to enable computer-based analyses, the systematic representation of the neurological phenotype is of major importance. We demonstrate how the Human Phenotype Ontology (HPO) can be incorporated into these efforts by providing a systematic semantic representation of phenotypic abnormalities encountered in human genetic diseases. The combination of the HPO together with the Orphanet disease classification represents a promising resource for automated disease classification, performing computational clustering and analysis of the neurogenetic phenome. Furthermore, standardized representations of neurologic phenotypic abnormalities employing the HPO link neurological phenotypic abnormalities to anatomical and functional entities represented in other biomedical ontologies through the semantic references provided by the HPO.
Collapse
Affiliation(s)
- Sebastian Köhler
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | | | | | | | | |
Collapse
|
49
|
Imam FT, Larson SD, Bandrowski A, Grethe JS, Gupta A, Martone ME. Development and use of Ontologies Inside the Neuroscience Information Framework: A Practical Approach. Front Genet 2012; 3:111. [PMID: 22737162 PMCID: PMC3381282 DOI: 10.3389/fgene.2012.00111] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2012] [Accepted: 05/29/2012] [Indexed: 11/16/2022] Open
Abstract
An initiative of the NIH Blueprint for neuroscience research, the Neuroscience Information Framework (NIF) project advances neuroscience by enabling discovery and access to public research data and tools worldwide through an open source, semantically enhanced search portal. One of the critical components for the overall NIF system, the NIF Standardized Ontologies (NIFSTD), provides an extensive collection of standard neuroscience concepts along with their synonyms and relationships. The knowledge models defined in the NIFSTD ontologies enable an effective concept-based search over heterogeneous types of web-accessible information entities in NIF’s production system. NIFSTD covers major domains in neuroscience, including diseases, brain anatomy, cell types, sub-cellular anatomy, small molecules, techniques, and resource descriptors. Since the first production release in 2008, NIF has grown significantly in content and functionality, particularly with respect to the ontologies and ontology-based services that drive the NIF system. We present here on the structure, design principles, community engagement, and the current state of NIFSTD ontologies.
Collapse
Affiliation(s)
- Fahim T Imam
- Neuroscience Information Framework, Center for Research in Biological Systems, University of California San Diego La Jolla, CA, USA
| | | | | | | | | | | |
Collapse
|
50
|
Schwartz Y, Barbot A, Thyreau B, Frouin V, Varoquaux G, Siram A, Marcus DS, Poline JB. PyXNAT: XNAT in Python. Front Neuroinform 2012; 6:12. [PMID: 22654752 PMCID: PMC3354345 DOI: 10.3389/fninf.2012.00012] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Accepted: 03/28/2012] [Indexed: 11/13/2022] Open
Abstract
As neuroimaging databases grow in size and complexity, the time researchers spend investigating and managing the data increases to the expense of data analysis. As a result, investigators rely more and more heavily on scripting using high-level languages to automate data management and processing tasks. For this, a structured and programmatic access to the data store is necessary. Web services are a first step toward this goal. They however lack in functionality and ease of use because they provide only low-level interfaces to databases. We introduce here PyXNAT, a Python module that interacts with The Extensible Neuroimaging Archive Toolkit (XNAT) through native Python calls across multiple operating systems. The choice of Python enables PyXNAT to expose the XNAT Web Services and unify their features with a higher level and more expressive language. PyXNAT provides XNAT users direct access to all the scientific packages in Python. Finally PyXNAT aims to be efficient and easy to use, both as a back-end library to build XNAT clients and as an alternative front-end from the command line.
Collapse
|