1
|
Abasi M, Kianmehr A, Variji A, Sangali P, Mahrooz A. microRNAs as molecular tools for brain health: Neuroprotective potential in neurodegenerative disorders. Neuroscience 2025; 574:83-103. [PMID: 40210196 DOI: 10.1016/j.neuroscience.2025.04.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 03/09/2025] [Accepted: 04/05/2025] [Indexed: 04/12/2025]
Abstract
As research on microRNAs (miRNAs) advances, it is becoming increasingly clear that these small molecules play crucial roles in the central nervous system (CNS). They are involved in various essential neuronal functions, with specific miRNAs preferentially expressed in different cell types within the nervous system. Notably, certain miRNAs are found at higher levels in the brain and spinal cord compared to other tissues, suggesting they may have specialized functions in the CNS. miRNAs associated with long-term neurodegenerative changes could serve as valuable tools for early treatment decisions and disease monitoring. The significance of miRNAs such as miR-320, miR-146 and miR-29 in the early diagnosis of neurodegenerative disorders becomes evident, especially considering that many neurological and physical symptoms manifest only after substantial degeneration of specific neurons. Interestingly, serum miRNA levels such as miR-92 and miR-486 may correlate with various MRI parameters in multiple sclerosis. Targeting miRNAs using antisense strategies, such as antisense miR-146 and miR-485, may provide advantages over targeting mRNAs, as a single anti-miRNA can regulate multiple disease-related genes. In the future, anti-miRNA-based therapeutic approaches could be integrated into the clinical management of neurological diseases. Certain miRNAs, including miR-223, miR-106, miR-181, and miR-146, contribute to the pathogenesis of various neurodegenerative diseases and thus warrant greater attention. This knowledge could pave the way for the identification of new diagnostic, prognostic, and theranostic biomarkers, and potentially guiding the development of RNA-based therapeutic strategies. This review highlights recent research on the roles of miRNAs in the nervous system, particularly their protective functions in neurodegenerative disorders.
Collapse
Affiliation(s)
- Mozhgan Abasi
- Department of Tissue Engineering and Applied Cell Sciences, Faculty of Advanced Technologies in Medicine, Mazandaran University of Medical Sciences, Sari, Iran; Immunogenetics Research Center, Faculty of Medicine, Mazandaran University of Medical Sciences, Sari, Iran
| | - Anvarsadat Kianmehr
- Medical Cellular and Molecular Research Center, Golestan University of Medical Sciences, Gorgan, Iran
| | - Athena Variji
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia; Lifelong Health Theme, South Australian Health and Medical Research Institute, Adelaide, Australia
| | - Parisa Sangali
- Department of Clinical Biochemistry and Medical Genetics, Faculty of Medicine, Mazandaran University of Medical Sciences, Sari, Iran
| | - Abdolkarim Mahrooz
- Immunogenetics Research Center, Faculty of Medicine, Mazandaran University of Medical Sciences, Sari, Iran.
| |
Collapse
|
2
|
Jansen JA, Manukyan A, Al Khoury N, Akalin A. Leveraging large language models for data analysis automation. PLoS One 2025; 20:e0317084. [PMID: 39982913 PMCID: PMC11844886 DOI: 10.1371/journal.pone.0317084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Accepted: 12/22/2024] [Indexed: 02/23/2025] Open
Abstract
Data analysis is constrained by a shortage of skilled experts, particularly in biology, where detailed data analysis and subsequent interpretation is vital for understanding complex biological processes and developing new treatments and diagnostics. One possible solution to this shortage in experts would be making use of Large Language Models (LLMs) for generating data analysis pipelines. However, although LLMs have shown great potential when used for code generation tasks, questions regarding the accuracy of LLMs when prompted with domain expert questions such as omics related data analysis questions, remain unanswered. To address this, we developed mergen, an R package that leverages LLMs for data analysis code generation and execution. We evaluated the performance of this data analysis system using various data analysis tasks for genomics. Our primary goal is to enable researchers to conduct data analysis by simply describing their objectives and the desired analyses for specific datasets through clear text. Our approach improves code generation via specialized prompt engineering and error feedback mechanisms. In addition, our system can execute the data analysis workflows prescribed by the LLM providing the results of the data analysis workflow for human review. Our evaluation of this system reveals that while LLMs effectively generate code for some data analysis tasks, challenges remain in executable code generation, especially for complex data analysis tasks. The best performance was seen with the self-correction mechanism, in which self-correct was able to increase the percentage of executable code when compared to the simple strategy by 22.5% for tasks of complexity 2. For tasks for complexity 3, 4 and 5, this increase was 52.5%, 27.5% and 15% respectively. Using a chi-squared test, it was shown that significant differences could be found using the different prompting strategies. Our study contributes to a better understanding of LLM capabilities and limitations, providing software infrastructure and practical insights for their effective integration into data analysis workflows.
Collapse
Affiliation(s)
- Jacqueline A. Jansen
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
- Bioinformatics and Omics Data Science Platform, Berlin, Germany
- University of Potsdam, Potsdam, Brandenburg, Germany
| | - Artür Manukyan
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
- Bioinformatics and Omics Data Science Platform, Berlin, Germany
| | - Nour Al Khoury
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
- Bioinformatics and Omics Data Science Platform, Berlin, Germany
- Free University of Berlin, Berlin, Germany
| | - Altuna Akalin
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
- Bioinformatics and Omics Data Science Platform, Berlin, Germany
| |
Collapse
|
3
|
Silkotch C, Garcia-Milian R, Hersey D. Partnering with health sciences libraries to address challenges in bioimaging data management and sharing. Histochem Cell Biol 2023; 160:193-198. [PMID: 37247072 DOI: 10.1007/s00418-023-02198-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/13/2023] [Indexed: 05/30/2023]
Abstract
Federal mandates, publishing requirements, and an interest in open science have all generated renewed attention on research data management and, in particular, data sharing practices. Due to the size and types of data they produce, bioimaging researchers confront specific challenges in aligning their data with FAIR principles, ensuring that it is findable, accessible, interoperable, and reusable. Although not always recognized by researchers, libraries can, and have been, offering support for data throughout its lifecycle by assisting with data management planning, acquisition, processing and analysis, and sharing and reuse of data. Libraries can educate researchers on best practices for research data management and sharing, facilitate connections to experts by coordinating sessions using peer educators and appropriate vendors, help assess the needs of different researcher groups to identify challenges or gaps, recommend appropriate repositories to make data as accessible as possible, and comply with funder and publisher requirements. As a centralized service within an institution, health sciences libraries have the capability to bridge silos and connect bioimaging researchers with specialized data support across campus and beyond.
Collapse
Affiliation(s)
- Christie Silkotch
- David W. Howe Memorial Library, University of Vermont, Burlington, VT, 05405, USA
| | - Rolando Garcia-Milian
- Bioinformatics Support Hub, Harvey Cushing/John Whitney Medical Library, Yale University, New Haven, CT, 06510, USA
| | - Denise Hersey
- Dana Health Sciences Library, University of Vermont, Burlington, VT, 05405, USA.
| |
Collapse
|
4
|
Li AM, Chen ZL, Qin CX, Li ZT, Liao F, Wang MQ, Lakshmanan P, Li YR, Wang M, Pan YQ, Huang DL. Proteomics data analysis using multiple statistical approaches identified proteins and metabolic networks associated with sucrose accumulation in sugarcane. BMC Genomics 2022; 23:532. [PMID: 35869434 PMCID: PMC9308345 DOI: 10.1186/s12864-022-08768-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 07/13/2022] [Indexed: 11/17/2022] Open
Abstract
Background Sugarcane is the most important sugar crop, contributing > 80% of global sugar production. High sucrose content is a key target of sugarcane breeding, yet sucrose improvement in sugarcane remains extremely slow for decades. Molecular breeding has the potential to break through the genetic bottleneck of sucrose improvement. Dissecting the molecular mechanism(s) and identifying the key genetic elements controlling sucrose accumulation will accelerate sucrose improvement by molecular breeding. In our previous work, a proteomics dataset based on 12 independent samples from high- and low-sugar genotypes treated with ethephon or water was established. However, in that study, employing conventional analysis, only 25 proteins involved in sugar metabolism were identified . Results In this work, the proteomics dataset used in our previous study was reanalyzed by three different statistical approaches, which include a logistic marginal regression, a penalized multiple logistic regression named Elastic net, as well as a Bayesian multiple logistic regression method named Stochastic search variable selection (SSVS) to identify more sugar metabolism-associated proteins. A total of 507 differentially abundant proteins (DAPs) were identified from this dataset, with 5 of them were validated by western blot. Among the DAPs, 49 proteins were found to participate in sugar metabolism-related processes including photosynthesis, carbon fixation as well as carbon, amino sugar, nucleotide sugar, starch and sucrose metabolism. Based on our studies, a putative network of key proteins regulating sucrose accumulation in sugarcane is proposed, with glucose-6-phosphate isomerase, 2-phospho-D-glycerate hydrolyase, malate dehydrogenase and phospho-glycerate kinase, as hub proteins. Conclusions The sugar metabolism-related proteins identified in this work are potential candidates for sucrose improvement by molecular breeding. Further, this work provides an alternative solution for omics data processing. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08768-2.
Collapse
|
5
|
Rahman NT, Meyer C, Thakral D, Cai WL, Chen AT, Obaid R, Garcia-Milian R. Peer Teaching as Bioinformatics Training Strategy: Incentives, Challenges, and Benefits. Med Ref Serv Q 2022; 41:13-25. [PMID: 35225737 DOI: 10.1080/02763869.2022.2020568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Bioinformatics is essential for basic and clinical research. Peer-to-peer (P2P) teaching was used to respond to the bioinformatics training needs at a research-intensive institution. In addition to the data collected from the workshops, personal experiences of the teachers were used to understand incentives, challenges, and benefits of P2P teaching. Developing communication skills such as confidence in teaching, explaining complex concepts, and better understanding of topics benefited P2P teachers. Lack of time and classroom management were identified as major challenges. Hence, P2P teaching can be beneficial not only for bioinformatics trainees but also as a professional development opportunity for peer teachers.
Collapse
Affiliation(s)
- Nur-Taz Rahman
- Bioinformatics Support Program, Research and Education Services, Harvey Cushing/John Hay Whitney Medical Library, Yale University, New Haven, Connecticut, USA
| | - Caitlin Meyer
- Bioinformatics Support Program, Research and Education Services, Harvey Cushing/John Hay Whitney Medical Library, Yale University, New Haven, Connecticut, USA
| | - Durga Thakral
- Department of Dermatology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Wesley L Cai
- Department of Pathology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Ann T Chen
- Department of Biomedical Engineering, Yale University, New Haven, Connecticut, USA
| | - Razib Obaid
- RARAF Radiological Research Accelerator Facility, Nevis Laboratory, Columbia University, Irvington, New York, USA
| | - Rolando Garcia-Milian
- Bioinformatics Support Program, Research and Education Services, Harvey Cushing/John Hay Whitney Medical Library, Yale University, New Haven, Connecticut, USA
| |
Collapse
|
6
|
John Cremin C, Dash S, Huang X. Big Data: Historic Advances and Emerging Trends in Biomedical Research. CURRENT RESEARCH IN BIOTECHNOLOGY 2022. [DOI: 10.1016/j.crbiot.2022.02.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
|
7
|
Madhu A, Cherian I, Gautam AK. Interdisciplinary approach to biomedical research: a panacea to efficient research output during the global pandemic. CORONAVIRUS DRUG DISCOVERY 2022. [PMCID: PMC9217733 DOI: 10.1016/b978-0-323-85156-5.00018-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Biomedical research is rapidly growing due to inventions and developments in science and technology. Several interdisciplinary fields should be combined to find the remedy of diseases including pandemics. To accomplish this, interdisciplinary research is a prerequisite. Using improved techniques in microscopy and genetic engineering, the systemic perspective of the human body and related diseases can be found. Recent genetic-based inheritance studies of diseases, understanding various omics, stem cell systems, and gene editing tools including CRISPR relevant to biomedical research require multidisciplinary approach. Improvements in the field of bioinformatics and efficient use of model organisms in clinical testing including drug assessment are important disciplines common in different researches. The merging of different closely related areas of medical research will produce suitable changes in diagnosis and treatment. In the present scenario of increased global pandemic hits like COVID-19, an understanding on the interdisciplinary approach is needed for controlling the spread and finding vaccines.
Collapse
|
8
|
Mitchell K, Ronas J, Dao C, Freise AC, Mangul S, Shapiro C, Moberg Parker J. PUMAA: A Platform for Accessible Microbiome Analysis in the Undergraduate Classroom. Front Microbiol 2020; 11:584699. [PMID: 33123113 PMCID: PMC7573227 DOI: 10.3389/fmicb.2020.584699] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 09/14/2020] [Indexed: 12/22/2022] Open
Abstract
Improvements in high-throughput sequencing makes targeted amplicon analysis an ideal method for the study of human and environmental microbiomes by undergraduates. Multiple bioinformatics programs are available to process and interpret raw microbial diversity datasets, and the choice of programs to use in curricula is largely determined by student learning goals. Many of the most commonly used microbiome bioinformatics platforms offer end-to-end data processing and data analysis using a command line interface (CLI), but the downside for novice microbiome researchers is the steep learning curve often required. Alternatively, some sequencing providers include processing of raw data and taxonomy assignments as part of their pipelines. This, when coupled with available web-based or graphical user interface (GUI) analysis and visualization tools, eliminates the need for students or instructors to have extensive CLI experience. However, lack of universal data formats can make integration of these tools challenging. For example, tools for upstream and downstream analyses frequently use multiple different data formats which then require writing custom scripts or hours of manual work to make the files compatible. Here, we describe a microbial ecology bioinformatics curriculum that focuses on data analysis, visualization, and statistical reasoning by taking advantage of existing web-based and GUI tools. We created the Program for Unifying Microbiome Analysis Applications (PUMAA), which solves the problem of inconsistent files by formatting the output files from several raw data processing programs to seamlessly transition to a suite of GUI programs for analysis and visualization of microbiome taxonomic and inferred functional profiles. Additionally, we created a series of tutorials to accompany each of the microbiome analysis curricular modules. From pre- and post-course surveys, students in this curriculum self-reported conceptual and confidence gains in bioinformatics and data analysis skills. Students also demonstrated gains in biologically relevant statistical reasoning based on rubric-guided evaluations of open-ended survey questions and the Statistical Reasoning in Biology Concept Inventory. The PUMAA program and associated analysis tutorials enable students and researchers with no computational experience to effectively analyze real microbiome datasets to investigate real-world research questions.
Collapse
Affiliation(s)
- Keith Mitchell
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Jiem Ronas
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Christopher Dao
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Amanda C Freise
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Serghei Mangul
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA, United States
| | - Casey Shapiro
- Center for Educational Assessment, Center for the Advancement of Teaching, University of California, Los Angeles, Los Angeles, CA, United States
| | - Jordan Moberg Parker
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| |
Collapse
|
9
|
Chattopadhyay A, Iwema CL, Epstein BA, Lee AV, Levine AS. Molecular Biology Information Service: an innovative medical library-based bioinformatics support service for biomedical researchers. Brief Bioinform 2020; 21:876-884. [PMID: 30949666 DOI: 10.1093/bib/bbz035] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Revised: 01/28/2019] [Accepted: 03/03/2019] [Indexed: 11/13/2022] Open
Abstract
Biomedical researchers are increasingly reliant on obtaining bioinformatics training in order to conduct their research. Here we present a model that academic institutions may follow to provide such training for their researchers, based on the Molecular Biology Information Service (MBIS) of the Health Sciences Library System, University of Pittsburgh (Pitt). The MBIS runs a four-facet service with the following goals: (1) identify, procure and implement commercially licensed bioinformatics software, (2) teach hands-on workshops using bioinformatics tools to solve research questions, (3) provide in-person and email consultations on software/databases and (4) maintain a web portal providing overall guidance on the access and use of bioinformatics resources and MBIS-created webtools. This paper describes these facets of MBIS activities from 2006 to 2018, including outcomes from a survey measuring attitudes of Pitt researchers about MBIS service and performance.
Collapse
Affiliation(s)
| | - Carrie L Iwema
- University of Pittsburgh, Health Sciences Library System
| | | | - Adrian V Lee
- University of Pittsburgh, Health Sciences Library System
| | | |
Collapse
|
10
|
A Combined Visualization Method for Multivariate Data Analysis. Application to Knee Kinematic and Clinical Parameters Relationships. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10051762] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This paper aims to analyze the correlation structure between the kinematic and clinical parameters of an end-staged knee osteoarthritis population. The kinematic data are a set of characteristics derived from 3D knee kinematic patterns. The clinical parameters include the answers of a clinical questionnaire and the patient’s demographic characteristics. The proposed method performs, first, a regularized canonical correlation analysis (RCCA) to evaluate the multivariate relationship between the clinical and kinematic datasets, and second, a combined visualization method to better understand the relationships between these multivariate data. Results show the efficiency of using different and complementary visual representation tools to highlight hidden relationships and find insights in data.
Collapse
|
11
|
NetR and AttR, Two New Bioinformatic Tools to Integrate Diverse Datasets into Cytoscape Network and Attribute Files. Genes (Basel) 2019; 10:genes10060423. [PMID: 31159440 PMCID: PMC6628208 DOI: 10.3390/genes10060423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 05/25/2019] [Accepted: 05/27/2019] [Indexed: 11/17/2022] Open
Abstract
High-throughput technologies have allowed researchers to obtain genome-wide data from a wide array of experimental model systems. Unfortunately, however, new data generation tends to significantly outpace data re-utilization, and most high throughput datasets are only rarely used in subsequent studies or to generate new hypotheses to be tested experimentally. The reasons behind such data underutilization include a widespread lack of programming expertise among experimentalist biologists to carry out the necessary file reformatting that is often necessary to integrate published data from disparate sources. We have developed two programs (NetR and AttR), which allow experimental biologists with little to no programming background to integrate publicly available datasets into files that can be later visualized with Cytoscape to display hypothetical networks that result from combining individual datasets, as well as a series of published attributes related to the genes or proteins in the network. NetR also allows users to import protein and genetic interaction data from InterMine, which can further enrich a network model based on curated information. We expect that NetR/AttR will allow experimental biologists to mine a largely unexploited wealth of data in their fields and facilitate their integration into hypothetical models to be tested experimentally.
Collapse
|
12
|
Oliver JC, Kollen C, Hickson B, Rios F. Data Science Support at the Academic Library. JOURNAL OF LIBRARY ADMINISTRATION 2019. [DOI: 10.1080/01930826.2019.1583015] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Jeffrey C. Oliver
- Data Science Specialist, Office of Digital Innovation and Stewardship, University Libraries, University of Arizona, Tucson, AZ, USA
| | - Christine Kollen
- Data Curation Librarian, Office of Digital Innovation and Stewardship, University Libraries, University of Arizona, Tucson, AZ, USA
| | - Benjamin Hickson
- Geospatial Specialist, Office of Digital Innovation and Stewardship, University Libraries, University of Arizona, Tucson, AZ, USA
| | - Fernando Rios
- Research Data Management Specialist, Office of Digital Innovation and Stewardship, University Libraries, University of Arizona, Tucson, AZ, USA
| |
Collapse
|