1
|
Grady SK, Peterson KA, Murray SA, Baker EJ, Langston MA, Chesler EJ. A graph theoretical approach to experimental prioritization in genome-scale investigations. Mamm Genome 2024; 35:724-733. [PMID: 39191873 PMCID: PMC11522061 DOI: 10.1007/s00335-024-10066-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 08/14/2024] [Indexed: 08/29/2024]
Abstract
The goal of systems biology is to gain a network level understanding of how gene interactions influence biological states, and ultimately inform upon human disease. Given the scale and scope of systems biology studies, resource constraints often limit researchers when validating genome-wide phenomena and potentially lead to an incomplete understanding of the underlying mechanisms. Further, prioritization strategies are often biased towards known entities (e.g. previously studied genes/proteins with commercially available reagents), and other technical issues that limit experimental breadth. Here, heterogeneous biological information is modeled as an association graph to which a high-performance minimum dominating set solver is applied to maximize coverage across the graph, and thus increase the breadth of experimentation. First, we tested our model on retrieval of existing gene functional annotations and demonstrated that minimum dominating set returns more diverse terms when compared to other computational methods. Next, we utilized our heterogenous network and minimum dominating set solver to assist in the process of identifying understudied genes to be interrogated by the International Mouse Phenotyping Consortium. Using an unbiased algorithmic strategy, poorly studied genes are prioritized from the remaining thousands of genes yet to be characterized. This method is tunable and extensible with the potential to incorporate additional user-defined prioritizing information. The minimum dominating set approach can be applied to any biological network in order to identify a tractable subset of features to test experimentally or to assist in prioritizing candidate genes associated with human disease.
Collapse
Affiliation(s)
- Stephen K Grady
- Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN, USA.
| | | | | | - Erich J Baker
- Department of Computer Science, Baylor University, Waco, TX, USA
| | - Michael A Langston
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, USA
| | | |
Collapse
|
2
|
Madsen EB, Andersen JP. Funding priorities and health outcomes in Danish medical research. Soc Sci Med 2024; 360:117347. [PMID: 39299153 DOI: 10.1016/j.socscimed.2024.117347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 09/09/2024] [Accepted: 09/13/2024] [Indexed: 09/22/2024]
Abstract
External research funding is an essential component of the infrastructure of modern, academic research. Priorities in funding decisions drive what knowledge is generated, and how scientists' careers are shaped. For health research, it can ultimately have implications for health outcomes. The aim of this paper is to illustrate how funding information can be used to track priorities in health research, linking them to disease burdens and research outputs. Furthermore, funding concentrations are analysed from both researcher and disease perspectives, to estimate the influence of personal Matthew-effects on the distribution of health research funding. Denmark is used as the case, including funding information from all major public and private research foundations in the period 2004-2016. Grant information is linked to research outputs and disability-adjusted life-years (DALY rates), for 34,160 publications linked to 2630 grants, receiving DKK 4.8 billion in funding. Data show poor correlation between funding priorities, research activity and disease burdens, with several diseases receiving disproportionate amounts of funding. A research opportunity index is calculated to identify diseases with the highest potential for future investments from a burden-centred point of view. Funding is highly concentrated, both on people and on specific diseases. High funding concentrations on researchers can be a driving factor behind the observed funding-to-burden imbalances, and may risk knowledge stagnation through monopolisation of the market place of ideas. Results indicate that funders of clinical and translational research, as well as some types of biomedical research, need to supplement traditional considerations of scientific excellence with measures of societal challenges and relevance.
Collapse
Affiliation(s)
- Emil Bargmann Madsen
- Danish Centre for Studies in Research and Research Policy, Department of Political Science, Aarhus University, Bartholins Allé 7, DK-8000, Aarhus C, Denmark.
| | - Jens Peter Andersen
- Danish Centre for Studies in Research and Research Policy, Department of Political Science, Aarhus University, Bartholins Allé 7, DK-8000, Aarhus C, Denmark.
| |
Collapse
|
3
|
Correa Marrero M, Jänes J, Baptista D, Beltrao P. Integrating Large-Scale Protein Structure Prediction into Human Genetics Research. Annu Rev Genomics Hum Genet 2024; 25:123-140. [PMID: 38621234 DOI: 10.1146/annurev-genom-120622-020615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2024]
Abstract
The last five years have seen impressive progress in deep learning models applied to protein research. Most notably, sequence-based structure predictions have seen transformative gains in the form of AlphaFold2 and related approaches. Millions of missense protein variants in the human population lack annotations, and these computational methods are a valuable means to prioritize variants for further analysis. Here, we review the recent progress in deep learning models applied to the prediction of protein structure and protein variants, with particular emphasis on their implications for human genetics and health. Improved prediction of protein structures facilitates annotations of the impact of variants on protein stability, protein-protein interaction interfaces, and small-molecule binding pockets. Moreover, it contributes to the study of host-pathogen interactions and the characterization of protein function. As genome sequencing in large cohorts becomes increasingly prevalent, we believe that better integration of state-of-the-art protein informatics technologies into human genetics research is of paramount importance.
Collapse
Affiliation(s)
- Miguel Correa Marrero
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland;
| | - Jürgen Jänes
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland;
| | | | - Pedro Beltrao
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, Zurich, Switzerland;
| |
Collapse
|
4
|
Munro V, Kelly V, Messner CB, Kustatscher G. Cellular control of protein levels: A systems biology perspective. Proteomics 2024; 24:e2200220. [PMID: 38012370 DOI: 10.1002/pmic.202200220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Revised: 11/13/2023] [Accepted: 11/15/2023] [Indexed: 11/29/2023]
Abstract
How cells regulate protein levels is a central question of biology. Over the past decades, molecular biology research has provided profound insights into the mechanisms and the molecular machinery governing each step of the gene expression process, from transcription to protein degradation. Recent advances in transcriptomics and proteomics have complemented our understanding of these fundamental cellular processes with a quantitative, systems-level perspective. Multi-omic studies revealed significant quantitative, kinetic and functional differences between the genome, transcriptome and proteome. While protein levels often correlate with mRNA levels, quantitative investigations have demonstrated a substantial impact of translation and protein degradation on protein expression control. In addition, protein-level regulation appears to play a crucial role in buffering protein abundances against undesirable mRNA expression variation. These findings have practical implications for many fields, including gene function prediction and precision medicine.
Collapse
Affiliation(s)
- Victoria Munro
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK
| | - Van Kelly
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK
| | - Christoph B Messner
- Precision Proteomics Center, Swiss Institute of Allergy and Asthma Research (SIAF), University of Zurich, Davos, Switzerland
| | - Georg Kustatscher
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
5
|
Noureddine R, Baba H, Aqillouch S, Abounouh K, Laazaazia O, Elmessaoudi-Idrissi M, Bahmani FZ, Tanouti IA, Ouladlahsen A, Sarih M, Dehbi H, Ezzikouri S. The Interleukin-6 gene variants may protect against SARS-CoV-2 infection and the severity of COVID-19: a case-control study in a Moroccan population. BMC Med Genomics 2024; 17:139. [PMID: 38783290 PMCID: PMC11112821 DOI: 10.1186/s12920-024-01911-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 05/14/2024] [Indexed: 05/25/2024] Open
Abstract
The symptoms of SARS-CoV-2 infection vary widely, ranging from asymptomatic cases to severe forms marked by acute respiratory distress syndrome, multi-organ damage, and fatalities. Studies indicate a correlation between specific genes and susceptibility to SARS-CoV-2 infection and disease severity, particularly involving variants in genes linked to inflammation and immune responses. The objective of this study is to investigate the association between rs1800795 (- 174 G > C) and rs1800797 (- 597 A > G) variants in the interleukin-6 (IL-6) promoter region and susceptibility to SARS-CoV-2 infection. Additionally, we aim to explore their correlation with COVID-19 severity in a Moroccan population. In this case-control study, we enrolled 270 unvaccinated COVID-19 patients, consisting of 132 with severe COVID-19 and 138 with asymptomatic-moderate COVID-19. Additionally, we included 339 SARS-CoV-2-negative group. Genotyping of rs1800795 and rs1800797 polymorphisms of the IL-6 gene was performed using predesigned TaqMan SNP genotyping. The median age of SARS-CoV-2-negative controls was 50 years, while severe COVID-19 cases exhibited a median age of 61 years. Additionally, individuals with asymptomatic to moderate COVID-19 had a median age of 36 years. We observed a significant age difference between severe and mild COVID-19 patients (p < 0.0001), and an association was noted between gender and the severity of COVID-19 (p = 0.011). The allele and genotype frequencies of the IL-6 - 597G > A and - 174G > C variants did not show significant associations with susceptibility to SARS-CoV-2 infection (p > 0.05). However, further analysis revealed that the linkage disequilibrium between rs1800797 and rs1800795 indicated that individuals with the GC* haplotype (OR = 0.04, 95% CI 0.01-0.30, p = 0.001) and AG* haplotype (OR = 0.11, 95% CI 0.03-0.46, p = 0.002) were significantly associated with protection against SARS-CoV-2 infection. Moreover, in the overdominant model, the IL-6 - 174 G/C genotype was found to be protective against the development of severe disease compared to those with the G/G-C/C genotypes (p = 0.03; OR = 0.41, 95% CI 0.18-0.96). However, correlations between complete blood count markers, hematological markers, D-dimer, C-reactive protein, and ferritin levels according to - 597 A > G and - 174G > C genotypes showed no significant differences (all p > 0.05). Our findings provide valuable insights into the pathogenesis of COVID-19, suggesting that genetic variations at the IL-6 gene may contribute to the susceptibility to severe SARS-CoV-2 infection within the Moroccan population.
Collapse
Affiliation(s)
- Rachid Noureddine
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc
- Laboratory of Genetics and Molecular Pathology, Medical School, University Hassan II, Casablanca, Maroc
- Laboratoire Morizgo d'analyses médicales, Casablanca, Maroc
| | - Hanâ Baba
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc
| | - Safaa Aqillouch
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc
| | - Karima Abounouh
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc
| | - Oumaima Laazaazia
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc
| | - Mohcine Elmessaoudi-Idrissi
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc
| | | | - Ikram Allah Tanouti
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc
| | - Ahd Ouladlahsen
- Service des maladies Infectieuses, CHU Ibn Rochd, Casablanca, Maroc
| | - M'hammed Sarih
- Service de Parasitologie et des Maladies Vectorielles, Institut Pasteur du Maroc, Casablanca, Morocco
| | - Hind Dehbi
- Laboratory of Genetics and Molecular Pathology, Medical School, University Hassan II, Casablanca, Maroc
| | - Sayeh Ezzikouri
- Virology Unit, Viral Hepatitis Laboratory, Institut Pasteur du Maroc, 1 Place Louis Pasteur, Casablanca, 20360, Maroc.
| |
Collapse
|
6
|
Richardson R, Tejedor Navarro H, Amaral LAN, Stoeger T. Meta-Research: Understudied genes are lost in a leaky pipeline between genome-wide assays and reporting of results. eLife 2024; 12:RP93429. [PMID: 38546716 PMCID: PMC10977968 DOI: 10.7554/elife.93429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/01/2024] Open
Abstract
Present-day publications on human genes primarily feature genes that already appeared in many publications prior to completion of the Human Genome Project in 2003. These patterns persist despite the subsequent adoption of high-throughput technologies, which routinely identify novel genes associated with biological processes and disease. Although several hypotheses for bias in the selection of genes as research targets have been proposed, their explanatory powers have not yet been compared. Our analysis suggests that understudied genes are systematically abandoned in favor of better-studied genes between the completion of -omics experiments and the reporting of results. Understudied genes remain abandoned by studies that cite these -omics experiments. Conversely, we find that publications on understudied genes may even accrue a greater number of citations. Among 45 biological and experimental factors previously proposed to affect which genes are being studied, we find that 33 are significantly associated with the choice of hit genes presented in titles and abstracts of -omics studies. To promote the investigation of understudied genes, we condense our insights into a tool, find my understudied genes (FMUG), that allows scientists to engage with potential bias during the selection of hits. We demonstrate the utility of FMUG through the identification of genes that remain understudied in vertebrate aging. FMUG is developed in Flutter and is available for download at fmug.amaral.northwestern.edu as a MacOS/Windows app.
Collapse
Affiliation(s)
- Reese Richardson
- Interdisciplinary Biological Sciences, Northwestern UniversityEvanstonUnited States
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
| | - Heliodoro Tejedor Navarro
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
- Northwestern Institute on Complex Systems, Northwestern UniversityEvanstonUnited States
| | - Luis A Nunes Amaral
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
- Northwestern Institute on Complex Systems, Northwestern UniversityEvanstonUnited States
- Department of Molecular Biosciences, Northwestern UniversityEvanstonUnited States
- Department of Physics and Astronomy, Northwestern UniversityEvanstonUnited States
| | - Thomas Stoeger
- Department of Chemical and Biological Engineering, Northwestern UniversityEvanstonUnited States
- The Potocsnak Longevity Institute, Northwestern UniversityChicagoUnited States
- Simpson Querrey Lung Institute for Translational Science, Northwestern UniversityChicagoUnited States
| |
Collapse
|
7
|
Richardson RAK, Tejedor Navarro H, Amaral LAN, Stoeger T. Meta-Research: understudied genes are lost in a leaky pipeline between genome-wide assays and reporting of results. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.28.530483. [PMID: 36909550 PMCID: PMC10002660 DOI: 10.1101/2023.02.28.530483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
Present-day publications on human genes primarily feature genes that already appeared in many publications prior to completion of the Human Genome Project in 2003. These patterns persist despite the subsequent adoption of high-throughput technologies, which routinely identify novel genes associated with biological processes and disease. Although several hypotheses for bias in the selection of genes as research targets have been proposed, their explanatory powers have not yet been compared. Our analysis suggests that understudied genes are systematically abandoned in favor of better-studied genes between the completion of -omics experiments and the reporting of results. Understudied genes remain abandoned by studies that cite these -omics experiments. Conversely, we find that publications on understudied genes may even accrue a greater number of citations. Among 45 biological and experimental factors previously proposed to affect which genes are being studied, we find that 33 are significantly associated with the choice of hit genes presented in titles and abstracts of - omics studies. To promote the investigation of understudied genes we condense our insights into a tool, find my understudied genes (FMUG), that allows scientists to engage with potential bias during the selection of hits. We demonstrate the utility of FMUG through the identification of genes that remain understudied in vertebrate aging. FMUG is developed in Flutter and is available for download at fmug.amaral.northwestern.edu as a MacOS/Windows app.
Collapse
Affiliation(s)
- Reese AK Richardson
- Interdisciplinary Biological Sciences, Northwestern University
- Department of Chemical and Biological Engineering, Northwestern University
| | - Heliodoro Tejedor Navarro
- Department of Chemical and Biological Engineering, Northwestern University
- Northwestern Institute on Complex Systems, Northwestern University
| | - Luis A Nunes Amaral
- Department of Chemical and Biological Engineering, Northwestern University
- Northwestern Institute on Complex Systems, Northwestern University
- Department of Physics and Astronomy, Northwestern University
- Department of Molecular Biosciences, Northwestern University
| | - Thomas Stoeger
- Department of Chemical and Biological Engineering, Northwestern University
- The Potocsnak Longevity Institute, Northwestern University
- Simpson Querrey Lung Institute for Translational Science, Northwestern University
| |
Collapse
|
8
|
Tantoso E, Eisenhaber B, Sinha S, Jensen LJ, Eisenhaber F. Did the early full genome sequencing of yeast boost gene function discovery? Biol Direct 2023; 18:46. [PMID: 37574542 PMCID: PMC10424406 DOI: 10.1186/s13062-023-00403-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Accepted: 08/01/2023] [Indexed: 08/15/2023] Open
Abstract
BACKGROUND Although the genome of Saccharomyces cerevisiae (S. cerevisiae) was the first one of a eukaryote organism that was fully sequenced (in 1996), a complete understanding of the potential of encoded biomolecular mechanisms has not yet been achieved. Here, we wish to quantify how far the goal of a full list of S. cerevisiae gene functions still is. RESULTS The scientific literature about S. cerevisiae protein-coding genes has been mapped onto the yeast genome via the mentioning of names for genomic regions in scientific publications. The match was quantified with the ratio of a given gene name's occurrences to those of any gene names in the article. We find that ~ 230 elite genes with ≥ 75 full publication equivalents (FPEs, FPE = 1 is an idealized publication referring to just a single gene) command ~ 45% of all literature. At the same time, about two thirds of the genes (each with less than 10 FPEs) are described in just 12% of the literature (in average each such gene has just ~ 1.5% of the literature of an elite gene). About 600 genes have not been mentioned in any dedicated article. Compared with other groups of genes, the literature growth rates were highest for uncharacterized or understudied genes until late nineties of the twentieth century. Yet, these growth rates deteriorated and became negative thereafter. Thus, yeast function discovery for previously uncharacterized genes has returned to the level of ~ 1980. At the same time, literature for anyhow well-studied genes (with a threshold T10 (≥ 10 FPEs) and higher) remains steadily growing. CONCLUSIONS Did the early full genome sequencing of yeast boost gene function discovery? The data proves that the moment of publishing the full genome in reality coincides with the onset of decline of gene function discovery for previously uncharacterized genes. If the current status of literature about yeast molecular mechanisms can be extrapolated into the future, it will take about another ~ 50 years to complete the yeast gene function list. We found that a small group of scientific journals contributed extraordinarily to publishing early reports relevant to yeast gene function discoveries.
Collapse
Affiliation(s)
- Erwin Tantoso
- Agency for Science, Technology and Research (A*STAR), Bioinformatics Institute (BII), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore.
- Agency for Science, Technology and Research (A*STAR), Genome Institute of Singapore (GIS), 60 Biopolis Street, Singapore, 138672, Republic of Singapore.
| | - Birgit Eisenhaber
- Agency for Science, Technology and Research (A*STAR), Bioinformatics Institute (BII), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore.
- Agency for Science, Technology and Research (A*STAR), Genome Institute of Singapore (GIS), 60 Biopolis Street, Singapore, 138672, Republic of Singapore.
- LASA - Lausitz Advanced Scientific Applications gGmbH, Straße Der Einheit 2-24, 02943, Weißwasser, Federal Republic of Germany.
| | - Swati Sinha
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Lars Juhl Jensen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Frank Eisenhaber
- Agency for Science, Technology and Research (A*STAR), Bioinformatics Institute (BII), 30 Biopolis Street #07-01, Matrix Building, Singapore, 138671, Republic of Singapore.
- Agency for Science, Technology and Research (A*STAR), Genome Institute of Singapore (GIS), 60 Biopolis Street, Singapore, 138672, Republic of Singapore.
- LASA - Lausitz Advanced Scientific Applications gGmbH, Straße Der Einheit 2-24, 02943, Weißwasser, Federal Republic of Germany.
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Republic of Singapore.
| |
Collapse
|
9
|
Elsamad G, Mecawi AS, Pauža AG, Gillard B, Paterson A, Duque VJ, Šarenac O, Žigon NJ, Greenwood M, Greenwood MP, Murphy D. Ageing restructures the transcriptome of the hypothalamic supraoptic nucleus and alters the response to dehydration. NPJ AGING 2023; 9:12. [PMID: 37264028 PMCID: PMC10234251 DOI: 10.1038/s41514-023-00108-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 05/04/2023] [Indexed: 06/03/2023]
Abstract
Ageing is associated with altered neuroendocrine function. In the context of the hypothalamic supraoptic nucleus, which makes the antidiuretic hormone vasopressin, ageing alters acute responses to hyperosmotic cues, rendering the elderly more susceptible to dehydration. Chronically, vasopressin has been associated with numerous diseases of old age, including type 2 diabetes and metabolic syndrome. Bulk RNAseq transcriptome analysis has been used to catalogue the polyadenylated supraoptic nucleus transcriptomes of adult (3 months) and aged (18 months) rats in basal euhydrated and stimulated dehydrated conditions. Gene ontology and Weighted Correlation Network Analysis revealed that ageing is associated with alterations in the expression of extracellular matrix genes. Interestingly, whilst the transcriptomic response to dehydration is overall blunted in aged animals compared to adults, there is a specific enrichment of differentially expressed genes related to neurodegenerative processes in the aged cohort, suggesting that dehydration itself may provoke degenerative consequences in aged rats.
Collapse
Affiliation(s)
- Ghadir Elsamad
- Molecular Neuroendocrinology Research Group, Bristol Medical School: Translational Health Sciences, Dorothy Hodgkin Building, University of Bristol, Bristol, England
| | - André Souza Mecawi
- Laboratory of Molecular Neuroendocrinology, Department of Biophysics, Paulista School of Medicine, Federal University of São Paulo, São Paulo, Brazil
| | - Audrys G Pauža
- Molecular Neuroendocrinology Research Group, Bristol Medical School: Translational Health Sciences, Dorothy Hodgkin Building, University of Bristol, Bristol, England
- Translational Cardio-Respiratory Research Group, Department of Physiology, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - Benjamin Gillard
- Molecular Neuroendocrinology Research Group, Bristol Medical School: Translational Health Sciences, Dorothy Hodgkin Building, University of Bristol, Bristol, England
| | - Alex Paterson
- Molecular Neuroendocrinology Research Group, Bristol Medical School: Translational Health Sciences, Dorothy Hodgkin Building, University of Bristol, Bristol, England
- Insilico Consulting Ltd., Wapping Wharf, Bristol, England
| | - Victor J Duque
- Laboratory of Molecular Neuroendocrinology, Department of Biophysics, Paulista School of Medicine, Federal University of São Paulo, São Paulo, Brazil
| | - Olivera Šarenac
- Institute of Pharmacology, Clinical Pharmacology and Toxicology, Faculty of Medicine, University of Belgrade, Belgrade, Serbia
- Department of Safety Pharmacology, Abbvie, North Chicago, Illinois, USA
| | - Nina Japundžić Žigon
- Institute of Pharmacology, Clinical Pharmacology and Toxicology, Faculty of Medicine, University of Belgrade, Belgrade, Serbia
| | - Mingkwan Greenwood
- Molecular Neuroendocrinology Research Group, Bristol Medical School: Translational Health Sciences, Dorothy Hodgkin Building, University of Bristol, Bristol, England
| | - Michael P Greenwood
- Molecular Neuroendocrinology Research Group, Bristol Medical School: Translational Health Sciences, Dorothy Hodgkin Building, University of Bristol, Bristol, England
| | - David Murphy
- Molecular Neuroendocrinology Research Group, Bristol Medical School: Translational Health Sciences, Dorothy Hodgkin Building, University of Bristol, Bristol, England.
| |
Collapse
|
10
|
Messner CB, Demichev V, Wang Z, Hartl J, Kustatscher G, Mülleder M, Ralser M. Mass spectrometry-based high-throughput proteomics and its role in biomedical studies and systems biology. Proteomics 2023; 23:e2200013. [PMID: 36349817 DOI: 10.1002/pmic.202200013] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 10/13/2022] [Accepted: 10/13/2022] [Indexed: 11/11/2022]
Abstract
There are multiple reasons why the next generation of biological and medical studies require increasing numbers of samples. Biological systems are dynamic, and the effect of a perturbation depends on the genetic background and environment. As a consequence, many conditions need to be considered to reach generalizable conclusions. Moreover, human population and clinical studies only reach sufficient statistical power if conducted at scale and with precise measurement methods. Finally, many proteins remain without sufficient functional annotations, because they have not been systematically studied under a broad range of conditions. In this review, we discuss the latest technical developments in mass spectrometry (MS)-based proteomics that facilitate large-scale studies by fast and efficient chromatography, fast scanning mass spectrometers, data-independent acquisition (DIA), and new software. We further highlight recent studies which demonstrate how high-throughput (HT) proteomics can be applied to capture biological diversity, to annotate gene functions or to generate predictive and prognostic models for human diseases.
Collapse
Affiliation(s)
- Christoph B Messner
- Precision Proteomics Center, Swiss Institute of Allergy and Asthma Research (SIAF), University of Zurich, Davos, Switzerland
| | - Vadim Demichev
- Institute of Biochemistry, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Ziyue Wang
- Institute of Biochemistry, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Johannes Hartl
- Institute of Biochemistry, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Georg Kustatscher
- Wellcome Centre for Cell Biology, University of Edinburgh, Max Born Crescent, Edinburgh, Scotland, UK
| | - Michael Mülleder
- Core Facility High Throughput Mass Spectrometry, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Markus Ralser
- Institute of Biochemistry, Charité - Universitätsmedizin Berlin, Berlin, Germany
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| |
Collapse
|
11
|
Byrne JA, Park Y, Richardson RAK, Pathmendra P, Sun M, Stoeger T. Protection of the human gene research literature from contract cheating organizations known as research paper mills. Nucleic Acids Res 2022; 50:12058-12070. [PMID: 36477580 PMCID: PMC9757046 DOI: 10.1093/nar/gkac1139] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 11/08/2022] [Accepted: 11/14/2022] [Indexed: 12/12/2022] Open
Abstract
Human gene research generates new biology insights with translational potential, yet few studies have considered the health of the human gene literature. The accessibility of human genes for targeted research, combined with unreasonable publication pressures and recent developments in scholarly publishing, may have created a market for low-quality or fraudulent human gene research articles, including articles produced by contract cheating organizations known as paper mills. This review summarises the evidence that paper mills contribute to the human gene research literature at scale and outlines why targeted gene research may be particularly vulnerable to systematic research fraud. To raise awareness of targeted gene research from paper mills, we highlight features of problematic manuscripts and publications that can be detected by gene researchers and/or journal staff. As improved awareness and detection could drive the further evolution of paper mill-supported publications, we also propose changes to academic publishing to more effectively deter and correct problematic publications at scale. In summary, the threat of paper mill-supported gene research highlights the need for all researchers to approach the literature with a more critical mindset, and demand publications that are underpinned by plausible research justifications, rigorous experiments and fully transparent reporting.
Collapse
Affiliation(s)
- Jennifer A Byrne
- School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, NSW, Australia
- NSW Health Statewide Biobank, NSW Health Pathology, Camperdown, NSW, Australia
| | - Yasunori Park
- School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, NSW, Australia
| | - Reese A K Richardson
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, USA
| | - Pranujan Pathmendra
- School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, NSW, Australia
| | - Mengyi Sun
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, USA
| | - Thomas Stoeger
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, USA
- Successful Clinical Response in Pneumonia Therapy (SCRIPT) Systems Biology Center, Northwestern University, Evanston, USA
- Center for Genetic Medicine, Northwestern University School of Medicine, Chicago, USA
| |
Collapse
|
12
|
David S, Dorado G, Duarte EL, David-Bosne S, Trigueiro-Louro J, Rebelo-de-Andrade H. COVID-19: impact on Public Health and hypothesis-driven investigations on genetic susceptibility and severity. Immunogenetics 2022; 74:381-407. [PMID: 35348847 PMCID: PMC8961091 DOI: 10.1007/s00251-022-01261-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 03/14/2022] [Indexed: 12/12/2022]
Abstract
COVID-19 is a new complex multisystem disease caused by the novel coronavirus SARS-CoV-2. In slightly over 2 years, it infected nearly 500 million and killed 6 million human beings worldwide, causing an unprecedented coronavirus pandemic. Currently, the international scientific community is engaged in elucidating the molecular mechanisms of the pathophysiology of SARS-CoV-2 infection as a basis of scientific developments for the future control of COVID-19. Global exome and genome analysis efforts work to define the human genetics of protective immunity to SARS-CoV-2 infection. Here, we review the current knowledge regarding the SARS-CoV-2 infection, the implications of COVID-19 to Public Health and discuss genotype to phenotype association approaches that could be exploited through the selection of candidate genes to identify the genetic determinants of severe COVID-19.
Collapse
Affiliation(s)
- Susana David
- Departamento de Genética Humana, Instituto Nacional de Saúde Doutor Ricardo Jorge (INSA,IP), Lisboa, Portugal.
- Instituto de Investigação do Medicamento (iMed.ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Lisboa, Portugal.
| | - Guillermo Dorado
- Atlántida Centro de Investigación y Desarrollo de Estudios Profesionales (CIDEP), Granada, Spain
| | - Elsa L Duarte
- MED-Instituto Mediterrâneo para a Agricultura, Ambiente e Desenvolvimento, Escola de Ciências e Tecnologia, Universidade de Évora, Évora, Portugal
| | | | - João Trigueiro-Louro
- Departamento de Doenças Infeciosas, INSA, IP, Lisboa, Portugal
- Host-Pathogen Interaction Unit, Instituto de Investigação do Medicamento (iMed.ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Lisboa, Portugal
- Hospital Egas Moniz, Centro Hospitalar Lisboa Ocidental, Lisboa, Portugal
| | - Helena Rebelo-de-Andrade
- Departamento de Doenças Infeciosas, INSA, IP, Lisboa, Portugal
- Host-Pathogen Interaction Unit, Instituto de Investigação do Medicamento (iMed.ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Lisboa, Portugal
| |
Collapse
|
13
|
Kustatscher G, Collins T, Gingras AC, Guo T, Hermjakob H, Ideker T, Lilley KS, Lundberg E, Marcotte EM, Ralser M, Rappsilber J. An open invitation to the Understudied Proteins Initiative. Nat Biotechnol 2022; 40:815-817. [PMID: 35534555 DOI: 10.1038/s41587-022-01316-z] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Georg Kustatscher
- Institute of Quantitative Biology, Biochemistry and Biotechnology, University of Edinburgh, Edinburgh, UK.
| | | | - Anne-Claude Gingras
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Sinai Health System, Toronto, Ontario, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Tiannan Guo
- Zhejiang Provincial Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou, China
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | - Trey Ideker
- Division of Genetics, Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Kathryn S Lilley
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Emma Lundberg
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH-Royal Institute of Technology, Stockholm, Sweden.,Department of Bioengineering, Stanford University, Stanford, CA, USA.,Department of Pathology, Stanford University, Stanford, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - Edward M Marcotte
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, University of Texas at Austin, Austin, TX, USA
| | - Markus Ralser
- Department of Biochemistry, Charité University Medicine, Berlin, Germany.,The Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, London, UK
| | - Juri Rappsilber
- Institute of Quantitative Biology, Biochemistry and Biotechnology, University of Edinburgh, Edinburgh, UK. .,Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany. .,Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
14
|
Stoeger T, Nunes Amaral LA. The characteristics of early-stage research into human genes are substantially different from subsequent research. PLoS Biol 2022; 20:e3001520. [PMID: 34990452 PMCID: PMC8769369 DOI: 10.1371/journal.pbio.3001520] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 01/19/2022] [Accepted: 12/21/2021] [Indexed: 11/19/2022] Open
Abstract
Throughout the last 2 decades, several scholars observed that present day research into human genes rarely turns toward genes that had not already been extensively investigated in the past. Guided by hypotheses derived from studies of science and innovation, we present here a literature-wide data-driven meta-analysis to identify the specific scientific and organizational contexts that coincided with early-stage research into human genes throughout the past half century. We demonstrate that early-stage research into human genes differs in team size, citation impact, funding mechanisms, and publication outlet, but that generalized insights derived from studies of science and innovation only partially apply to early-stage research into human genes. Further, we demonstrate that, presently, genome biology accounts for most of the initial early-stage research, while subsequent early-stage research can engage other life sciences fields. We therefore anticipate that the specificity of our findings will enable scientists and policymakers to better promote early-stage research into human genes and increase overall innovation within the life sciences.
Collapse
Affiliation(s)
- Thomas Stoeger
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois, United States of America
- Northwestern Institute on Complex Systems (NICO), Northwestern University, Evanston, Illinois, United States of America
- Center for Genetic Medicine, Northwestern University, Chicago, Illinois, United States of America
| | - Luís A. Nunes Amaral
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois, United States of America
- Northwestern Institute on Complex Systems (NICO), Northwestern University, Evanston, Illinois, United States of America
- Department of Molecular Bioscience, Northwestern University, Evanston, Illinois, United States of America
- Department of Physics and Astronomy, Northwestern University, Evanston, Illinois, United States of America
- Department of Medicine, Northwestern University School of Medicine, Chicago, Illinois, United States of America
| |
Collapse
|
15
|
Law JN, Akers K, Tasnina N, Santina CMD, Deutsch S, Kshirsagar M, Klein-Seetharaman J, Crovella M, Rajagopalan P, Kasif S, Murali TM. Interpretable network propagation with application to expanding the repertoire of human proteins that interact with SARS-CoV-2. Gigascience 2021; 10:giab082. [PMID: 34966926 PMCID: PMC8716363 DOI: 10.1093/gigascience/giab082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 09/21/2021] [Accepted: 11/28/2021] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND Network propagation has been widely used for nearly 20 years to predict gene functions and phenotypes. Despite the popularity of this approach, little attention has been paid to the question of provenance tracing in this context, e.g., determining how much any experimental observation in the input contributes to the score of every prediction. RESULTS We design a network propagation framework with 2 novel components and apply it to predict human proteins that directly or indirectly interact with SARS-CoV-2 proteins. First, we trace the provenance of each prediction to its experimentally validated sources, which in our case are human proteins experimentally determined to interact with viral proteins. Second, we design a technique that helps to reduce the manual adjustment of parameters by users. We find that for every top-ranking prediction, the highest contribution to its score arises from a direct neighbor in a human protein-protein interaction network. We further analyze these results to develop functional insights on SARS-CoV-2 that expand on known biology such as the connection between endoplasmic reticulum stress, HSPA5, and anti-clotting agents. CONCLUSIONS We examine how our provenance-tracing method can be generalized to a broad class of network-based algorithms. We provide a useful resource for the SARS-CoV-2 community that implicates many previously undocumented proteins with putative functional relationships to viral infection. This resource includes potential drugs that can be opportunistically repositioned to target these proteins. We also discuss how our overall framework can be extended to other, newly emerging viruses.
Collapse
Affiliation(s)
- Jeffrey N Law
- Interdisciplinary Ph.D. Program in Genetics, Bioinformatics, and Computational Biology, Virginia Tech, Blacksburg, VA 24061, USA
| | - Kyle Akers
- Interdisciplinary Ph.D. Program in Genetics, Bioinformatics, and Computational Biology, Virginia Tech, Blacksburg, VA 24061, USA
| | - Nure Tasnina
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | | | - Shay Deutsch
- Department of Mathematics, University of California, Los Angeles, CA 90095, USA
| | | | | | - Mark Crovella
- Department of Computer Science, Boston University, Boston, MA 02215, USA
| | | | - Simon Kasif
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - T M Murali
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
16
|
Meta-Analysis of Gene Popularity: Less Than Half of Gene Citations Stem from Gene Regulatory Networks. Genes (Basel) 2021; 12:genes12020319. [PMID: 33672419 PMCID: PMC7926953 DOI: 10.3390/genes12020319] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 02/14/2021] [Accepted: 02/20/2021] [Indexed: 12/04/2022] Open
Abstract
The reasons for selecting a gene for further study might vary from historical momentum to funding availability, thus leading to unequal attention distribution among all genes. However, certain biological features tend to be overlooked in evaluating a gene’s popularity. Here we present a meta-analysis of the reasons why different genes have been studied and to what extent, with a focus on the gene-specific biological features. From unbiased datasets we can define biological properties of genes that reasonably may affect their perceived importance. We make use of both linear and nonlinear computational approaches for estimating gene popularity to then compare their relative importance. We find that roughly 25% of the studies are the result of a historical positive feedback, which we may think of as social reinforcement. Of the remaining features, gene family membership is the most indicative followed by disease relevance and finally regulatory pathway association. Disease relevance has been an important driver until the 1990s, after which the focus shifted to exploring every single gene. We also present a resource that allows one to study the impact of reinforcement, which may guide our research toward genes that have not yet received proportional attention.
Collapse
|
17
|
Grigorescu F, Lautier C. HOW GENETICISTS CONTRIBUTE TO UNDERSTANDING OF COVID-19 DISEASE PATHOGENICITY. ACTA ENDOCRINOLOGICA (BUCHAREST, ROMANIA : 2005) 2020; 16:346-352. [PMID: 33363658 PMCID: PMC7748221 DOI: 10.4183/aeb.2020.346] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Human populations are faced to the COVID-19 pandemic due to the emerging SARS-CoV-2 coronavirus originating from Wuhan (China) and with dramatic Public Health consequences. Despite periods of panic, the scientific community demonstrated an incredible innovation potential and energy ending up in one year with new vaccines to be used in population. Researchers are interrogating on how individual genetic differences contribute to the diversity of clinical manifestations or ethnic and geographic disparities of COVID-19. While efforts were spent to understand mechanistically the infectious potential of the virus, recent progresses in molecular genetics and bioinformatics allowed the characterization of viral sequence and construction of phylogeographical maps of viral dispersion worldwide. These data will help understanding epidemiological disparities among continents and ethnic populations. Much effort was also spent in analyzing host genetics by studying individual genes involved in innate and immune responses or explaining pathogenesis of comorbidities that complicate the fate of elderly patients. Several international consortia launched already Genome wide Association Studies (GWAS) and whole genome sequencing strategies to identify genetic markers with immediate application in patients at risk of respiratory failure. These new genetic data are important not only for understanding susceptibility factors for COVID-19 but they also contain an important message of hope for mankind warranting our survival and health.
Collapse
Affiliation(s)
- F. Grigorescu
- Direction of Clinical Research and Innovation (DCRI), Montpellier Cancer Institute, University of Montpellier, Montpellier, France
- Institut Convergences Migrations, Collège de France, Paris, France
| | - C. Lautier
- Nutrition & Genome, UMR204 NUTRIPASS (IRD, UM, SupAgro), University of Montpellier, Montpellier, France
| |
Collapse
|