1
|
Buer B, Dönitz J, Milner M, Mehlhorn S, Hinners C, Siemanowski‐Hrach J, Ulrich JK, Großmann D, Cedden D, Nauen R, Geibel S, Bucher G. Superior target genes and pathways for RNAi-mediated pest control revealed by genome-wide analysis in the beetle Tribolium castaneum. PEST MANAGEMENT SCIENCE 2025; 81:1026-1036. [PMID: 39498580 PMCID: PMC11716340 DOI: 10.1002/ps.8505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 10/03/2024] [Accepted: 10/12/2024] [Indexed: 01/11/2025]
Abstract
BACKGROUND An increasing human population, the emergence of resistances against pesticides and their potential impact on the environment call for the development of new eco-friendly pest control strategies. RNA interference (RNAi)-based pesticides have emerged as a new option with the first products entering the market. Essentially, double-stranded RNAs targeting essential genes of pests are either expressed in the plants or sprayed on their surface. Upon feeding, pests mount an RNAi response and die. However, it has remained unclear whether RNAi-based insecticides should target the same pathways as classic pesticides or whether the different mode-of-action would favor other processes. Moreover, there is no consensus on the best genes to be targeted. RESULTS We performed a genome-wide screen in the red flour beetle to identify 905 RNAi target genes. Based on a validation screen and clustering, we identified the 192 most effective target genes in that species. The transfer to oral application in other beetle pests revealed a list of 34 superior target genes, which are an excellent starting point for application in other pests. Gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) analyses of our genome-wide dataset revealed that genes with high efficacy belonged mainly to basic cellular processes such as gene expression and protein homeostasis - processes not targeted by classic insecticides. CONCLUSION Our work revealed the best target genes and target processes for RNAi-based pest control and we propose a procedure to transfer our short list of superior target genes to other pests. © 2024 The Author(s). Pest Management Science published by John Wiley & Sons Ltd on behalf of Society of Chemical Industry.
Collapse
Affiliation(s)
- Benjamin Buer
- Crop Science Division, Bayer AG, R&D, Pest ControlMonheimGermany
| | - Jürgen Dönitz
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
- Department of Medical BioinformaticsUniversity Medical Center GöttingenGöttingenGermany
| | - Martin Milner
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
| | - Sonja Mehlhorn
- Crop Science Division, Bayer AG, R&D, Pest ControlMonheimGermany
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
| | - Claudia Hinners
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
| | - Janna Siemanowski‐Hrach
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
| | - Julia K. Ulrich
- Crop Science Division, Bayer AG, R&D, Pest ControlMonheimGermany
| | - Daniela Großmann
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
- Department of Medical BioinformaticsUniversity Medical Center GöttingenGöttingenGermany
| | - Doga Cedden
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
| | - Ralf Nauen
- Crop Science Division, Bayer AG, R&D, Pest ControlMonheimGermany
| | - Sven Geibel
- Crop Science Division, Bayer AG, R&D, Pest ControlMonheimGermany
| | - Gregor Bucher
- Department of Evolutionary Developmental GeneticsUniversity of Göttingen, Johann‐Friedrich‐Blumenbach Institute, GZMBGöttingenGermany
| |
Collapse
|
2
|
MacNish TR, Danilevicz MF, Bayer PE, Bestry MS, Edwards D. Application of machine learning and genomics for orphan crop improvement. Nat Commun 2025; 16:982. [PMID: 39856113 PMCID: PMC11760368 DOI: 10.1038/s41467-025-56330-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Accepted: 01/15/2025] [Indexed: 01/27/2025] Open
Abstract
Orphan crops are important sources of nutrition in developing regions and many are tolerant to biotic and abiotic stressors; however, modern crop improvement technologies have not been widely applied to orphan crops due to the lack of resources available. There are orphan crop representatives across major crop types and the conservation of genes between these related species can be used in crop improvement. Machine learning (ML) has emerged as a promising tool for crop improvement. Transferring knowledge from major crops to orphan crops and using machine learning to improve accuracy and efficiency can be used to improve orphan crops.
Collapse
Affiliation(s)
- Tessa R MacNish
- School of Biological Sciences, The University of Western Australia, Perth, Australia
- Centre for Applied Bioinformatics, The University of Western Australia, Perth, Australia
| | - Monica F Danilevicz
- School of Biological Sciences, The University of Western Australia, Perth, Australia
- Centre for Applied Bioinformatics, The University of Western Australia, Perth, Australia
- Australian Herbicide Resistance Initiative, The University of Western Australia, Perth, Australia
| | - Philipp E Bayer
- Centre for Applied Bioinformatics, The University of Western Australia, Perth, Australia
- The UWA Oceans Institute, The University of Western Australia, Perth, Australia
- Minderoo Foundation, Perth, Australia
| | - Mitchell S Bestry
- School of Biological Sciences, The University of Western Australia, Perth, Australia
- Centre for Applied Bioinformatics, The University of Western Australia, Perth, Australia
| | - David Edwards
- School of Biological Sciences, The University of Western Australia, Perth, Australia.
- Centre for Applied Bioinformatics, The University of Western Australia, Perth, Australia.
| |
Collapse
|
3
|
Granata I, Maddalena L, Manzo M, Guarracino MR, Giordano M. HELP: A computational framework for labelling and predicting human common and context-specific essential genes. PLoS Comput Biol 2024; 20:e1012076. [PMID: 39331694 PMCID: PMC11463781 DOI: 10.1371/journal.pcbi.1012076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 10/09/2024] [Accepted: 08/19/2024] [Indexed: 09/29/2024] Open
Abstract
Machine learning-based approaches are particularly suitable for identifying essential genes as they allow the generation of predictive models trained on features from multi-source data. Gene essentiality is neither binary nor static but determined by the context. The databases for essential gene annotation do not permit the personalisation of the context, and their update can be slower than the publication of new experimental data. We propose HELP (Human Gene Essentiality Labelling & Prediction), a computational framework for labelling and predicting essential genes. Its double scope allows for identifying genes based on dependency or not on experimental data. The effectiveness of the labelling method was demonstrated by comparing it with other approaches in overlapping the reference sets of essential gene annotations, where HELP demonstrated the best compromise between false and true positive rates. The gene attributes, including multi-omics and network embedding features, lead to high-performance prediction of essential genes while confirming the existence of essentiality nuances.
Collapse
Affiliation(s)
- Ilaria Granata
- Institute for High-Performance Computing and Networking, National Research Council, Naples, Italy
| | - Lucia Maddalena
- Institute for High-Performance Computing and Networking, National Research Council, Naples, Italy
| | - Mario Manzo
- Information Technology Services, University of Naples “L’Orientale”, Naples, Italy
| | - Mario Rosario Guarracino
- Laboratory of Algorithms and Technologies for Network Analysis, National Research University Higher School of Economics, Nizhny Novgorod, Russia
- Department of Economics and Law, University of Cassino and Southern Lazio, Cassino, Frosinone, Italy
| | - Maurizio Giordano
- Institute for High-Performance Computing and Networking, National Research Council, Naples, Italy
| |
Collapse
|
4
|
Adedeji EO, Beder T, Damiani C, Cappelli A, Accoti A, Tapanelli S, Ogunlana OO, Fatumo S, Favia G, Koenig R, Adebiyi E. Combination of computational techniques and RNAi reveal targets in Anopheles gambiae for malaria vector control. PLoS One 2024; 19:e0305207. [PMID: 38968330 PMCID: PMC11226046 DOI: 10.1371/journal.pone.0305207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 05/25/2024] [Indexed: 07/07/2024] Open
Abstract
Increasing reports of insecticide resistance continue to hamper the gains of vector control strategies in curbing malaria transmission. This makes identifying new insecticide targets or alternative vector control strategies necessary. CLassifier of Essentiality AcRoss EukaRyote (CLEARER), a leave-one-organism-out cross-validation machine learning classifier for essential genes, was used to predict essential genes in Anopheles gambiae and selected predicted genes experimentally validated. The CLEARER algorithm was trained on six model organisms: Caenorhabditis elegans, Drosophila melanogaster, Homo sapiens, Mus musculus, Saccharomyces cerevisiae and Schizosaccharomyces pombe, and employed to identify essential genes in An. gambiae. Of the 10,426 genes in An. gambiae, 1,946 genes (18.7%) were predicted to be Cellular Essential Genes (CEGs), 1716 (16.5%) to be Organism Essential Genes (OEGs), and 852 genes (8.2%) to be essential as both OEGs and CEGs. RNA interference (RNAi) was used to validate the top three highly expressed non-ribosomal predictions as probable vector control targets, by determining the effect of these genes on the survival of An. gambiae G3 mosquitoes. In addition, the effect of knockdown of arginase (AGAP008783) on Plasmodium berghei infection in mosquitoes was evaluated, an enzyme we computationally inferred earlier to be essential based on chokepoint analysis. Arginase and the top three genes, AGAP007406 (Elongation factor 1-alpha, Elf1), AGAP002076 (Heat shock 70kDa protein 1/8, HSP), AGAP009441 (Elongation factor 2, Elf2), had knockdown efficiencies of 91%, 75%, 63%, and 61%, respectively. While knockdown of HSP or Elf2 significantly reduced longevity of the mosquitoes (p<0.0001) compared to control groups, Elf1 or arginase knockdown had no effect on survival. However, arginase knockdown significantly reduced P. berghei oocytes counts in the midgut of mosquitoes when compared to LacZ-injected controls. The study reveals HSP and Elf2 as important contributors to mosquito survival and arginase as important for parasite development, hence placing them as possible targets for vector control.
Collapse
Affiliation(s)
- Eunice O. Adedeji
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
- Department of Biochemistry, Covenant University, Ota, Ogun State, Nigeria
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
- Department of Biology, University of York, York, United Kingdom
| | - Thomas Beder
- Medical Department II, Hematology and Oncology, University Medical Center Schleswig-Holstein, Kiel, Germany
- University Cancer Center Schleswig-Holstein, University Medical Center Schleswig-Holstein, Kiel and Lübeck, Germany
- Institute for Infectious Diseases and Infection Control (IIMK, RG Systemsbiology), Jena University Hospital, Jena, Germany
| | - Claudia Damiani
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Alessia Cappelli
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Anastasia Accoti
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Sofia Tapanelli
- Department of Life Sciences, Imperial College, London, United Kingdom
| | - Olubanke O. Ogunlana
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
- Department of Biochemistry, Covenant University, Ota, Ogun State, Nigeria
- African Center of Excellence in Bioinformatics & Data Intensive Science, Makerere University, Kampala, Uganda
| | - Segun Fatumo
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Guido Favia
- School of Biosciences & Veterinary Medicine, University of Camerino, Camerino, Italy
| | - Rainer Koenig
- Institute for Infectious Diseases and Infection Control (IIMK, RG Systemsbiology), Jena University Hospital, Jena, Germany
| | - Ezekiel Adebiyi
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
- African Center of Excellence in Bioinformatics & Data Intensive Science, Makerere University, Kampala, Uganda
- Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
5
|
Campos TL, Korhonen PK, Young ND, Wang T, Song J, Marhoefer R, Chang BCH, Selzer PM, Gasser RB. Inference of Essential Genes of the Parasite Haemonchus contortus via Machine Learning. Int J Mol Sci 2024; 25:7015. [PMID: 39000124 PMCID: PMC11240989 DOI: 10.3390/ijms25137015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Revised: 06/19/2024] [Accepted: 06/21/2024] [Indexed: 07/16/2024] Open
Abstract
Over the years, comprehensive explorations of the model organisms Caenorhabditis elegans (elegant worm) and Drosophila melanogaster (vinegar fly) have contributed substantially to our understanding of complex biological processes and pathways in multicellular organisms generally. Extensive functional genomic-phenomic, genomic, transcriptomic, and proteomic data sets have enabled the discovery and characterisation of genes that are crucial for life, called 'essential genes'. Recently, we investigated the feasibility of inferring essential genes from such data sets using advanced bioinformatics and showed that a machine learning (ML)-based workflow could be used to extract or engineer features from DNA, RNA, protein, and/or cellular data/information to underpin the reliable prediction of essential genes both within and between C. elegans and D. melanogaster. As these are two distantly related species within the Ecdysozoa, we proposed that this ML approach would be particularly well suited for species that are within the same phylum or evolutionary clade. In the present study, we cross-predicted essential genes within the phylum Nematoda (evolutionary clade V)-between C. elegans and the pathogenic parasitic nematode H. contortus-and then ranked and prioritised H. contortus proteins encoded by these genes as intervention (e.g., drug) target candidates. Using strong, validated predictors, we inferred essential genes of H. contortus that are involved predominantly in crucial biological processes/pathways including ribosome biogenesis, translation, RNA binding/processing, and signalling and which are highly transcribed in the germline, somatic gonad precursors, sex myoblasts, vulva cell precursors, various nerve cells, glia, or hypodermis. The findings indicate that this in silico workflow provides a promising avenue to identify and prioritise panels/groups of drug target candidates in parasitic nematodes for experimental validation in vitro and/or in vivo.
Collapse
Affiliation(s)
- Túlio L Campos
- Department of Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, VIC 3010, Australia
- Bioinformatics Core Facility, Aggeu Magalhães Institute (Fiocruz), Recife 50740-465, PE, Brazil
| | - Pasi K Korhonen
- Department of Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Neil D Young
- Department of Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Tao Wang
- Department of Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Jiangning Song
- Department of Data Science and AI, Faculty of IT, Monash University, Melbourne, VIC 3800, Australia
- Biomedicine Discovery Institute, Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Clayton, VIC 3800, Australia
| | - Richard Marhoefer
- Boehringer Ingelheim Animal Health, Binger Strasse 173, 55216 Ingelheim am Rhein, Germany
| | - Bill C H Chang
- Department of Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Paul M Selzer
- Boehringer Ingelheim Animal Health, Binger Strasse 173, 55216 Ingelheim am Rhein, Germany
| | - Robin B Gasser
- Department of Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, VIC 3010, Australia
| |
Collapse
|
6
|
Garmire LX, Li Y, Huang Q, Xu C, Teichmann SA, Kaminski N, Pellegrini M, Nguyen Q, Teschendorff AE. Challenges and perspectives in computational deconvolution of genomics data. Nat Methods 2024; 21:391-400. [PMID: 38374264 DOI: 10.1038/s41592-023-02166-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 12/26/2023] [Indexed: 02/21/2024]
Abstract
Deciphering cell-type heterogeneity is crucial for systematically understanding tissue homeostasis and its dysregulation in diseases. Computational deconvolution is an efficient approach for estimating cell-type abundances from a variety of omics data. Despite substantial methodological progress in computational deconvolution in recent years, challenges are still outstanding. Here we enlist four important challenges related to computational deconvolution: the quality of the reference data, generation of ground truth data, limitations of computational methodologies, and benchmarking design and implementation. Finally, we make recommendations on reference data generation, new directions of computational methodologies, and strategies to promote rigorous benchmarking.
Collapse
Affiliation(s)
- Lana X Garmire
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| | - Yijun Li
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Qianhui Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Chuan Xu
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | | | - Naftali Kaminski
- Pulmonary, Critical Care & Sleep Medicine, Yale University School of Medicine, New Haven, CT, USA
| | - Matteo Pellegrini
- Molecular, Cell and Developmental Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Quan Nguyen
- Institute for Molecular Bioscience, The University of Queensland and QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Andrew E Teschendorff
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- UCL Cancer Institute, University College London, London, UK
| |
Collapse
|
7
|
Giordano M, Falbo E, Maddalena L, Piccirillo M, Granata I. Untangling the Context-Specificity of Essential Genes by Means of Machine Learning: A Constructive Experience. Biomolecules 2023; 14:18. [PMID: 38254618 PMCID: PMC10813179 DOI: 10.3390/biom14010018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 11/29/2023] [Accepted: 12/20/2023] [Indexed: 01/24/2024] Open
Abstract
Gene essentiality is a genetic concept crucial for a comprehensive understanding of life and evolution. In the last decade, many essential genes (EGs) have been determined using different experimental and computational approaches, and this information has been used to reduce the genomes of model organisms. A growing amount of evidence highlights that essentiality is a property that depends on the context. Because of their importance in vital biological processes, recognising context-specific EGs (csEGs) could help for identifying new potential pharmacological targets and to improve precision therapeutics. Since most of the computational procedures proposed to identify and predict EGs neglect their context-specificity, we focused on this aspect, providing a theoretical and experimental overview of the literature, data and computational methods dedicated to recognising csEGs. To this end, we adapted existing computational methods to exploit a specific context (the kidney tissue) and experimented with four different prediction methods using the labels provided by four different identification approaches. The considerations derived from the analysis of the obtained results, confirmed and validated also by further experiments for a different tissue context, provide the reader with guidance on exploiting existing tools for achieving csEGs identification and prediction.
Collapse
Affiliation(s)
- Maurizio Giordano
- Institute for High-Performance Computing and Networking (ICAR), National Research Council (CNR), V. Pietro Castellino 111, 80131 Naples, Italy; (E.F.); (L.M.); (M.P.); (I.G.)
| | | | | | | | | |
Collapse
|
8
|
Ma J, Song J, Young ND, Chang BCH, Korhonen PK, Campos TL, Liu H, Gasser RB. 'Bingo'-a large language model- and graph neural network-based workflow for the prediction of essential genes from protein data. Brief Bioinform 2023; 25:bbad472. [PMID: 38152979 PMCID: PMC10753293 DOI: 10.1093/bib/bbad472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/22/2023] [Accepted: 11/28/2023] [Indexed: 12/29/2023] Open
Abstract
The identification and characterization of essential genes are central to our understanding of the core biological functions in eukaryotic organisms, and has important implications for the treatment of diseases caused by, for example, cancers and pathogens. Given the major constraints in testing the functions of genes of many organisms in the laboratory, due to the absence of in vitro cultures and/or gene perturbation assays for most metazoan species, there has been a need to develop in silico tools for the accurate prediction or inference of essential genes to underpin systems biological investigations. Major advances in machine learning approaches provide unprecedented opportunities to overcome these limitations and accelerate the discovery of essential genes on a genome-wide scale. Here, we developed and evaluated a large language model- and graph neural network (LLM-GNN)-based approach, called 'Bingo', to predict essential protein-coding genes in the metazoan model organisms Caenorhabditis elegans and Drosophila melanogaster as well as in Mus musculus and Homo sapiens (a HepG2 cell line) by integrating LLM and GNNs with adversarial training. Bingo predicts essential genes under two 'zero-shot' scenarios with transfer learning, showing promise to compensate for a lack of high-quality genomic and proteomic data for non-model organisms. In addition, the attention mechanisms and GNNExplainer were employed to manifest the functional sites and structural domain with most contribution to essentiality. In conclusion, Bingo provides the prospect of being able to accurately infer the essential genes of little- or under-studied organisms of interest, and provides a biological explanation for gene essentiality.
Collapse
Affiliation(s)
- Jiani Ma
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Jiangning Song
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victoria 3800, Australia
| | - Neil D Young
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Bill C H Chang
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Pasi K Korhonen
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Tulio L Campos
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
- Bioinformatics Core Facility, Instituto Aggeu Magalhaes, Fundaçao Oswaldo Cruz (IAM-Fiocruz), Recife, Pernambuco, Brazil
| | - Hui Liu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|
9
|
Kelch MA, Vera-Guapi A, Beder T, Oswald M, Hiemisch A, Beil N, Wajda P, Ciesek S, Erfle H, Toptan T, Koenig R. Machine learning on large scale perturbation screens for SARS-CoV-2 host factors identifies β-catenin/CBP inhibitor PRI-724 as a potent antiviral. Front Microbiol 2023; 14:1193320. [PMID: 37342561 PMCID: PMC10277617 DOI: 10.3389/fmicb.2023.1193320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 05/16/2023] [Indexed: 06/23/2023] Open
Abstract
Expanding antiviral treatment options against SARS-CoV-2 remains crucial as the virus evolves under selection pressure which already led to the emergence of several drug resistant strains. Broad spectrum host-directed antivirals (HDA) are promising therapeutic options, however the robust identification of relevant host factors by CRISPR/Cas9 or RNA interference screens remains challenging due to low consistency in the resulting hits. To address this issue, we employed machine learning, based on experimental data from several knockout screens and a drug screen. We trained classifiers using genes essential for virus life cycle obtained from the knockout screens. The machines based their predictions on features describing cellular localization, protein domains, annotated gene sets from Gene Ontology, gene and protein sequences, and experimental data from proteomics, phospho-proteomics, protein interaction and transcriptomic profiles of SARS-CoV-2 infected cells. The models reached a remarkable performance suggesting patterns of intrinsic data consistency. The predicted HDF were enriched in sets of genes particularly encoding development, morphogenesis, and neural processes. Focusing on development and morphogenesis-associated gene sets, we found β-catenin to be central and selected PRI-724, a canonical β-catenin/CBP disruptor, as a potential HDA. PRI-724 limited infection with SARS-CoV-2 variants, SARS-CoV-1, MERS-CoV and IAV in different cell line models. We detected a concentration-dependent reduction in cytopathic effects, viral RNA replication, and infectious virus production in SARS-CoV-2 and SARS-CoV-1-infected cells. Independent of virus infection, PRI-724 treatment caused cell cycle deregulation which substantiates its potential as a broad spectrum antiviral. Our proposed machine learning concept supports focusing and accelerating the discovery of host dependency factors and identification of potential host-directed antivirals.
Collapse
Affiliation(s)
- Maximilian A. Kelch
- Institute for Medical Virology, University Hospital Frankfurt, Goethe University Frankfurt, Frankfurt, Germany
| | | | - Thomas Beder
- Medical Department II, Hematology and Oncology, University Hospital Schleswig-Holstein, Kiel, Germany
| | - Marcus Oswald
- Institute for Infectious Diseases and Infection Control, Jena University Hospital, Jena, Germany
| | - Alicia Hiemisch
- Institute for Infectious Diseases and Infection Control, Jena University Hospital, Jena, Germany
| | - Nina Beil
- Advanced Biological Screening Facility (ABSF), High-Content Analysis of the Cell (HiCell), BioQuant, Heidelberg University, Heidelberg, Germany
| | - Piotr Wajda
- Advanced Biological Screening Facility (ABSF), High-Content Analysis of the Cell (HiCell), BioQuant, Heidelberg University, Heidelberg, Germany
| | - Sandra Ciesek
- Institute for Medical Virology, University Hospital Frankfurt, Goethe University Frankfurt, Frankfurt, Germany
- German Centre for Infection Research (DZIF), External Partner Site Frankfurt, Frankfurt, Germany
- Fraunhofer Institute for Translational Medicine and Pharmacology ITMP, Frankfurt, Germany
| | - Holger Erfle
- Advanced Biological Screening Facility (ABSF), High-Content Analysis of the Cell (HiCell), BioQuant, Heidelberg University, Heidelberg, Germany
| | - Tuna Toptan
- Institute for Medical Virology, University Hospital Frankfurt, Goethe University Frankfurt, Frankfurt, Germany
| | - Rainer Koenig
- Institute for Infectious Diseases and Infection Control, Jena University Hospital, Jena, Germany
| |
Collapse
|
10
|
Baisya D, Ramesh A, Schwartz C, Lonardi S, Wheeldon I. Genome-wide functional screens enable the prediction of high activity CRISPR-Cas9 and -Cas12a guides in Yarrowia lipolytica. Nat Commun 2022; 13:922. [PMID: 35177617 PMCID: PMC8854577 DOI: 10.1038/s41467-022-28540-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 02/01/2022] [Indexed: 12/15/2022] Open
Abstract
Genome-wide functional genetic screens have been successful in discovering genotype-phenotype relationships and in engineering new phenotypes. While broadly applied in mammalian cell lines and in E. coli, use in non-conventional microorganisms has been limited, in part, due to the inability to accurately design high activity CRISPR guides in such species. Here, we develop an experimental-computational approach to sgRNA design that is specific to an organism of choice, in this case the oleaginous yeast Yarrowia lipolytica. A negative selection screen in the absence of non-homologous end-joining, the dominant DNA repair mechanism, was used to generate single guide RNA (sgRNA) activity profiles for both SpCas9 and LbCas12a. This genome-wide data served as input to a deep learning algorithm, DeepGuide, that is able to accurately predict guide activity. DeepGuide uses unsupervised learning to obtain a compressed representation of the genome, followed by supervised learning to map sgRNA sequence, genomic context, and epigenetic features with guide activity. Experimental validation, both genome-wide and with a subset of selected genes, confirms DeepGuide's ability to accurately predict high activity sgRNAs. DeepGuide provides an organism specific predictor of CRISPR guide activity that with retraining could be applied to other fungal species, prokaryotes, and other non-conventional organisms.
Collapse
Affiliation(s)
- Dipankar Baisya
- Department of Computer Science and Engineering, University of California, Riverside, CA, 92521, USA
| | - Adithya Ramesh
- Department of Chemical and Environmental Engineering, University of California, Riverside, CA, 92521, USA
| | - Cory Schwartz
- Department of Chemical and Environmental Engineering, University of California, Riverside, CA, 92521, USA
- iBio Inc., San Diego, CA, USA
| | - Stefano Lonardi
- Department of Computer Science and Engineering, University of California, Riverside, CA, 92521, USA.
- Integrative Institute for Genome Biology, University of California, Riverside, CA, 92521, USA.
| | - Ian Wheeldon
- Department of Chemical and Environmental Engineering, University of California, Riverside, CA, 92521, USA.
- Integrative Institute for Genome Biology, University of California, Riverside, CA, 92521, USA.
- Center for Industrial Biotechnology, University of California, Riverside, CA, 92521, USA.
| |
Collapse
|
11
|
Mehlhorn S, Hunnekuhl VS, Geibel S, Nauen R, Bucher G. Establishing RNAi for basic research and pest control and identification of the most efficient target genes for pest control: a brief guide. Front Zool 2021; 18:60. [PMID: 34863212 PMCID: PMC8643023 DOI: 10.1186/s12983-021-00444-7] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 11/04/2021] [Indexed: 11/14/2022] Open
Abstract
RNA interference (RNAi) has emerged as a powerful tool for knocking-down gene function in diverse taxa including arthropods for both basic biological research and application in pest control. The conservation of the RNAi mechanism in eukaryotes suggested that it should-in principle-be applicable to most arthropods. However, practical hurdles have been limiting the application in many taxa. For instance, species differ considerably with respect to efficiency of dsRNA uptake from the hemolymph or the gut. Here, we review some of the most frequently encountered technical obstacles when establishing RNAi and suggest a robust procedure for establishing this technique in insect species with special reference to pests. Finally, we present an approach to identify the most effective target genes for the potential control of agricultural and public health pests by RNAi.
Collapse
Affiliation(s)
- Sonja Mehlhorn
- Crop Science Division, Bayer AG, R&D, Pest Control, Alfred-Nobel-Straße 50, 40789, Monheim, Germany
- Department of Evolutionary Developmental Genetics, Johann-Friedrich-Blumenbach Institute, GZMB, University of Göttingen, Göttingen, Germany
| | - Vera S Hunnekuhl
- Department of Evolutionary Developmental Genetics, Johann-Friedrich-Blumenbach Institute, GZMB, University of Göttingen, Göttingen, Germany
| | - Sven Geibel
- Crop Science Division, Bayer AG, R&D, Pest Control, Alfred-Nobel-Straße 50, 40789, Monheim, Germany
| | - Ralf Nauen
- Crop Science Division, Bayer AG, R&D, Pest Control, Alfred-Nobel-Straße 50, 40789, Monheim, Germany
| | - Gregor Bucher
- Department of Evolutionary Developmental Genetics, Johann-Friedrich-Blumenbach Institute, GZMB, University of Göttingen, Göttingen, Germany.
| |
Collapse
|