Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gurumayum S, Jiang P, Hao X, Campos TL, Young ND, Korhonen PK, Gasser RB, Bork P, Zhao XM, He LJ, Chen WH. OGEE v3: Online GEne Essentiality database with increased coverage of organisms and human cell lines. Nucleic Acids Res 2021;49:D998-D1003. [PMID: 33084874 PMCID: PMC7779042 DOI: 10.1093/nar/gkaa884] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 09/24/2020] [Accepted: 09/28/2020] [Indexed: 12/17/2022] Open

Number

Cited by Other Article(s)

Ma S, Su T, Lu X, Qi Q. Bacterial genome reduction for optimal chassis of synthetic biology: a review. Crit Rev Biotechnol 2024;44:660-673. [PMID: 37380345 DOI: 10.1080/07388551.2023.2208285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 10/13/2022] [Accepted: 02/20/2023] [Indexed: 06/30/2023]

Tiani KA, Stover PJ. DTYMK is an essential gene in mice and heterozygosity does not cause neural tube defects. Arch Biochem Biophys 2024;755:109991. [PMID: 38621447 DOI: 10.1016/j.abb.2024.109991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 04/03/2024] [Accepted: 04/11/2024] [Indexed: 04/17/2024]

Yin R, Gutierrez A, Kobren SN, Avillach P. VarPPUD: Variant post prioritization developed for undiagnosed genetic disorders. medRxiv 2024:2024.04.15.24305876. [PMID: 38699371 PMCID: PMC11065012 DOI: 10.1101/2024.04.15.24305876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]

Abstract

Rare and ultra-rare genetic conditions are estimated to impact nearly 1 in 17 people worldwide, yet accurately pinpointing the diagnostic variants underlying each of these conditions remains a formidable challenge. Because comprehensive, in vivo functional assessment of all possible genetic variants is infeasible, clinicians instead consider in silico variant pathogenicity predictions to distinguish plausibly disease-causing from benign variants across the genome. However, in the most difficult undiagnosed cases, such as those accepted to the Undiagnosed Diseases Network (UDN), existing pathogenicity predictions cannot reliably discern true etiological variant(s) from other deleterious candidate variants that were prioritized through N-of-1 efforts. Pinpointing the disease-causing variant from a pool of plausible candidates remains a largely manual effort requiring extensive clinical workups, functional and experimental assays, and eventual identification of genotype- and phenotype-matched individuals. Here, we introduce VarPPUD, a tool trained on prioritized variants from UDN cases, that leverages gene-, amino acid-, and nucleotide-level features to discern pathogenic variants from other deleterious variants that are unlikely to be confirmed as disease relevant. VarPPUD achieves a cross-validated accuracy of 79.3% and precision of 77.5% on a held-out subset of uniquely challenging UDN cases, respectively representing an average 18.6% and 23.4% improvement over nine traditional pathogenicity prediction approaches on this task. We validate VarPPUD's ability to discriminate likely from unlikely pathogenic variants on synthetic, GAN-generated candidate variants as well. Finally, we show how VarPPUD can be probed to evaluate each input feature's importance and contribution toward prediction-an essential step toward understanding the distinct characteristics of newly-uncovered disease-causing variants.

Significance Statement

Patients with chronic, undiagnosed and underdiagnosed genetic conditions often endure expensive and excruciating years-long diagnostic odysseys without clear results. In many instances, clinical genome sequencing of patients and their family members fails to reveal known disease-causing variants, although compelling variants of uncertain significance are frequently encountered. Existing computational tools struggle to reliably differentiate truly disease-causing variants from other plausible candidate variants within these prioritized sets. Consequently, the confirmation of disease-causing variants often necessitates extensive experimental follow-up, including studies in model organisms and identification of other similarly presenting genotype-matched individuals, a process that can extend for several years. Here, we present VarPPUD, a tool trained specifically to distinguish likely from unlikely to be confirmed pathogenic variants that were prioritized across cases in the Undiagnosed Diseases Network. By evaluating the importance and impact of different input feature values on prediction, we gain deeper insights into the distinctive attributes of difficult-to-identify diagnostic variants. For patients who remain undiagnosed following comprehensive whole genome sequencing, our new method VarPPUD may reveal pathogenic variants amid a pool of candidate variants, thereby advancing diagnostic efforts where progress has otherwise stalled.

Collapse

Mu W, Luo T, Barrera A, Bounds LR, Klann TS, Ter Weele M, Bryois J, Crawford GE, Sullivan PF, Gersbach CA, Love MI, Li Y. Machine learning methods for predicting guide RNA effects in CRISPR epigenome editing experiments. bioRxiv 2024:2024.04.18.590188. [PMID: 38659894 PMCID: PMC11042384 DOI: 10.1101/2024.04.18.590188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]

Affiliation(s)

Wancen Mu Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Tianyou Luo Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Alejandro Barrera Center for Genomic and Computational Biology, Duke University, Durham, NC, USA Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA
Lexi R Bounds Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA Department of Biomedical Engineering, Duke University, Durham, NC, USA
Tyler S Klann Center for Genomic and Computational Biology, Duke University, Durham, NC, USA Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA Department of Biomedical Engineering, Duke University, Durham, NC, USA
Maria Ter Weele Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA Department of Biomedical Engineering, Duke University, Durham, NC, USA
Julien Bryois Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Gregory E Crawford Center for Genomic and Computational Biology, Duke University, Durham, NC, USA Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA Department of Pediatrics, Division of Medical Genetics, Duke University Medical Center, Durham, NC, USA
Patrick F Sullivan Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Charles A Gersbach Center for Genomic and Computational Biology, Duke University, Durham, NC, USA Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA Department of Biomedical Engineering, Duke University, Durham, NC, USA
Michael I Love Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Yun Li Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA

Collapse

Wang HT, Xiao FH, Gao ZL, Guo LY, Yang LQ, Li GH, Kong QP. Methylation entropy landscape of Chinese long-lived individuals reveals lower epigenetic noise related to human healthy aging. Aging Cell 2024:e14163. [PMID: 38566438 DOI: 10.1111/acel.14163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 03/12/2024] [Accepted: 03/15/2024] [Indexed: 04/04/2024] Open

Affiliation(s)

Hao-Tian Wang Key Laboratory of Genetic Evolution & Animal Models (Chinese Academy of Sciences), Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Key Laboratory of Healthy Aging Study, KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
Fu-Hui Xiao Key Laboratory of Genetic Evolution & Animal Models (Chinese Academy of Sciences), Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Key Laboratory of Healthy Aging Study, KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
Zong-Liang Gao Key Laboratory of Genetic Evolution & Animal Models (Chinese Academy of Sciences), Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Key Laboratory of Healthy Aging Study, KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
Li-Yun Guo Key Laboratory of Genetic Evolution & Animal Models (Chinese Academy of Sciences), Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Key Laboratory of Healthy Aging Study, KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
Li-Qin Yang Key Laboratory of Genetic Evolution & Animal Models (Chinese Academy of Sciences), Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Key Laboratory of Healthy Aging Study, KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
Gong-Hua Li Key Laboratory of Genetic Evolution & Animal Models (Chinese Academy of Sciences), Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Key Laboratory of Healthy Aging Study, KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
Qing-Peng Kong Key Laboratory of Genetic Evolution & Animal Models (Chinese Academy of Sciences), Key Laboratory of Healthy Aging Research of Yunnan Province, Kunming Key Laboratory of Healthy Aging Study, KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China

Collapse

Shanley HT, Taki AC, Nguyen N, Wang T, Byrne JJ, Ang CS, Leeming MG, Williamson N, Chang BCH, Jabbar A, Sleebs BE, Gasser RB. Comparative structure activity and target exploration of 1,2-diphenylethynes in Haemonchus contortus and Caenorhabditis elegans. Int J Parasitol Drugs Drug Resist 2024;25:100534. [PMID: 38554597 PMCID: PMC10992699 DOI: 10.1016/j.ijpddr.2024.100534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/14/2024] [Accepted: 03/17/2024] [Indexed: 04/01/2024]

Abstract

Infections and diseases caused by parasitic nematodes have a major adverse impact on the health and productivity of animals and humans worldwide. The control of these parasites often relies heavily on the treatment with commercially available chemical compounds (anthelmintics). However, the excessive or uncontrolled use of these compounds in livestock animals has led to major challenges linked to drug resistance in nematodes. Therefore, there is a need to develop new anthelmintics with novel mechanism(s) of action. Recently, we identified a small molecule, designated UMW-9729, with nematocidal activity against the free-living model organism Caenorhabditis elegans. Here, we evaluated UMW-9729's potential as an anthelmintic in a structure-activity relationship (SAR) study in C. elegans and the highly pathogenic, blood-feeding Haemonchus contortus (barber's pole worm), and explored the compound-target relationship using thermal proteome profiling (TPP). First, we synthesised and tested 25 analogues of UMW-9729 for their nematocidal activity in both H. contortus (larvae and adults) and C. elegans (young adults), establishing a preliminary nematocidal pharmacophore for both species. We identified several compounds with marked activity against either H. contortus or C. elegans which had greater efficacy than UMW-9729, and found a significant divergence in compound bioactivity between these two nematode species. We also identified a UMW-9729 analogue, designated 25, that moderately inhibited the motility of adult female H. contortus in vitro. Subsequently, we inferred three H. contortus proteins (HCON_00134350, HCON_00021470 and HCON_00099760) and five C. elegans proteins (F30A10.9, F15B9.8, B0361.6, DNC-4 and UNC-11) that interacted directly with UMW-9729; however, no conserved protein target was shared between the two nematode species. Future work aims to extend the SAR investigation in these and other parasitic nematode species, and validate individual proteins identified here as possible targets of UMW-9729. Overall, the present study evaluates this anthelmintic candidate and highlights some challenges associated with early anthelmintic investigation.

Collapse

Affiliation(s)

Harrison T Shanley Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia; Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia
Aya C Taki Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia
Nghi Nguyen Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia
Tao Wang Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia
Joseph J Byrne Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia
Ching-Seng Ang Melbourne Mass Spectrometry and Proteomics Facility, The Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, Victoria, 3010, Australia
Michael G Leeming Melbourne Mass Spectrometry and Proteomics Facility, The Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, Victoria, 3010, Australia
Nicholas Williamson Melbourne Mass Spectrometry and Proteomics Facility, The Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, Victoria, 3010, Australia
Bill C H Chang Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia
Abdul Jabbar Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia
Brad E Sleebs Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia; Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia.
Robin B Gasser Department of Veterinary Biosciences, Melbourne Veterinary School, Faculty of Science, The University of Melbourne, Parkville, Victoria, 3010, Australia.

Collapse

Ye C, Wu Q, Chen S, Zhang X, Xu W, Wu Y, Zhang Y, Yue Y. ECDEP: identifying essential proteins based on evolutionary community discovery and subcellular localization. BMC Genomics 2024;25:117. [PMID: 38279081 PMCID: PMC10821549 DOI: 10.1186/s12864-024-10019-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 01/15/2024] [Indexed: 01/28/2024] Open

Abstract

BACKGROUND

In cellular activities, essential proteins play a vital role and are instrumental in comprehending fundamental biological necessities and identifying pathogenic genes. Current deep learning approaches for predicting essential proteins underutilize the potential of gene expression data and are inadequate for the exploration of dynamic networks with limited evaluation across diverse species.

RESULTS

We introduce ECDEP, an essential protein identification model based on evolutionary community discovery. ECDEP integrates temporal gene expression data with a protein-protein interaction (PPI) network and employs the 3-Sigma rule to eliminate outliers at each time point, constructing a dynamic network. Next, we utilize edge birth and death information to establish an interaction streaming source to feed into the evolutionary community discovery algorithm and then identify overlapping communities during the evolution of the dynamic network. SVM recursive feature elimination (RFE) is applied to extract the most informative communities, which are combined with subcellular localization data for classification predictions. We assess the performance of ECDEP by comparing it against ten centrality methods, four shallow machine learning methods with RFE, and two deep learning methods that incorporate multiple biological data sources on Saccharomyces. Cerevisiae (S. cerevisiae), Homo sapiens (H. sapiens), Mus musculus, and Caenorhabditis elegans. ECDEP achieves an AP value of 0.86 on the H. sapiens dataset and the contribution ratio of community features in classification reaches 0.54 on the S. cerevisiae (Krogan) dataset.

CONCLUSIONS

Our proposed method adeptly integrates network dynamics and yields outstanding results across various datasets. Furthermore, the incorporation of evolutionary community discovery algorithms amplifies the capacity of gene expression data in classification.

Collapse

Affiliation(s)

Chen Ye School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China
Qi Wu School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China
Shuxia Chen School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China
Xuemei Zhang School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China
Wenwen Xu School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China
Yunzhi Wu School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China
Youhua Zhang School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China
Yi Yue School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, 230036, China. Anhui Beidou Precision Agriculture Information Engineering Research Center, Anhui Agricultural University, Hefei, 230036, China.

Collapse

Cacheiro P, Lawson S, Van den Veyver IB, Marengo G, Zocche D, Murray SA, Duyzend M, Robinson PN, Smedley D. Lethal phenotypes in Mendelian disorders. medRxiv 2024:2024.01.12.24301168. [PMID: 38260283 PMCID: PMC10802756 DOI: 10.1101/2024.01.12.24301168] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]

Abstract

Essential genes are those whose function is required for cell proliferation and/or organism survival. A gene's intolerance to loss-of-function can be allocated within a spectrum, as opposed to being considered a binary feature, since this function might be essential at different stages of development, genetic backgrounds or other contexts. Existing resources that collect and characterise the essentiality status of genes are based on either proliferation assessment in human cell lines, embryonic and postnatal viability evaluation in different model organisms, and gene metrics such as intolerance to variation scores derived from human population sequencing studies. There are also several repositories available that document phenotypic annotations for rare disorders in humans such as the Online Mendelian Inheritance in Man (OMIM) and the Human Phenotype Ontology (HPO) knowledgebases. This raises the prospect of being able to use clinical data, including lethality as the most severe phenotypic manifestation, to further our characterisation of gene essentiality. Here we queried OMIM for terms related to lethality and classified all Mendelian genes into categories, according to the earliest age of death recorded for the associated disorders, from prenatal death to no reports of premature death. To showcase this curated catalogue of human essential genes, we developed the Lethal Phenotypes Portal (https://lethalphenotypes.research.its.qmul.ac.uk), where we also explore the relationships between these lethality categories, constraint metrics and viability in cell lines and mouse. Further analysis of the genes in these categories reveals differences in the mode of inheritance of the associated disorders, physiological systems affected and disease class. We highlight how the phenotypic similarity between genes in the same lethality category combined with gene family/group information can be used for novel disease gene discovery. Finally, we explore the overlaps and discrepancies between the lethal phenotypes observed in mouse and human and discuss potential explanations that include differences in transcriptional regulation, functional compensation and molecular disease mechanisms. We anticipate that this resource will aid clinicians in the diagnosis of early lethal conditions and assist researchers in investigating the properties that make these genes essential for human development.

Collapse

Giordano M, Falbo E, Maddalena L, Piccirillo M, Granata I. Untangling the Context-Specificity of Essential Genes by Means of Machine Learning: A Constructive Experience. Biomolecules 2023;14:18. [PMID: 38254618 PMCID: PMC10813179 DOI: 10.3390/biom14010018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 11/29/2023] [Accepted: 12/20/2023] [Indexed: 01/24/2024] Open

Wang J, Shi A, Lyu J. A comprehensive atlas of epigenetic regulators reveals tissue-specific epigenetic regulation patterns. Epigenetics 2023;18:2139067. [PMID: 36305095 PMCID: PMC9980636 DOI: 10.1080/15592294.2022.2139067] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open

Ma J, Song J, Young ND, Chang BCH, Korhonen PK, Campos TL, Liu H, Gasser RB. 'Bingo'-a large language model- and graph neural network-based workflow for the prediction of essential genes from protein data. Brief Bioinform 2023;25:bbad472. [PMID: 38152979 PMCID: PMC10753293 DOI: 10.1093/bib/bbad472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/22/2023] [Accepted: 11/28/2023] [Indexed: 12/29/2023] Open

Bay ÖF, Hayes KS, Schwartz JM, Grencis RK, Roberts IS. A genome-scale metabolic model of parasitic whipworm. Nat Commun 2023;14:6937. [PMID: 37907472 PMCID: PMC10618284 DOI: 10.1038/s41467-023-42552-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 10/13/2023] [Indexed: 11/02/2023] Open

Cacheiro P, Smedley D. Essential genes: a cross-species perspective. Mamm Genome 2023;34:357-363. [PMID: 36897351 PMCID: PMC10382395 DOI: 10.1007/s00335-023-09984-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/17/2023] [Indexed: 03/11/2023]

Zhou JB, Tang D, He L, Lin S, Lei JH, Sun H, Xu X, Deng CX. Machine learning model for anti-cancer drug combinations: Analysis, prediction, and validation. Pharmacol Res 2023;194:106830. [PMID: 37343647 DOI: 10.1016/j.phrs.2023.106830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 06/10/2023] [Accepted: 06/17/2023] [Indexed: 06/23/2023]

Affiliation(s)

Jing-Bo Zhou Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China
Dongyang Tang Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China
Lin He Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China
Shiqi Lin Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China
Josh Haipeng Lei Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China
Heng Sun Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China
Xiaoling Xu Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China; MOE Frontier Science Center for Precision Oncology, University of Macau, Macau SAR, China
Chu-Xia Deng Cancer Center, Faculty of Health Sciences, University of Macau, Macau SAR, China; Centre for Precision Medicine Research and Training, Faculty of Health Sciences, University of Macau, Macau SAR, China; MOE Frontier Science Center for Precision Oncology, University of Macau, Macau SAR, China.

Collapse

Cesur MF, Basile A, Patil KR, Çakır T. A new metabolic model of Drosophila melanogaster and the integrative analysis of Parkinson's disease. Life Sci Alliance 2023;6:e202201695. [PMID: 37236669 PMCID: PMC10215973 DOI: 10.26508/lsa.202201695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2022] [Revised: 05/11/2023] [Accepted: 05/12/2023] [Indexed: 05/28/2023] Open

Itai T, Jia P, Dai Y, Chen J, Chen X, Zhao Z. De novo mutations disturb early brain development more frequently than common variants in schizophrenia. Am J Med Genet B Neuropsychiatr Genet 2023;192:62-70. [PMID: 36863698 DOI: 10.1002/ajmg.b.32932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 12/08/2022] [Accepted: 01/29/2023] [Indexed: 03/04/2023]

Gorlov IP, Conway K, Edmiston SN, Parrish EA, Hao H, Amos CI, Tsavachidis S, Gorlova OY, Begg C, Hernando E, Cheng C, Shen R, Orlow I, Luo L, Ernstoff MS, Kuan PF, Ollila DW, Tsai YS, Berwick M, Thomas NE. Methylation of nonessential genes in cutaneous melanoma - Rule Out hypothesis. Melanoma Res 2023;33:163-172. [PMID: 36805567 PMCID: PMC10148896 DOI: 10.1097/cmr.0000000000000881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]

Affiliation(s)

Ivan P Gorlov Department of Medicine, Baylor College of Medicine, Houston, Texas
Kathleen Conway Department of Dermatology, University of North Carolina Department of Epidemiology Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
Sharon N Edmiston Department of Dermatology, University of North Carolina Department of Epidemiology Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
Eloise A Parrish Department of Applied Mathematics and Statistics, State University of New York, Stony Brook
Honglin Hao Department of Dermatology, University of North Carolina
Christopher I Amos Department of Medicine, Baylor College of Medicine, Houston, Texas
Spiridon Tsavachidis Department of Medicine, Baylor College of Medicine, Houston, Texas
Olga Y Gorlova Department of Medicine, Baylor College of Medicine, Houston, Texas
Colin Begg Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York
Eva Hernando Department of Pathology, New York University School of Medicine, New York
Chao Cheng Department of Medicine, Baylor College of Medicine, Houston, Texas
Ronglai Shen Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York
Irene Orlow Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York
Li Luo Department of Internal Medicine, University of New Mexico, Albuquerque, New Maxico
Marc S Ernstoff Roswell Park Comprehensive Cancer Center, Elm and Carlton, Buffalo
Pei Fen Kuan Department of Applied Mathematics and Statistics, State University of New York, Stony Brook and
David W Ollila Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina Department of Surgery, University of North Carolina, Chapel Hill, North Carolina, USA
Yihsuan S Tsai Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
Marianne Berwick Department of Internal Medicine, University of New Mexico, Albuquerque, New Maxico
Nancy E Thomas Department of Dermatology, University of North Carolina Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina

Collapse

Mugwanda K, Hamese S, Van Zyl WF, Prinsloo E, Du Plessis M, Dicks LMT, Thimiri Govinda Raj DB. Recent advances in genetic tools for engineering probiotic lactic acid bacteria. Biosci Rep 2023;43. [PMID: 36597861 DOI: 10.1042/BSR20211299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 12/19/2022] [Accepted: 01/03/2023] [Indexed: 01/05/2023] Open

Liao W, Nie W, Ahmad I, Chen G, Zhu B. The occurrence, characteristics, and adaptation of A-to-I RNA editing in bacteria: A review. Front Microbiol 2023;14:1143929. [PMID: 36960293 PMCID: PMC10027721 DOI: 10.3389/fmicb.2023.1143929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 02/15/2023] [Indexed: 03/09/2023] Open

Manzo M, Giordano M, Maddalena L, Guarracino MR, Granata I. Novel Data Science Methodologies for Essential Genes Identification Based on Network Analysis. Studies in Computational Intelligence 2023:117-145. [DOI: 10.1007/978-3-031-24453-7_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]

Möller S, Saul N, Projahn E, Barrantes I, Gézsi A, Walter M, Antal P, Fuellen G. Gene co-expression analyses of health(span) across multiple species. NAR Genom Bioinform 2022;4:lqac083. [PMID: 36458022 PMCID: PMC9706456 DOI: 10.1093/nargab/lqac083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2021] [Revised: 08/20/2022] [Accepted: 10/31/2022] [Indexed: 12/03/2022] Open

Abstract

Health(span)-related gene clusters/modules were recently identified based on knowledge about the cross-species genetic basis of health, to interpret transcriptomic datasets describing health-related interventions. However, the cross-species comparison of health-related observations reveals a lot of heterogeneity, not least due to widely varying health(span) definitions and study designs, posing a challenge for the exploration of conserved healthspan modules and, specifically, their transfer across species. To improve the identification and exploration of conserved/transferable healthspan modules, here we apply an established workflow based on gene co-expression network analyses employing GEO/ArrayExpress data for human and animal models, and perform a comprehensive meta-study of the resulting modules related to health(span), yielding a small set of literature backed health(span) candidate genes. For each experiment, WGCNA (weighted gene correlation network analysis) was used to infer modules of genes which correlate in their expression with a 'health phenotype score' and to determine the most-connected (hub) genes (and their interactions) for each such module. After mapping these hub genes to their human orthologs, 12 health(span) genes were identified in at least two species (ACTN3, ANK1, MRPL18, MYL1, PAXIP1, PPP1CA, SCN3B, SDCBP, SKIV2L, TUBG1, TYROBP, WIPF1), for which enrichment analysis by g:profiler found an association with actin filament-based movement and associated organelles, as well as muscular structures. We conclude that a meta-study of hub genes from co-expression network analyses for the complex phenotype health(span), across multiple species, can yield molecular-mechanistic insights and can direct experimentalists to further investigate the contribution of individual genes and their interactions to health(span).

Collapse

Banik A, Podder S, Saha S, Chatterjee P, Halder AK, Nasipuri M, Basu S, Plewczynski D. Rule-Based Pruning and In Silico Identification of Essential Proteins in Yeast PPIN. Cells 2022;11:2648. [PMID: 36078056 PMCID: PMC9454873 DOI: 10.3390/cells11172648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 08/18/2022] [Accepted: 08/22/2022] [Indexed: 11/25/2022] Open

Yue Y, Ye C, Peng PY, Zhai HX, Ahmad I, Xia C, Wu YZ, Zhang YH. A deep learning framework for identifying essential proteins based on multiple biological information. BMC Bioinformatics 2022;23:318. [PMID: 35927611 PMCID: PMC9351218 DOI: 10.1186/s12859-022-04868-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 07/29/2022] [Indexed: 11/15/2022] Open

Abstract

Background

Essential Proteins are demonstrated to exert vital functions on cellular processes and are indispensable for the survival and reproduction of the organism. Traditional centrality methods perform poorly on complex protein–protein interaction (PPI) networks. Machine learning approaches based on high-throughput data lack the exploitation of the temporal and spatial dimensions of biological information.

Results

We put forward a deep learning framework to predict essential proteins by integrating features obtained from the PPI network, subcellular localization, and gene expression profiles. In our model, the node2vec method is applied to learn continuous feature representations for proteins in the PPI network, which capture the diversity of connectivity patterns in the network. The concept of depthwise separable convolution is employed on gene expression profiles to extract properties and observe the trends of gene expression over time under different experimental conditions. Subcellular localization information is mapped into a long one-dimensional vector to capture its characteristics. Additionally, we use a sampling method to mitigate the impact of imbalanced learning when training the model. With experiments carried out on the data of Saccharomyces cerevisiae, results show that our model outperforms traditional centrality methods and machine learning methods. Likewise, the comparative experiments have manifested that our process of various biological information is preferable.

Conclusions

Our proposed deep learning framework effectively identifies essential proteins by integrating multiple biological data, proving a broader selection of subcellular localization information significantly improves the results of prediction and depthwise separable convolution implemented on gene expression profiles enhances the performance.

Collapse

Affiliation(s)

Yi Yue Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China. .,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China. .,School of Life Sciences, Anhui Agricultural University, Hefei, 230036, China. .,State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, China.
Chen Ye Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China.,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China
Pei-Yun Peng Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China.,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China
Hui-Xin Zhai Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China.,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China
Iftikhar Ahmad Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China.,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China
Chuan Xia Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China.,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China
Yun-Zhi Wu Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China.,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China.,State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, China
You-Hua Zhang Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, 230036, China. .,School of Information and Computer, Anhui Agricultural University, Hefei, 230036, China. .,School of Life Sciences, Anhui Agricultural University, Hefei, 230036, China.

Collapse

Trastulla L, Noorbakhsh J, Vazquez F, McFarland J, Iorio F. Computational estimation of quality and clinical relevance of cancer cell lines. Mol Syst Biol 2022;18:e11017. [PMID: 35822563 PMCID: PMC9277610 DOI: 10.15252/msb.202211017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 06/10/2022] [Accepted: 06/13/2022] [Indexed: 12/12/2022] Open

Zhang Y, Zhang W, Xin X, Du P. dbEssLnc: A manually curated database of human and mouse essential lncRNA genes. Comput Struct Biotechnol J 2022. [PMID: 35685362 PMCID: PMC9162909 DOI: 10.1016/j.csbj.2022.05.043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 05/20/2022] [Accepted: 05/21/2022] [Indexed: 02/07/2023] Open

Hütter CVR, Sin C, Müller F, Menche J. Network cartographs for interpretable visualizations. Nat Comput Sci 2022;2:84-89. [PMID: 38177513 PMCID: PMC10766564 DOI: 10.1038/s43588-022-00199-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 01/20/2022] [Indexed: 01/06/2024]

Soubise B, Jiang Y, Douet-Guilbert N, Troadec MB. RBM22, a Key Player of Pre-mRNA Splicing and Gene Expression Regulation, Is Altered in Cancer. Cancers (Basel) 2022;14:cancers14030643. [PMID: 35158909 PMCID: PMC8833553 DOI: 10.3390/cancers14030643] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 01/19/2022] [Accepted: 01/22/2022] [Indexed: 01/05/2023] Open

Krauze AV, Camphausen K. Molecular Biology in Treatment Decision Processes-Neuro-Oncology Edition. Int J Mol Sci 2021;22:13278. [PMID: 34948075 PMCID: PMC8703419 DOI: 10.3390/ijms222413278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/02/2021] [Accepted: 12/03/2021] [Indexed: 11/30/2022] Open

Beder T, Aromolaran O, Dönitz J, Tapanelli S, Adedeji E, Adebiyi E, Bucher G, Koenig R. Identifying essential genes across eukaryotes by machine learning. NAR Genom Bioinform 2021;3:lqab110. [PMID: 34859210 PMCID: PMC8634067 DOI: 10.1093/nargab/lqab110] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 10/09/2021] [Accepted: 11/29/2021] [Indexed: 02/07/2023] Open

Chu V, Feng Q, Lim Y, Shao S. Selective destabilization of polypeptides synthesized from NMD-targeted transcripts. Mol Biol Cell 2021;32:ar38. [PMID: 34586879 PMCID: PMC8694075 DOI: 10.1091/mbc.e21-08-0382] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open

Ringwald M, Richardson JE, Baldarelli RM, Blake JA, Kadin JA, Smith C, Bult CJ. Mouse Genome Informatics (MGI): latest news from MGD and GXD. Mamm Genome 2021;33:4-18. [PMID: 34698891 PMCID: PMC8913530 DOI: 10.1007/s00335-021-09921-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 09/21/2021] [Indexed: 12/01/2022]

Campos TL, Korhonen PK, Hofmann A, Gasser RB, Young ND. Harnessing model organism genomics to underpin the machine learning-based prediction of essential genes in eukaryotes - Biotechnological implications. Biotechnol Adv 2021;54:107822. [PMID: 34461202 DOI: 10.1016/j.biotechadv.2021.107822] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 08/17/2021] [Accepted: 08/24/2021] [Indexed: 12/17/2022]

Abstract

The availability of high-quality genomes and advances in functional genomics have enabled large-scale studies of essential genes in model eukaryotes, including the 'elegant worm' (Caenorhabditis elegans; Nematoda) and the 'vinegar fly' (Drosophila melanogaster; Arthropoda). However, this is not the case for other, much less-studied organisms, such as socioeconomically important parasites, for which functional genomic platforms usually do not exist. Thus, there is a need to develop innovative techniques or approaches for the prediction, identification and investigation of essential genes. A key approach that could enable the prediction of such genes is machine learning (ML). Here, we undertake an historical review of experimental and computational approaches employed for the characterisation of essential genes in eukaryotes, with a particular focus on model ecdysozoans (C. elegans and D. melanogaster), and discuss the possible applicability of ML-approaches to organisms such as socioeconomically important parasites. We highlight some recent results showing that high-performance ML, combined with feature engineering, allows a reliable prediction of essential genes from extensive, publicly available 'omic data sets, with major potential to prioritise such genes (with statistical confidence) for subsequent functional genomic validation. These findings could 'open the door' to fundamental and applied research areas. Evidence of some commonality in the essential gene-complement between these two organisms indicates that an ML-engineering approach could find broader applicability to ecdysozoans such as parasitic nematodes or arthropods, provided that suitably large and informative data sets become/are available for proper feature engineering, and for the robust training and validation of algorithms. This area warrants detailed exploration to, for example, facilitate the identification and characterisation of essential molecules as novel targets for drugs and vaccines against parasitic diseases. This focus is particularly important, given the substantial impact that such diseases have worldwide, and the current challenges associated with their prevention and control and with drug resistance in parasite populations.

Collapse

Zahra NUA, Jamil F, Uddin R. Protein Integrated Network Analysis to Reveal Potential Drug Targets Against Extended Drug-Resistant Mycobacterium tuberculosis XDR1219. Mol Biotechnol 2021. [PMID: 34382159 DOI: 10.1007/s12033-021-00377-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 07/30/2021] [Indexed: 10/20/2022]

Xu D, Lyon S, Bu CH, Hildebrand S, Choi JH, Zhong X, Liu A, Turer EE, Zhang Z, Russell J, Ludwig S, Mahrt E, Nair-Gill E, Shi H, Wang Y, Zhang D, Yue T, Wang KW, SoRelle JA, Su L, Misawa T, McAlpine W, Sun L, Wang J, Zhan X, Choi M, Farokhnia R, Sakla A, Schneider S, Coco H, Coolbaugh G, Hayse B, Mazal S, Medler D, Nguyen B, Rodriguez E, Wadley A, Tang M, Li X, Anderton P, Keller K, Press A, Scott L, Quan J, Cooper S, Collie T, Qin B, Cardin J, Simpson R, Tadesse M, Sun Q, Wise CA, Rios JJ, Moresco EMY, Beutler B. Thousands of induced germline mutations affecting immune cells identified by automated meiotic mapping coupled with machine learning. Proc Natl Acad Sci U S A 2021;118:e2106786118. [PMID: 34260399 PMCID: PMC8285956 DOI: 10.1073/pnas.2106786118] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Affiliation(s)

Darui Xu Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Stephen Lyon Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Chun Hui Bu Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Sara Hildebrand Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Jin Huk Choi Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390 Department of Immunology, University of Texas Southwestern Medical Center, Dallas, TX 75390
Xue Zhong Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Aijie Liu Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Emre E Turer Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390 Department of Internal Medicine, Division of Gastroenterology, University of Texas Southwestern Medical Center, Dallas, TX 75390
Zhao Zhang Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Jamie Russell Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Sara Ludwig Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Elena Mahrt Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Evan Nair-Gill Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Hexin Shi Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Ying Wang Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Duanwu Zhang Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Tao Yue Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Kuan-Wen Wang Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Jeffrey A SoRelle Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Lijing Su Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Takuma Misawa Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
William McAlpine Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Lei Sun Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Jianhui Wang Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Xiaoming Zhan Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Mihwa Choi Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Roxana Farokhnia Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Andrew Sakla Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Sara Schneider Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Hannah Coco Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Gabrielle Coolbaugh Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Braden Hayse Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Sara Mazal Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Dawson Medler Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Brandon Nguyen Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Edward Rodriguez Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Andrew Wadley Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Miao Tang Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Xiaohong Li Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Priscilla Anderton Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Katie Keller Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Amanda Press Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Lindsay Scott Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Jiexia Quan Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Sydney Cooper Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Tiffany Collie Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Baifang Qin Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Jennifer Cardin Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Rochelle Simpson Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Meron Tadesse Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Qihua Sun Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Carol A Wise Center for Pediatric Bone Biology and Translational Research, Scottish Rite for Children, Dallas, TX 75219 McDermott Center for Human Growth & Development, University of Texas Southwestern Medical Center, Dallas, TX 75390 Department of Orthopaedic Surgery, University of Texas Southwestern Medical Center, Dallas, TX 75390 Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX 75390
Jonathan J Rios Center for Pediatric Bone Biology and Translational Research, Scottish Rite for Children, Dallas, TX 75219 McDermott Center for Human Growth & Development, University of Texas Southwestern Medical Center, Dallas, TX 75390 Department of Orthopaedic Surgery, University of Texas Southwestern Medical Center, Dallas, TX 75390 Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, TX 75390
Eva Marie Y Moresco Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390
Bruce Beutler Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX 75390;

Collapse

Daniels MW, Dvorkin D, Powers RK, Kechris K. Semi-Supervised Learning Using Hierarchical Mixture Models: Gene Essentiality Case Study. MCA 2021;26:40. [DOI: 10.3390/mca26020040] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Galperin MY, Wolf YI, Garushyants SK, Vera Alvarez R, Koonin EV. Non-essential ribosomal proteins in bacteria and archaea identified using COGs. J Bacteriol 2021;203:JB. [PMID: 33753464 DOI: 10.1128/JB.00058-21] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Abstract

Ribosomal proteins (RPs) are highly conserved across the bacterial and archaeal domains. Although many RPs are essential for survival, genome analysis demonstrates the absence of some RP genes in many bacterial and archaeal genomes. Furthermore, global transposon mutagenesis and/or targeted deletion showed that elimination of some RP genes had only a moderate effect on the bacterial growth rate. Here, we systematically analyze the evolutionary conservation of RPs in prokaryotes by compiling the list of the ribosomal genes that are missing from one or more genomes in the recently updated version of the Clusters of Orthologous Genes (COG) database. Some of these absences occurred because the respective genes carried frameshifts, presumably, resulting from sequencing errors, while others were overlooked and not translated during genome annotation. Apart from these annotation errors, we identified multiple genuine losses of RP genes in a variety of bacteria and archaea. Some of these losses are clade-specific, whereas others occur in symbionts and parasites with dramatically reduced genomes. The lists of computationally and experimentally defined non-essential ribosomal genes show a substantial overlap, revealing a common trend in prokaryote ribosome evolution that could be linked to the architecture and assembly of the ribosomes. Thus, RPs that are located at the surface of the ribosome and/or are incorporated at a late stage of ribosome assembly are more likely to be non-essential and to be lost during microbial evolution, particularly, in the course of genome compaction.IMPORTANCEIn many prokaryote genomes, one or more ribosomal protein (RP) genes are missing. Analysis of 1,309 prokaryote genomes included in the COG database shows that only about half of the RPs are universally conserved in bacteria and archaea. In contrast, up to 16 other RPs are missing in some genomes, primarily, tiny (<1 Mb) genomes of host-associated bacteria and archaea. Ten universal and nine archaea-specific ribosomal proteins show clear patterns of lineage-specific gene loss. Most of the RPs that are frequently lost from bacterial genomes are located on the ribosome periphery and are non-essential in Escherichia coli and Bacillus subtilis These results reveal general trends and common constraints in the architecture and evolution of ribosomes in prokaryotes.

Collapse