Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Detlefsen NS, Hauberg S, Boomsma W. Learning meaningful representations of protein sequences. Nat Commun 2022;13:1914. [PMID: 35395843 PMCID: PMC8993921 DOI: 10.1038/s41467-022-29443-w] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 03/15/2022] [Indexed: 01/27/2023] Open

For:	Detlefsen NS, Hauberg S, Boomsma W. Learning meaningful representations of protein sequences. Nat Commun 2022;13:1914. [PMID: 35395843 PMCID: PMC8993921 DOI: 10.1038/s41467-022-29443-w] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 03/15/2022] [Indexed: 01/27/2023] Open

Number

Cited by Other Article(s)

Ji D, Frkic RL, Delyami J, Larsen JS, Spence MA, Jackson CJ. A Thermostable Bacterial Metallohydrolase that Degrades Organophosphate Plasticizers. Chembiochem 2025:e2500055. [PMID: 40364453 DOI: 10.1002/cbic.202500055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2025] [Revised: 05/08/2025] [Accepted: 05/12/2025] [Indexed: 05/15/2025]

Pandey A, Chen W, Keten S. COLOR: A Compositional Linear Operation-Based Representation of Protein Sequences for Identification of Monomer Contributions to Properties. J Chem Inf Model 2025;65:4320-4333. [PMID: 40272990 DOI: 10.1021/acs.jcim.5c00205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2025]

Thompson M, Martín M, Olmo TS, Rajesh C, Koo PK, Bolognesi B, Lehner B. Massive experimental quantification allows interpretable deep learning of protein aggregation. SCIENCE ADVANCES 2025;11:eadt5111. [PMID: 40305601 PMCID: PMC12042874 DOI: 10.1126/sciadv.adt5111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2024] [Accepted: 03/26/2025] [Indexed: 05/02/2025]

Merdler-Rabinowicz R, Omar M, Ganesh J, Morava E, Nadkarni GN, Klang E. The role of large language models in medical genetics. Mol Genet Metab 2025;145:109098. [PMID: 40154187 DOI: 10.1016/j.ymgme.2025.109098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/01/2025]

Magateshvaren Saras MA, Mitra MK, Tyagi S. Navigating the Multiverse: a Hitchhiker's guide to selecting harmonization methods for multimodal biomedical data. Biol Methods Protoc 2025;10:bpaf028. [PMID: 40308831 PMCID: PMC12043205 DOI: 10.1093/biomethods/bpaf028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2025] [Revised: 03/20/2025] [Accepted: 04/15/2025] [Indexed: 05/02/2025] Open

Abstract

The application of machine learning (ML) techniques in predictive modelling has greatly advanced our comprehension of biological systems. There is a notable shift in the trend towards integration methods that specifically target the simultaneous analysis of multiple modes or types of data, showcasing superior results compared to individual analyses. Despite the availability of diverse ML architectures for researchers interested in embracing a multimodal approach, the current literature lacks a comprehensive taxonomy that includes the pros and cons of these methods to guide the entire process. Closing this gap is imperative, necessitating the creation of a robust framework. This framework should not only categorize the diverse ML architectures suitable for multimodal analysis but also offer insights into their respective advantages and limitations. Additionally, such a framework can serve as a valuable guide for selecting an appropriate workflow for multimodal analysis. This comprehensive taxonomy would provide a clear guidance and support informed decision-making within the progressively intricate landscape of biomedical and clinical data analysis. This is an essential step towards advancing personalized medicine. The aims of the work are to comprehensively study and describe the harmonization processes that are performed and reported in the literature and present a working guide that would enable planning and selecting an appropriate integrative model. We present harmonization as a dual process of representation and integration, each with multiple methods and categories. The taxonomy of the various representation and integration methods are classified into six broad categories and detailed with the advantages, disadvantages and examples. A guide flowchart describing the step-by-step processes that are needed to adopt a multimodal approach is also presented along with examples and references. This review provides a thorough taxonomy of methods for harmonizing multimodal data and introduces a foundational 10-step guide for newcomers to implement a multimodal workflow.

Collapse

Bjerregaard A, Groth PM, Hauberg S, Krogh A, Boomsma W. Foundation models of protein sequences: A brief overview. Curr Opin Struct Biol 2025;91:103004. [PMID: 39983412 DOI: 10.1016/j.sbi.2025.103004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 01/24/2025] [Accepted: 01/26/2025] [Indexed: 02/23/2025]

Refahi M, Sokhansanj BA, Mell JC, Brown JR, Yoo H, Hearne G, Rosen GL. Enhancing nucleotide sequence representations in genomic analysis with contrastive optimization. Commun Biol 2025;8:517. [PMID: 40155693 PMCID: PMC11953366 DOI: 10.1038/s42003-025-07902-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Accepted: 03/07/2025] [Indexed: 04/01/2025] Open

Chang DH, Richardson JD, Lee MR, Lynn DM, Palecek SP, Van Lehn RC. Machine learning-driven discovery of highly selective antifungal peptides containing non-canonical β-amino acids. Chem Sci 2025;16:5579-5594. [PMID: 40028619 PMCID: PMC11867109 DOI: 10.1039/d4sc06689h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2024] [Accepted: 02/19/2025] [Indexed: 03/05/2025] Open

Abstract

Antimicrobial peptides (AMPs) are promising compounds for the treatment and prevention of multidrug-resistant infections because of their ability to directly disrupt microbial membranes, a mechanism that is less likely to lead to resistance compared to antibiotics. Unfortunately, natural AMPs are prone to proteolytic cleavage in vivo and have relatively low selectivity for microbial versus human cells, motivating the development of synthetic peptidomimetics of AMPs with improved peptide stability, activity, and selectivity. However, a lack of understanding of structure-activity relationships for peptidomimetics constrains development to rational design or experimental predictors, both of which are cost and time prohibitive, especially when the design space of possible sequences scales exponentially with the number of amino acids. To address these challenges, we developed an iterative Gaussian process regression (GPR) approach to explore a large design space of 336 000 synthetic α/β-peptide analogues of a natural AMP, aurein 1.2, based on an initial training set of 147 sequences and their biological activities against microbial pathogens and selectivity for microbes vs. mammalian cells. We show that the quantification of prediction uncertainty provided by GPR can guide the exploration of this design space via iterative experimental measurements to efficiently discover novel sequences with up to a 52-fold increase in antifungal selectivity compared to aurein 1.2. The highest selectivity peptide discovered using this approach features an unconventional substitution of cationic amino acids in the hydrophobic face and would be unlikely to be explored by conventional rational design. Overall, this work demonstrates a generalizable approach that integrates computation and experiment to accurately predict the selectivity of AMPs containing synthetic amino acids, which we employed to discover new α/β-peptides that hold promise as selective antifungal agents to combat the antimicrobial resistance crisis.

Collapse

Shukla D, Martin J, Morcos F, Potoyan DA. Thermal Adaptation of Cytosolic Malate Dehydrogenase Revealed by Deep Learning and Coevolutionary Analysis. J Chem Theory Comput 2025;21:3277-3287. [PMID: 40079215 PMCID: PMC11948321 DOI: 10.1021/acs.jctc.4c01774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2024] [Revised: 03/06/2025] [Accepted: 03/07/2025] [Indexed: 03/14/2025]

NaderiAlizadeh N, Singh R. Aggregating residue-level protein language model embeddings with optimal transport. BIOINFORMATICS ADVANCES 2025;5:vbaf060. [PMID: 40170888 PMCID: PMC11961220 DOI: 10.1093/bioadv/vbaf060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Revised: 02/13/2025] [Accepted: 03/17/2025] [Indexed: 04/03/2025]

Kohout P, Vasina M, Majerova M, Novakova V, Damborsky J, Bednar D, Marek M, Prokop Z, Mazurenko S. Engineering Dehalogenase Enzymes Using Variational Autoencoder-Generated Latent Spaces and Microfluidics. JACS AU 2025;5:838-850. [PMID: 40017771 PMCID: PMC11862945 DOI: 10.1021/jacsau.4c01101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2024] [Revised: 01/23/2025] [Accepted: 01/30/2025] [Indexed: 03/01/2025]

Affiliation(s)

Pavel Kohout Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
Michal Vasina Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
Marika Majerova Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
Veronika Novakova Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
Jiri Damborsky Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
David Bednar Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
Martin Marek Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
Zbynek Prokop Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic
Stanislav Mazurenko Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic International Clinical Research Centre, St. Anne’s Hospital, Brno 656 91, Czech Republic

Collapse

Adams E, Bai L, Lee M, Yu Y, AlQuraishi M. From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and Interpreting Sparse Autoencoders on Protein Language Models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.06.636901. [PMID: 39975216 PMCID: PMC11839115 DOI: 10.1101/2025.02.06.636901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]

Bowyer S, Allen DJ, Furnham N. Unveiling the ghost: machine learning's impact on the landscape of virology. J Gen Virol 2025;106. [PMID: 39804261 DOI: 10.1099/jgv.0.002067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2025] Open

Ghazikhani H, Butler G. Ion channel classification through machine learning and protein language model embeddings. J Integr Bioinform 2024;21:jib-2023-0047. [PMID: 39572876 PMCID: PMC11698620 DOI: 10.1515/jib-2023-0047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 09/04/2024] [Indexed: 01/06/2025] Open

Dong B, Liu Z, Xu D, Hou C, Dong G, Zhang T, Wang G. SERT-StructNet: Protein secondary structure prediction method based on multi-factor hybrid deep model. Comput Struct Biotechnol J 2024;23:1364-1375. [PMID: 38596312 PMCID: PMC11001767 DOI: 10.1016/j.csbj.2024.03.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 03/20/2024] [Accepted: 03/21/2024] [Indexed: 04/11/2024] Open

Harding-Larsen D, Funk J, Madsen NG, Gharabli H, Acevedo-Rocha CG, Mazurenko S, Welner DH. Protein representations: Encoding biological information for machine learning in biocatalysis. Biotechnol Adv 2024;77:108459. [PMID: 39366493 DOI: 10.1016/j.biotechadv.2024.108459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 09/19/2024] [Accepted: 09/29/2024] [Indexed: 10/06/2024]

Abstract

Enzymes offer a more environmentally friendly and low-impact solution to conventional chemistry, but they often require additional engineering for their application in industrial settings, an endeavour that is challenging and laborious. To address this issue, the power of machine learning can be harnessed to produce predictive models that enable the in silico study and engineering of improved enzymatic properties. Such machine learning models, however, require the conversion of the complex biological information to a numerical input, also called protein representations. These inputs demand special attention to ensure the training of accurate and precise models, and, in this review, we therefore examine the critical step of encoding protein information to numeric representations for use in machine learning. We selected the most important approaches for encoding the three distinct biological protein representations - primary sequence, 3D structure, and dynamics - to explore their requirements for employment and inductive biases. Combined representations of proteins and substrates are also introduced as emergent tools in biocatalysis. We propose the division of fixed representations, a collection of rule-based encoding strategies, and learned representations extracted from the latent spaces of large neural networks. To select the most suitable protein representation, we propose two main factors to consider. The first one is the model setup, which is influenced by the size of the training dataset and the choice of architecture. The second factor is the model objectives such as consideration about the assayed property, the difference between wild-type models and mutant predictors, and requirements for explainability. This review is aimed at serving as a source of information and guidance for properly representing enzymes in future machine learning models for biocatalysis.

Collapse

Kantroo P, Wagner GP, Machta BB. High fitness paths can connect proteins with low sequence overlap. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.13.623265. [PMID: 39605533 PMCID: PMC11601429 DOI: 10.1101/2024.11.13.623265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]

Chen Z, Li H, Zhang C, Zhang H, Zhao Y, Cao J, He T, Xu L, Xiao H, Li Y, Shao H, Yang X, He X, Fang G. Crystal Structure Prediction Using Generative Adversarial Network with Data-Driven Latent Space Fusion Strategy. J Chem Theory Comput 2024;20:9627-9641. [PMID: 39454048 DOI: 10.1021/acs.jctc.4c01096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2024]

Affiliation(s)

Zian Chen Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Haichao Li Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Chen Zhang Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Hongbin Zhang Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Yongxiao Zhao Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Jian Cao Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Tao He Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Lina Xu Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Hongping Xiao Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China
Yi Li College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325035, China
Hezhu Shao College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou 325035, China
Xiaoyu Yang Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
Xiao He Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, Shanghai Frontiers Science Center of Molecule Intelligent Syntheses, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China Chongqing Key Laboratory of Precision Optics, Chongqing Institute of East China Normal University, Chongqing 401120, China New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 200062, China
Guoyong Fang Key Laboratory of Carbon Materials of Zhejiang Province, College of Chemistry and Materials Engineering, Wenzhou University, Wenzhou 325035, China

Collapse

Alazmi M. Enzyme catalytic efficiency prediction: employing convolutional neural networks and XGBoost. Front Artif Intell 2024;7:1446063. [PMID: 39498388 PMCID: PMC11532030 DOI: 10.3389/frai.2024.1446063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Accepted: 10/07/2024] [Indexed: 11/07/2024] Open

Abstract

Introduction

In the intricate realm of enzymology, the precise quantification of enzyme efficiency, epitomized by the turnover number (k cat), is a paramount yet elusive objective. Existing methodologies, though sophisticated, often grapple with the inherent stochasticity and multifaceted nature of enzymatic reactions. Thus, there arises a necessity to explore avant-garde computational paradigms.

Methods

In this context, we introduce "enzyme catalytic efficiency prediction (ECEP)," leveraging advanced deep learning techniques to enhance the previous implementation, TurNuP, for predicting the enzyme catalase k cat. Our approach significantly outperforms prior methodologies, incorporating new features derived from enzyme sequences and chemical reaction dynamics. Through ECEP, we unravel the intricate enzyme-substrate interactions, capturing the nuanced interplay of molecular determinants.

Results

Preliminary assessments, compared against established models like TurNuP and DLKcat, underscore the superior predictive capabilities of ECEP, marking a pivotal shift in silico enzymatic turnover number estimation. This study enriches the computational toolkit available to enzymologists and lays the groundwork for future explorations in the burgeoning field of bioinformatics. This paper suggested a multi-feature ensemble deep learning-based approach to predict enzyme kinetic parameters using an ensemble convolution neural network and XGBoost by calculating weighted-average of each feature-based model's output to outperform traditional machine learning methods. The proposed "ECEP" model significantly outperformed existing methodologies, achieving a mean squared error (MSE) reduction of 0.35 from 0.81 to 0.46 and R-squared score from 0.44 to 0.54, thereby demonstrating its superior accuracy and effectiveness in enzyme catalytic efficiency prediction.

Discussion

This improvement underscores the model's potential to enhance the field of bioinformatics, setting a new benchmark for performance.

Collapse

Susanty M, Mursalim MKN, Hertadi R, Purwarianti A, LE Rajab T. Leveraging protein language model embeddings and logistic regression for efficient and accurate in-silico acidophilic proteins classification. Comput Biol Chem 2024;112:108163. [PMID: 39098138 DOI: 10.1016/j.compbiolchem.2024.108163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 07/02/2024] [Accepted: 07/24/2024] [Indexed: 08/06/2024]

Thompson M, Martín M, Olmo TS, Rajesh C, Koo PK, Bolognesi B, Lehner B. Massive experimental quantification of amyloid nucleation allows interpretable deep learning of protein aggregation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.13.603366. [PMID: 39071305 PMCID: PMC11275847 DOI: 10.1101/2024.07.13.603366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]

Dong B, Liu Z, Xu D, Hou C, Niu N, Wang G. Impact of Multi-Factor Features on Protein Secondary Structure Prediction. Biomolecules 2024;14:1155. [PMID: 39334921 PMCID: PMC11430196 DOI: 10.3390/biom14091155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Revised: 09/05/2024] [Accepted: 09/10/2024] [Indexed: 09/30/2024] Open

Struski L, Sadowski M, Danel T, Tabor J, Podolak IT. Feature-Based Interpolation and Geodesics in the Latent Spaces of Generative Models. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:12068-12082. [PMID: 37028296 DOI: 10.1109/tnnls.2023.3251848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Concha-Eloko R, Stock M, De Baets B, Briers Y, Sanjuán R, Domingo-Calap P, Boeckaerts D. DepoScope: Accurate phage depolymerase annotation and domain delineation using large language models. PLoS Comput Biol 2024;20:e1011831. [PMID: 39102416 PMCID: PMC11326577 DOI: 10.1371/journal.pcbi.1011831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 08/15/2024] [Accepted: 07/20/2024] [Indexed: 08/07/2024] Open

Madan S, Lentzen M, Brandt J, Rueckert D, Hofmann-Apitius M, Fröhlich H. Transformer models in biomedicine. BMC Med Inform Decis Mak 2024;24:214. [PMID: 39075407 PMCID: PMC11287876 DOI: 10.1186/s12911-024-02600-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 07/08/2024] [Indexed: 07/31/2024] Open

Cuturello F, Celoria M, Ansuini A, Cazzaniga A. Enhancing predictions of protein stability changes induced by single mutations using MSA-based Language Models. Bioinformatics 2024;40:btae447. [PMID: 39012369 PMCID: PMC11269464 DOI: 10.1093/bioinformatics/btae447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/19/2024] [Accepted: 07/10/2024] [Indexed: 07/17/2024] Open

Li MM, Huang Y, Sumathipala M, Liang MQ, Valdeolivas A, Ananthakrishnan AN, Liao K, Marbach D, Zitnik M. Contextual AI models for single-cell protein biology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.18.549602. [PMID: 37503080 PMCID: PMC10370131 DOI: 10.1101/2023.07.18.549602] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Randall JR, Vieira LC, Wilke CO, Davies BW. Deep mutational scanning and machine learning for the analysis of antimicrobial-peptide features driving membrane selectivity. Nat Biomed Eng 2024;8:842-853. [PMID: 39085646 PMCID: PMC12044605 DOI: 10.1038/s41551-024-01243-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 05/12/2024] [Indexed: 08/02/2024]

Norton-Baker B, Denton MCR, Murphy NP, Fram B, Lim S, Erickson E, Gauthier NP, Beckham GT. Enabling high-throughput enzyme discovery and engineering with a low-cost, robot-assisted pipeline. Sci Rep 2024;14:14449. [PMID: 38914665 PMCID: PMC11196671 DOI: 10.1038/s41598-024-64938-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 06/14/2024] [Indexed: 06/26/2024] Open

Martínez Gascueña A, Wu H, Wang R, Owen CD, Hernando PJ, Monaco S, Penner M, Xing K, Le Gall G, Gardner R, Ndeh D, Urbanowicz PA, Spencer DIR, Walsh M, Angulo J, Juge N. Exploring the sequence-function space of microbial fucosidases. Commun Chem 2024;7:137. [PMID: 38890439 PMCID: PMC11189522 DOI: 10.1038/s42004-024-01212-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 05/28/2024] [Indexed: 06/20/2024] Open

Affiliation(s)

Ana Martínez Gascueña The Gut Microbes and Health Institute Strategic Programme, Quadram Institute Bioscience, Norwich Research Park, Norwich, NR4 7UQ, UK
Haiyang Wu The Gut Microbes and Health Institute Strategic Programme, Quadram Institute Bioscience, Norwich Research Park, Norwich, NR4 7UQ, UK GuangDong Engineering Technology Research Center of Enzyme and Biocatalysis, Institute of Biological and Medical Engineering, Guangdong Academy of Sciences, Guangzhou, China
Rui Wang Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China Collaborative Innovation Center of Railway Traffic Safety, Beijing Jiaotong University, Beijing, China School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
C David Owen Diamond Light Source Ltd, Diamond House, Harwell Science and Innovation Campus, Didcot, OX11 0FA, UK Research Complex at Harwell, Rutherford Appleton Laboratory, Harwell Oxford, Didcot, OX11 0FA, UK
Pedro J Hernando The Gut Microbes and Health Institute Strategic Programme, Quadram Institute Bioscience, Norwich Research Park, Norwich, NR4 7UQ, UK Iceni Glycoscience Ltd., Norwich Research Park, Norwich, NR4 7JG, UK
Serena Monaco School of Pharmacy, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
Matthew Penner Diamond Light Source Ltd, Diamond House, Harwell Science and Innovation Campus, Didcot, OX11 0FA, UK Research Complex at Harwell, Rutherford Appleton Laboratory, Harwell Oxford, Didcot, OX11 0FA, UK
Ke Xing School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
Gwenaelle Le Gall Norwich Medical School, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
Richard Gardner Ludger Ltd, Culham Science Centre, Abingdon, OX14 3EB, UK
Didier Ndeh The Gut Microbes and Health Institute Strategic Programme, Quadram Institute Bioscience, Norwich Research Park, Norwich, NR4 7UQ, UK University of Dundee, School of Life Sciences, Dundee, DD1 5EH, Scotland, UK
Paulina A Urbanowicz Ludger Ltd, Culham Science Centre, Abingdon, OX14 3EB, UK
Daniel I R Spencer Ludger Ltd, Culham Science Centre, Abingdon, OX14 3EB, UK
Martin Walsh Diamond Light Source Ltd, Diamond House, Harwell Science and Innovation Campus, Didcot, OX11 0FA, UK Research Complex at Harwell, Rutherford Appleton Laboratory, Harwell Oxford, Didcot, OX11 0FA, UK
Jesus Angulo School of Pharmacy, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK Departamento de Química Orgánica, Universidad de Sevilla, 41012, Sevilla, Spain Instituto de Investigaciones Químicas (CSIC-US), 41092, Sevilla, Spain
Nathalie Juge The Gut Microbes and Health Institute Strategic Programme, Quadram Institute Bioscience, Norwich Research Park, Norwich, NR4 7UQ, UK.

Collapse

Fooladi H, Hirte S, Kirchmair J. Quantifying the Hardness of Bioactivity Prediction Tasks for Transfer Learning. J Chem Inf Model 2024;64:4031-4046. [PMID: 38739465 PMCID: PMC11134514 DOI: 10.1021/acs.jcim.4c00160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 04/24/2024] [Accepted: 04/24/2024] [Indexed: 05/16/2024]

Leary AY, Scott D, Gupta NT, Waite JC, Skokos D, Atwal GS, Hawkins PG. Designing meaningful continuous representations of T cell receptor sequences with deep generative models. Nat Commun 2024;15:4271. [PMID: 38769289 PMCID: PMC11106309 DOI: 10.1038/s41467-024-48198-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 04/24/2024] [Indexed: 05/22/2024] Open

García Sánchez N, Ugarte Carro E, Prieto-Santamaría L, Rodríguez-González A. Protein sequence analysis in the context of drug repurposing. BMC Med Inform Decis Mak 2024;24:122. [PMID: 38741115 DOI: 10.1186/s12911-024-02531-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 05/08/2024] [Indexed: 05/16/2024] Open

Hu F, Zhang W, Huang H, Li W, Li Y, Yin P. A Transferability-Based Method for Evaluating the Protein Representation Learning. IEEE J Biomed Health Inform 2024;28:3158-3166. [PMID: 38416611 DOI: 10.1109/jbhi.2024.3370680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2024]

Michael R, Kæstel-Hansen J, Mørch Groth P, Bartels S, Salomon J, Tian P, Hatzakis NS, Boomsma W. A systematic analysis of regression models for protein engineering. PLoS Comput Biol 2024;20:e1012061. [PMID: 38701099 PMCID: PMC11095727 DOI: 10.1371/journal.pcbi.1012061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 05/15/2024] [Accepted: 04/10/2024] [Indexed: 05/05/2024] Open

Vitale R, Bugnon LA, Fenoy EL, Milone DH, Stegmayer G. Evaluating large language models for annotating proteins. Brief Bioinform 2024;25:bbae177. [PMID: 38706315 PMCID: PMC11070647 DOI: 10.1093/bib/bbae177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 03/15/2024] [Accepted: 03/27/2024] [Indexed: 05/07/2024] Open

Chu HY, Fong JHC, Thean DGL, Zhou P, Fung FKC, Huang Y, Wong ASL. Accurate top protein variant discovery via low-N pick-and-validate machine learning. Cell Syst 2024;15:193-203.e6. [PMID: 38340729 DOI: 10.1016/j.cels.2024.01.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 10/11/2023] [Accepted: 01/18/2024] [Indexed: 02/12/2024]

Wang M, Patsenker J, Li H, Kluger Y, Kleinstein S. Language model-based B cell receptor sequence embeddings can effectively encode receptor specificity. Nucleic Acids Res 2024;52:548-557. [PMID: 38109302 PMCID: PMC10810273 DOI: 10.1093/nar/gkad1128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 10/18/2023] [Accepted: 11/11/2023] [Indexed: 12/20/2023] Open

Bravi B. Development and use of machine learning algorithms in vaccine target selection. NPJ Vaccines 2024;9:15. [PMID: 38242890 PMCID: PMC10798987 DOI: 10.1038/s41541-023-00795-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 12/07/2023] [Indexed: 01/21/2024] Open

James JK, Norland K, Johar AS, Kullo IJ. Deep generative models of LDLR protein structure to predict variant pathogenicity. J Lipid Res 2023;64:100455. [PMID: 37821076 PMCID: PMC10696256 DOI: 10.1016/j.jlr.2023.100455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 09/16/2023] [Accepted: 10/05/2023] [Indexed: 10/13/2023] Open

Xie WJ, Warshel A. Harnessing generative AI to decode enzyme catalysis and evolution for enhanced engineering. Natl Sci Rev 2023;10:nwad331. [PMID: 38299119 PMCID: PMC10829072 DOI: 10.1093/nsr/nwad331] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 09/27/2023] [Accepted: 10/13/2023] [Indexed: 02/02/2024] Open

Kouba P, Kohout P, Haddadi F, Bushuiev A, Samusevich R, Sedlar J, Damborsky J, Pluskal T, Sivic J, Mazurenko S. Machine Learning-Guided Protein Engineering. ACS Catal 2023;13:13863-13895. [PMID: 37942269 PMCID: PMC10629210 DOI: 10.1021/acscatal.3c02743] [Citation(s) in RCA: 45] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/20/2023] [Indexed: 11/10/2023]

Affiliation(s)

Petr Kouba Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic Faculty of Electrical Engineering, Czech Technical University in Prague, Technicka 2, 166 27 Prague 6, Czech Republic
Pavel Kohout Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Faraneh Haddadi Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Anton Bushuiev Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
Raman Samusevich Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
Jiri Sedlar Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
Jiri Damborsky Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Tomas Pluskal Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
Josef Sivic Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
Stanislav Mazurenko Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic

Collapse

Markus B, C GC, Andreas K, Arkadij K, Stefan L, Gustav O, Elina S, Radka S. Accelerating Biocatalysis Discovery with Machine Learning: A Paradigm Shift in Enzyme Engineering, Discovery, and Design. ACS Catal 2023;13:14454-14469. [PMID: 37942268 PMCID: PMC10629211 DOI: 10.1021/acscatal.3c03417] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/29/2023] [Accepted: 10/03/2023] [Indexed: 11/10/2023]

Xie WJ, Warshel A. Harnessing Generative AI to Decode Enzyme Catalysis and Evolution for Enhanced Engineering. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.10.561808. [PMID: 37873334 PMCID: PMC10592750 DOI: 10.1101/2023.10.10.561808] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Qiu Y, Wei GW. Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models. Brief Bioinform 2023;24:bbad289. [PMID: 37580175 PMCID: PMC10516362 DOI: 10.1093/bib/bbad289] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 07/14/2023] [Accepted: 07/26/2023] [Indexed: 08/16/2023] Open

Randall JR, Vieira LC, Wilke CO, Davies BW. Deep mutational scanning and machine learning uncover antimicrobial peptide features driving membrane selectivity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.28.551017. [PMID: 37547010 PMCID: PMC10402124 DOI: 10.1101/2023.07.28.551017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]

Koludarov I, Senoner T, Jackson TNW, Dashevsky D, Heinzinger M, Aird SD, Rost B. Domain loss enabled evolution of novel functions in the snake three-finger toxin gene superfamily. Nat Commun 2023;14:4861. [PMID: 37567881 PMCID: PMC10421932 DOI: 10.1038/s41467-023-40550-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 07/28/2023] [Indexed: 08/13/2023] Open

Wang H, Fu T, Du Y, Gao W, Huang K, Liu Z, Chandak P, Liu S, Van Katwyk P, Deac A, Anandkumar A, Bergen K, Gomes CP, Ho S, Kohli P, Lasenby J, Leskovec J, Liu TY, Manrai A, Marks D, Ramsundar B, Song L, Sun J, Tang J, Veličković P, Welling M, Zhang L, Coley CW, Bengio Y, Zitnik M. Scientific discovery in the age of artificial intelligence. Nature 2023;620:47-60. [PMID: 37532811 DOI: 10.1038/s41586-023-06221-2] [Citation(s) in RCA: 271] [Impact Index Per Article: 135.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 05/16/2023] [Indexed: 08/04/2023]

Affiliation(s)

Hanchen Wang Department of Engineering, University of Cambridge, Cambridge, UK Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA Department of Research and Early Development, Genentech Inc, South San Francisco, CA, USA Department of Computer Science, Stanford University, Stanford, CA, USA
Tianfan Fu Department of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, USA
Yuanqi Du Department of Computer Science, Cornell University, Ithaca, NY, USA
Wenhao Gao Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
Kexin Huang Department of Computer Science, Stanford University, Stanford, CA, USA
Ziming Liu Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, USA
Payal Chandak Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA, USA
Shengchao Liu Mila - Quebec AI Institute, Montreal, Quebec, Canada Université de Montréal, Montreal, Quebec, Canada
Peter Van Katwyk Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA Data Science Institute, Brown University, Providence, RI, USA
Andreea Deac Mila - Quebec AI Institute, Montreal, Quebec, Canada Université de Montréal, Montreal, Quebec, Canada
Anima Anandkumar Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA NVIDIA, Santa Clara, CA, USA
Karianne Bergen Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA Data Science Institute, Brown University, Providence, RI, USA
Carla P Gomes Department of Computer Science, Cornell University, Ithaca, NY, USA
Shirley Ho Center for Computational Astrophysics, Flatiron Institute, New York, NY, USA Department of Astrophysical Sciences, Princeton University, Princeton, NJ, USA Department of Physics, Carnegie Mellon University, Pittsburgh, PA, USA Department of Physics and Center for Data Science, New York University, New York, NY, USA
Pushmeet Kohli Google DeepMind, London, UK
Joan Lasenby Department of Engineering, University of Cambridge, Cambridge, UK
Jure Leskovec Department of Computer Science, Stanford University, Stanford, CA, USA
Tie-Yan Liu Microsoft Research, Beijing, China
Arjun Manrai Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Debora Marks Department of Systems Biology, Harvard Medical School, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Bharath Ramsundar Deep Forest Sciences, Palo Alto, CA, USA
Le Song BioMap, Beijing, China Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, United Arab Emirates
Jimeng Sun University of Illinois at Urbana-Champaign, Champaign, IL, USA
Jian Tang Mila - Quebec AI Institute, Montreal, Quebec, Canada HEC Montréal, Montreal, Quebec, Canada CIFAR AI Chair, Toronto, Ontario, Canada
Petar Veličković Google DeepMind, London, UK Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
Max Welling University of Amsterdam, Amsterdam, Netherlands Microsoft Research Amsterdam, Amsterdam, Netherlands
Linfeng Zhang DP Technology, Beijing, China AI for Science Institute, Beijing, China
Connor W Coley Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
Yoshua Bengio Mila - Quebec AI Institute, Montreal, Quebec, Canada Université de Montréal, Montreal, Quebec, Canada
Marinka Zitnik Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA. Broad Institute of MIT and Harvard, Cambridge, MA, USA. Harvard Data Science Initiative, Cambridge, MA, USA. Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA.

Collapse

Qiu Y, Wei GW. Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models. ARXIV 2023:arXiv:2307.14587v1. [PMID: 37547662 PMCID: PMC10402185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]

Saar KL, Qian D, Good LL, Morgunov AS, Collepardo-Guevara R, Best RB, Knowles TPJ. Theoretical and Data-Driven Approaches for Biomolecular Condensates. Chem Rev 2023;123:8988-9009. [PMID: 37171907 PMCID: PMC10375482 DOI: 10.1021/acs.chemrev.2c00586] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Indexed: 05/14/2023]