1
|
Deng P, Zhang Y, Xu L, Lyu J, Li L, Sun F, Zhang WB, Gao H. Computational discovery and systematic analysis of protein entangling motifs in nature: from algorithm to database. Chem Sci 2025; 16:8998-9009. [PMID: 40271025 PMCID: PMC12013726 DOI: 10.1039/d4sc08649j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2024] [Accepted: 03/29/2025] [Indexed: 04/25/2025] Open
Abstract
Nontrivial protein topology has the potential to revolutionize protein engineering by enabling the manipulation of proteins' stability and dynamics. However, the rarity of topological proteins in nature poses a challenge for their design, synthesis and application, primarily due to the limited number of available entangling motifs as synthetic templates. Discovering these motifs is particularly difficult, as entanglement is a subtle structural feature that is not readily discernible from protein sequences. In this study, we developed a streamlined workflow enabling efficient and accurate identification of structurally reliable and applicable entangling motifs from protein sequences. Through this workflow, we automatically curated a database of 1115 entangling protein motifs from over 100 thousand sequences in the UniProt Knowledgebase. In our database, 73.3% of C2 entangling motifs and 80.1% of C3 entangling motifs exhibited low structural similarity to known protein structures. The entangled structures in the database were categorized into different groups and their functional and biological significance were analyzed. The results were summarized in an online database accessible through a user-friendly web platform, providing researchers with an expanded toolbox of entangling motifs. This resource is poised to significantly advance the field of protein topology engineering and inspire new research directions in protein design and application.
Collapse
Affiliation(s)
- Puqing Deng
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology Clear Water Bay Hong Kong
| | - Yuxuan Zhang
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology Clear Water Bay Hong Kong
| | - Lianjie Xu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University Beijing 100871 P. R. China
| | - Jinyu Lyu
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology Clear Water Bay Hong Kong
| | - Linyan Li
- Department of Data Science, City University of Hong Kong Kowloon Hong Kong
| | - Fei Sun
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology Clear Water Bay Hong Kong
| | - Wen-Bin Zhang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University Beijing 100871 P. R. China
- AI for Science (AI4S)-Preferred Program, Shenzhen Graduate School, Peking University Shenzhen 518055 P. R. China
| | - Hanyu Gao
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology Clear Water Bay Hong Kong
| |
Collapse
|
2
|
Deng P, Xu L, Wei Y, Sun F, Li L, Zhang WB, Gao H. Deep Learning-Assisted Discovery of Protein Entangling Motifs. Biomacromolecules 2025; 26:1520-1529. [PMID: 39937127 DOI: 10.1021/acs.biomac.4c01243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2025]
Abstract
Natural topological proteins exhibit unique properties including enhanced stability, controlled quaternary structures, and dynamic switching properties, highlighting topology as a unique dimension in protein engineering. Although artificial design and synthesis of topological proteins have achieved certain success, their diversity and complexity remain rather limited due to the scarcity of available entangling motifs essential for the construction of nontrivial protein topologies. In this work, we developed a deep-learning model to predict the entanglement features of a homodimer based solely on its amino acid sequence via the Gauss linking number matrices. The model achieved a search speed that was dozens of times faster than AlphaFold-Multimer, while maintaining comparable mean squared error. It was used to screen for entangling motifs from the genome of a hyperthermophilic archaeon. We demonstrated the effectiveness of our model by successful wet-lab synthesis of protein catenanes using two candidate entangling motifs. These findings show the great potential of our model for advancing the design and synthesis of novel topological proteins.
Collapse
Affiliation(s)
- Puqing Deng
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Clear Water Bay 999077, Hong Kong
| | - Lianjie Xu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, P. R. China
| | - Ying Wei
- College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, P. R. China
| | - Fei Sun
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Clear Water Bay 999077, Hong Kong
| | - Linyan Li
- Department of Data Science, City University of Hong Kong, Kowloon 999077, Hong Kong
| | - Wen-Bin Zhang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, P. R. China
- AI for Science (AI4S)-Preferred Program, Shenzhen Graduate School, Peking University, Shenzhen 518055, P. R. China
| | - Hanyu Gao
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Clear Water Bay 999077, Hong Kong
| |
Collapse
|
3
|
Qu Z, Xu L, Jiang F, Liu Y, Zhang WB. Folds from fold: Exploring topological isoforms of a single-domain protein. Proc Natl Acad Sci U S A 2024; 121:e2407355121. [PMID: 39405345 PMCID: PMC11513978 DOI: 10.1073/pnas.2407355121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 09/06/2024] [Indexed: 10/30/2024] Open
Abstract
Expanding the protein fold space beyond linear chains is of fundamental significance, yet remains largely unexplored. Herein, we report the creation of seven topological isoforms (i.e., linear, cyclic, knot, lasso, pseudorotaxane, and catenane) from a single protein fold precursor by rewiring the connectivity of secondary structure elements of the SpyTag-SpyCatcher complex and mutating the reactive residue on SpyTag to abolish the isopeptide bonding. These topological isoforms can be directly expressed in cells. Their topologies were confirmed by combined techniques of proteolytic digestion, fluorescence correlation spectroscopy (FCS), size-exclusion chromatography (SEC), and topological transformation. To study the effects of topology on their structures and properties, their biophysical properties were characterized by differential scanning calorimetry (DSC), heteronuclear single quantum coherence nuclear magnetic resonance spectroscopy (HSQC-NMR), and circular dichroism (CD) spectroscopy. Molecular dynamics (MD) simulations were further performed to reveal the atomic details of structural changes upon unfolding. Both experimental and simulation results suggest that they share a similar, well-folded hydrophobic core but exhibit distinct folding/unfolding dynamic behaviors. These results shed light onto the folding landscape of topological isoforms derived from the same protein fold. As a model system, this work improves our understanding of protein structure and dynamics beyond linear chains and suggests that protein folds are highly amenable to topological variation.
Collapse
Affiliation(s)
- Zhiyu Qu
- Beijing National Laboratory for Molecular Sciences, Department of Polymer Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing100871, People’s Republic of China
- Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, Peking University, Beijing100871, People’s Republic of China
| | - Lianjie Xu
- Beijing National Laboratory for Molecular Sciences, Department of Polymer Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing100871, People’s Republic of China
- Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, Peking University, Beijing100871, People’s Republic of China
| | - Fengyi Jiang
- Beijing National Laboratory for Molecular Sciences, Department of Polymer Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing100871, People’s Republic of China
- Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, Peking University, Beijing100871, People’s Republic of China
| | - Yuan Liu
- Beijing National Laboratory for Molecular Sciences, Department of Polymer Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing100871, People’s Republic of China
| | - Wen-Bin Zhang
- Beijing National Laboratory for Molecular Sciences, Department of Polymer Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing100871, People’s Republic of China
- Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, Peking University, Beijing100871, People’s Republic of China
- Artificial Intelligence for Science-Preferred Program, Shenzhen Graduate School, Peking University, Shenzhen518055, People’s Republic of China
| |
Collapse
|
4
|
Liu Y, Tian X, Zhang F, Zhang WB. Probing the Topological Effects on Stability Enhancement and Therapeutic Performance of Protein Bioconjugates: Tadpole, Macrocycle versus Figure-of-Eight. Adv Healthc Mater 2024:e2400466. [PMID: 39091049 DOI: 10.1002/adhm.202400466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 07/22/2024] [Indexed: 08/04/2024]
Abstract
Chemical topology provides a unique dimension for making therapeutic protein bioconjugates with native structure and intact function, yet the effects of topology remain elusive. Herein, the design, synthesis, and characterization of therapeutic protein bioconjugates in three topologies (i.e., tadpole, macrocycle, and figure-of-eight), are reported. The interferon α2b (IFN) and albumin binding domain (ABD) are selected as the model proteins for bioconjugation and proof-of-concept. The biosynthesis of these topological isoforms is accomplished via direct expression in cells using SpyTag-SpyCatcher chemistry and/or split-intein-mediated ligation for topology diversification. The corresponding topologies are proven with combined techniques of LC-MS, SDS-PAGE, and controlled proteolytic digestion. While the properties of these topological isoforms are similar in most cases, the figure-of-eight-shaped bioconjugate, f8-IFN-ABD, exhibits the best thermal stability and anti-aggregation properties along with prolonged half-life and enhanced tumor retention relative to the tadpole-shaped control, tadp-IFN-ABD, and the macrocyclic control, c-IFN-ABD, showcasing considerable topological effects. The work expands the topological diversity of proteins and demonstrates the potential advantages of leveraging chemical topology for functional benefits beyond multi-function integration in protein therapeutics.
Collapse
Affiliation(s)
- Yajie Liu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, P. R. China
| | - Xibao Tian
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, P. R. China
| | - Fan Zhang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, P. R. China
| | - Wen-Bin Zhang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, P. R. China
| |
Collapse
|
5
|
Castells-Graells R, Yeates TO. Making topological protein links using enzymatic reactions. Natl Sci Rev 2024; 11:nwae071. [PMID: 38572076 PMCID: PMC10990160 DOI: 10.1093/nsr/nwae071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 02/25/2024] [Indexed: 04/05/2024] Open
Affiliation(s)
- Roger Castells-Graells
- Department of Chemistry and Biochemistry, University of California, USA
- UCLA-DOE Institute for Genomics and Proteomics, USA
| | - Todd O Yeates
- Department of Chemistry and Biochemistry, University of California, USA
- UCLA-DOE Institute for Genomics and Proteomics, USA
| |
Collapse
|